Display smol-developer-results (#103)

This commit is contained in:
merwanehamadi
2023-07-14 18:26:17 -07:00
committed by GitHub
parent 7de965ab3f
commit 66fc7ccb31
2 changed files with 19 additions and 4 deletions

View File

@@ -3,7 +3,7 @@
A repo built for the purpose of benchmarking the performance of agents far and wide, regardless of how they are set up and how they work
## Scores:
Spider chart for each agent coming soon !
Radio chart for each agent coming soon !
## Detailed results
:warning: These results are constantly evolving at the moment. We will publish an official benchmark result very soon.
@@ -42,7 +42,7 @@ Interface
| Task | Results |
|-------------|--------------------|
| Write File | :white_check_mark: |
| Read File | :white_check_mark: |
| Read File | :x: |
| Search File | :x: |
Code
@@ -58,4 +58,19 @@ Code
Coming Soon!
### smol-developer
Coming Soon!
Interface
| Task | Results |
|-------------|--------------------|
| Write File | :white_check_mark: |
| Read File | :x: |
| Search File | :x: |
Code
| Task | Results |
|-----------------------------------|----------------------|
| Debug Simple Typo With Guidance | :x: |
| Debug Simple Typo Without Guidance| :x: |
| Basic Code Generation | :white_check_mark: |
| Create Simple Web Server | :x: |