mirror of
https://github.com/aljazceru/gpt-engineer.git
synced 2026-01-23 06:46:08 +01:00
1.5 KiB
1.5 KiB
Benchmarks
$ python scripts/benchmark.py
| Benchmark | Ran | Works | Perfect |
|---|---|---|---|
| currency_converter | ❌ | ❌ | ❌ |
| image_resizer | ✅ | ❌ | ❌ |
| pomodoro_timer | ❌ | ❌ | ❌ |
| url_shortener | ❌ | ❌ | ❌ |
| file_explorer | ✅ | ✅ | ✅ |
| markdown_editor | ❌ | ❌ | ❌ |
| timer_app | ✅ | ❌ | ❌ |
| weather_app | ❌ | ❌ | ❌ |
| file_organizer | ✅ | ✅ | ✅ |
| password_generator | ✅ | ✅ | ✅ |
| todo_list | ✅ | ❌ | ❌ |
Notes on the errors
timer_app almost works with unit tests config
- failure mode: undefined import/conflicting names
file_explorer works
file organiser: works
image_resizer almost works with unit tests config
- failure mode: undefined import
todo_list runs. doesn't really work with unit tests config Uncaught ReferenceError: module is not defined
- failure mode: placeholder text
url_shortner starts but gets the error: SQLite objects created in a thread can only be used in that same thread. The object was created in thread id 8636125824 and this is thread id 13021003776.
markdown_editor: failing tests, 'WebDriver' object has no attribute 'find_element_by_id'
pomodoro: doesn't run it only tests
currency_converter: backend doesnt return anything
weather_app only runs test, no code existed