Commit Graph

35 Commits

Author SHA1 Message Date
Silen Naihin
4be22ae5ab mini agi attempt 2023-06-26 09:27:20 -04:00
Silen Naihin
7604ae07bb can now put file extensions or names in files data 2023-06-25 19:30:04 -04:00
Silen Naihin
adc6b225a6 update regression tests info 2023-06-25 11:12:33 -04:00
Silen Naihin
31c1192719 other was non solution, solution is pytest-depends 2023-06-25 08:48:16 -04:00
Silen Naihin
d1c5e0a91a finally figured out right way to do dependencies 2023-06-25 00:22:53 -04:00
Silen Naihin
f895d54e02 more elegant marking & dependency solution 2023-06-24 14:42:35 -04:00
Silen Naihin
4fa9f72083 adding dependencies on other challenges 2023-06-24 12:24:17 -04:00
Silen Naihin
66c9e68b04 file creation from within file before server :) 2023-06-24 12:15:53 -04:00
Silen Naihin
a5073ab577 basic challenges, more ChallengeData structure 2023-06-24 09:42:36 -04:00
Silen Naihin
b6562f3420 Update README.md 2023-06-23 09:31:21 -04:00
Silen Naihin
ffd1d15a0e MockManager, mock_func in data.json (#39) 2023-06-23 07:53:57 -04:00
Silen Naihin
15c5469bb1 Add automatic regression markers (#38) 2023-06-22 08:18:22 -04:00
Silen Naihin
e5974ca3ea Delete file_to_check.txt 2023-06-21 11:44:59 -04:00
Silen Naihin
b7deb984f7 start click, fixtures, types, challenge creation, mock run -stable (#37) 2023-06-21 11:43:18 -04:00
Silen Naihin
04536e92a5 Merge pull request #34 from Significant-Gravitas/dsl 2023-06-20 18:32:58 -04:00
Silen Naihin
1eb278f3cc Update README.md 2023-06-19 09:53:30 -04:00
scarletpan
f37981c388 init first challenge template 2023-06-19 12:39:34 +00:00
Silen Naihin
51f2295971 init agbenchmark 2023-06-18 11:14:54 -04:00
Douglas Schonholtz
dfb73204bf Update readme to suggest people check out challenges 2023-05-05 16:33:39 -04:00
Douglas Schonholtz
04722e7fc5 EvalNames with dates for the eval run filename and compatibility with 0.3.0 (#26)
* EvalNames with dates and the eval run

* Ignore .idea files, update readme to use 3.10, updates for 0.3.0
2023-05-03 10:14:44 -04:00
Douglas Schonholtz
b8c7c05dd5 windows docs make workspace if not there (#25)
* windows docs make workspace if not there

* small fixes
2023-04-22 19:17:28 -04:00
Media
ef5c4f8a11 Graphs for evals (#20)
* Update README.md

* Jupyter Notebook for evaluating eval results

---------

Co-authored-by: Douglas Schonholtz <15002691+dschonholtz@users.noreply.github.com>
2023-04-20 19:04:34 -04:00
Douglas Schonholtz
011ed2f2b9 Update README.md (#17)
remove -m
2023-04-20 15:47:15 -04:00
Douglas Schonholtz
625d6e72ec Remove the submodule, reference OpenAI directly rather than running it on the command line, fix logging (#16)
* Removed submodule, refactor, docker on pip, async docker logging, running our own tool on CLI rather than OpenAIs
2023-04-20 15:41:29 -04:00
Douglas Schonholtz
f00ced6612 Update README.md 2023-04-18 11:59:42 -04:00
Douglas Schonholtz
486c7e3a5e Update README.md
Adding set up info
2023-04-18 11:10:24 -04:00
Douglas Schonholtz
dad4804b4e Update README.md 2023-04-18 10:29:05 -04:00
Douglas Schonholtz
2fbb03dc6c Update README.md 2023-04-18 10:27:47 -04:00
Douglas Schonholtz
63c8e4da84 Merge pull request #2 from ambujpawar/typo_in_readme
Typo in README.md
2023-04-18 09:18:14 -04:00
Ambuj Pawar
3b0091c231 Typo in README.md 2023-04-18 09:25:25 +02:00
Douglas Schonholtz
22d997d088 Merge pull request #1 from dschonholtz/master
First commit for AutoGPT Benchmarks
2023-04-17 19:07:49 -04:00
douglas
59ff485253 Prompt engineering fixes 2023-04-17 18:14:09 -04:00
douglas
7212c3876d Cleanup 2023-04-17 17:34:45 -04:00
douglas
89081d942c First commit for AutoGPT Benchmarks 2023-04-17 17:22:31 -04:00
Toran Bruce Richards
0b899eb4cf Initial commit 2023-04-06 13:59:45 +01:00