Commit Graph

86 Commits

Author SHA1 Message Date
merwanehamadi
41909f0de7 Tic tac toe challenge (#345)
Signed-off-by: Merwane Hamadi <merwanehamadi@gmail.com>
2023-08-31 20:45:31 -07:00
Luke
595e04def1 Updating Turbo (#343)
Co-authored-by: Luke <2609441+lc0rp@user.noreply.github.com>
2023-08-31 07:09:41 -04:00
Silen Naihin
b6ad300eda restructure library, deprecate challenges (#336)
Signed-off-by: Merwane Hamadi <merwanehamadi@gmail.com>
Co-authored-by: Merwane Hamadi <merwanehamadi@gmail.com>
2023-08-30 22:38:31 -07:00
merwanehamadi
7c49b0f29c Fix tests (#338) 2023-08-30 20:31:10 -07:00
merwanehamadi
afb59a0778 Support agent protocol (#337)
Signed-off-by: Merwane Hamadi <merwanehamadi@gmail.com>
2023-08-30 19:44:39 -07:00
Luke
16a1d884f1 Update TestPasswordGenerator_Easy to mention ValueError (#335)
Co-authored-by: Luke <2609441+lc0rp@user.noreply.github.com>
2023-08-30 19:08:36 -04:00
Reinier van der Leer
d86ed40f83 Improve TestRevenueRetrieval_1.1 task specification (#329)
Co-authored-by: Luke <2609441+lc0rp@users.noreply.github.com>
2023-08-28 09:37:26 -07:00
merwanehamadi
a8dd079d4c Updatee cutoff for SuperAGI (#330) 2023-08-28 08:13:18 -07:00
Silen Naihin
59655a8d96 adding backend and a basic ui (#309) 2023-08-27 03:18:30 -04:00
Fluder-Paradyne
2176e1179a Fix for TestWrite6Files and TestWrite5FilesWithArray (#328)
Co-authored-by: Silen Naihin <silen.naihin@gmail.com>
2023-08-24 09:14:03 -04:00
Luke
9f1631719c Fix "code.py" conflict with Python's code module, and fix TestReturnCode_Simple conflict between two test.py files. (#321)
Co-authored-by: Luke <2609441+lc0rp@user.noreply.github.com>
Co-authored-by: merwanehamadi <merwanehamadi@gmail.com>
2023-08-19 09:04:18 -07:00
merwanehamadi
62c52643b4 Remove build a nuke challenge (#316)
Signed-off-by: Merwane Hamadi <merwanehamadi@gmail.com>
2023-08-16 15:58:17 -07:00
merwanehamadi
82ed4a136a Remove submodule (#314)
Signed-off-by: Merwane Hamadi <merwanehamadi@gmail.com>
2023-08-16 14:57:52 -07:00
Silen Naihin
8bc3710e23 init backend, fix frontend module (#307) 2023-08-15 14:14:35 +01:00
Silen Naihin
c59e5fb7d8 new frontend connections (#306) 2023-08-15 13:16:07 +01:00
merwanehamadi
1129e6b426 Add safety challenge (#300)
Signed-off-by: Merwane Hamadi <merwanehamadi@gmail.com>
2023-08-13 10:15:58 -07:00
merwanehamadi
8bf2f3fe5d Fix all tests skipped (#296)
Signed-off-by: Merwane Hamadi <merwanehamadi@gmail.com>
2023-08-12 17:35:55 -07:00
merwanehamadi
6dc713059c Remember goal loss (#291) 2023-08-11 18:44:18 -07:00
merwanehamadi
1560892c58 Sync skill tree to a versioned website (#289)
Signed-off-by: Merwane Hamadi <merwanehamadi@gmail.com>
2023-08-11 17:28:53 -07:00
Erik Peterson
79be5cd70f Update challenges submodule 2023-08-11 14:45:36 -07:00
Silen Naihin
1a61c66898 mock flag, workspace io fixes, mark fixes 2023-08-11 13:22:21 +01:00
merwanehamadi
47c6062092 Cleanup skill tree (#287) 2023-08-10 16:29:58 -07:00
Rob
fb67c3aaf1 add updated challenges 2023-08-10 21:45:58 +02:00
Rob
a2380a7bdd feat: ethereum price challenge 2023-08-10 21:09:04 +02:00
merwanehamadi
7d60ce5f44 See the task when clicking in the skill tree (#279) 2023-08-09 09:37:17 -07:00
merwanehamadi
305f3a6138 Add web app creation challenge (#272) 2023-08-08 13:08:51 -07:00
merwanehamadi
db48e7849b Add product advisor tests (#267) 2023-08-06 20:59:53 -07:00
merwanehamadi
f157f46a07 Fix test write file (#266) 2023-08-06 18:44:42 -07:00
Silen Naihin
3c20191156 updating challenges commit sha 2023-08-06 23:02:35 +01:00
Silen Naihin
19848f362d remove pytest-depends, rerouting functions (#250) 2023-08-06 22:35:22 +01:00
merwanehamadi
5232522e47 Remove space challenges (#262) 2023-08-06 10:10:58 -07:00
merwanehamadi
53ec3337f3 Add all agent protocol tests (#260)
Signed-off-by: Merwane Hamadi <merwanehamadi@gmail.com>
2023-08-06 09:52:46 -07:00
merwanehamadi
530eb61f25 Add agent protocol interface test (#259)
Signed-off-by: Merwane Hamadi <merwanehamadi@gmail.com>
2023-08-05 18:00:05 -07:00
merwanehamadi
fb13a83d15 Add more coding challenge (#254)
Signed-off-by: Merwane Hamadi <merwanehamadi@gmail.com>
2023-08-05 09:51:53 -07:00
merwanehamadi
6309bc9c3d Update submodule (#219) 2023-07-30 20:03:53 -07:00
merwanehamadi
c4554225bd Update submodules (#212)
Signed-off-by: Merwane Hamadi <merwanehamadi@gmail.com>
2023-07-29 10:18:35 -07:00
Silen Naihin
f07e7b60d4 Advanced LLM Evaluation Implementation (#205)
Co-authored-by: Auto-GPT-Bot <github-bot@agpt.co>
2023-07-29 10:26:19 +01:00
merwanehamadi
80bd0c4260 Fix tests not being run (#207)
Signed-off-by: Merwane Hamadi <merwanehamadi@gmail.com>
2023-07-27 20:50:53 -07:00
merwanehamadi
5df710fd35 Add helicone dynamic headers (#199)
Signed-off-by: Merwane Hamadi <merwanehamadi@gmail.com>
2023-07-26 16:03:13 -07:00
Silen Naihin
66d1fec07e attempting more logs 2023-07-26 23:36:45 +01:00
merwanehamadi
01b118e590 Add llm eval (#197)
Signed-off-by: Merwane Hamadi <merwanehamadi@gmail.com>
2023-07-26 14:00:24 -07:00
Silen Naihin
80506e9a3b report # bug, adding submodule challenges (#193) 2023-07-26 13:53:10 +01:00
merwanehamadi
a1e02f243c Add safety suite (#196)
Signed-off-by: Merwane Hamadi <merwanehamadi@gmail.com>
2023-07-25 20:13:01 -07:00
Silen Naihin
b82277515f hotfix reports (#191) 2023-07-25 19:07:24 +01:00
Silen Naihin
d9b3d7da37 Safety challenges, adaptability challenges, suite same_task (#177) 2023-07-24 13:57:44 -07:00
Erik Peterson
5a3b4f3d1d Kill subprocesses when test ends (#172)
Signed-off-by: Merwane Hamadi <merwanehamadi@gmail.com>
Co-authored-by: Merwane Hamadi <merwanehamadi@gmail.com>
2023-07-20 15:41:59 -07:00
Silen Naihin
12c5d54583 Fixing memory challenges, naming, testing mini-agi, smooth retrieval scaling (#166) 2023-07-17 19:41:58 -07:00
Silen Naihin
9f3a2d4f05 Dynamic cutoff and other quality of life (#101) 2023-07-15 22:10:20 -04:00
merwanehamadi
5886d75059 Add three sum challenge (#108)
Co-authored-by: Silen Naihin <silen.naihin@gmail.com>
2023-07-15 19:52:42 -04:00
merwanehamadi
7bc7d9213d Replace hidden files with custom python (#99)
Signed-off-by: Merwane Hamadi <merwanehamadi@gmail.com>
2023-07-14 14:39:47 -07:00