Commit Graph

18 Commits

Author SHA1 Message Date
Silen Naihin
59655a8d96 adding backend and a basic ui (#309) 2023-08-27 03:18:30 -04:00
Silen Naihin
1a61c66898 mock flag, workspace io fixes, mark fixes 2023-08-11 13:22:21 +01:00
Jakub Novák
c2269397f1 Use agent protocol (#278)
Signed-off-by: Jakub Novak <jakub@e2b.dev>
2023-08-11 09:04:08 +02:00
merwanehamadi
1b20e45ec1 Implement the 'explore' mode (#284) 2023-08-09 17:59:48 -07:00
Silen Naihin
19848f362d remove pytest-depends, rerouting functions (#250) 2023-08-06 22:35:22 +01:00
merwanehamadi
ec262f0667 Fix more attempted metrics not working (#252)
Signed-off-by: Merwane Hamadi <merwanehamadi@gmail.com>
2023-08-04 15:07:15 -07:00
merwanehamadi
34814d837a Fix "attempted" metric being incorrect (#251)
Signed-off-by: Merwane Hamadi <merwanehamadi@gmail.com>
2023-08-04 11:28:45 -07:00
merwanehamadi
e3562a4b66 Add attempted metrics (#244)
Signed-off-by: Merwane Hamadi <merwanehamadi@gmail.com>
2023-08-02 13:27:57 -07:00
merwanehamadi
eeb68858d7 Only run mini-agi on tests (#232) 2023-08-01 16:50:41 -07:00
Silen Naihin
f9fea473f5 Refactoring for TDD (#222) 2023-07-31 21:59:47 +01:00
Silen Naihin
2ec306e850 linter fixes 2023-07-31 13:28:01 +01:00
Silen Naihin
14c49fa7ea handling helicone errors 2023-07-31 12:54:27 +01:00
merwanehamadi
ad00a0634e Get helicone costs (#220)
Signed-off-by: Merwane Hamadi <merwanehamadi@gmail.com>
2023-07-30 21:33:09 -07:00
Silen Naihin
19db3151dd Feature: Visualize Test Results (#211) 2023-07-30 23:51:17 +01:00
Silen Naihin
ecc386ec7b returning scores (#210)
Co-authored-by: Auto-GPT-Bot <github-bot@agpt.co>
2023-07-29 11:43:22 +01:00
merwanehamadi
80bd0c4260 Fix tests not being run (#207)
Signed-off-by: Merwane Hamadi <merwanehamadi@gmail.com>
2023-07-27 20:50:53 -07:00
Silen Naihin
0e6be16d07 helicone and llm eval fixes 2023-07-27 14:07:46 +01:00
Silen Naihin
80506e9a3b report # bug, adding submodule challenges (#193) 2023-07-26 13:53:10 +01:00