Commit Graph

130 Commits

Author SHA1 Message Date
merwanehamadi
2cfafcfbf0 Fix cutoff errors (#116)
Signed-off-by: Merwane Hamadi <merwanehamadi@gmail.com>
2023-07-16 07:54:49 -07:00
merwanehamadi
2704bcee5e Allow change location of reports (#115)
Signed-off-by: Merwane Hamadi <merwanehamadi@gmail.com>
2023-07-16 07:26:36 -07:00
Silen Naihin
9f3a2d4f05 Dynamic cutoff and other quality of life (#101) 2023-07-15 22:10:20 -04:00
merwanehamadi
757baba3ff Remove cache true on pr (#111)
Signed-off-by: Merwane Hamadi <merwanehamadi@gmail.com>
2023-07-15 18:09:29 -07:00
merwanehamadi
02dce41937 Fix ci (#110) 2023-07-15 18:00:37 -07:00
merwanehamadi
5886d75059 Add three sum challenge (#108)
Co-authored-by: Silen Naihin <silen.naihin@gmail.com>
2023-07-15 19:52:42 -04:00
Erik Peterson
cbd2e49d97 Clean up workspace between each test (#109) 2023-07-15 16:23:49 -07:00
merwanehamadi
dab4e90e15 Update Auto-GPT score (#106)
Signed-off-by: Merwane Hamadi <merwanehamadi@gmail.com>
2023-07-15 09:53:56 -07:00
merwanehamadi
bb65473416 Update Auto-GPT to current version of master (#105) 2023-07-15 08:57:28 -07:00
merwanehamadi
8be2a0b2e1 Display results per category (#104) 2023-07-14 18:45:24 -07:00
merwanehamadi
66fc7ccb31 Display smol-developer-results (#103) 2023-07-14 18:26:17 -07:00
merwanehamadi
7de965ab3f Show Auto-GPT results (#102) 2023-07-14 18:04:35 -07:00
merwanehamadi
281cb0ef37 Start showing benchmark results (#100) 2023-07-14 17:56:56 -04:00
merwanehamadi
7bc7d9213d Replace hidden files with custom python (#99)
Signed-off-by: Merwane Hamadi <merwanehamadi@gmail.com>
2023-07-14 14:39:47 -07:00
merwanehamadi
a9702e4629 Add basic code generation challenge (#98) 2023-07-14 13:27:48 -04:00
merwanehamadi
3a9dfa4c59 Update submodules and upload artifacts (#97)
Signed-off-by: Merwane Hamadi <merwanehamadi@gmail.com>
2023-07-13 20:47:55 -07:00
merwanehamadi
78df4915cf Remove dependencies if a specific test is asked by the user (#95)
Signed-off-by: Merwane Hamadi <merwanehamadi@gmail.com>
2023-07-12 14:35:12 -07:00
merwanehamadi
48ac1c91cd Remove dependencies cache (#94)
Signed-off-by: Merwane Hamadi <merwanehamadi@gmail.com>
2023-07-12 14:30:06 -07:00
merwanehamadi
e0b16cf4ac Fix Smol developer and gpt engineer (#93)
Signed-off-by: Merwane Hamadi <merwanehamadi@gmail.com>
2023-07-12 10:54:50 -07:00
Silen Naihin
8d0c5179ed fixing backslashes, adding basic metrics (#89) 2023-07-12 01:37:59 -04:00
merwanehamadi
e292ffebaf Enable cache (#92) 2023-07-11 21:37:49 -07:00
merwanehamadi
504634b4a6 Add custom properties to Helicone (#91) 2023-07-11 20:50:56 -07:00
merwanehamadi
b3c506cd94 Fix Auto-GPT looping forever (#87) 2023-07-11 20:02:29 -04:00
merwanehamadi
4ecb70c5e3 Fix Auto-GPT integration by adding python module as entrypoint (#86)
Co-authored-by: Silen Naihin <silen.naihin@gmail.com>
2023-07-11 15:11:24 -04:00
merwanehamadi
22295350a6 All Agents log to helicone automatically (#85)
Signed-off-by: Merwane Hamadi <merwanehamadi@gmail.com>
Co-authored-by: Justin <justintorre75@gmail.com>
2023-07-11 09:57:53 -07:00
merwanehamadi
0799be7e28 Fix tests ci (#82) 2023-07-10 21:54:25 -07:00
Silen Naihin
8df82909b2 Added --test, consolidate files, reports working (#83) 2023-07-10 19:25:19 -07:00
merwanehamadi
437e066a66 Add "Simple web server" challenge (#74)
Co-authored-by: Silen Naihin <silen.naihin@gmail.com>
2023-07-10 20:46:03 -04:00
merwanehamadi
30ba51593f Add Helicone (#81) 2023-07-10 12:19:12 -04:00
Silen Naihin
b8830f8625 Adding search interface challenge and cleaning repo (#80) 2023-07-09 18:33:08 -07:00
merwanehamadi
0fa5286ad0 Combine all agents into one ci.yml (#79)
Signed-off-by: Merwane Hamadi <merwanehamadi@gmail.com>
2023-07-09 18:06:26 -07:00
Silen Naihin
3d43117554 Just json, no test files (#77) 2023-07-09 17:27:21 -07:00
merwanehamadi
573130549f Add gpt engineer to ci (#78) 2023-07-09 13:31:31 -07:00
merwanehamadi
d89264998d Fix debug code challenge (#76)
Co-authored-by: Silen Naihin <silen.naihin@gmail.com>
2023-07-08 21:46:37 -04:00
Silen Naihin
69bd41f741 Quality of life improvements & fixes (#75) 2023-07-08 18:43:38 -07:00
Silen Naihin
db86ccdcb4 removing agentgpt 2023-07-08 13:02:47 -04:00
Silen Naihin
2d05c3ec56 reverting accidental previous changes 2023-07-08 12:50:39 -04:00
Silen Naihin
a35569a77b submodule integration 2023-07-08 12:47:48 -04:00
Silen Naihin
082a876612 fixing the incorrect addition of superagi (#73) 2023-07-08 05:04:06 -04:00
Silen Naihin
e56b112aab i/o workspace, adding superagi (#60) 2023-07-08 03:27:31 -04:00
merwanehamadi
487f99f8f2 Use artifacts out insted of python code (#72) 2023-07-07 15:49:37 -07:00
merwanehamadi
f0f7d2be90 Fix memory challenge 2 (#71) 2023-07-07 15:38:50 -07:00
merwanehamadi
e34c83ca1c Add .txt to memory challenges (#70) 2023-07-07 15:34:57 -07:00
Erik Peterson
3defe044bd Print out all of stdout on each process poll. (#69) 2023-07-07 15:02:08 -07:00
Silen Naihin
4562bc6caf Update data.json remove text 2023-07-07 17:54:09 -04:00
merwanehamadi
e61523e59e Get rid of get file path by using the data.json convention to store the challenge information (#67)
Signed-off-by: Merwane Hamadi <merwanehamadi@gmail.com>
2023-07-07 13:58:17 -07:00
merwanehamadi
6ef32a9b1f Add "Debug code without guidance" challenge (#66)
Signed-off-by: Merwane Hamadi <merwanehamadi@gmail.com>
2023-07-07 13:55:59 -07:00
merwanehamadi
9ede17891b Add 'Debug simple typo with guidance' challenge (#65)
Signed-off-by: Merwane Hamadi <merwanehamadi@gmail.com>
2023-07-07 13:50:53 -07:00
Silen Naihin
bfd0d5c826 Fix home_path, local mini-agi run works (#64)
Co-authored-by: merwanehamadi <merwanehamadi@gmail.com>
2023-07-06 18:00:45 -07:00
merwanehamadi
0b4ae5ea78 Add 'remember phrases with noise' challenge (#63) 2023-07-06 17:19:12 -04:00