merwanehamadi
|
bcb24c1a58
|
Fix challenges (#5561)
Fix challenges and CI
Signed-off-by: Merwane Hamadi <merwanehamadi@gmail.com>
|
2023-10-05 10:59:50 -07:00 |
|
merwanehamadi
|
a30cbcc2ce
|
Fix benchmark ci (#5478)
Fix benchmark CI
Signed-off-by: Merwane Hamadi <merwanehamadi@gmail.com>
|
2023-10-02 12:41:32 -07:00 |
|
merwanehamadi
|
37fbb52d19
|
Add more challenges + cleanup (#5368)
Signed-off-by: Merwane Hamadi <merwanehamadi@gmail.com>
|
2023-09-27 17:58:58 -07:00 |
|
merwanehamadi
|
f4e7b1c61c
|
Add eval_id and sync Skill Tree with Frontend(#5287)
Add eval_id to skill tree
Signed-off-by: Merwane Hamadi <merwanehamadi@gmail.com>
|
2023-09-21 13:36:17 -07:00 |
|
merwanehamadi
|
ff4c76ba00
|
Make agbenchmark a proxy of the evaluated agent (#5279)
Make agbenchmark a Proxy of the evaluated agent
Signed-off-by: Merwane Hamadi <merwanehamadi@gmail.com>
|
2023-09-20 16:06:00 -07:00 |
|
merwanehamadi
|
c09a0e7afa
|
Implement old polling mechanism (#5248)
Implement old polling mechanism
Signed-off-by: Merwane Hamadi <merwanehamadi@gmail.com>
|
2023-09-18 16:23:06 -07:00 |
|
merwanehamadi
|
f4d319cee4
|
Refactor benchmark (#5247)
Signed-off-by: Merwane Hamadi <merwanehamadi@gmail.com>
|
2023-09-17 06:55:20 -07:00 |
|
merwanehamadi
|
f76d45cd9e
|
Remove start from agbenchmark (#5241)
Signed-off-by: Merwane Hamadi <merwanehamadi@gmail.com>
|
2023-09-16 17:22:49 -07:00 |
|
Reinier van der Leer
|
b21d68a8ab
|
Migrate AutoGPT agent to poetry (#5219)
Inspired by #1102
* Migrate AutoGPT agent to poetry
Co-authored-by: rickythefox <richard@ginzburg.se>
* Rewrite automatic dependency check (check_requirements.py) for poetry
* Sort dependencies
* Add instructions for poetry to README
|
2023-09-15 05:18:44 +02:00 |
|
Merwane Hamadi
|
cd4589d4d9
|
Add CI to the forge
Signed-off-by: Merwane Hamadi <merwanehamadi@gmail.com>
|
2023-09-11 16:12:44 -07:00 |
|
Merwane Hamadi
|
b512808653
|
Name agents like their github repos
Signed-off-by: Merwane Hamadi <merwanehamadi@gmail.com>
|
2023-09-07 17:25:50 -07:00 |
|
Merwane Hamadi
|
8ccd2fd367
|
Benchmark agents without submodule + ability to pin a specific commit.
Signed-off-by: Merwane Hamadi <merwanehamadi@gmail.com>
|
2023-09-07 16:58:22 -07:00 |
|
Merwane Hamadi
|
fa888bfafa
|
Add back api mode
Signed-off-by: Merwane Hamadi <merwanehamadi@gmail.com>
|
2023-09-06 22:51:45 -07:00 |
|
Merwane Hamadi
|
d901d01be8
|
Benchmark all agents
Signed-off-by: Merwane Hamadi <merwanehamadi@gmail.com>
|
2023-09-06 22:00:03 -07:00 |
|
Merwane Hamadi
|
bc14028294
|
Add benchmark CI
Signed-off-by: Merwane Hamadi <merwanehamadi@gmail.com>
|
2023-09-06 19:50:24 -07:00 |
|