merwanehamadi
|
f4e7b1c61c
|
Add eval_id and sync Skill Tree with Frontend(#5287)
Add eval_id to skill tree
Signed-off-by: Merwane Hamadi <merwanehamadi@gmail.com>
|
2023-09-21 13:36:17 -07:00 |
|
merwanehamadi
|
ff4c76ba00
|
Make agbenchmark a proxy of the evaluated agent (#5279)
Make agbenchmark a Proxy of the evaluated agent
Signed-off-by: Merwane Hamadi <merwanehamadi@gmail.com>
|
2023-09-20 16:06:00 -07:00 |
|
merwanehamadi
|
c09a0e7afa
|
Implement old polling mechanism (#5248)
Implement old polling mechanism
Signed-off-by: Merwane Hamadi <merwanehamadi@gmail.com>
|
2023-09-18 16:23:06 -07:00 |
|
merwanehamadi
|
f4d319cee4
|
Refactor benchmark (#5247)
Signed-off-by: Merwane Hamadi <merwanehamadi@gmail.com>
|
2023-09-17 06:55:20 -07:00 |
|
merwanehamadi
|
ece9e85b41
|
Add agent protocol within agbenchmark (#5239)
Signed-off-by: Merwane Hamadi <merwanehamadi@gmail.com>
|
2023-09-16 15:31:12 -07:00 |
|
merwanehamadi
|
b101fec16b
|
Add ability to run multiple tests (#5233)
Add multiple tests
Signed-off-by: Merwane Hamadi <merwanehamadi@gmail.com>
|
2023-09-16 13:01:11 -07:00 |
|
merwanehamadi
|
295702867a
|
Ability to run by categories (#5229)
* Ability to run by categories
Signed-off-by: Merwane Hamadi <merwanehamadi@gmail.com>
* always use Path.cwd()
Signed-off-by: Merwane Hamadi <merwanehamadi@gmail.com>
---------
Signed-off-by: Merwane Hamadi <merwanehamadi@gmail.com>
|
2023-09-15 20:04:12 -07:00 |
|
SwiftyOS
|
d44a4f591d
|
Added ability to keep answers
|
2023-09-13 11:56:31 +02:00 |
|
Merwane Hamadi
|
1b14d304d4
|
Benchmark changes
Signed-off-by: Merwane Hamadi <merwanehamadi@gmail.com>
|
2023-09-12 12:13:39 -07:00 |
|
SwiftyOS
|
c73e90c4e6
|
Fixing benchmarks
|
2023-09-11 17:41:27 -07:00 |
|
Merwane Hamadi
|
fa888bfafa
|
Add back api mode
Signed-off-by: Merwane Hamadi <merwanehamadi@gmail.com>
|
2023-09-06 22:51:45 -07:00 |
|
Auto-GPT-Bot
|
45c15e370f
|
Auto-GPT-20230905085638
Signed-off-by: Merwane Hamadi <merwanehamadi@gmail.com>
|
2023-09-05 10:10:03 -07:00 |
|