merwanehamadi
|
ff4c76ba00
|
Make agbenchmark a proxy of the evaluated agent (#5279)
Make agbenchmark a Proxy of the evaluated agent
Signed-off-by: Merwane Hamadi <merwanehamadi@gmail.com>
|
2023-09-20 16:06:00 -07:00 |
|
merwanehamadi
|
c09a0e7afa
|
Implement old polling mechanism (#5248)
Implement old polling mechanism
Signed-off-by: Merwane Hamadi <merwanehamadi@gmail.com>
|
2023-09-18 16:23:06 -07:00 |
|
merwanehamadi
|
f4d319cee4
|
Refactor benchmark (#5247)
Signed-off-by: Merwane Hamadi <merwanehamadi@gmail.com>
|
2023-09-17 06:55:20 -07:00 |
|
merwanehamadi
|
f76d45cd9e
|
Remove start from agbenchmark (#5241)
Signed-off-by: Merwane Hamadi <merwanehamadi@gmail.com>
|
2023-09-16 17:22:49 -07:00 |
|
merwanehamadi
|
b101fec16b
|
Add ability to run multiple tests (#5233)
Add multiple tests
Signed-off-by: Merwane Hamadi <merwanehamadi@gmail.com>
|
2023-09-16 13:01:11 -07:00 |
|
merwanehamadi
|
295702867a
|
Ability to run by categories (#5229)
* Ability to run by categories
Signed-off-by: Merwane Hamadi <merwanehamadi@gmail.com>
* always use Path.cwd()
Signed-off-by: Merwane Hamadi <merwanehamadi@gmail.com>
---------
Signed-off-by: Merwane Hamadi <merwanehamadi@gmail.com>
|
2023-09-15 20:04:12 -07:00 |
|
merwanehamadi
|
b4401cd409
|
add benchmark endpoints mock (#5221)
Signed-off-by: Merwane Hamadi <merwanehamadi@gmail.com>
|
2023-09-15 08:48:12 -07:00 |
|
merwanehamadi
|
4bb86c0cb5
|
Support agent protocol in benchmark (#5213)
Benchmark/Forge/Agent Protocol
|
2023-09-13 18:50:39 -07:00 |
|
merwanehamadi
|
52c8b53122
|
Fix API Mode (#5209)
|
2023-09-13 07:30:46 -07:00 |
|
SwiftyOS
|
9eb01d85a3
|
fixed multiple report folder bug
|
2023-09-13 12:18:04 +02:00 |
|
SwiftyOS
|
d44a4f591d
|
Added ability to keep answers
|
2023-09-13 11:56:31 +02:00 |
|
Merwane Hamadi
|
1b14d304d4
|
Benchmark changes
Signed-off-by: Merwane Hamadi <merwanehamadi@gmail.com>
|
2023-09-12 12:13:39 -07:00 |
|