Commit Graph

12 Commits

Author SHA1 Message Date
merwanehamadi
ff4c76ba00 Make agbenchmark a proxy of the evaluated agent (#5279)
Make agbenchmark a Proxy of the evaluated agent

Signed-off-by: Merwane Hamadi <merwanehamadi@gmail.com>
2023-09-20 16:06:00 -07:00
merwanehamadi
c09a0e7afa Implement old polling mechanism (#5248)
Implement old polling mechanism

Signed-off-by: Merwane Hamadi <merwanehamadi@gmail.com>
2023-09-18 16:23:06 -07:00
merwanehamadi
f4d319cee4 Refactor benchmark (#5247)
Signed-off-by: Merwane Hamadi <merwanehamadi@gmail.com>
2023-09-17 06:55:20 -07:00
merwanehamadi
f76d45cd9e Remove start from agbenchmark (#5241)
Signed-off-by: Merwane Hamadi <merwanehamadi@gmail.com>
2023-09-16 17:22:49 -07:00
merwanehamadi
b101fec16b Add ability to run multiple tests (#5233)
Add multiple tests

Signed-off-by: Merwane Hamadi <merwanehamadi@gmail.com>
2023-09-16 13:01:11 -07:00
merwanehamadi
295702867a Ability to run by categories (#5229)
* Ability to run by categories

Signed-off-by: Merwane Hamadi <merwanehamadi@gmail.com>

* always use Path.cwd()

Signed-off-by: Merwane Hamadi <merwanehamadi@gmail.com>

---------

Signed-off-by: Merwane Hamadi <merwanehamadi@gmail.com>
2023-09-15 20:04:12 -07:00
merwanehamadi
b4401cd409 add benchmark endpoints mock (#5221)
Signed-off-by: Merwane Hamadi <merwanehamadi@gmail.com>
2023-09-15 08:48:12 -07:00
merwanehamadi
4bb86c0cb5 Support agent protocol in benchmark (#5213)
Benchmark/Forge/Agent Protocol
2023-09-13 18:50:39 -07:00
merwanehamadi
52c8b53122 Fix API Mode (#5209) 2023-09-13 07:30:46 -07:00
SwiftyOS
9eb01d85a3 fixed multiple report folder bug 2023-09-13 12:18:04 +02:00
SwiftyOS
d44a4f591d Added ability to keep answers 2023-09-13 11:56:31 +02:00
Merwane Hamadi
1b14d304d4 Benchmark changes
Signed-off-by: Merwane Hamadi <merwanehamadi@gmail.com>
2023-09-12 12:13:39 -07:00