Silen Naihin
|
19db3151dd
|
Feature: Visualize Test Results (#211)
|
2023-07-30 23:51:17 +01:00 |
|
Silen Naihin
|
ecc386ec7b
|
returning scores (#210)
Co-authored-by: Auto-GPT-Bot <github-bot@agpt.co>
|
2023-07-29 11:43:22 +01:00 |
|
Silen Naihin
|
f07e7b60d4
|
Advanced LLM Evaluation Implementation (#205)
Co-authored-by: Auto-GPT-Bot <github-bot@agpt.co>
|
2023-07-29 10:26:19 +01:00 |
|
merwanehamadi
|
80bd0c4260
|
Fix tests not being run (#207)
Signed-off-by: Merwane Hamadi <merwanehamadi@gmail.com>
|
2023-07-27 20:50:53 -07:00 |
|
Silen Naihin
|
71e0c598d6
|
forcing AGENT_NAME to be defined from repo
|
2023-07-27 14:28:11 +01:00 |
|
Silen Naihin
|
0e6be16d07
|
helicone and llm eval fixes
|
2023-07-27 14:07:46 +01:00 |
|
merwanehamadi
|
01b118e590
|
Add llm eval (#197)
Signed-off-by: Merwane Hamadi <merwanehamadi@gmail.com>
|
2023-07-26 14:00:24 -07:00 |
|
Silen Naihin
|
80506e9a3b
|
report # bug, adding submodule challenges (#193)
|
2023-07-26 13:53:10 +01:00 |
|