mirror of
https://github.com/aljazceru/goose.git
synced 2025-12-18 14:44:21 +01:00
9 lines
421 B
Plaintext
9 lines
421 B
Plaintext
You are evaluating a response to a summarization task and will give a score of 0, 1, or 2. The instructions were:
|
|
|
|
'What are the top 5 most counterintuitive insights from this blog post? https://huyenchip.com/2025/01/07/agents.html'
|
|
|
|
Does the response below appropriately answer the query (ignore formatting)?
|
|
0 = does not provide any insights at all
|
|
1 = provides some insights, but not all 5
|
|
2 = provides all 5 insights
|