Jarrod Sibbison
|
0a00b0f588
|
feat: Structured output for recipes (#3188)
|
2025-07-02 12:16:57 +10:00 |
|
Lifei Zhou
|
9e6247d9ed
|
feat: created sub recipe tools (#2982)
|
2025-06-25 09:29:26 +10:00 |
|
Max Novich
|
180b1df25d
|
Mnovich/temporal foreground tasks (#2895)
Co-authored-by: Carlos M. Lopez <carlopez@squareup.com>
|
2025-06-20 16:19:58 -07:00 |
|
Chaitanya Rahalkar
|
4e97ffacd6
|
feat(cli): Add --quiet /-q flag to goose run (#2939)
|
2025-06-18 12:23:12 -07:00 |
|
Douwe Osinga
|
78afaded33
|
Goose recipes have settings now (#2397)
Co-authored-by: Douwe Osinga <douwe@squareup.com>
Co-authored-by: Lifei Zhou <lifei@squareup.com>
|
2025-06-13 11:32:01 +10:00 |
|
Michael Neale
|
030e8a82b5
|
feat/fix: don't stop cli starting if MCPs don't load (#2860)
|
2025-06-12 14:46:56 +10:00 |
|
Max Novich
|
4e1b091d91
|
feat: add tool repetition monitoring to prevent infinite loops (#2527)
|
2025-05-14 14:46:37 -07:00 |
|
Max Novich
|
78e4de7893
|
allow running goose run with no session persistence (#2517)
|
2025-05-13 09:48:34 -07:00 |
|
Kalvin C
|
d1c124c28d
|
feat: add recipes, a custom goose agent configuration (#2115)
|
2025-04-09 18:57:24 -07:00 |
|
marcelle
|
8fbd9eb327
|
feat: efficient benching (#1921)
Co-authored-by: Tyler Rockwood <rockwotj@gmail.com>
Co-authored-by: Kalvin C <kalvinnchau@users.noreply.github.com>
Co-authored-by: Alice Hau <110418948+ahau-square@users.noreply.github.com>
|
2025-04-08 14:43:43 -04:00 |
|
Jim Bennett
|
050a8f2f42
|
Add -with-remote-extension (#2062)
|
2025-04-07 16:42:38 -04:00 |
|
marcelle
|
4c03b34058
|
feat: refactor register eval (#1713)
|
2025-03-18 15:18:09 -04:00 |
|
Salman Mohammed
|
ea0960f645
|
refactor: clean up log usage (#1704)
|
2025-03-17 15:18:21 -04:00 |
|
marcelle
|
1f8d45984c
|
feat: write eval results to eval dir (#1620)
|
2025-03-11 15:05:52 -04:00 |
|
marcelle
|
c23be1eb19
|
fix: ensure repeating benches return to initial run-dir (#1617)
|
2025-03-11 11:44:57 -04:00 |
|
Zaki Ali
|
c0e719eaba
|
fix: merge error logging in goose bench (#1545)
|
2025-03-10 15:45:00 -07:00 |
|
Alice Hau
|
bb4feacf03
|
feat: add additional goosebench evals (#1571)
Co-authored-by: Alice Hau <alice.a.hau@gmail.com>
|
2025-03-10 15:11:44 -04:00 |
|
Lily Delalande
|
5df2875c1c
|
feat: update config endpoints for use with providers (#1563)
|
2025-03-10 09:51:54 -07:00 |
|
marcelle
|
00fc3a5de8
|
Feat: support auto-including dirs in binary/bench-work-dir (#1576)
|
2025-03-07 17:53:39 -05:00 |
|
Kalvin C
|
7b37ab0b52
|
feat(cli): add --debug flag to goose session / run (#1564)
|
2025-03-06 18:58:25 -08:00 |
|
marcelle
|
798d657e7e
|
bugfix: refactor workdirs to be async-safe, and simpler (#1558)
|
2025-03-06 21:11:35 -05:00 |
|
Zaki Ali
|
ebf7cb1231
|
feat: split required_extensions in bench to builtin/external (#1547)
|
2025-03-06 17:12:21 -08:00 |
|
marcelle
|
49dee048e4
|
feat: goose bench framework for functional and regression testing
Co-authored-by: Zaki Ali <zaki@squareup.com>
|
2025-03-05 21:23:00 -05:00 |
|