Commit Graph

23 Commits

Author SHA1 Message Date
Jarrod Sibbison
0a00b0f588 feat: Structured output for recipes (#3188) 2025-07-02 12:16:57 +10:00
Lifei Zhou
9e6247d9ed feat: created sub recipe tools (#2982) 2025-06-25 09:29:26 +10:00
Max Novich
180b1df25d Mnovich/temporal foreground tasks (#2895)
Co-authored-by: Carlos M. Lopez <carlopez@squareup.com>
2025-06-20 16:19:58 -07:00
Chaitanya Rahalkar
4e97ffacd6 feat(cli): Add --quiet /-q flag to goose run (#2939) 2025-06-18 12:23:12 -07:00
Douwe Osinga
78afaded33 Goose recipes have settings now (#2397)
Co-authored-by: Douwe Osinga <douwe@squareup.com>
Co-authored-by: Lifei Zhou <lifei@squareup.com>
2025-06-13 11:32:01 +10:00
Michael Neale
030e8a82b5 feat/fix: don't stop cli starting if MCPs don't load (#2860) 2025-06-12 14:46:56 +10:00
Max Novich
4e1b091d91 feat: add tool repetition monitoring to prevent infinite loops (#2527) 2025-05-14 14:46:37 -07:00
Max Novich
78e4de7893 allow running goose run with no session persistence (#2517) 2025-05-13 09:48:34 -07:00
Kalvin C
d1c124c28d feat: add recipes, a custom goose agent configuration (#2115) 2025-04-09 18:57:24 -07:00
marcelle
8fbd9eb327 feat: efficient benching (#1921)
Co-authored-by: Tyler Rockwood <rockwotj@gmail.com>
Co-authored-by: Kalvin C <kalvinnchau@users.noreply.github.com>
Co-authored-by: Alice Hau <110418948+ahau-square@users.noreply.github.com>
2025-04-08 14:43:43 -04:00
Jim Bennett
050a8f2f42 Add -with-remote-extension (#2062) 2025-04-07 16:42:38 -04:00
marcelle
4c03b34058 feat: refactor register eval (#1713) 2025-03-18 15:18:09 -04:00
Salman Mohammed
ea0960f645 refactor: clean up log usage (#1704) 2025-03-17 15:18:21 -04:00
marcelle
1f8d45984c feat: write eval results to eval dir (#1620) 2025-03-11 15:05:52 -04:00
marcelle
c23be1eb19 fix: ensure repeating benches return to initial run-dir (#1617) 2025-03-11 11:44:57 -04:00
Zaki Ali
c0e719eaba fix: merge error logging in goose bench (#1545) 2025-03-10 15:45:00 -07:00
Alice Hau
bb4feacf03 feat: add additional goosebench evals (#1571)
Co-authored-by: Alice Hau <alice.a.hau@gmail.com>
2025-03-10 15:11:44 -04:00
Lily Delalande
5df2875c1c feat: update config endpoints for use with providers (#1563) 2025-03-10 09:51:54 -07:00
marcelle
00fc3a5de8 Feat: support auto-including dirs in binary/bench-work-dir (#1576) 2025-03-07 17:53:39 -05:00
Kalvin C
7b37ab0b52 feat(cli): add --debug flag to goose session / run (#1564) 2025-03-06 18:58:25 -08:00
marcelle
798d657e7e bugfix: refactor workdirs to be async-safe, and simpler (#1558) 2025-03-06 21:11:35 -05:00
Zaki Ali
ebf7cb1231 feat: split required_extensions in bench to builtin/external (#1547) 2025-03-06 17:12:21 -08:00
marcelle
49dee048e4 feat: goose bench framework for functional and regression testing
Co-authored-by: Zaki Ali <zaki@squareup.com>
2025-03-05 21:23:00 -05:00