agency-agents

mirror of https://github.com/msitarzewski/agency-agents.git synced 2026-06-11 13:44:57 +03:00

Files

T

Russell Jones b456845e85 feat: add promptfoo eval harness for agent quality scoring (#371 )

Adds promptfoo eval harness for agent quality scoring. LLM-as-judge system scoring task completion, instruction adherence, identity consistency, deliverable quality, and safety. Includes tests.

2026-04-10 21:54:31 -05:00

academic.yaml

feat: add promptfoo eval harness for agent quality scoring (#371 )

2026-04-10 21:54:31 -05:00

design.yaml

feat: add promptfoo eval harness for agent quality scoring (#371 )

2026-04-10 21:54:31 -05:00

engineering.yaml

feat: add promptfoo eval harness for agent quality scoring (#371 )

2026-04-10 21:54:31 -05:00