5.9 Experiment tracking
5.9 Experiment tracking
canary results feed into a learnings doc, including *negative* results, queryable by future agents
Levels
Level 0
None
Level 1
Wins logged, losses forgotten
Level 2
Both wins and losses captured, indexed, retrievable as agent context
Level 3
Learnings retrieval rate measured; recurring-decision rate trends downward; experiment-design quality improves from past failures
Recipes that advance this criterion
No recipes yet.