5.2 CI/CD pipeline health
5.2 CI/CD pipeline health
the CI/CD pipeline *itself* (distinct from 2.1–2.4 which score *what* CI runs) is fast, reliable, observable, versioned, and environment-matched to production. A slow or flaky CI pipeline makes downstream scores meaningless
Levels
Level 0
Slow (>30min), flaky (>10% infrastructure-caused failures), opaque logs, pipeline config untested
Level 1
Config as code and versioned, but pipeline is slow or flaky; infrastructure failures mixed with real failures in ways agents can't distinguish
Level 2
Fast (<15min for PR validation), reliable (<2% infrastructure failures), agent-readable logs that distinguish infrastructure vs. real failures, config reviewed like production code, environment matches production
Level 3
Pipeline performance and flakiness tracked over time; regressions auto-investigated; slow stages auto-flagged for optimisation; new stages emerge from observed task patterns
Recipes that advance this criterion
No recipes yet.