Pages in This Section
- Evals as Outer Loop: how evals fit into AI development and post-deployment monitoring.
- Eval Patterns: common measurement approaches and how they fit together.
- Where to Start: how to choose the first eval that can change a decision.
- Static Evals vs. Judges: why static evals and LLM judges solve different parts of the AI measurement problem.