CKO-005Validation & EvaluationStrong evidence

What is validation?

Validation is the process of determining whether an AI system performs reliably and accurately for its intended task.

In more detail

Validation involves testing a system against known standards or human judgements to determine whether it performs as expected. Validation may involve benchmarking, external testing, SWARs and real-world implementation studies. Validation should be task-specific and context-specific.

Why it matters

Without validation there is no basis for trusting AI outputs.

Decision rule

Do not use AI systems for important tasks unless appropriate validation evidence exists.

Common misconception

  • “A tool works because the vendor says it works.”

At a glance

Evidence strength
Strong

Related concepts

Generalisability Robustness Stability
Key takeaway

Trust should be earned through evidence, not marketing.

More on Validation & Evaluation