Definition
Training data
Training data is the data used to fit the learnable parameters of an AI system. It shapes what patterns the model can learn, which means gaps, bias, duplication, outdated examples, or unclear rights can become model behavior.
Last updated: 25 June 2026
Why it matters
It gives data governance a concrete role in model quality, compliance, and measurable performance.
Signals to watch
- Examples fit model parameters
- Data quality affects outputs
- Rights and provenance must be checked