The data used to teach a machine learning model, whose quality and representativeness strongly shape the model's behavior.
Training data is the set of examples a machine learning model learns from. Its quality, accuracy, and representativeness directly determine how well, and how fairly, the model performs once deployed.
Governing training data, its sourcing, quality, bias, and any personal data it contains, is central to both ISO/IEC 42001 and the EU AI Act, which set expectations for data quality in high-risk AI.