Browse by letter


Data & Model Concepts

We currently have 10 entries in this glossary.

Bias

Bias – Systematic errors in AI models that lead to unfair or inaccurate predictions.

Big Data

Big Data – Large, complex datasets that require advanced analytics techniques.

Dimensionality Reduction

Dimensionality Reduction – Techniques like PCA that reduce the number of input variables in a model.

Embeddings

Embeddings – A representation of data (e.g., words or images) in a lower-dimensional space to capture relationships.

Feature Engineering

Feature Engineering – The process of selecting and transforming variables to improve model performance.

Overfitting

Overfitting – When an ML model learns noise instead of patterns, performing well on training data but poorly on new data.

Test Data

Test Data – Data used to evaluate the performance of an AI model.

Training Data

Training Data – Data used to teach an AI model how to recognize patterns.

Underfitting

Underfitting – When an ML model is too simple to capture the underlying patterns in data.

Variance

Variance – A model’s sensitivity to fluctuations in training data, leading to overfitting.

[ Return | Return to module index ]