Section outline
-

Introduction
This section builds the essential vocabulary you’ll use to understand and communicate about machine learning work in a data science setting. You’ll learn how datasets are represented for modeling, what it means to “train” a model, and why performance can change outside the lab. These concepts form the foundation for reading ML documentation, collaborating with teams, and validating results responsibly.
Learning Objectives
-
Identify features, targets/labels, and training examples in a supervised ML dataset.
-
Distinguish models, parameters, and hyperparameters, and describe what happens during training.
-
Explain generalization, basic evaluation metrics, and why validation matters before production use.
-