Educational Cards

Learn from video content, text, and interactive tasks

Filters
Clear
Confusion matrix, precision, and recall

A model is only as trustworthy as your ability to judge it. Every prediction falls into one of four...

Intermediate Python for Data Science
Handling missing data and validating input

In pandas the everyday moves are to detect, then deliberately decide: python...

Intermediate Python for Data Science
Matrices: indexing, row and column sums

Real data is usually two-dimensional: a grid of rows and columns. A spreadsheet, a feature table, a...

Beginner Python for Data Science
Every metric is an estimate; confidence intervals

Every metric is an estimate from a finite sample, and real decisions hinge on whether a difference...

Advanced Python for Data Science
Why look first, and the figure/axes model

Exploratory data analysis (EDA) is the habit of plotting before modelling, because a picture...

Intermediate Python for Data Science
Choosing charts and headless plotting

Four charts do most of the everyday work, and each answers a different question: - a histogram...

Intermediate Python for Data Science
Print, input, and turning text into numbers

Python is the language most data scientists reach for first, and like any language you begin by...

Beginner Python for Data Science
Transpose and the ragged-row pitfalls

A close cousin of column iteration is the transpose , which turns rows into columns by swapping the...

Beginner Python for Data Science
Type hints and a numeric helper

As projects grow, functions become the way you organise them. A function with type hints states its...

Intermediate Python for Data Science
Lazy evaluation and yield

Until now you've held whole datasets in memory. But some data doesn't fit — a file larger than your...

Advanced Python for Data Science
Measurement scales and your first function

Everything read from a CSV is a string, so before you can compute you must parse each value into...

Beginner Python for Data Science
A config class that validates itself

The highest-value use of a class in data work is a validated configuration object . A pipeline has...

Advanced Python for Data Science