Glicko-2

🌿 Budding· last tended 22 Jan 2026ratingsmathssports

Elo gives every competitor a single number. Glicko-2 gives three: a rating, a rating deviation (how uncertain we are about it), and a volatility (how erratic their results have been). That extra structure matters.

A new player with a high RD moves a lot per game — we're still learning who they are.
A veteran with a low RD barely budges from a single upset.
RD grows during inactivity: certainty decays, so a player returning after a long break is treated as less known again.

I computed Glicko-2 ratings for 100+ football leagues by chronologically replaying every match through the algorithm — a clean way to turn raw results into a strength signal a model can actually use, and a feature with real expected value for match prediction.

The general lesson: a point estimate without its uncertainty is half a measurement. Same reason I won't act on a model until I validate before automating.