Glicko-2
๐ฟ Buddingยท last tended 22 Jan 2026ratingsmathssports
Elo gives every competitor a single number. Glicko-2 gives three: a rating, a rating deviation (how uncertain we are about it), and a volatility (how erratic their results have been). That extra structure matters.
- A new player with a high RD moves a lot per game โ we're still learning who they are.
- A veteran with a low RD barely budges from a single upset.
- RD grows during inactivity: certainty decays, so a player returning after a long break is treated as less known again.
I computed Glicko-2 ratings for 100+ football leagues by chronologically replaying every match through the algorithm โ a clean way to turn raw results into a strength signal a model can actually use, and a feature with real expected value for match prediction.
The general lesson: a point estimate without its uncertainty is half a measurement. Same reason I won't act on a model until I validate before automating.