The big picture
The Pivot Pipeline runs in five sequential stages. Each stage takes the output of the previous one and adds value — collecting raw data, normalizing it into master files, modeling per-player attributes, layering on scouting intelligence, reconciling players who appear in multiple leagues, and finally producing the EHM-importable database.
nhlmisc_*, edgeSpeed_*, ehgar_*; SHL uses shl_*; etc.). The master build joins every available data source by player ID, normalizes column types, applies hard-fail checks for required signals, and produces one consolidated master file per league.
How the modeling actually shapes ratings
Inside each modeling phase, the same general flow applies. Raw data signals are converted to per-60 rates, normalized within position groups, blended with reliability weighting (small samples regress toward neutral averages), and mapped to the 1–20 EHM attribute scale via normal-CDF distribution shaping.
For the full phase-by-phase breakdown — what data each phase reads, what attributes it produces, what the formulas actually do — see the Pipeline Guide.
Where to dig deeper
This page is the architectural overview. Three places to go from here:
-
→ The Pipeline Guide
Full visual walkthrough of every modeling phase, the CA tier system, the role suggestion engine, and how it all comes together. -
→ Data Sources by League
What data each of the 17 league pipelines uses, with three real example players per league showing what the data actually drove. -
→ Design Specs
Technical reference docs — NHL gold standard, AHL/European tier-2 adaptations, gap-filling techniques, porting checklist.