Discussion about this post

User's avatar
The AI Architect's avatar

Super clear walkthrough of NDCG! The normalization insight is key here bcause most teams I've worked with initially freak out when comparing DCG across different k values. One thing worth noting: graded relevance scoring can get subjective real fast when building ground truth sets, especially in domains where 'partial relevance' isn't clear-cut. We ended up needing threee annotators per query just to keep scores consistent.

No posts

Ready for more?