Checking if a recommendation system is actually good in practice is kind of toug...

bradly · 2025-03-23T20:57:09 1742763429

> you still will struggle to make your dev loop productive enough without throwing similar amounts of compute that the ~FAANGs do so as to validate whether that 0.2% improvement you got really meant anything or not

And do not forget the incredible of number of actual humans FAANG pays every day to evaluate any changes in result sets for top x,000 queries.

lmeyerov · 2025-03-23T14:15:04 1742739304

As someone whose customers do this stuff, I'm 100% for most academics chasing harder and more important problems

Most of these papers are specialized increments on high baselines for a primarily commercial problem. Likewise, they focus on optimizing phenomena that occur in their product, which may not occur in others. Eg, Netflix sliding window is neato to see the result of, but I rather students user their freedom to explore bigger ideas like mamba, and leave sliding windows to a masters student who is experimenting with intentionally narrowly scoped tweaks.