More

stephantul · 2026-01-17T16:42:19 1768668139

Amazing post, I didn’t think this through a lot, but since you are normalizing the vectors and calculating the euclidean distance, you will get the same results using a simple matmul, because euclidean distance over normalized vectors is a linear transform of the cosine distance.

Since you are just interested in the ranking, not the actual distance, you could also consider skipping the sqrt. This gives the same ranking, but will be a little faster.

qingcharles · 2026-01-17T17:36:32 1768671392

It's stuff like this I would have loved to know when I was doing game engine dev in the 90s.

mads_quist · 2026-01-17T19:47:56 1768679276

I want to do game programming again like it's 1999. No more `npm i` or "accept all cookies" :/ rant off :)

corysama · 2026-01-18T00:18:52 1768695532

Go make a game for the Sega Genesis https://mdengine.dev/

Or, the GameBoy Advance https://github.com/GValiente/butano

eru · 2026-01-18T07:50:28 1768722628

I was seriously looking into the GameBoy Advance, but the real hardware has gotten quite expensive these days.

I wonder how the latest and greatest Wonderswan is doing in terms of price.

Keyframe · 2026-01-18T12:44:49 1768740289

One uses emulator while developing anyways. Try with C64 and VICE and join us at https://csdb.dk/

stephantul · 2026-01-14T13:15:46 1768396546

Using the phrase "without the benefit of hindsight" is interesting. The hardest thing with any technology is knowing when to spend the effort/money on applying it. The real question is: do you want to spend your innovation tokens on things like this? If so, how many? And where?

Not knocking this, just saying that it is easy to claim improvements if you know there are improvements to be had.

esafak · 2026-01-14T15:13:26 1768403606

That's what experience is for.

pfdietz · 2026-01-14T16:38:01 1768408681

Experience is that which lets you recognize a mistake when you make it again.

stephantul · 2026-01-12T08:04:53 1768205093

Stop the slop!

stephantul · 2025-12-31T17:48:12 1767203292

Love this guy and how committed he is

stephantul · 2025-12-21T18:24:14 1766341454

Where did you get this from? Searching for it, in a weird irony I guess, just leads me back to this post.

cryzinger · 2025-12-21T21:28:40 1766352520

I recognize it as a quote from A Year With Swollen Appendices, which is a great read even if you aren't an Eno fan (although I am, which admittedly makes me biased :P)

stephantul · 2025-12-22T05:31:03 1766381463

Thank you! I’ll check that out

stephantul · 2025-12-04T19:53:49 1764878029

I don’t really believe this is a paradigm shift with regards to train/test splits.

Before LLMs you would do a lot of these things, it’s just become a lot easier to get started and not train. What the author describes is very similar to the standard ml product loop in companies, including it being very difficult to “beat” the incumbent model because it has been overfit on the test set that is used compare the incumbent to your own model.

stephantul · 2025-11-28T19:06:17 1764356777

“Normal search” is generally called bm25 in retrieval papers. Many, if not all, retrieval papers about modeling will use or list bm25 as a baseline. Hope this helps!

stephantul · 2025-11-06T19:17:58 1762456678

I fully agree, except that I think this will still be a very “power user” thing. Perhaps this is also what you mean because you reference Linux. But traditional search will be very important for a very long while, imo

stephantul · 2025-09-12T18:25:17 1757701517

It does not run on Google’s cloud. You can download the model and host it yourself, locally or using a provider you trust.

ForHackernews · 2025-09-12T20:40:38 1757709638

That's actually great. I didn't realize Google had any models that could be self-hosted.

pkaye · 2025-09-12T22:00:00 1757714400

The Gemma models are available for self hosting. I've used these one on the ollama website myself.

https://ollama.com/library/gemma3

stephantul · 2025-08-10T06:17:08 1754806628

I recently did some work on making tokenizers greedy.