It doesn't. This is particularly tough to search for and i'm not on social media. I'd be surprised if Le Cunn somehow thought these reasoning models were somehow architecturally unique from a good old LLM. It's all in the training regime, right?
In any case I'll take your word for it, but that's still surprising to me.