It doesn't. This is particularly tough to search for and i'm not on social media. I'd be surprised if Le Cunn somehow thought these reasoning models were somehow architecturally unique from a good old LLM. It's all in the training regime, right?
In any case I'll take your word for it, but that's still surprising to me.
Would be nice if the author could cite even one example of this as it doesn't match my experience whatsoever.