I do a lot of human evaluations. Lots of Bayesian / statistical models that can ...

fc417fc802 · 2026-01-07T22:48:00 1767826080

> in many cases it’s easier for a model to learn how to persuade than actually learn the right answers

So we should expect the models to eventually tend toward the same behaviors that politicians exhibit?

c0balt · 2026-01-07T23:42:39 1767829359

Maybe a happy to deceive marketing/sales role would be more accurate.

RA_Fisher · 2026-01-07T23:41:07 1767829267

100% (am a Bayesian statistician).

Isn’t it fascinating how it comes down to quality of judgement (and the descriptions thereof)?

We need an LMArena rated by experts.

Lerc · 2026-01-08T03:14:21 1767842061

As a statistician, do you you think you could, given access to the data, identify the subset of LMArena users that are experts?

RA_Fisher · 2026-01-08T12:09:52 1767874192

Yes, for sure! I can think of a few ways.

zqy123007 · 2026-01-08T01:14:40 1767834880

they always know, they just have non-AGI incentive and asymetric upside to play along...