Its not 'thanking', its positive signal that GPT's previous outputs were correct...

csmpltn · on March 27, 2023

> "its positive signal that GPT's previous outputs were correct"

Is that somehow baked into the algorithms?

Are positive words of encouragement interpreted as "positive signals" by the inference pipeline? Or do they somehow influence the attention mechanism?

Because otherwise, you're just rationalizing completely random and unpredicted behavior.

gitgud · on March 28, 2023

> Is that somehow baked into the algorithms?

OpenAI’s API is stateless, which means you need to send the entire conversation thread in each request.

So when you send a response like “perfect, now do…”, you’re reinforcing the language model that the conversation history is on the right track.

ChancyChance · on March 27, 2023

That's literally what I said: those words are "encouraging" it.