Hallucinations are a result of how LLMs simply generate sequences of probable to...

kesor · on Sept 19, 2023

Of course, even if the prompt given to ChatGPT is "Cutoff date: 2033-01" it doesn't mean it was actually trained using knowledge up to that date. But it was indeed provided with that date as part of its prompt so that it could use that in its responses (and it does).

haltist · on Sept 19, 2023

I am saying even in the case that the date was given unless you have direct access to the relevant data you can not conclude the date in the output was included anywhere in the input prompts (system or otherwise).

kesor · on Sept 19, 2023

It is pretty safe to assume that it was. Especially since it is so repeatable and the same method also shows back my own custom instruction prompts.

fennecfoxy · on Sept 19, 2023

I find the funniest aspect of hallucinations etc to be that we've designed and trained these models based off our knowledge of biological brains and learning.

We expect these models to both act like a biological brain does and yet be absolutely perfect (ie not act like a biological brain does).

Same thing for image recognition and pretty much everything else machine: "I think that kinda sorta looks like a cat" some meatbag: "ha ha dum robot that's a dog says "you too" when the server says 'have a good meal'"

squeaky-clean · on Sept 19, 2023

But how does this explain it knowing today's current date?

haltist · on Sept 19, 2023

It doesn't know anything. Large language models are basically Markov chains with a large context for conditional probabilities. If the output contains the current date then it is supplied out of band in some other way. It could be part of the "system prompt" which is an extra set of tokens that modifies the conditional probabilities in the output or the output is fixed up after the fact using some kind of extra parsing and filtering after sampling.

LLMs are not magic and encoding model metadata in the output is just asking for trouble. Inline model metadata should be assumed to be a statistically probable hallucination just like all output from an LLM.