site banner

Friday Fun Thread for April 24, 2026

Be advised: this thread is not for serious in-depth discussion of weighty topics (we have a link for that), this thread is not for anything Culture War related. This thread is for Fun. You got jokes? Share 'em. You got silly questions? Ask 'em.

Jump in the discussion.

No email address required.

Has anyone found LLM performance to actually improve with memory on? At least on ChatGPT, I find it overfits pretty severely to my previous chats and noticeably increases the rate of hallucination. For example, if I asked it to solve a geometry question in a previous chat, then ask it the exact same structure of question but with different parameters, it will sometimes give an incorrect answer that seems to have been poisoned by the output of the previous chat.

I keep going back and forth on how powerful I think these models are. There are moments where I am impressed by a seemingly new thought that it had to have extrapolated or reasoned out for itself as it's unlikely anyone would write this out (usually some combination of too niche and too obvious). Yet at some point, every time this would happen I started to ask it for a source and sure enough it links to a page where someone at some point did indeed spell it out explicitly on the internet. Each new model gives outputs that take me longer to find cracks, but the cracks are always there, and they are generally cracks of the same type as the example above - that is, the kind suggestive of a lack of world model and simple stochastic interpolation of existing texts. I especially get the feeling that their understanding of the relations between objects in 3D space is rather poor. This apparent asymptotic improvement makes me think that what's needed is a rather drastic change in the fundamental structure of LLMs. But I'm just a layman, so interested to hear others' thoughts and experiences.

I especially get the feeling that their understanding of the relations between objects in 3D space is rather poor.

Yeah they are basically the platonically perfect wordcells. You can get around it with .skills. Render the results to an image and give it the image. Or have a tool that will check details about the 3d space.

Gemini with memory on seems to make reasonable guesses about the reasons why I ask a question, which so far is only a little useful for me, but which could possibly make it a better source of answers than I often am for some of the sorts of XY-Problem questions I sometimes get from others.