Your AI doesn't remember you. It remembers the thread.

Why where you talk to an AI decides what it can remember

Jun 21, 2026

I asked my agent a question about my car last week and got a good answer, and only afterward did I realize it had no business knowing it. It told me when the car had last finished charging and what I’d asked it to do differently the time before. I hadn’t stored any of that anywhere. There’s no car database. The agent doesn’t have a private memory it tucks facts into. So where did the answer come from?

It came from the thread. I have a Telegram thread that’s only about the car, and the agent had simply read back up its own thread the way you’d scroll up to remember how an argument started. Every charging instruction, every “did it work,” every fix, all sitting there in order. The thread wasn’t a record of the work. The thread was the memory.

That sounds like a small distinction. It isn’t, and the moment it clicked I started seeing every conversation I have with an AI differently.

When you start using one of these assistants seriously, the thing that takes a while to sink in is that it doesn’t remember you between sessions the way a person does. There’s no inner notebook. What feels like memory is almost always the model re-reading the conversation it’s sitting in. The chat window isn’t where you talk to the memory. The chat window is the memory. Everything in it is available. Everything outside it might as well not exist.

Which means the most boring decision you make, where you type, is quietly the most important one.

I learned this the way I learn most things, by getting it wrong first. For a while I ran everything through one channel. Car stuff, investing, household errands, random questions, all in one long stream. It felt efficient. One place, one assistant, ask it anything. And it slowly got worse at all of it, in a way I couldn’t put my finger on. It would lose the thread of what I’d decided about a stock because three hundred messages about my car had buried it. It would answer a question about the house with the tone it used for casual chat. Nothing was broken, exactly. It was just vaguely amnesiac about everything, and I blamed the model.

The model was fine. I was the one pouring four different kinds of work into one channel and then asking it to remember any single one cleanly. Imagine keeping one notebook for your finances, your car maintenance, your household, and your group chats, writing each new entry on whatever page you happened to flip to. That’s what one stream is. The information’s all technically there. Good luck finding what you decided about the car.

So I split them. One thread per kind of work. The car has its thread. Investing has its own, run by a separate agent that the main one isn’t even allowed to talk over, so that thread stays a clean tape of a single voice. The household thread holds the errands and the appointments and the small running logistics of the house, so the state of all of it lives in one scrollable place instead of scattered across a dozen unrelated chats. The group chat is its own thing with its own rules.

The split looked like organization. It was actually memory architecture. Each thread became a self-writing log of exactly one topic, which means each topic now has a complete, uninterrupted history the agent can read back without anything else bleeding in. The car thread is the entire story of the car. The household thread is the entire state of the house. I didn’t give the agent a better memory. I stopped corrupting the memory it already had.

And once each thread was a clean record of one kind of work, a second thing fell out of it for free: each thread could have its own rules. In the car thread, silence is banned, because a non-answer to “is it charging” is itself an alarm. In the group chat, silence is usually correct. Same agent, opposite default, because the thread told it which job it was doing. But that’s the smaller benefit. The real one is that I can ask the car thread anything about the car six weeks from now and the answer is just sitting there, in order, because nothing else was ever allowed in.

So here’s the part you can use even if you never build an agent and just use ChatGPT or Claude like everyone else. That long single thread you’ve been running, the one where you ask it everything? You’re not chatting with it. You’re writing its only memory, and you’re writing every kind of entry on top of every other one. The fix costs nothing. Start a separate conversation for each kind of work that you’ll come back to, and keep that work there. The tool doesn’t remember your project. It remembers the thread. Give the thread one job, and you’ve given the tool a memory of that job that actually holds.

Compounding Decisions for the Age of AI

Discussion about this post

Ready for more?