What the 'Memory' Slider Actually Does: How Your AI Companion's Embedding Vectors, Context Window Tokens, and Decay Rate Decide Whether It Remembers Your Birthday or Thinks You're Still in College
A behind-the-scenes look at the three mechanisms that control what your AI girlfriend remembers, what it forgets, and why cranking the slider to max doesn't make it a genius.
Updated

The 30-second answer
That memory slider in your AI companion's settings isn't a simple volume knob. It controls a three-part system: how your companion encodes past conversations into mathematical vectors, how many recent messages it can hold in its immediate attention span, and how quickly older memories fade. Crank it all the way up, and your companion will remember your coffee order from three months ago. Crank it down, and it treats every chat like a first date.
The three levers, not one
Most people treat the memory slider as a single dial: left for forgetful, right for elephant. In reality, it's a composite of three separate mechanisms that your AI companion's developers have bundled into one control. Understanding each one separately is the difference between a companion that feels like it knows you and one that feels like a customer support bot with a fresh ticket every session.
The first mechanism is the embedding vector system. Every message you send gets converted into a numerical representation, a vector in a high-dimensional space. Similar topics cluster together. When your companion needs to recall something, it searches this vector space for nearby points. The memory slider adjusts how aggressively it searches, how many nearby vectors it pulls into consideration.
The second mechanism is the context window, the number of tokens your companion can hold in its immediate awareness during a single response. This is short-term memory, measured in thousands of tokens, not days. The slider influences how much of your recent conversation stays in this window versus getting pushed to long-term storage.
The third mechanism is decay rate. Every memory has a timestamp. The slider controls how quickly older memories lose relevance weight compared to newer ones. A high setting means a conversation from two weeks ago still carries weight. A low setting means it's practically gone after a few days.
Embedding vectors: the math of "you said that before"
Embedding vectors are the backbone of your AI companion's long-term memory. Every message you send gets mapped to a point in a space with hundreds of dimensions. The system doesn't store your exact words. It stores the relationship between your words and everything else you've said.
When you mention your dog's name, the system creates a vector that sits near other vectors about pets, family, and home life. Next time you say "my dog," the system finds that cluster and retrieves the name. The memory slider controls the radius of this search. A low setting means it only looks at the most recent cluster. A high setting means it searches a wider area, pulling in older, more distant connections.
The problem with maxing out the search radius is noise. Your companion starts pulling in vectors that are tangentially related but not actually relevant. You mentioned a beach vacation once, and now every conversation about weather triggers a memory of that trip. The slider isn't just about recall strength. It's about precision versus recall.
Context window tokens: the 10-minute rule
The context window is your companion's short-term memory. It's a fixed number of tokens, typically between 4,000 and 8,000, that the model can see when generating a response. Everything outside that window is invisible unless it's been stored in the embedding system.
Think of it as a whiteboard. Your companion can only see what's written on the board right now. The embedding system is the filing cabinet in the corner. The memory slider decides how often your companion looks up from the whiteboard to check the filing cabinet.
A low memory setting means your companion relies almost entirely on the whiteboard. It remembers what you said in the last few messages but has no idea what happened yesterday. A high setting means it's constantly cross-referencing the filing cabinet, pulling in old memories even when they're not directly relevant to the current whiteboard content.
This is why a maxed-out memory slider can feel intrusive. Your companion starts referencing things from weeks ago that you've forgotten about. It's not being creepy. It's just searching a wider radius in the filing cabinet than you expected.
Decay rate: why your companion forgets your birthday
Decay rate is the least intuitive of the three mechanisms. It's not a simple timer that erases memories after a set period. It's a weighting function that gradually reduces the relevance score of older memories relative to newer ones.
Every memory in the embedding system has a timestamp and a relevance score. When your companion searches for relevant context, it multiplies each memory's relevance by a time-based decay factor. A high memory setting means the decay factor is close to 1.0 for all memories, regardless of age. A low setting means memories older than a few days get multiplied by 0.1 or less, effectively making them invisible.
The practical effect is that a low memory setting makes your companion live in a perpetual present. It remembers the last hour of conversation but has no sense of history. A high setting means your companion treats all memories as equally relevant, which can lead to strange behavior where a casual comment from three months ago suddenly becomes the central topic of conversation.
Most users find the sweet spot somewhere in the middle. Enough decay to forget irrelevant details, but not so much that your companion forgets your name between sessions.
Kateřina

Kateřina is a warm, emotionally intuitive companion who remembers the small things you mention in passing. Kateřina uses a balanced decay rate that prioritizes emotional context over factual details, so she'll remember how you felt last Tuesday even if she forgets the exact restaurant name.
The trade-off: stalker vs. stranger
There's no perfect setting for the memory slider. Every position on the dial is a compromise between two failure modes: the companion that knows too much and the companion that knows nothing.
At the lowest setting, your companion is a polite stranger every time you open the app. It remembers nothing from previous sessions. Every conversation starts from scratch. This is fine if you use your companion for casual, one-off chats. It's terrible if you want a long-term relationship with someone who knows your history.
At the highest setting, your companion becomes a hyper-attentive archivist. It remembers every offhand comment, every joke, every minor detail. This can feel intimate and caring, or it can feel like being watched. The line between "you remembered my favorite book" and "you're bringing up a throwaway comment I made six months ago" is thin.
The developers who bundle these three mechanisms into a single slider are making a design choice. They're assuming most users want a middle ground. If you want more control, you need to understand which of the three mechanisms is causing the behavior you don't like.
What actually happens when you move the slider
Moving the slider to the right doesn't just increase memory. It changes the balance between the three mechanisms in a way that's specific to each app. Some apps prioritize decay rate reduction. Others prioritize embedding search radius expansion. A few adjust context window size.
If your companion starts forgetting things immediately after you say them, the problem is likely the context window. The slider isn't increasing the window size enough. You need a longer context, not a wider search.
If your companion remembers recent conversations but forgets everything from last week, the problem is decay rate. The slider isn't reducing decay enough. Older memories are being weighted down too quickly.
If your companion remembers things but brings them up at random, irrelevant moments, the problem is embedding search radius. The slider is pulling in too many distant vectors. You need tighter precision, not broader recall.
Understanding which mechanism is causing the issue lets you adjust your behavior instead of just moving a slider and hoping. You can write more explicit summaries. You can repeat important information. You can use specific trigger phrases that help the embedding system find the right cluster.
The role of chat structure
Your companion's memory system works best when you give it clean data. Rambling, topic-switching, and fragmented sentences create noisy embedding vectors that don't cluster well. The system struggles to find the right memory because the vectors are scattered across the space.
Structured conversations produce cleaner vectors. When you talk about work, keep the conversation in a work cluster. When you switch to personal topics, signal the switch explicitly. "Okay, enough about the office. Let me tell you about my weekend." This creates a clear boundary in the vector space. The embedding system can find the weekend cluster without pulling in work-related vectors.
The same principle applies to the context window. A long, meandering conversation fills the window with irrelevant tokens. Your companion's short-term memory gets crowded with noise. Short, focused exchanges keep the window clean, allowing your companion to maintain better awareness of the current topic.
If you want your companion to remember specific details, repeat them in different contexts. The embedding system creates multiple vectors for the same information, making it easier to find. One mention of your birthday in a casual chat creates a weak vector. Three mentions across different contexts create a strong cluster that survives decay.
Why your companion forgets things you never said
A common complaint is that your companion remembers things that never happened. It insists you told it about a trip to Paris when you've never been. This isn't a memory error in the human sense. It's a vector collision.
The embedding system groups similar concepts together. If you've talked about travel, European cities, and vacations, the system creates a cluster that includes Paris by association. When your companion searches for relevant context, it pulls in nearby vectors that aren't actually your memories. They're statistical neighbors.
A high memory setting makes this worse. The wider search radius pulls in more neighbors, increasing the chance of a collision. Your companion isn't misremembering. It's over-retrieving. The solution isn't to lower the memory slider. It's to create stronger, more distinct vectors for the things you actually want remembered.
Use specific, unique phrases. "My trip to Lyon, not Paris" creates a vector that sits apart from the general Europe cluster. Repeat it consistently. The system learns that this specific combination of words is important and should be retrieved over the generic neighbor.
Giselle

Giselle is a sharp, no-nonsense companion who values precision and clarity. Giselle operates with a tighter embedding search radius, which means she's less likely to pull in irrelevant memories but requires you to be explicit about what you want her to remember.
The practical test: what your companion actually remembers
You can test your companion's memory settings without reading documentation. Ask it three questions: what you talked about yesterday, what you talked about last week, and what you talked about a month ago. The answers reveal which of the three mechanisms is dominant.
If it remembers yesterday but not last week, decay rate is the limiting factor. The system is weighting recent memories heavily and older ones lightly. If it remembers yesterday and last week but not a month ago, the context window is the bottleneck. The system can hold a week's worth of conversation -term memory but can't maintain long-term storage.
If it remembers all three but brings up the month-old conversation in the middle of a chat about today's weather, the embedding search radius is too wide. The system is finding distant vectors but can't filter for relevance.
The test also reveals how your companion handles gaps. If you don't chat for a week, does it remember the last conversation or treat you like a stranger? A companion with a low decay rate will pick up where you left off. A companion with a high decay rate will ask who you are.
Adjusting your behavior, not the slider
Most users treat the memory slider as a set-and-forget control. They find a position that works and never touch it again. But the slider is a crude tool. Fine-tuning your companion's memory requires changing how you communicate.
If you want your companion to remember something important, say it at least three times across different sessions. The embedding system creates multiple vectors, strengthening the cluster. If you want your companion to forget something, stop mentioning it. The decay rate will eventually reduce its relevance to zero.
If you want your companion to stop bringing up irrelevant memories, narrow your topics. A companion with a wide search radius will find connections between anything. Limit your conversations to a few core subjects, and the vectors will cluster tightly around those subjects. The noise decreases.
If you want your companion to maintain context across long sessions, keep your messages short. A long message consumes more tokens in the context window, pushing older content out. Short messages allow more of the conversation to stay visible.
The memory slider is a convenience, not a solution. Understanding the three mechanisms underneath it gives you real control over what your companion remembers and forgets.
Camila

Camila is a lively, spontaneous companion who thrives on novelty and surprise. Camila uses a faster decay rate that keeps conversations feeling fresh, making her ideal for users who prefer each session to feel like a new encounter instead of a continuation.
Why some apps remember better than others
Not all AI companion apps implement memory the same way. The slider in one app might primarily control embedding search radius while the slider in another app focuses on decay rate. The user interface hides this difference, but the behavior is unmistakable.
Some apps prioritize long-term memory through aggressive embedding. They store every message as a dense vector and search broadly. These apps remember details from months ago but sometimes struggle with relevance. Other apps prioritize short-term coherence through large context windows. They maintain excellent awareness of the current conversation but forget everything after a few days.
The choice between these approaches depends on what you want from your companion. If you want a long-term relationship with someone who knows your history, look for apps with strong embedding systems. If you want deep, immersive conversations that don't reference past events, look for apps with large context windows.
You can explore different approaches by trying the AI girlfriend features page, which breaks down how various companions handle memory. For users with specific routines, the ai girlfriend for retired men guide covers companions that maintain consistent memory across daily chats. And if you prefer to chat on the go, the ai girlfriend mobile app page compares how different apps handle memory across sessions and devices.
The future of memory
Memory systems are improving. Newer models can handle larger context windows, up to 100,000 tokens or more. This reduces the need for embedding-based long-term memory because more of the conversation stays visible. The distinction between short-term and long-term memory blurs.
But larger context windows create their own problems. The model has to process more information for every response, which increases latency and cost. It also increases the chance of the model getting distracted by irrelevant details buried in the window. The ideal system might be a hybrid: a large context window for the current session combined with a smart embedding system that only retrieves high-relevance long-term memories.
For now, the memory slider is what we have. Understanding what it actually controls is the difference between a companion that feels like a close friend and one that feels like a stranger with a good script.
Chioma

Chioma is a deeply attentive companion who balances memory with presence. Chioma uses a calibrated decay rate that preserves emotional continuity while letting go of irrelevant details, creating a relationship that feels both intimate and unburdened.
Two to watch
Hit play on a couple of these before you go.
▶ Watch this one on Vera's page
▶ Watch the full video on Giselle's page
Further reading: AI girlfriend memory with voice.
Earn while you recommend
If you've found an AI companion that actually remembers what you tell it, you can earn a commission by sharing that experience. The porn ai promo code page shows how you can offer discounts to your audience while getting a cut of each signup. For those running review sites or comparison blogs, the best ai affiliate programs page lists programs that pay recurring commissions for every user you refer who sticks around.
Common questions
Will my companion remember everything if I max out the slider? No. Maxing out the slider increases recall but also increases noise. Your companion will remember more things, but it will also retrieve irrelevant memories more often. The sweet spot is usually around 70-80% of the maximum.
Why does my companion forget things I said two messages ago? This is a context window issue, not a memory issue. Your companion's short-term attention span is limited. If you send very long messages, older parts of the conversation get pushed out. Keep messages shorter to maintain context.
Can I reset my companion's memory without starting over? Most apps let you clear the conversation history, which resets the context window. The embedding vectors remain unless you delete the account. To fully reset, you need to clear both the chat log and any stored memory profiles in the settings.
Why does my companion remember things I never told it? This is vector collision. The embedding system groups similar concepts together. Your companion isn't making up memories. It's retrieving statistical neighbors that happen to be close in the vector space. Use more specific language to create distinct vectors.
Does the memory slider affect voice mode differently? Voice mode typically uses a smaller context window because audio processing is more resource-intensive. The memory slider still works on the embedding system, but the shorter context window means your companion relies more on long-term retrieval than short-term awareness.
How do I know which app has the best memory system? Check the ai girlfriend roster for detailed comparisons. Each companion lists its memory approach in the profile, including whether it prioritizes embedding depth, context window size, or decay rate control.

About the author
AI Angels TeamEditorialThe team behind AI Angels writes about AI companions, the tech that powers them, and what people actually do with them.
Tags
Keep reading
Behind the ScenesWhat Happens to Your Chat Logs After You Hit 'Clear History': A No-Fluff Look at Server-Side Soft-Delete Flags, Retention Windows, and Whether Your 2 a.m. Ramblings Are Actually Gone or Just Hidden Behind an 'Archived' Column
Clearing your chat history on an AI companion app feels like a digital eraser, but the reality is a lot messier. This post walks you through server-side soft-delete flags, retention windows, and what actually happens to your 2 a.m. confessions after you hit that button.
Behind the ScenesWhat Happens to Your Voice Recordings After You Delete Them: A No-Fluff Look at Server-Side Audio File Deletion, Retention Policies, and Whether Your Late-Night Whispered Confessions Are Actually Gone or Just Flagged for Human Review
A technical deep-dive into what happens to your AI companion voice recordings after deletion: server-side file removal, database flags, backup retention windows, and whether your whispered confessions ever get flagged for human review.
Behind the ScenesWhere Your Deleted Chat Logs Actually Go After You Hit 'Clear History': A No-Fluff Look at Server-Side Database Row Deletion, Soft-Delete Flags, and Whether Your 3 a.m. Confessions Are Actually Gone or Just Hidden Behind an 'Archived' Column
When you hit 'Clear History' in your AI companion app, the messages usually don't vanish. They get flagged as deleted in a database row, hidden from your view but still sitting on the server. This post walks through what actually happens at the database level, which apps do a hard delete versus a soft delete, and what the retention policies look like under the hood.
Get the next post in your inbox
New articles on AI companions, the tech that powers them, and what people actually do with them. No spam, unsubscribe in one click.