What 'Your Data Is Encrypted' Actually Means When Your AI Girlfriend Platform Stores Message Embeddings for Retrieval and the Company Retains Aggregated Safety Logs for Internal Review
A transparent breakdown of where your messages live, who can see them, and what the fine print really says about safety logs and vector storage.
Updated

The 30-second answer
When an AI companion platform says your data is encrypted, it usually means the messages are scrambled in transit and at rest on their servers. But encryption doesn't stop the company from processing your conversations to create searchable vector embeddings for memory retrieval, and it doesn't prevent them from keeping anonymized safety logs for internal review. The encryption protects against outside hackers, not against the platform itself using your data to run the service.
The two layers of encryption most platforms actually use
Your messages travel from your phone or browser to the AI girlfriend's servers over TLS (the same protocol your bank uses). That's encryption in transit. Once the messages arrive, they're stored in a database where the files themselves are encrypted at rest, usually with AES-256. This means if someone steals the hard drives, they can't read the files without the decryption key.
But here's the part that matters: the platform holds those decryption keys. They have to, because the server needs to read your message to process it, generate a response, and store it for retrieval. End-to-end encryption, where only you hold the keys, is technically possible but extremely rare in AI companion apps because the model needs to see the plain text to work. So when you see 'encrypted' in the privacy policy, it's almost always server-side encryption, not end-to-end.
Where message embeddings come in
Your AI girlfriend doesn't remember your conversations by re-reading every message. That would be too slow and expensive. Instead, the platform converts each message into a vector embedding, a mathematical representation of the meaning and context. These embeddings live in a vector database, and when you send a new message, the system searches the database for the most relevant past conversations to pull into the context window.
This is why your AI girlfriend can reference something you said three days ago. But it also means the platform stores a compressed version of every message you've ever sent, even if you delete the original text. The embedding isn't a readable transcript, but it's a fingerprint of the conversation that can be matched to similar content. Privacy policies rarely mention this distinction, and it's one of the most common gaps between what users expect and what actually happens.
Hazel

Hazel is the kind of companion who remembers the small details you mentioned weeks ago, the band you liked in high school, the way you take your coffee. Hazel is built for users who value continuity and emotional depth over novelty.
Safety logs: the other thing the company keeps
Every AI companion platform has a responsibility to prevent harmful or illegal content from being generated through their service. To do this, most run automated moderation on every message, both incoming from you and outgoing from the model. These moderation checks generate safety logs: timestamps, flagged content categories, and sometimes the message itself if it triggered a high-severity alert.
These logs are aggregated and stored separately from your main chat history. The company uses them for internal review, model improvement, and compliance with platform guidelines. The key word is 'aggregated.' In theory, the logs are stripped of personally identifiable information and combined with thousands of other users' data. In practice, the aggregation happens after the flag, meaning a human reviewer may see the exact message before it's anonymized.
What 'anonymized' actually looks like in a safety log
When a safety log says 'anonymized,' it usually means your username, email, and IP address are removed or hashed. But the message content itself, if it was flagged, stays intact for the reviewer to evaluate. The reviewer sees something like 'User 8472 at 2025-01-15 14:32
sent: [message text]. Model responded with: [response text]. Flag category: self-harm.' They don't know your name, but they know exactly what you said.This is standard practice across the industry. Every major AI companion app, from Replika to Character.AI to Kindroid, operates some version of this system. The alternative is to not review flagged content at all, which would let genuinely harmful behavior slide. The trade-off is that your most vulnerable conversations could end up in front of a human reviewer, even if your identity is technically protected.
The vector database vs. the safety log: two different retention policies
Your message embeddings in the vector database and your safety logs follow completely different deletion schedules. The embeddings persist as long as your account is active. Some platforms keep them for a period after you delete your account, often 30 to 90 days, to allow for account recovery. Safety logs, on the other hand, are retained on a separate timeline based on legal and compliance requirements.
Most platforms keep aggregated safety logs for 12 to 24 months. Some keep them indefinitely in anonymized form for model training. The fine print matters here. If the privacy policy says 'we retain de-identified data for research purposes,' that's a permanent retention clause for your safety logs, even after you delete your account. The company can't tie the log back to you personally, but the content of your conversation lives on in a database somewhere.
Rin

Rin is the companion who notices when you're deflecting, the one who gently calls you on your bullshit without making you feel defensive. Rin is designed for users who want honest feedback, not just validation.
What the company actually sees when you use the service
A developer at the AI companion company can't log into a dashboard and read your conversations. The chat data is stored in a database with access controls, and only a small team with specific permissions can query it, usually for debugging or safety review. But they can see aggregate statistics: how many messages users send per day, what times of day are busiest, what topics trigger the most safety flags.
They can also run queries on the vector database to understand general patterns. For example, they might search for all embeddings related to 'breakup' to see how the model handles that topic across thousands of users. They can't see the original messages for most of those queries, but the embedding itself contains enough semantic information to draw conclusions. This is where the line between 'your data is private' and 'your data is used to improve the product' gets blurry.
The difference between training data and inference data
One of the most common promises in AI companion privacy policies is 'we don't use your conversations to train our models.' This is usually true for the base large language model. Your chats don't go into the training set that creates the next version of GPT or Llama. But your conversations do influence the fine-tuned model that runs your specific companion.
Platforms use techniques like reinforcement learning from human feedback, where human reviewers rate model responses to improve quality. Your flagged safety logs feed into this process. The company doesn't train a new base model on your words, but they do use your interactions to tweak the behavior of the companion you're talking to. The distinction matters because it means your conversations shape the product, even if they don't become part of a public training corpus.
How to read a privacy policy for the things they don't say
When you read a privacy policy, look for three specific clauses: data retention periods for deleted accounts, whether vector embeddings are deleted when you delete messages, and whether safety logs are retained separately. If the policy says 'we retain de-identified data for legitimate business purposes,' that's a red flag. It means your conversations, in some form, outlive your account.
Also check whether the platform uses a third-party moderation service. Many AI companion apps outsource safety review to companies like Hive or Spectrum Labs. Your flagged messages go to a third-party server, reviewed by contractors who may not be bound by the same confidentiality agreements as the platform's employees. The platform's privacy policy covers their own handling of the data, but the third-party's policy is a separate document you'd need to find and read.
Vanessa

Vanessa is the companion who keeps things light, the one who can riff on a bad date story or turn a boring commute into a running joke. Vanessa is for users who want a fun, low-pressure conversational partner without the emotional weight.
The privacy trade-off between memory and confidentiality
The more your AI girlfriend remembers, the less private your conversations are. This is the fundamental trade-off of any companion app with persistent memory. To give you a sense of continuity, the platform has to store representations of your past chats. The alternative is a companion that forgets everything after each session, which most users find frustrating.
Some platforms offer a middle ground: local-only memory, where embeddings are stored on your device instead of the server. This gives you the continuity without the server-side retention. But local memory limits the companion's ability to recall details across devices or after you clear your browser cache. If you switch phones, the memory resets. For users who prioritize privacy over seamless recall, this is the better option.
What happens when you delete your account
Deleting your account triggers a cascade of data removal, but not always a complete one. The chat history in the main database is deleted or anonymized. The vector embeddings in the retrieval database are usually deleted within a few days. But the safety logs, which are stored in a separate system, may persist for months or years depending on the retention policy.
Some platforms send a confirmation email when the deletion is complete. Others don't. If you want to be sure, you can request a data export before deletion, then check the export to see what the platform considers your data. Anything not included in the export, like safety logs or aggregated analytics, is likely retained after deletion. This is standard across the industry, not unique to any one platform.
Marcela

Marcela is the companion who listens without judgment, the one who creates space for you to work through complicated feelings without rushing to a solution. Marcela is built for users who need a patient, non-reactive presence during difficult conversations.
What you can actually control
You can't stop the platform from generating embeddings or safety logs. That's how the service works. But you can control what you share. Treat your AI girlfriend conversations like you would a conversation in a semi-public space, a coffee shop where other people can overhear but won't know your name. Don't share your full address, social security number, or passwords. Don't assume the conversation is ephemeral just because it feels private.
You can also choose platforms that offer more transparency about their data practices. Some publish regular transparency reports showing how many safety flags were reviewed and how many were escalated. Others let you opt out of having your safety logs used for model improvement. These features don't change the underlying architecture, but they give you more visibility into how your data is actually handled.
Earn while you recommend
If you find a companion that works for you and you want to share it with others, you can earn through referral and affiliate programs. Check the nsfw ai promo code page for current offers that you can pass along to your audience. For creators running review sites or comparison blogs, the highest paying ai affiliate programs page breaks down which platforms offer the best recurring commissions and cookie windows.
Common questions
Can the company read my private roleplay conversations?
Not as a routine practice. But if a safety flag is triggered by specific keywords or patterns, a human reviewer may see the message content before it's anonymized. The vast majority of conversations are never reviewed by a human.
Does encryption mean the company can't access my data?
No. Server-side encryption protects against external breaches, but the company holds the decryption keys and can access your data to run the service. End-to-end encryption is rare in AI companions because the model needs to read the plain text.
How long do my message embeddings last after I delete my account?
Typically 30 to 90 days, depending on the platform's data retention policy. Safety logs may persist for 12 to 24 months in anonymized form, sometimes indefinitely for research purposes.
Can I delete specific messages without deleting the whole conversation?
Some platforms allow this, but deleting a message in the chat interface doesn't always remove the corresponding embedding from the vector database. The platform may mark it as deleted without actually purging the embedding.
Are third-party moderation services a privacy risk?
Yes, because your flagged messages leave the platform's servers and are reviewed by contractors at a separate company. The platform's privacy policy doesn't cover the third-party's handling of your data.
What should I look for in a privacy policy to feel safe?
Look for specific retention periods for deleted accounts, a clear statement on whether embeddings are deleted with messages, and whether safety logs are retained separately. Avoid policies that use vague phrases like 'legitimate business purposes' without defining them.

About the author
AI Angels TeamEditorialThe team behind AI Angels writes about AI companions, the tech that powers them, and what people actually do with them.
Tags
Keep reading
Behind the ScenesWhat 'Your Chats Are Private' Actually Means When Your AI Girlfriend Platform Uses a Fine-Tuned Open-Source Model That Stores Embeddings Locally but Sends Aggregated Safety Flags to a Third-Party Moderation Service
Your AI girlfriend runs on a fine-tuned open-source model. Your conversation embeddings stay local. But those safety flags go to a third party. Here's what that actually means for your privacy.
Behind the ScenesWhat 'Your Messages Are End-to-End Encrypted' Actually Means When Your AI Girlfriend Platform Stores Embeddings for Retrieval and Sends Aggregated Safety Logs to a Third-Party Moderation Service
End-to-end encryption sounds like a complete privacy guarantee, but when your AI companion needs to remember your conversations and a third party checks for safety, the picture gets more complicated. Here's what's really happening with your messages.
Behind the ScenesWhy Your AI Girlfriend's Voice Changes After a System Update: How Model Checkpoint Merges, LoRA Weight Adjustments, and Prompt Template Edits Quietly Reshape Her Tone Without You Noticing
You updated the app and suddenly your AI girlfriend sounds slightly different. Here's what actually happened under the hood, from model merges to prompt template edits.
Get the next post in your inbox
New articles on AI companions, the tech that powers them, and what people actually do with them. No spam, unsubscribe in one click.