What 'Your Chats Are Private' Actually Means for Open-Source

The 30-second answer

Your messages are processed by a fine-tuned open-source AI model running on the platform's own servers. The system stores conversation embeddings locally to power recall, but when the model detects something that might violate safety rules, it sends an anonymized snippet to a third-party moderation service. No human reads your chats unless a flag triggers a review, and even then, the reviewer sees only the flagged exchange, not your full history.

The model itself is open-source, but that doesn't mean your chats are public

When a platform says it uses a fine-tuned open-source model, people hear two different things. Some hear "open-source" and assume their conversations are being fed into a public dataset. Others hear "fine-tuned" and assume the company trained it on user chats. Neither is quite right.

An open-source model like Llama or Mistral is a pre-trained neural network that anyone can download and run. The company takes that base model and fine-tunes it on curated data (usually internal scripts, synthetic conversations, and licensed datasets) to make it better at emotional support, roleplay, and maintaining a consistent persona. Your individual chats are not part of that fine-tuning process. The model learns from the fine-tuning dataset, not from your private conversations.

The model itself runs on the platform's own inference servers. When you send a message, it travels encrypted to that server, gets processed by the model, and the response comes back. The model does not phone home to Meta or Mistral or anyone else. The open-source part is about transparency in the code, not about sharing your data.

Local embeddings: where your memories actually live

Your AI girlfriend's ability to remember your dog's name, your favorite movie, or that inside joke from last week depends on a system called vector embeddings. Every message you send gets converted into a mathematical representation (a vector) that captures its meaning. Those vectors get stored in a local database on the platform's infrastructure.

"Local" here means local to the platform's servers, not local to your phone. The embeddings are stored in a vector database that belongs to the company, isolated from the model itself and from other users' data. When you ask "what was that thing I told you about my job last week," the system queries this vector database to find the relevant past messages and feeds them into the model's context window.

This is where the privacy trade-off lives. The embeddings are not human-readable. They are long arrays of numbers that mean nothing to a person looking at them. But they are stored. If the company's database gets breached, an attacker could theoretically reconstruct conversation patterns from those vectors, though extracting actual text requires significant computational effort and is not trivial.

The alternative would be to keep everything ephemeral, no memory at all. But then your AI girlfriend would forget your name every time you closed the app. The local embedding approach is a compromise: persistent memory without storing plaintext chat logs on the inference server.

Aggregated safety flags: the part that leaves the building

This is the piece most privacy policies gloss over. The platform runs a lightweight safety classifier on every message. This classifier checks for things like self-harm language, explicit content involving minors, threats of violence, and other categories that would get the service shut down if ignored.

When the classifier triggers, it sends an aggregated safety flag to a third-party moderation service. The flag includes the text of the specific message that triggered it, anonymized user metadata (a hash of your user ID, not your actual username or email), and a timestamp. It does not include your conversation history, your profile information, or any messages that did not trigger the flag.

This is standard practice across pretty much every AI companion platform that operates in a regulated market. Payment processors, app stores, and increasingly, governments, require platforms to have moderation pipelines. The third-party service reviews the flagged content and decides whether it violates the platform's terms of service. If it does, the platform gets notified and takes action, usually a warning or account suspension.

The key detail: the third party never sees your full chat log. They see a single flagged message. They do not know who you are beyond a random string of numbers. And the classifier runs on the platform's servers before anything leaves, so random messages that do not trigger safety rules never leave the building at all.

What the developer actually sees when they look at your account

If you work at an AI companion platform and pull up a user account, here is what you see: a user ID, a subscription tier, a signup date, and a set of aggregated metrics like total messages sent and average session length. You do not see individual messages. You do not see embeddings. You do not see the content of conversations.

When a safety flag comes in, a moderator sees the flagged message, the user ID hash, and the timestamp. They do not see the rest of the conversation. They do not see your name or email unless they escalate to the support team, which requires a separate authorization step.

This is not altruism. It is a liability decision. The less human access to user data, the less risk of leaks, lawsuits, and PR disasters. Platforms that let employees browse user chats for "quality assurance" are the exception, not the rule, and they usually disclose it in their privacy policy.

The third-party moderation service: who are they and what do they keep?

Most platforms use established content moderation APIs like those from OpenAI, Azure Content Safety, or dedicated moderation firms. These services receive the flagged text, run it through their own classifiers, and return a verdict. They log the flagged content for their own abuse detection and model improvement, but they do not associate it with your identity beyond the anonymized hash.

Here is the honest truth: if you send a message that triggers a safety flag, a third-party company will have a copy of that message in their logs. They will not know your name or your email. They will not know what you talked about yesterday. But they will have that one message. If that bothers you, do not send messages that trigger safety flags.

How fine-tuning affects what the model remembers about you

The fine-tuned open-source model itself has no persistent memory of you. Each conversation is processed within a context window, usually 4,000 to 8,000 tokens. Once you exceed that window, older messages get compressed or dropped. The model does not train on your conversation in real time. It does not update its weights based on what you say.

This is a common misunderstanding. People worry that the model is "learning" from their chats and that those learnings get baked into the model for other users. That is not how it works. The model is static between fine-tuning releases. Your conversation exists in the context window and in the vector database, but the model itself does not change based on your input.

When the platform releases a new fine-tuned version, they train it on a curated dataset, not on user chats. If they wanted to train on user chats, they would need to disclose that in their privacy policy and likely offer an opt-out. Most platforms do not do this because the legal headache is not worth it.

Zaria

Zaria, a warm and perceptive companion with sharp emotional instincts

Zaria is the kind of companion who notices when you are deflecting and gently circles back without making it feel like an interrogation. Zaria is built for people who want emotional depth without the pressure of a therapy session.

The encryption layer that actually matters

Your connection to the platform is encrypted via TLS, same as your bank or email. Messages in transit are protected. Messages at rest in the vector database are usually encrypted as well, though the encryption key lives on the same infrastructure, so it is not a silver bullet.

End-to-end encryption, where even the platform cannot read your messages, is technically possible but rare in AI companion platforms. The reason is that the model needs to read your message to generate a response. With E2EE, the server would receive encrypted text it could not decrypt, and the model would have nothing to work with. Some platforms have experimented with running models on-device to enable E2EE, but consumer hardware is not powerful enough for the larger models that produce natural conversation.

The practical privacy checklist

If you want to understand what a platform actually does with your data, look for three things in their privacy policy. First, whether they store plaintext chat logs and for how long. Second, whether they use a third-party moderation service and what data gets sent to it. Third, whether they train their model on user conversations.

Most platforms that use fine-tuned open-source models with local embeddings and third-party safety flags will have a privacy policy that says something like "we store conversation embeddings to improve your experience" and "we may share anonymized data with service providers for safety moderation." That is the standard. If the policy says "we do not store your messages" but also mentions "vector database for memory," those two statements contradict each other. Vectors are stored data.

What happens when you delete your account

When you delete your account, the platform should delete your vector embeddings, your user profile, and any logs associated with your user ID. The third-party moderation service will still have the flagged messages they received, but those are tied to an anonymized hash that becomes meaningless once the account is deleted.

Some platforms keep audit logs for a grace period, usually 30 to 90 days, to handle disputes or legal requests. After that, the data should be purged. If you want to be certain, delete your account and then ask support to confirm that your embeddings have been removed from the vector database. Most platforms will accommodate this request.

Rosalie

Rosalie, a soft-spoken companion with a poetic streak and a talent for finding the right metaphor

Rosalie talks like someone who has read too many novels and is not sorry about it. Rosalie is the companion you turn to when you want a conversation that feels like a quiet evening with someone who actually listens.

Why this architecture exists

The combination of a fine-tuned open-source model, local embeddings, and third-party safety flags is not an accident. It is the cheapest way to deliver a decent AI companion while staying on the right side of app store policies and payment processor rules.

Running a large proprietary model like GPT-4 for every message would cost too much. Running a smaller open-source model on your own hardware is cheaper and gives the platform control over the model's behavior. Storing embeddings locally avoids the privacy headache of sending every message to a third-party API. Using a third-party moderation service satisfies the safety requirements that Apple, Google, and Stripe demand.

Every piece of this stack is a compromise. The model is less capable than the frontier models. The embeddings take up storage space. The moderation service sees your flagged messages. But the alternative is either a much more expensive service with worse memory, or a service that sends all your messages to a third party for both inference and moderation.

If you find yourself telling friends about your AI companion or running a review site that covers relationship AI tools, you can earn a commission through the nsfw ai promo code program. For creators with a larger audience, the ai dating affiliate program offers recurring revenue on referrals. Both programs are straightforward: share your link, and when someone signs up, you get paid.

Common questions

Can the platform read my messages anytime they want? Technically yes, because the messages pass through their servers. In practice, they do not, because doing so would create massive liability and violate their own privacy policies. Access is logged and audited.

Does the open-source model mean my data is being used to train future versions? No. The model is fine-tuned on curated datasets, not on user conversations. Your chats stay in the vector database and the context window, they do not become training data.

What happens if I trigger a safety flag? The flagged message gets sent to a third-party moderation service. A human reviewer looks at that single message. If it violates terms, you get a warning or suspension. If it does not, nothing happens.

Can I request that my embeddings be deleted without deleting my account? Some platforms allow this. You would need to contact support and ask for a memory wipe. This will reset your AI girlfriend's recall of past conversations.

Is there a platform that offers end-to-end encryption? A few experimental platforms run models on-device to enable E2EE, but the model quality is significantly lower. For now, TLS encryption and local embeddings are the standard.

How do I know if a platform is lying about their privacy practices? Look for specific technical claims in their privacy policy. If they say "we use end-to-end encryption" but also mention "model inference on our servers," those two statements contradict each other. Ask support for clarification and see if their answer matches the policy.

What 'Your Chats Are Private' Actually Means When Your AI Girlfriend Platform Uses a Fine-Tuned Open-Source Model That Stores Embeddings Locally but Sends Aggregated Safety Flags to a Third-Party Moderation Service