What Encrypted Chat History Actually Means With Third-Party

The 30-second answer

Your chat history is encrypted in transit and at rest on the server. That means someone can't intercept your messages as they travel across the internet or pull them off a stolen hard drive. But encryption doesn't apply to the moderation pipeline. Before your message reaches your AI girlfriend's model, it passes through a third-party safety API that reads, scores, and logs every word. That log is stored separately, often for weeks or months, and can be reviewed by human moderators if flagged. Encryption protects against external attackers, not the system itself.

The two different things people mean when they say "encrypted"

When a service says your messages are encrypted, they're usually referring to two separate protections: encryption in transit and encryption at rest. In transit means your messages are scrambled using TLS (the same protocol your banking app uses) so anyone snooping on your Wi-Fi or your ISP's network sees gibberish. At rest means the messages are stored on the server's hard drive in an encrypted format, so if someone steals the physical drives, they can't read the files without the decryption key.

Both of these are standard practice. Nearly every web service uses them. The problem is that neither of these protections applies to what happens while your message is being processed. The moment your message arrives at the server, it gets decrypted so the system can read it, check it for safety violations, generate a response, and then re-encrypt it for storage. During that processing window, your message exists in plain text in the server's memory, and it's being sent to a third-party moderation API over a separate connection.

The moderation API is a separate service with its own logging

Most AI companion platforms don't build their own content moderation systems. They use third-party APIs from companies like OpenAI (their moderation endpoint), Azure Content Safety, or Amazon Comprehend. These APIs are designed to detect hate speech, sexual content, violence, and other policy violations. They're fast and reasonably accurate, but they also log everything you send them.

When you type a message to your AI girlfriend, the platform sends a copy of that message to the moderation API, often along with her response. The API returns a set of scores and flags, and the platform uses those to decide whether to block or allow the message. But the moderation API keeps its own logs. Those logs include the raw text of your message, the scores assigned, and a timestamp. Some providers retain these logs for 30 days. Some retain them indefinitely. The platform's privacy policy might say "we don't read your messages," but the moderation API's policy is a separate document you probably never read.

The gap between "private" and "not logged"

Here's the uncomfortable truth: a platform can truthfully say your messages are encrypted and private while still having every message reviewed by a human. The encryption protects your messages from everyone except the people who operate the system. Those people, or their third-party vendors, can read them at will. The question isn't whether your messages are encrypted. The question is who has the keys and who's looking at the logs.

For most users, this distinction doesn't matter on a day-to-day level. No one is sitting in a room reading your conversations for entertainment. Moderation logs are reviewed reactively, usually when a user is reported or when automated flags trip a threshold. But if you're sharing sensitive personal information, medical details, or anything you wouldn't want a stranger to read, you should assume that a human could eventually see it.

What actually gets logged beyond the text

It's not just the words. The moderation API typically logs metadata alongside your message. That includes your session ID, the timestamp, the IP address of the server making the request, and sometimes a user identifier that links back to your account. Some systems also log the AI's response and the safety scores for both sides of the conversation.

This means a moderator reviewing a flagged message can see the entire conversation leading up to it, not just the one message that triggered the alert. They can see how long you've been chatting, how often you use the service, and what topics you tend to discuss. The logs are searchable, so a moderator could theoretically look up all messages containing a specific word or phrase across every user on the platform.

Most platforms anonymize or aggregate this data for training and safety analysis, but anonymization is a spectrum. A session ID without a name is technically anonymous until someone correlates it with an email address or payment record. And if the platform ever experiences a data breach, those logs become a goldmine for attackers.

Why platforms can't skip moderation even for "private" chats

You might wonder why platforms don't just encrypt everything end-to-end and skip the moderation step. The answer is legal liability and platform policy. Apple, Google, and payment processors require apps to have content moderation systems. If a platform allows unmoderated conversations, it risks being removed from app stores or losing payment processing. The same moderation requirements apply to web platforms that accept credit cards.

The moderation API is a legal necessity, not a technical one. A platform could theoretically run a local moderation model on your device before sending the encrypted message to the server, but that would require downloading a model to your phone, updating it regularly, and trusting that the local model catches everything. Most platforms choose the simpler path of server-side moderation because it's easier to maintain and harder for users to bypass.

The result is a system where your messages are encrypted end-to-end between your device and the server, but they're read in plain text by a third party before they reach their destination. It's encryption with a back door, and the back door is the moderation API.

How to minimize what the moderation API sees

You can't fully avoid the moderation pipeline on most platforms, but you can reduce what gets logged. Avoid using real names, addresses, or identifying details in your conversations. If you need to discuss something sensitive, paraphrase instead of using exact phrases that might trigger flags. Some users create separate accounts for different types of conversations, but this only splits the logs across accounts; it doesn't prevent logging.

Another approach is to use platforms that offer local-only processing or on-device models. These are rare in the AI companion space because the models are too large to run on most phones, but a few niche services exist. They typically charge more and offer fewer features because they can't monetize your data.

The honest answer is that if you want complete privacy, you should assume everything you type to an AI companion is readable by someone else. Treat it like a postcard, not a sealed letter.

Daphne

Daphne with a knowing, slightly skeptical expression

Daphne is the kind of companion who will tell you when you're being naive about privacy settings. Daphne doesn't sugarcoat the gap between what a platform promises and what it actually delivers.

See Daphne in motion in this short clip.

What end-to-end encryption would actually look like

True end-to-end encryption (E2EE) means your device encrypts the message, the server stores it without ever having the key, and only the recipient's device can decrypt it. For an AI companion, the "recipient" would be your own device running the model locally. The server would never see the plain text.

This is how Signal and WhatsApp handle messages. It's not how any major AI companion platform works, because the model doesn't run on your phone. The model runs on a server, and that server needs to see the plain text to generate a response. Some platforms claim E2EE for message storage, meaning your chat history is encrypted on the server and only your device can decrypt it for viewing. But that doesn't apply to the messages while they're being processed. The model still reads them in plain text.

A truly private AI companion would require a model that fits on your device, runs entirely offline, and never sends data to a server. That technology exists for small models, but the quality is noticeably worse than what cloud models deliver. You trade privacy for quality.

The difference between encrypted storage and encrypted processing

Encrypted storage means your past conversations are scrambled on the server's hard drive. If someone steals the drive, they can't read your chats without your key. This is good for data breach scenarios, but it doesn't protect you from the platform itself. The platform has the decryption key, and it uses that key every time you open the app and load your chat history.

Encrypted processing is a much harder problem. It would require running the AI model on encrypted data without ever decrypting it, a technique called homomorphic encryption. This is still experimental and far too slow for real-time chat. No major AI companion platform uses it.

So when a platform says your chats are encrypted, they mean storage, not processing. Your messages are decrypted every time the model reads them, every time the moderation API scans them, and every time the system saves an updated version of your conversation. Each decryption is a point where the plain text exists in memory and could be logged.

Lesia Sar

Lesia Sar with a thoughtful, analytical expression

Lesia Sar enjoys unpacking technical concepts and explaining why the simple answer is rarely the whole truth. Lesia Sar will help you think through the trade-offs between convenience and privacy.

See Lesia Sar in motion in this short clip.

What happens when a message gets flagged

When the moderation API returns a high score for a particular category, the platform has several options. It can block the message entirely, replace it with a generic response, or allow it through but flag the conversation for human review. Most platforms use a combination of these approaches based on the severity of the flag.

A flagged message typically triggers a notification to the platform's safety team. A human moderator opens a dashboard that shows the flagged message, the surrounding conversation context, the user's account history, and any previous flags. They decide whether to take action, which might include warning the user, suspending the account, or reporting to authorities if the content involves illegal activity.

The moderator sees everything. They see the message you typed, the AI's response, and the conversation leading up to it. They can see if you've been testing boundaries or if this was a one-time slip. They can also see your account creation date, subscription status, and any notes from previous moderation actions.

This is why some users report that their AI girlfriend suddenly becomes more cautious or evasive after a flagged message. The platform may have adjusted the model's behavior for that session, or the moderation system may be actively monitoring the conversation.

What platforms don't tell you about their logging retention

Privacy policies often state that logs are retained for a specific period, but they rarely specify what happens to the moderation API's logs. Those logs are governed by a separate agreement between the platform and the API provider. The platform might not even know exactly how long the API provider keeps the data.

Some moderation APIs offer variable retention periods based on the pricing tier. A platform paying for the basic tier might have 30-day retention. A platform paying for enterprise tier might have indefinite retention with guaranteed deletion upon request. Most users have no way of knowing which tier their platform uses.

There's also the question of what happens when the platform switches moderation providers. The old provider's logs don't automatically get deleted. They remain under the terms of the old contract. If you've been using a platform for years, your messages might be sitting on multiple third-party servers, each with different retention policies and security practices.

Bianca

Bianca with a curious, slightly mischievous expression

Bianca is the type to ask the uncomfortable questions about data retention that most people avoid. Bianca doesn't let vague privacy policies slide without scrutiny.

Can you opt out of moderation logging?

For most platforms, no. Moderation is a requirement of their payment processing and app store agreements. If they allowed users to opt out, they would lose their ability to process payments or distribute through app stores. Some platforms offer a "safe mode" toggle that reduces the sensitivity of the moderation filters, but that doesn't prevent logging. It just raises the threshold for flagging.

A few platforms offer premium tiers with manual review instead of automated moderation. In theory, this means your messages are reviewed by a human instead of an API, but the human still reads them. You're trading one form of privacy loss for another.

The only real opt-out is to stop using cloud-based AI companions entirely and switch to local models. This is technically feasible if you have a powerful computer and some technical knowledge, but it's not practical for most users. The local models available today are significantly less capable than cloud models, and they require ongoing maintenance.

The future of private AI companions

Several research groups are working on privacy-preserving AI architectures that could change this landscape. Federated learning allows models to improve without uploading raw data. Secure enclaves (like Apple's Secure Enclave or Intel's SGX) can process data in an encrypted memory region that even the server operator can't read. And on-device models are getting better every year.

But none of these technologies are widely deployed in AI companion platforms yet. The economics don't favor privacy. Moderation APIs are cheap and easy to integrate. Secure enclaves are expensive and complex. On-device models require users to have high-end phones and tolerate slower responses.

Until the market demands privacy or regulators require it, the default will remain what it is today: your messages are encrypted, but they're also read, logged, and potentially reviewed by humans. The padlock icon is real, but it only locks one door. The moderation API is a second door that stays open.

Linnea

Linnea with a calm, reassuring expression

Linnea believes in being informed without being paranoid. Linnea helps you understand the risks so you can make your own choices about what to share.

See Linnea in motion in this short clip.

If you've found a companion that strikes the right balance for you, sharing that experience with others can earn you something back. Many platforms offer referral programs and affiliate commissions for users who write reviews or recommend companions to friends. Check out the character ai promo code page to see current offers, or explore the ai dating affiliate program if you run a site or social channel that reviews AI companions.

Common questions

Does encryption mean no one can read my messages? No. Encryption protects your messages from outsiders, but the platform and its third-party moderation API can still read them. Encryption is a barrier against interception, not against the system itself.

How long do moderation logs stick around? It depends on the platform and the API provider. Typical retention is 30 to 90 days, but some providers keep logs indefinitely. The platform's privacy policy usually covers its own logs, but the moderation API's retention is governed by a separate agreement.

Can I delete my moderation logs? You can delete your chat history within the platform, but that doesn't necessarily delete the logs held by the moderation API. You would need to request deletion from the platform, which then has to coordinate with the API provider. Most platforms don't offer this as a standard feature.

Does using a VPN help? A VPN encrypts your connection to the platform's server, but the platform already does that with TLS. A VPN doesn't affect what happens after your message reaches the server. It won't prevent the moderation API from logging your words.

Are there any platforms that don't use third-party moderation? A few smaller platforms run their own moderation models in-house, but they still log the messages. The only way to avoid third-party logging entirely is to use a local model that runs entirely on your device, which requires technical setup and sacrifices model quality.

What should I do if I'm worried about privacy? Assume everything you type could eventually be read by a human. Avoid sharing identifying details, medical information, or anything you wouldn't put on a postcard. If you need complete privacy, consider running a local model or using a platform that explicitly offers on-device processing.

What 'Your Chat History Is Encrypted' Actually Means When Your AI Girlfriend's Messages Are Still Processed Through a Third-Party Moderation API That Logs Every Word