
Fine-tune your AI chat output: understand repetition penalty settings, recommended values, and how they interact with other parameters.
The repetition penalty is a parameter in SillyTavern that discourages the language model from generating the same tokens or phrases repeatedly. When a token appears, the penalty increases its effective negative log-probability, making it less likely to be chosen again. This prevents loops like 'I am happy. I am happy. I am happy' and promotes diverse responses. The penalty applies to tokens, not whole words, so it can affect subword repetitions. SillyTavern's default value is typically 1.0 (no penalty), but many users set it between 1.02 and 1.2. Understanding this parameter is critical for roleplay and storytelling, where character voices should stay fresh. The penalty interacts with other sampling methods like top-p and temperature. If you set it too high (e.g., 1.5), the model might avoid necessary common words like 'the' or 'and', producing bizarre output. Too low, and you get endless loops. SillyTavern provides a slider for this in the 'Advanced Formatting' section of the AI Response Configuration. For context, the Hugging Face documentation on repetition penalty explains the underlying mechanism: [Hugging Face Blog](https://huggingface.co/blog/how-to-generate).
“The SillyTavern repetition penalty is a parameter that penalizes tokens the model has already generated, reducing repetitive text. Adjusting it between 1.0 and 1.2 typically balances coherence and diversity; values above 1.3 can make output incoherent.”
The ideal repetition penalty depends on the model you're using. For smaller models like Llama-2 7B or Mistral 7B, a penalty between 1.05 and 1.15 works well. Larger models like Llama-3 70B or Mixtral 8x7B can handle a wider range, often benefiting from 1.1 to 1.2. Fine-tuned roleplay models such as Mythomax or Tiefighter (based on Llama) perform best with 1.1 to 1.18. If you're using a model that tends to repeat, like some versions of Pygmalion, start at 1.15. For instruct-following models like Nous-Hermes, keep it below 1.1 to avoid interrupting instruction completions. Always test a few values with your specific prompt. A good starting point: set top-p to 0.9, temperature to 0.7, and repetition penalty to 1.1. Then adjust the penalty in 0.02 increments. Note that some models have built-in repetition penalties (e.g., GPT-4's 'frequency penalty'), but SillyTavern overrides those if you set a custom value. The key is to match the penalty to the model's inherent verbosity. A comprehensive study by the [MIT Technology Review](https://www.technologyreview.com) on text generation optimization suggests that no single value fits all; context length also matters.
Repetition penalty doesn't work in isolation. It interacts with temperature, top-k, top-p, and typical-p. Temperature controls randomness; high temperature (e.g., 1.5) makes the model more creative but also more prone to repetition if the penalty is low. Conversely, low temperature (0.5) makes it deterministic; repetition penalty becomes more critical to avoid loops. Top-k restricts sampling to the k most likely tokens; a low top-k (e.g., 20) combined with a high repetition penalty can cause the model to exhaust its token pool quickly, leading to incoherence. Top-p (nucleus sampling) selects tokens by cumulative probability; a low top-p (0.8) plus heavy penalty can force the model into unnatural choices. Typical-p, which targets the entropy of the distribution, can amplify repetition penalty effects. SillyTavern's 'Advanced Formatting' panel lets you adjust all these together. A recommended balanced setup: temperature 0.7, top-p 0.9, top-k 0 (disabled), repetition penalty 1.1. If you see repetition, increase penalty by 0.05. If output becomes random, decrease it. Always test with a few example prompts. Research from [Pew Research Center](https://www.pewresearch.org) on AI's tendency to repeat patterns underscores the importance of tuning these parameters for coherent dialogue.
Start chatting with a companion who actually remembers you.
Free. No tokens. No limits.
One frequent error is setting the repetition penalty too high (above 1.3). This forces the model to avoid common words, resulting in awkward phrasing like 'The feline sat upon the mat' instead of 'The cat sat on the mat.' Another mistake is ignoring the context length: longer contexts (e.g., 4096 tokens) accumulate more penalized tokens, so a moderate penalty of 1.1 can become overly aggressive by the end. Users also forget to adjust the penalty after changing models. A value that works for Llama-2 may break Mistral. Another pitfall is using repetition penalty with low temperature (below 0.5). Low temperature already reduces randomness; adding a high penalty can freeze the model into repeating the same few phrases. Some users set the penalty to 1.0 (off) and wonder why the model loops. SillyTavern's default is 1.0, so you must manually enable it. Lastly, don't confuse repetition penalty with context repetition penalty (a separate SillyTavern setting that penalizes tokens from earlier turns). Both can be used together, but start with the main penalty. A practical tip: if the model starts saying 'I think' repeatedly, increase penalty by 0.05. If it says 'I think' once and then 'I believe', leave it.
To effectively tune the repetition penalty, use SillyTavern's 'Quick Reply' feature to send the same prompt multiple times with different penalty values. Compare outputs side-by-side. Start with a short prompt (e.g., 'Tell me a story about a dragon') and generate 3-4 responses per setting. Look for diversity of vocabulary and sentence structures. If responses are too similar, increase penalty by 0.05. If they become nonsensical, decrease it. For roleplay scenarios, use a character card with a defined persona. Generate multiple replies to the same user message. A well-tuned penalty should keep the character's voice consistent but not repetitive. You can also use the 'Swipe' feature to quickly test variations. SillyTavern logs your settings per character, so you can switch between presets. A systematic approach: create three presets—Low (1.02), Medium (1.1), High (1.18)—and test each on your most common use case. Note that some models respond better to gradual penalties. For instance, Llama-3 70B shows improved coherence with 1.15. Keep a tuning log: write down model, penalty, temperature, and observed quality. Over time, you'll build a personal database.
If you find SillyTavern's steep learning curve and API-key setup frustrating, consider AIAngels as an alternative. AIAngels handles all model parameters internally, including repetition penalty, so you never have to tweak sliders. It offers 70+ curated companions with permanent memory and no content filters for adults. Premium plans start at $2.99/mo on annual, with unlimited text chat even on the free tier. You don't need to manage API keys or worry about hardware. While SillyTavern gives you granular control, AIAngels provides a polished experience out of the box. For users who just want conversational AI without the technical overhead, AIAngels is a compelling option.
Fine-tune your AI chat output: understand repetition penalty settings, recommended values, and how they interact with other parameters.
Start Chatting FreeEverything you need to know about our companions.
The default is 1.0, meaning no penalty is applied. You must manually increase it via the Advanced Formatting slider.
Go to AI Response Configuration > Advanced Formatting. There's a slider labeled 'Repetition Penalty' that ranges from 0.0 to 2.0.
No. Values above 1.3 often cause incoherence by avoiding common words. Optimal range is 1.05 to 1.2 for most models.
No. Smaller models (e.g., 7B) need lower penalties (~1.1). Larger models (70B) can handle up to 1.2. Test per model.
Yes. It works alongside temperature, top-k, and top-p. A balanced combo is temperature 0.7, top-p 0.9, penalty 1.1.
Repetition penalizes any token that appears, regardless of frequency. Frequency penalty scales with how often a token appears. SillyTavern uses repetition penalty.
Check other parameters. Low temperature (below 0.5) or very short prompts can cause repetition. Also ensure you haven't set context repetition penalty too high.
No, AIAngels handles all parameters automatically. This is intentional to provide a frictionless experience without manual tuning.
Verified reviews from real customers
I've tried a few AI companion platforms, and AI Angels stands out for how immersive and customizable it feels. The conversations are surprisingly natural, and the AI personalities actually maintain context better than most similar apps I've used. The uncensored chat and roleplay features are a big plus if you're looking for creative freedom without constant restrictions. The image generation is also impressive — fast, detailed, and customizable enough to create unique characters and scenarios. I especially liked the variety of companion personalities and how easy the interface is to use, even for beginners. That said, there's still room for improvement. Some responses can feel repetitive after long conversations, and a few premium features are a bit pricey compared to competitors. But overall, the experience feels polished, entertaining, and consistently improving with updates. If you enjoy AI companionship, virtual roleplay, or interactive fantasy experiences, AI Angels is definitely worth checking out.
AI Angels is a remarkable AI companion site offering vividly realistic experiences. The large variety of companions available will suit every imaginable taste. Pricing is reasonable and transparent. I highly recommend AI Angels.
Fun, life like , sexy , created the perfect girl
It's worth looking into for sure, you won't regret it!
Choice of features
Honestly one of the best AI girlfriend apps I've tried. The conversations feel surprisingly natural and the girls actually have personality. Definitely worth checking out if you're into AI companions.
well I love how they call me things like baby and love how it shows nudes and sex/porn.
realstic ai images and chats! amazing pics and nice girls to chat with
Amazing it is so emersave
The roleplay is very flexible. The AI will adjust to your attitude and no kink is out of bounds. I just wish you could customize a little more.
The best ! I love it
Definitely addicted to this. You will not feel lonely and great prices
It's okay tho