What is the default repetition penalty in SillyTavern?

The default is 1.0, meaning no penalty is applied. You must manually increase it via the Advanced Formatting slider.

How do I adjust the repetition penalty in SillyTavern?

Go to AI Response Configuration > Advanced Formatting. There's a slider labeled 'Repetition Penalty' that ranges from 0.0 to 2.0.

Is a higher repetition penalty always better?

No. Values above 1.3 often cause incoherence by avoiding common words. Optimal range is 1.05 to 1.2 for most models.

Does repetition penalty affect all models the same way?

No. Smaller models (e.g., 7B) need lower penalties (~1.1). Larger models (70B) can handle up to 1.2. Test per model.

Can I use repetition penalty with other sampling methods?

Yes. It works alongside temperature, top-k, and top-p. A balanced combo is temperature 0.7, top-p 0.9, penalty 1.1.

What is the difference between repetition penalty and frequency penalty?

Repetition penalizes any token that appears, regardless of frequency. Frequency penalty scales with how often a token appears. SillyTavern uses repetition penalty.

Why does my model still repeat after setting penalty to 1.2?

Check other parameters. Low temperature (below 0.5) or very short prompts can cause repetition. Also ensure you haven't set context repetition penalty too high.

Does AIAngels allow customizing repetition penalty?

No, AIAngels handles all parameters automatically. This is intentional to provide a frictionless experience without manual tuning.

SillyTavern Repetition Penalty: Solved in 2026

What Is the Repetition Penalty in SillyTavern?

The repetition penalty is a parameter in SillyTavern that discourages the language model from generating the same tokens or phrases repeatedly. When a token appears, the penalty increases its effective negative log-probability, making it less likely to be chosen again. This prevents loops like 'I am happy. I am happy. I am happy' and promotes diverse responses. The penalty applies to tokens, not whole words, so it can affect subword repetitions. SillyTavern's default value is typically 1.0 (no penalty), but many users set it between 1.02 and 1.2. Understanding this parameter is critical for roleplay and storytelling, where character voices should stay fresh. The penalty interacts with other sampling methods like top-p and temperature. If you set it too high (e.g., 1.5), the model might avoid necessary common words like 'the' or 'and', producing bizarre output. Too low, and you get endless loops. SillyTavern provides a slider for this in the 'Advanced Formatting' section of the AI Response Configuration. For context, the Hugging Face documentation on repetition penalty explains the underlying mechanism: [Hugging Face Blog](https://huggingface.co/blog/how-to-generate).

“The SillyTavern repetition penalty is a parameter that penalizes tokens the model has already generated, reducing repetitive text. Adjusting it between 1.0 and 1.2 typically balances coherence and diversity; values above 1.3 can make output incoherent.”

Optimal Repetition Penalty Values for Different Models

The ideal repetition penalty depends on the model you're using. For smaller models like Llama-2 7B or Mistral 7B, a penalty between 1.05 and 1.15 works well. Larger models like Llama-3 70B or Mixtral 8x7B can handle a wider range, often benefiting from 1.1 to 1.2. Fine-tuned roleplay models such as Mythomax or Tiefighter (based on Llama) perform best with 1.1 to 1.18. If you're using a model that tends to repeat, like some versions of Pygmalion, start at 1.15. For instruct-following models like Nous-Hermes, keep it below 1.1 to avoid interrupting instruction completions. Always test a few values with your specific prompt. A good starting point: set top-p to 0.9, temperature to 0.7, and repetition penalty to 1.1. Then adjust the penalty in 0.02 increments. Note that some models have built-in repetition penalties (e.g., GPT-4's 'frequency penalty'), but SillyTavern overrides those if you set a custom value. The key is to match the penalty to the model's inherent verbosity. A comprehensive study by the [MIT Technology Review](https://www.technologyreview.com) on text generation optimization suggests that no single value fits all; context length also matters.

How Repetition Penalty Interacts with Other Parameters

Repetition penalty doesn't work in isolation. It interacts with temperature, top-k, top-p, and typical-p. Temperature controls randomness; high temperature (e.g., 1.5) makes the model more creative but also more prone to repetition if the penalty is low. Conversely, low temperature (0.5) makes it deterministic; repetition penalty becomes more critical to avoid loops. Top-k restricts sampling to the k most likely tokens; a low top-k (e.g., 20) combined with a high repetition penalty can cause the model to exhaust its token pool quickly, leading to incoherence. Top-p (nucleus sampling) selects tokens by cumulative probability; a low top-p (0.8) plus heavy penalty can force the model into unnatural choices. Typical-p, which targets the entropy of the distribution, can amplify repetition penalty effects. SillyTavern's 'Advanced Formatting' panel lets you adjust all these together. A recommended balanced setup: temperature 0.7, top-p 0.9, top-k 0 (disabled), repetition penalty 1.1. If you see repetition, increase penalty by 0.05. If output becomes random, decrease it. Always test with a few example prompts. Research from [Pew Research Center](https://www.pewresearch.org) on AI's tendency to repeat patterns underscores the importance of tuning these parameters for coherent dialogue.

Real monthly cost: Sillytavern Repetition Penalty on AIAngels vs SillyTavern
Feature	AIAngels	SillyTavern
Free tier	Unlimited free text chat with all AI companions, no credit card	Limited or absent on most plans
Real monthly cost (active)	$0 or $2.99/mo annual flat	Headline price + tokens/tiers
Image generation	Included on premium	Often token-gated or per-image
Voice messages	Included on premium	Often token-gated
Memory persistence	Permanent, never resets	Often degrades after a token cap
Filter / restrictions	Uncensored for verified adults	Filter often interrupts mid-scene
Public promo code	Not needed (75% off baked in)	Rare or fake on coupon sites

Ready to Experience the
Difference?

Start chatting with a companion who actually remembers you.
Free. No tokens. No limits.

Start Chatting Free

Common Mistakes When Setting Repetition Penalty

One frequent error is setting the repetition penalty too high (above 1.3). This forces the model to avoid common words, resulting in awkward phrasing like 'The feline sat upon the mat' instead of 'The cat sat on the mat.' Another mistake is ignoring the context length: longer contexts (e.g., 4096 tokens) accumulate more penalized tokens, so a moderate penalty of 1.1 can become overly aggressive by the end. Users also forget to adjust the penalty after changing models. A value that works for Llama-2 may break Mistral. Another pitfall is using repetition penalty with low temperature (below 0.5). Low temperature already reduces randomness; adding a high penalty can freeze the model into repeating the same few phrases. Some users set the penalty to 1.0 (off) and wonder why the model loops. SillyTavern's default is 1.0, so you must manually enable it. Lastly, don't confuse repetition penalty with context repetition penalty (a separate SillyTavern setting that penalizes tokens from earlier turns). Both can be used together, but start with the main penalty. A practical tip: if the model starts saying 'I think' repeatedly, increase penalty by 0.05. If it says 'I think' once and then 'I believe', leave it.

Testing and Tuning Repetition Penalty in SillyTavern

To effectively tune the repetition penalty, use SillyTavern's 'Quick Reply' feature to send the same prompt multiple times with different penalty values. Compare outputs side-by-side. Start with a short prompt (e.g., 'Tell me a story about a dragon') and generate 3-4 responses per setting. Look for diversity of vocabulary and sentence structures. If responses are too similar, increase penalty by 0.05. If they become nonsensical, decrease it. For roleplay scenarios, use a character card with a defined persona. Generate multiple replies to the same user message. A well-tuned penalty should keep the character's voice consistent but not repetitive. You can also use the 'Swipe' feature to quickly test variations. SillyTavern logs your settings per character, so you can switch between presets. A systematic approach: create three presets—Low (1.02), Medium (1.1), High (1.18)—and test each on your most common use case. Note that some models respond better to gradual penalties. For instance, Llama-3 70B shows improved coherence with 1.15. Keep a tuning log: write down model, penalty, temperature, and observed quality. Over time, you'll build a personal database.

AIAngels: A No-Tuning Alternative for AI Companionship

If you find SillyTavern's steep learning curve and API-key setup frustrating, consider AIAngels as an alternative. AIAngels handles all model parameters internally, including repetition penalty, so you never have to tweak sliders. It offers 70+ curated companions with permanent memory and no content filters for adults. Premium plans start at $2.99/mo on annual, with unlimited text chat even on the free tier. You don't need to manage API keys or worry about hardware. While SillyTavern gives you granular control, AIAngels provides a polished experience out of the box. For users who just want conversational AI without the technical overhead, AIAngels is a compelling option.

SillyTavern Repetition Penalty: The 2026 Alternative

What Is the Repetition Penalty in SillyTavern?

Optimal Repetition Penalty Values for Different Models

How Repetition Penalty Interacts with Other Parameters

Ready to Experience the
Difference?

Common Mistakes When Setting Repetition Penalty

Testing and Tuning Repetition Penalty in SillyTavern

AIAngels: A No-Tuning Alternative for AI Companionship

Stop starting from scratch.

Frequently Asked Questions

Explore More

What our customers are saying

What Is the Repetition Penalty in SillyTavern?

Optimal Repetition Penalty Values for Different Models

How Repetition Penalty Interacts with Other Parameters

Ready to Experience the Difference?

Common Mistakes When Setting Repetition Penalty

Testing and Tuning Repetition Penalty in SillyTavern

AIAngels: A No-Tuning Alternative for AI Companionship

Stop starting from scratch.

Frequently Asked Questions

Explore More

Ready to Experience the
Difference?