See exactly how temperature, top-p, top-k, min-p, and repetition penalty reshape a model's next-token probabilities
Prompt
The weather today is▌
12 candidate next-tokens with fixed logits. Move the sliders to see how each sampling stage reshapes the distribution.
Sampling parameters
Flattens (high) or sharpens (low) the whole distribution.
Keep only the k most-likely tokens. 0 = disabled.
Keep the smallest set whose probabilities sum to p. 1.0 = disabled.
Drop tokens below p × (the top token's probability). 0 = disabled.
Demotes already-used tokens (tagged rep↓): sunny, warm.
Order of operations
Next-token distribution
—Bar width = final sampling probability after all filters. Filtered tokens are greyed.
As-is, no warranty. These apps are free under their listed license and run entirely in your browser. Use at your own risk — don't blame me if your PC catches fire, your dog runs away, or the math turns out wrong. Verify anything that actually matters. None of this is professional financial, medical, legal, or engineering advice.
When a language model generates text, it produces a probability for every possible next token. Sampling parameters decide which of those tokens are actually allowed to be picked, and how the odds are weighted. Most people treat them as vague vibes - “turn temperature down to make it less crazy.” This tool shows the actual mechanism.
Start from a fixed example distribution (the candidate next-tokens after “The weather today is”), then move the sliders and watch the bars change in real time.
The parameters are applied in a specific order, and order matters:
min_p x (the top token's probability).For informational purposes only. Not financial, medical, or legal advice. You are solely responsible for how you use these tools.