What is Top-K Sampling? Control AI Creativity

Top-K Sampling

Limit word choices

How It Works:

Next word: "The cat is ___"
AI considers top K most likely words only

K = 10 (Low)

Only 10 choices

→ Predictable

"sitting, sleeping, eating..."

K = 100 (High)

100 choices

→ Creative

"juggling, teleporting..."

Typical Values:

K=10: Very focused

K=40: Balanced (default)

K=100: More random

When to Use:

Control creativity vs consistency - low K for facts, high K for stories

"Use Top-K sampling K=40"

Loading pattern...

What is Top-K Sampling?

Top-K Sampling limits the AI to choosing from the K most likely next words. Top-K=1: always picks the most likely word (deterministic). Top-K=50: chooses from top 50 options (more random). Higher K = more creative but less focused. Lower K = more predictable. Often used with Temperature. Most developers use Top-P (nucleus sampling) instead—it's more flexible. Top-K is simpler but cruder.

When Should You Use This?

Most people don't adjust Top-K—use Temperature or Top-P instead, they're better. If you do use Top-K: set low (1-10) for factual tasks, high (50-100) for creative writing. OpenAI uses Top-P by default. Anthropic exposes Top-K. Don't tweak unless you understand the tradeoffs—Temperature is easier. Only adjust if default outputs aren't working.

Common Mistakes to Avoid

•Adjusting without testing—changing Top-K can make outputs worse, test first
•Using with Temperature—they interact weirdly, adjust one at a time
•Too low—Top-K=1 makes AI repetitive and boring
•Too high—Top-K=1000 lets AI pick random nonsense words
•Not using Top-P instead—Top-P (nucleus sampling) is usually better

Real-World Examples

•Factual: Top-K=1, Temperature=0 → deterministic, same output every time
•Balanced: Top-K=40, Temperature=0.7 → creative but coherent
•Creative: Top-K=100, Temperature=1.0 → very random, might be nonsense
•Default: Most APIs use Top-P instead, ignore Top-K

Permalink

Related Patterns

Temperature (AI)

Ai Vocabulary

Prompt Engineering

Ai Vocabulary

← Back to Pattern Gallery

Top-K Sampling

Limit word choices

How It Works:

Next word: "The cat is ___"
AI considers top K most likely words only

K = 10 (Low)

Only 10 choices

→ Predictable

"sitting, sleeping, eating..."

K = 100 (High)

100 choices

→ Creative

"juggling, teleporting..."

Typical Values:

K=10: Very focused

K=40: Balanced (default)

K=100: More random

When to Use:

Control creativity vs consistency - low K for facts, high K for stories

"Use Top-K sampling K=40"

Top-K Sampling

What is Top-K Sampling?

When Should You Use This?

Common Mistakes to Avoid

Real-World Examples

Category

Tags

Permalink

Related Patterns

Temperature (AI)

Prompt Engineering

Top-K Sampling