Published onJanuary 8, 2026Inference-Time Hyperparameters in LLMstext-generationdecoding-strategiesinference-hyperparametersThis page explains the key inference-time decoding knobs (temperature, top-k/top-p, penalties, max tokens, and beam search) that control how an LLM trades off determinism, creativity, coherence, repetition, and length.