Agent Temperature

straighten

Temperature Scale Reference

BandBehaviourUse When
Very LowAlways picks the highest-probability token. Identical inputs produce identical outputs.Exact reproduction, structured output parsing, mathematical computation.
LowSlight variation while staying highly predictable. Minor phrasing differences between runs.Code generation, factual Q&A, data extraction, tool-calling agents.
MediumBalanced creativity and coherence. Explores alternative phrasings while maintaining accuracy.Conversational agents, summarisation, general-purpose assistants.
HighSignificant variation in word choice and structure. May produce novel combinations.Creative writing, brainstorming, marketing copy, dialogue generation.
Very HighNear-uniform sampling across the vocabulary. High risk of incoherence at extreme values.Experimental generation, style exploration. Rarely used in production agents.
code

Code & Engineering

Agents that write, review, debug, or transform source code need low temperatures to maintain syntactic correctness and reliable tool calls.

RoleTemperatureRationale
Code GeneratorVery LowSyntax errors from sampling randomness break compilation. Precise output enables caching and reproducibility.
Code ReviewerLowNeeds slight variation to catch different issues across runs, but must stay grounded in actual code semantics.
Bug Fixer / DebuggerVery LowFixes must be precise. A creative rewrite risks introducing new bugs while solving the original one.
Test GeneratorLowSome variation helps cover edge cases and diverse test scenarios, but assertions must remain logically correct.
Code RefactorerLowStructural changes require consistency. Too much randomness produces refactors that change behaviour, not just form.
Documentation WriterMediumNeeds enough flexibility for readable prose while staying accurate to the codebase it describes.
DevOps / IaC AgentVery LowInfrastructure definitions are safety-critical. A hallucinated port number or permission can cause outages.
API DesignerLowSchema design benefits from exploring alternatives, but endpoints and types must be internally consistent.
analytics

Data & Analytics

Agents working with structured data, queries, and analytical pipelines need precision in syntax and logic. Even small deviations can produce incorrect results.

RoleTemperatureRationale
SQL Query WriterVery LowSQL syntax is unforgiving. A misplaced JOIN or wrong column name returns wrong data silently.
Data Pipeline BuilderVery LowPipeline steps must be precise and idempotent. Random variation risks data corruption or duplication.
Data AnalystLowStatistical interpretation benefits from exploring different angles, but calculations must be exact.
ETL AgentVery LowTransformation logic must be precise and repeatable. Schema mapping errors cascade through downstream systems.
Dashboard BuilderLowVisualisation choices benefit from creativity, but the underlying data queries must be accurate.
Data Quality MonitorVery LowAnomaly detection rules need consistency. False positives from random variation erode trust in monitoring.
science

Research & Knowledge

Research-oriented agents balance thoroughness with accuracy. They need enough variation to explore different sources and perspectives without fabricating facts.

RoleTemperatureRationale
Research AssistantMediumNeeds to synthesise across sources and find non-obvious connections, but must not fabricate citations or data.
Fact CheckerVery LowVerification requires precision. Creative interpretation of facts defeats the purpose of checking them.
Literature ReviewerMediumSynthesising themes across papers benefits from varied phrasing, while citations must remain accurate.
Knowledge Base CuratorLowCategorisation and tagging need consistency. The same concept should get the same label across runs.
Question AnswererLowFactual responses should be stable. Users expect the same question to produce the same answer.
Competitive IntelligenceMediumAnalysis benefits from exploring different interpretive frames, but market data must be reported accurately.
Patent AnalystLowLegal precision matters. Claims analysis cannot tolerate creative reinterpretation of patent language.
brush

Creative & Content

Creative agents benefit from higher temperatures that unlock diverse phrasing, unexpected combinations, and stylistic range. Coherence remains important even at elevated settings.

RoleTemperatureRationale
Creative WriterHighFiction and poetry thrive on unexpected word choices and novel combinations that lower temperatures suppress.
Marketing CopywriterHighHeadlines and slogans need memorability, which comes from surprising phrasing. Brand voice constrains the upper range.
Social Media AgentHighEngagement requires fresh, varied content. Repetitive posts from low temperature kill audience interest.
Blog WriterMediumLong-form content needs consistent voice but enough variation to stay engaging across paragraphs.
Brainstorming AgentVery HighIdea generation explicitly benefits from low-probability token selection. Quantity and novelty matter more than precision.
Storyteller / NarratorHighNarrative flow requires unexpected turns and vivid language that precise decoding cannot produce.
Dialogue WriterHighNatural conversation has variation in register, rhythm, and word choice. Too predictable sounds robotic.
Email DrafterMediumProfessional emails need clarity over creativity. Misinterpretation risks outweigh stylistic benefits.
support_agent

Customer & Support

Customer-facing agents must balance helpfulness with accuracy. Empathetic tone requires some flexibility, but factual responses about products, policies, and accounts must be precise.

RoleTemperatureRationale
Customer Support AgentMediumEmpathetic tone needs variation, but policy and product information must be accurate and consistent.
Sales AssistantMediumPersuasion benefits from natural, varied language. Pricing and feature claims must be factually correct.
Onboarding GuideMediumInstructions must be clear and correct. Slight variation in phrasing helps avoid sounding scripted.
Complaint HandlerMediumAcknowledgement and empathy benefit from natural language, but resolution steps must be precise.
FAQ BotLowAnswers should be stable and predictable. Users compare responses and inconsistency erodes trust.
Feedback CollectorLowFollow-up questions need some variation to feel natural, but must stay on-topic and non-leading.
event_note

Planning & Strategy

Strategic agents need moderate temperatures to explore diverse options while maintaining logical coherence in their recommendations and action plans.

RoleTemperatureRationale
Project PlannerMediumTask decomposition and dependency mapping need logical consistency, with some flexibility for creative structuring.
Strategy AdvisorMediumStrategic recommendations benefit from exploring non-obvious options, but must be grounded in available data.
Risk AssessorLowRisk identification benefits from breadth, but probability estimates and impact ratings need consistency.
Resource AllocatorLowAllocation decisions involve constraints and optimisation. Random variation produces suboptimal distributions.
Meeting FacilitatorMediumNeeds flexible phrasing for prompts and summaries, but action items must accurately reflect discussions.
OKR GeneratorMediumObjectives benefit from aspirational language. Key results must be specific and measurable.
gavel

Legal & Compliance

Legal agents operate in high-stakes domains where precision is paramount. Creative interpretation of regulations or contract language can have serious consequences.

RoleTemperatureRationale
Contract ReviewerVery LowLegal language is precise by design. Paraphrasing a clause can change its legal meaning entirely.
Compliance CheckerVery LowRegulatory requirements are binary: met or unmet. Creative interpretation of compliance standards is dangerous.
Policy DrafterLowPolicies need clear, readable language (some flexibility), but legal precision in operative clauses is essential.
Regulatory MonitorLowChange detection must be accurate. Missing a regulatory update or flagging a non-change wastes resources.
Privacy AssessorVery LowData classification and privacy impact assessments have direct legal consequences. No room for creative labelling.
Audit Trail LoggerVery LowAudit logs must be precise and reproducible. Any variation between runs indicates a reliability problem.
school

Education & Training

Educational agents need enough variation to adapt explanations to different learning styles and levels, while keeping factual content accurate.

RoleTemperatureRationale
TutorMediumExplanations benefit from varied approaches, trying different analogies and framings when students struggle.
Quiz GeneratorMediumQuestion variety is important to avoid predictable patterns, but answers must be unambiguously correct.
Curriculum DesignerMediumLearning pathways benefit from creative structuring, but prerequisite chains must be logically sound.
Flashcard CreatorLowCard content must be accurate and concise. Some phrasing variation prevents rote memorisation of exact wording.
Language TeacherMediumNatural language exposure requires varied examples and conversational styles. Grammar rules must stay correct.
Code InstructorLowCode examples must compile and run correctly. Explanatory prose benefits from some variation.
Assessment GraderVery LowGrading rubric application must be consistent. The same answer should receive the same score across runs.
dns

Operations & Infrastructure

Operational agents manage systems where errors have immediate impact. Precise behaviour ensures that automated actions are predictable and auditable.

RoleTemperatureRationale
Incident ResponderVery LowRunbook execution must be precise. A creative diagnostic step during an outage can make things worse.
Monitoring AgentVery LowAlert rules and thresholds must be consistent. Random variation in alerting creates noise or missed incidents.
Capacity PlannerLowForecasting benefits from exploring scenarios, but resource calculations must be mathematically sound.
Security ScannerVery LowVulnerability detection must be reproducible. False positives from random variation waste security team time.
Log AnalyserLowPattern recognition in logs benefits from some flexibility, but timestamps and error codes must be parsed exactly.
Deployment AgentVery LowDeployment scripts must be identical across runs. Any variation between deploys risks configuration drift.
Cost OptimiserLowCost analysis requires precise arithmetic. Recommendations benefit from exploring alternatives within constraints.
translate

Communication & Translation

Translation and communication agents balance fidelity to source material with natural target-language expression. The right temperature depends on whether precision or fluency matters more.

RoleTemperatureRationale
Technical TranslatorLowTechnical terminology must be translated precisely. Domain-specific terms have exact equivalents that must be used.
Literary TranslatorMediumCapturing tone, idiom, and cultural nuance requires creative choices that mechanical translation misses.
Localisation AgentMediumCultural adaptation needs flexibility for idiomatic expressions while preserving the original message intent.
Meeting SummariserLowKey decisions and action items must be captured accurately. Phrasing variation is acceptable for readability.
Press Release WriterMediumNeeds engaging language for public communication while keeping factual claims precise and verifiable.
Technical WriterLowClarity and precision are paramount. Some variation in sentence structure improves readability of dense material.
hub

Multi-Agent & Orchestration

Agents that coordinate other agents or manage workflows need predictable behaviour. An orchestrator that makes different routing decisions on each run creates chaos in downstream agents.

RoleTemperatureRationale
Orchestrator / RouterVery LowRouting decisions must be consistent. An orchestrator that sends the same query to different agents on each run is unreliable.
Task DecomposerLowBreaking down tasks benefits from exploring different decomposition strategies, but subtask definitions must be clear.
Quality Gate / EvaluatorVery LowPass/fail decisions must be consistent. The same output should receive the same evaluation across runs.
Aggregator / SynthesiserMediumCombining outputs from multiple agents benefits from flexible phrasing, but must preserve all source information.
Escalation AgentVery LowEscalation thresholds must be consistent. Inconsistent escalation creates either alert fatigue or missed issues.
Retry / Recovery AgentLowRecovery strategies benefit from trying alternative approaches, but each attempt must be well-formed and trackable.
Consensus BuilderLowSynthesising agreement across agents needs flexibility, but the final consensus must accurately represent inputs.