Skip to main content
Calibrate lets you simulate entire conversations with your agent using simulated users matching the personas of your target audience for specific scenarios defined by you. This guide walks you through setting up and running your first simulation.

Create an agent

From the sidebar, click AgentsNew agent. Configure the system prompt, select your STT, TTS, and LLM providers and add tools to the agent.
Learn more about agent configuration in Core Concepts: Agents
Agent Configuration

Create personas

From the sidebar under End-to-End Tests, click PersonasAdd persona.
Personas List
Define WHO the simulated user is - their personality, speaking style, and behavior.
Learn more about personas in Core Concepts - Personas
Add Persona Dialog

Create scenarios

Click ScenariosAdd scenario.
Scenarios List
Define WHAT the simulated user should accomplish during the conversation.
Learn more about scenarios in Core Concepts - Scenarios
Add Scenario Dialog

Create metrics

Click MetricsAdd metric.
Metrics List
Define how to evaluate whether the agent performed well.
Learn more about metrics in Core Concepts - Metrics
Add Metric Dialog

Create a simulation

Click SimulationsAdd simulation and enter a name.
Create Simulation
Select the agent, personas, scenarios, and metrics for your simulation.
Simulation Configuration
Click Create after filling in the details to save the simulation configuration. This creates the simulation but does not run it yet.

Run the simulation

After creating, you’ll see a Launch button. Click it to start a simulation run and choose between:
  • Text simulation — text-only conversations (faster)
  • Voice simulation — full STT/TTS pipeline with latency metrics
Launch Options
Each run creates one simulated conversation for each persona × scenario combination.

Text simulation

After launching a text simulation, the run page opens showing each conversation streaming in real-time.
Text Simulation Running
Once conversations complete, results show the metric evaluations for each row.
Text Simulation Results
Click any row to view the full transcript.
Transcript View
The Overall metrics show the aggregated values for each metric across all conversations.

Voice simulation

Voice simulations use the full STT/TTS pipeline for more realistic testing. After launching, conversations stream in real-time like text simulations.
Voice Simulation Running
Click any row to view the transcript with audio playback for each turn and the full conversation.
Voice Transcript with Audio
Once complete, voice simulations show results with the Performance tab displaying aggregated evaluation metrics across all conversations.
Voice Simulation Results
Switch to the Latency tab to view detailed timing metrics for each component:
  • stt/ttft and stt/processing_time — Speech-to-Text latency
  • llm/ttft and llm/processing_time — LLM response latency
  • tts/ttft and tts/processing_time — Text-to-Speech latency
Voice Simulation Latency Metrics
Voice simulations take longer than text simulations due to the full audio pipeline, but provide more realistic testing of your agent’s voice experience.

Next steps