Configuration
You need to create a config file that defines the following:- Instructions for your agent (system prompt)
- Tools available to your agent
- Personas that simulate different user types
- Scenarios that define conversation patterns
- Evaluation criteria to measure agent performance
- The STT, TTS, and LLM providers to use (for voice simulations only)
system_prompt
The system prompt that defines your agent’s behavior. This is the same prompt you use in production.
tools
A list of tools available to your agent. See the guide on Configuring Tools for how to set it up along with examples for different tool types.
personas
An array of user personas that simulate different types of users interacting with your agent. For a primer on personas, refer to Personas. Each persona has:
| Key | Type | Description |
|---|---|---|
label | string | A short name for the persona |
characteristics | string | Detailed description of who the persona represents and how they behave |
gender | string | Gender for voice simulations: male or female |
language | string | Language the persona speaks: english, hindi, or kannada (more coming soon) |
interruption_sensitivity | string | (Voice only) How likely the persona is to interrupt the agent mid-sentence: none (0%), low (25%), medium (50%), high (80%) |
scenarios
An array of scenarios that define different conversation patterns to test. For a primer on scenarios, refer to Scenarios. Each scenario has:
| Key | Type | Description |
|---|---|---|
name | string | A short name for the scenario |
description | string | Instructions for what the simulated user should do |
evaluation_criteria
An array of criteria used to evaluate the agent’s performance. Each criterion has:
| Key | Type | Description |
|---|---|---|
name | string | A short name for the criterion (used in results) |
description | string | What the criterion measures (used by the LLM judge) |
settings
Optional settings to control the simulation:
| Key | Type | Description |
|---|---|---|
agent_speaks_first | boolean | Whether the agent initiates the conversation (default: true) |
max_turns | number | Maximum number of agent messages after which the simulated conversation ends automatically (default: 10) |
stt, llm, and tts (voice simulations only)
For voice simulations, specify the STT, LLM, and TTS providers:
Full example
Refer to this sample for a text simulation and this sample for a voice simulation.Get started
- Simulation type — text (LLM-only) or voice (full STT → LLM → TTS pipeline)
- Config file — path to the config file you created in the previous section
- Provider — OpenRouter or OpenAI (text simulations only)
- Model — enter the model you want to use for the simulation (text simulations only)
- Parallel count — run multiple simulations simultaneously (default:
1) - Output directory — where results will be saved (defaults to
./out) - API keys — enter the API keys for the selected providers
Output
Once all simulated conversations complete, it displays the overall metrics aggregated across all simulations along with bar charts for visualization.


Resources
Personas
Learn how to create realistic user personas
Scenarios
Learn how to write effective scenarios