Create an agent
From the sidebar, click Agents → New agent. Configure the system prompt, select your STT, TTS, and LLM providers and add tools to the agent.
Create your first test case
Open the LLM Evaluation tab and click on Add test to create a new test.
Next Reply Test
Next Reply Test
These tests verify your agent responds appropriately to the last user
message given a conversation history defined by you by checking if the
agent’s response meets your criteria (for example, tone, content, or
accuracy).
Tool Invocation Test
Tool Invocation Test
These tests verify that your agent calls the correct tools with the right
parameters given a conversation history defined by you.
Create a next reply test
Next reply tests verify that your agent response adheres to your criteria given a conversation history defined by you.
Create a tool invocation test
Tool invocation tests verify that your agent calls the correct tools with the right parameters given a conversation history defined by you.
Run one test on one agent
Once the test is created, you can click on the play button to run that test.

Selecting the
Attach this test to the agent config checkbox will attach the
test to the list of all tests for the selected agent
Run all tests for one agent
Navigate to the Tests tab of the agent you want to test. You can add new tests by selecting the Add test button or run the existing tests by clicking the Run all tests button.


Find the best LLM for your agent
The tests above are run using the LLM configured for that agent. But it may not be the optimal model for your use case. You can compare the performance of different LLMs on your tests by clicking the Compare models button.


