Skip to main content
Calibrate lets you evaluate multiple STT providers simultaneously using your own dataset. This guide will walk you through creating an evaluation for your dataset.
Speech to Text Preview

Start a new evaluation

From the sidebar, click on Speech to Text to view all your evaluations. Click the New evaluation button to create a new evaluation.
Speech to Text Evaluations List

Configure settings

On the Settings tab, select the language and providers you want to compare:
Speech to Text Settings

Upload your dataset

Switch to the Dataset tab to add your audio samples along with the reference transcriptions.
Speech to Text Dataset Upload
You can also bulk upload your dataset as a ZIP file having the following structure:
your_dataset.zip
|-- audios/
|   |-- sample_1.wav
|   |-- sample_2.wav
|   |-- sample_3.wav
|-- data.csv
The data.csv should have two columns:
audio_filetext
sample_1.wavThis is the reference transcription for sample 1.
sample_2.wavThis is the reference transcription for sample 2.
sample_3.wavThis is the reference transcription for sample 3.
Click Download sample ZIP to get a template with the correct structure

Run evaluation

Click the Evaluate button to start the evaluation. You’ll be redirected to the results page where you can monitor progress in real-time. The Outputs tab streams the results for each file as it completes for each provider:
Speech to Text Outputs View

Leaderboard

Once all providers complete, the Leaderboard tab shows a comparison across all of them.
Speech to Text Leaderboard View

Next Steps