ASR Benchmark UI

Under the ASR Benchmark tab, you can view and manage all configured ASR benchmarks.

1. Package Configuration

On the main ASR Benchmark page, you’ll see all benchmark packages currently defined:

Home Page

Click Create new ASR package.
Add Dataset – Each package can contain multiple datasets.
Configure each dataset with:
- Dataset Identifier – A unique label.
- Source & Model – Point to your audio dataset (e.g., from HuggingFace Datasets).
- Dataset Configuration – Subset, split (train/test/validation), and the specific audio/transcription columns.
- Normalizer – (Optional) Applies text normalization before comparing transcripts.
  
  Tip: For HuggingFace datasets, check the Viewer tab to find valid subset and split names.

Dataset Configuration 4. Save the package after adding the desired datasets.

Complete Package

Select an existing package.
Click New ASR Benchmark Run.
Specify the model, required resources (CPU/GPU), and any additional settings.
Run Benchmark.

New Run

Note: You can discover pre-trained ASR models on HuggingFace with the automatic-speech-recognition tag.

After the benchmark finishes, see aggregated scores for each model in the run.

Results

Click on an individual run to view more in-depth metrics and performance details.

Result Details