User Interface

The ASR Benchmark tab contains all the benchmarks configured for ASR models.

Package configuration

The main page contains all the benchmarks configured for ASR.

Home Page

To create a new benchmark package select Create new ASR package.

New package

Each package is composed by multiple datasets, click Add dataset to add a new dataset to the package.

New dataset

A dataset is made of

Dataset identifier: the identifier for this dataset
Source and Model: the audio dataset source
- The dataset should contain a column with the audio to transcribe and the audio transcription
- You can find several datasets ready for use on HuggingFace Datasets
Dataset configuration: instructions on how to read the dataset
- Subset: the subset of the data to use
- Split: the split (train, test or validation)
- Audio and Transcription columns
Normalizer: an optional normalizer applied to the transcribed and expected texts before comparing them

If using an HuggingFace dataset, you can check the available Subset and Split values from the Viewer tab

Dataset configuration

After adding all the datasets you want to test on, you can then save the package.

Complete package

To run a benchmark select a package and then click New ASR Benchmark Run.

New Run

Fill-in the model you want to use, the necessary resources (GPU, CPU, ...) then click Run benchmark.

You can find some models to compare against on HuggingFace searching the tag automatic-speech-recognition.

On the run page you can see all the models that ran with their aggregated score.

Results

Click on an individual run to see the detailed results.

Result Details