Accuracy Benchmarks

Salad Transcription API delivers industry-leading accuracy across a wide range of languages and public benchmark datasets. Below is a breakdown of results by language and dataset.

Languages with Accuracy ≥ 90%

  • English
  • Portuguese
  • French
  • Spanish
  • German
  • Italian
  • Russian

Languages with Accuracy between 80%–89%

  • Hindi
  • Hebrew

Languages with Accuracy < 80%

  • Urdu
  • Kazakh
  • Thai (in progress)

English

DatasetSub-DatasetAccuracy (Full)WER (Full)Accuracy (Lite)WER (Lite)Source
TED-LIUMtedlium95.8%4.2%91.8%8.2%TED-LIUM on Hugging Face
MeanwhileMeanwhile95.7%4.3%83.3%16.7%Meanwhile on Hugging Face
CommonVoicecv-corpus-5.1-2020-06-2295.1%4.9%81.3%18.7%Common Voice
CommonVoicecv-corpus-20.0-delta-2024-12-0693.1%6.9%78.1%21.9%Common Voice

Portuguese

DatasetSub-DatasetAccuracyWERSource
CommonVoicecv-corpus-8.0-2022-01-1992.0%8.0%Common Voice

French

DatasetSub-DatasetAccuracyWERSource
CommonVoicecv-corpus-10.0-delta-2022-07-0492.0%8.0%Common Voice

Spanish

DatasetSub-DatasetAccuracyWERSource
CommonVoicecv-corpus-12.0-delta-2022-12-0794.0%6.0%Common Voice
CommonVoicecv-corpus-14.0-delta-2023-06-2396.8%3.2%Common Voice
CommonVoicecv-corpus-16.1-delta-2023-12-0696.8%4.3%Common Voice

German

DatasetSub-DatasetAccuracyWERSource
CommonVoicecv-corpus-13.0-delta-2023-03-0996.3%3.7%Common Voice

Hindi

DatasetSub-DatasetAccuracyWERSource
CommonVoicecv-corpus-20.0-2024-12-0684.0%16.0%Common Voice

Italian

DatasetSub-DatasetAccuracyWERSource
CommonVoice93.3%6.7%Common Voice

Russian

DatasetSub-DatasetAccuracyWERSource
CommonVoice96.4%3.6%Common Voice

Hebrew

DatasetSub-DatasetAccuracyWERSource
CommonVoicecv-corpus-17.0-2024-03-1584.2%15.8%Common Voice

Kazakh

DatasetSub-DatasetAccuracyWERSource
CommonVoicecv-corpus-19.0-2024-09-1351.0%49.0%Common Voice

Urdu

DatasetSub-DatasetAccuracyWERSource
CommonVoicecv-corpus-9.0-2022-04-2778.8%21.2%Common Voice

Thai (in progress)

DatasetSub-DatasetAccuracyWERSource
CommonVoicecv-corpus-10.0-delta-2022-07-0433.0%67.0%*Common Voice
Thai WER may need a recalculation due to formatting issues.

Methodology

To ensure fair and repeatable accuracy evaluation, we adopted a benchmarking methodology similar to AssemblyAI: This benchmark continues to expand as we test more languages and improve our models. Want to run your own benchmarks? Reach out to us at support@salad.com.