Transcription API Features Guide
Introduction
Salad Transcription API offers a suite of powerful features to help you get the most out of your audio and video content. This guide covers the key transcription parameters you can use to customize your transcription outputs:- Transcription Output Options:
return_as_file
sentence_level_timestamps
word_level_timestamps
- Speaker Identification:
diarization
sentence_diarization
- Language Specification:
language_code
Transcription Parameters
1. return_as_file
Description
Thereturn_as_file
parameter allows you to receive your transcription output as a downloadable file URL. This is
particularly useful when dealing with large responses, as it helps avoid issues with response size limitations.
- Default:
false
- Type:
boolean
Usage
Set"return_as_file": true
in your request to receive the transcription output as a file URL.
Example:
- If the response size exceeds 1 MB, the output will automatically be returned as a file URL, even if return_as_file is set to false.
- This helps ensure reliable delivery of large transcription outputs.
2. sentence_level_timestamps
Description
Include timestamps at the sentence level in your transcription output.- Default:
false
- Type:
boolean
Usage
Set"sentence_level_timestamps": true
to include sentence-level timestamps. Set to false if you do not need them.
Example:
3. word_level_timestamps
Description
Include timestamps for each word in your transcription output.- Default:
false
- Type:
boolean
Usage
Set"word_level_timestamps": true
to include word-level timestamps.
Example:
4. diarization
Description
Enable speaker separation and identification at the word level.- Default:
false
- Type:
boolean
Usage
Example:5. sentence_diarization
Description
Include speaker information at the sentence level.- Default:
false
- Type:
boolean
Usage
Example:6. multichannel
Description
Enable multichannel transcription for audio files with separate speaker channels. Requires eitherdiarization
or
sentence_diarization
to return speaker/channel information.
- Default:
false
- Type:
boolean
"multichannel": true
and pair it with either:
"diarization": true
→ word-level speaker and channel labeling"sentence_diarization": true
→ sentence-level speaker and channel labeling
Usage
7. language_code
Description
The language_code parameter allows you to specify the language of the transcription to improve the accuracy of transcription, diarization and timestamps. This parameter supports both full language names and their short versions.- Default:
en
(English) - Type:
string
Usage
Set thelanguage_code
to the ISO 639-1 code of the audio’s language.
Example:
Code | Language | Code | Language | Code | Language |
---|---|---|---|---|---|
en | English | zh | Chinese | de | German |
es | Spanish | ru | Russian | ko | Korean |
fr | French | ja | Japanese | pt | Portuguese |
tr | Turkish | pl | Polish | ca | Catalan |
nl | Dutch | ar | Arabic | sv | Swedish |
it | Italian | id | Indonesian | hi | Hindi |
fi | Finnish | vi | Vietnamese | he | Hebrew |
uk | Ukrainian | el | Greek | ms | Malay |
cs | Czech | ro | Romanian | da | Danish |
hu | Hungarian | ta | Tamil | no | Norwegian |
th | Thai | ur | Urdu | hr | Croatian |
bg | Bulgarian | lt | Lithuanian | la | Latin |
mi | Maori | ml | Malayalam | cy | Welsh |
sk | Slovak | te | Telugu | fa | Persian |
lv | Latvian | bn | Bengali | sr | Serbian |
az | Azerbaijani | sl | Slovenian | kn | Kannada |
et | Estonian | mk | Macedonian | br | Breton |
eu | Basque | is | Icelandic | hy | Armenian |
ne | Nepali | mn | Mongolian | bs | Bosnian |
kk | Kazakh | sq | Albanian | sw | Swahili |
gl | Galician | mr | Marathi | pa | Punjabi |
si | Sinhala | km | Khmer | sn | Shona |
yo | Yoruba | so | Somali | af | Afrikaans |
oc | Occitan | ka | Georgian | be | Belarusian |
tg | Tajik | sd | Sindhi | gu | Gujarati |
am | Amharic | yi | Yiddish | lo | Lao |
uz | Uzbek | fo | Faroese | ht | Haitian Creole |
ps | Pashto | tk | Turkmen | nn | Nynorsk |
mt | Maltese | sa | Sanskrit | lb | Luxembourgish |
my | Myanmar | bo | Tibetan | tl | Tagalog |
mg | Malagasy | as | Assamese | tt | Tatar |
haw | Hawaiian | ln | Lingala | ha | Hausa |
ba | Bashkir | jw | Javanese | su | Sundanese |
yue | Cantonese |
Note
- Mandatory for Diarization: When using diarization or sentence-level diarization, specifying
language_code
is required for optimal performance. - Automatic Detection: If no language is specified or the provided language is not supported, the system will automatically detect the language.
- Translation Option: If the wrong language is specified, the system will provide a transcription translated into the specified language.
- Multilingual Content: For multilingual audio, it is better not to specify a language to ensure the best results.
language_code
or leveraging the automatic detection feature, you ensure the best possible
transcription quality tailored to your audio content.