> ## Documentation Index > Fetch the complete documentation index at: https://docs.modelslab.com/llms.txt > Use this file to discover all available pages before exploring further. # Speech To Text > This endpoint allows you to convert speech in audio files to text. ## Request Make a `POST` request to below endpoint and pass the required parameters as a request body. ```curl curl theme={null} --request POST 'https://modelslab.com/api/v1/enterprise/voice/speech_to_text' \ ``` ## Body ```json json theme={null} { "key": "enterprise_api_key", "init_audio": "https://assets.modelslab.ai/generations/5c3eef10-0eb4-4db8-8b12-fc4eedbf30b9.mp3", "language": "en", "timestamp_level": null, "webhook": null, "track_id": null } ``` ## Body Attributes The API key required to authorize the request. The URL of the audio file to be transcribed.\ Supported formats: WAV, MP3, FLAC, OPUS.\ Duration limits: minimum 5 seconds, maximum 1 hour. The language code of the audio content in ISO 639-1 format. Examples: en (English), es (Spanish), fr (French). The level of detail for timestamps in the transcription. Options: word, sentence, or null (no timestamps). Default: null. **Timestamp Level Accuracy:** Sentence-level timestamps work well and provide reliable results. However, word-level timestamps may not be accurate and may provide less reliable results. A URL to receive a POST request once the transcription is complete. An ID included in the webhook response to identify the request. ### Languages Supported Whisper supports several languages, but performance may vary due to factors like limited training data, script complexity, and regional dialects, potentially affecting transcription accuracy. ``` "Afrikaans": "af", "Arabic": "ar", "Belarusian": "be", "Bengali": "bn", "Bulgarian": "bg", "Chinese": "zh", "Czech": "cs", "Danish": "da", "Dutch": "nl", "English": "en", "Finnish": "fi", "French": "fr", "German": "de", "Greek": "el", "Hebrew": "he", "Hindi": "hi", "Hungarian": "hu", "Indonesian": "id", "Italian": "it", "Japanese": "ja", "Kannada": "kn", "Korean": "ko", "Malayalam": "ml", "Marathi": "mr", "Nepali": "ne", "Panjabi": "pa", "Persian": "fa", "Polish": "pl", "Portuguese": "pt", "Romanian": "ro", "Russian": "ru", "Serbian": "sr", "Spanish": "es", "Swedish": "sv", "Tagalog": "tl", "Tamil": "ta", "Telugu": "te", "Thai": "th", "Turkish": "tr", "Ukrainian": "uk", "Urdu": "ur", "Vietnamese": "vi", "Welsh": "cy" ``` **Performance may vary due to factors like script complexity, and regional dialects, which may affect transcription accuracy.**