Skip to main content
POST
/
voice
/
voice_design
Design and generate custom voices
curl --request POST \
  --url https://modelslab.com/api/v6/voice/voice_design \
  --header 'Content-Type: application/json' \
  --data '
{
  "prompt": "<string>",
  "voice_description": "<string>",
  "key": "<string>",
  "language": "chinese",
  "webhook": "<string>",
  "track_id": 123
}
'
{
  "status": "success",
  "generationTime": 123,
  "id": 123,
  "output": [
    "<string>"
  ],
  "proxy_links": [
    "<string>"
  ],
  "future_links": [
    "<string>"
  ],
  "links": [
    "<string>"
  ],
  "meta": {},
  "eta": 123,
  "message": "<string>",
  "tip": "<string>",
  "fetch_result": "<string>",
  "audio_time": 123
}

Request

Make a POST request to below endpoint and pass the required parameters as a request body.
curl
--request POST 'https://modelslab.com/api/v6/voice/voice_design' \

Body

json
{
    "key": "your_api_key",
    "prompt": "Greetings, my friend. What brings you to Mount Olympus today? Here to gaze at my beautiful muscles again?",
    "voice_description": "The friendly mythical God, Zeus, with a huge deep powerful voice. Charming, proud, strong and theatrical.",
    "language": "english",
    "webhook": null,
    "track_id": null
}
Both prompt and voice_description have a maximum limit of 1000 characters each.

Supported Languages

LanguageValue
Chinesechinese
Englishenglish
Japanesejapanese
Koreankorean
Germangerman
Frenchfrench
Russianrussian
Portugueseportuguese
Spanishspanish
Italianitalian

Body

application/json
prompt
string
required

The text content to be spoken by the designed voice

voice_description
string
required

A detailed description of the voice characteristics (tone, accent, gender, age, style)

key
string

Your API Key used for request authorization

language
enum<string>

Language for the voice

Available options:
chinese,
english,
japanese,
korean,
german,
french,
russian,
portuguese,
spanish,
italian
webhook
string<uri>

URL to receive POST notification upon completion

track_id
integer

ID for webhook identification

Response

Voice design response

status
enum<string>

Status of the voice generation

Available options:
success,
processing,
error
generationTime
number

Time taken to generate the audio in seconds

id
integer

Unique identifier for the voice generation

output
string<uri>[]

Array of generated audio URLs

Array of proxy audio URLs

Array of future audio URLs for queued requests

Array of audio URLs (voice cover response)

meta
object

Metadata about the audio generation including all parameters used

eta
integer

Estimated time for completion in seconds (processing status)

message
string

Status message or additional information

tip
string

Additional information or tips for the user

fetch_result
string<uri>

URL to fetch the result when processing

audio_time
number

Duration of the generated audio in seconds