Make sure you add your s3 details for voice_cloning server, so you can receive image generated in your bucket. Images generated without s3 details being added will be delete after 24 hours

Request

Make a POST request to below endpoint and pass the required parameters as a request body.
curl
--request POST 'https://modelslab.com/api/v1/enterprise/voice/voice_to_voice' \

Body

json
{    
  "key":"enterprise_api_key",    
  "init_audio":"https://pub-f3505056e06f40d6990886c8e14102b2.r2.dev/audio/tom_hanks_1.wav",    
  "target_audio":"https://assets.modelslab.com/tmp/gEJ3RgkrWumjCJpb8hHYD3OsID7ET9-metaSGlyZV92b2ljZV9hY3RvcnNfYW5kX3ZvaWNlX292ZXJfc2VydmljZXNfLV9CdW5ueV9TdHVkaW9fMiAoMSktW0F1ZGlvVHJpbW1lci5jb21dLm1wMw==-.mp3",    
  "temp":false,    
  "base64":false,    
  "webhook":null,    
  "track_id":null
}

Body Attributes

key
string
required
Your API Key used for request authorization.
init_audio
string
required
Source utterances. Must be a valid URL or base64 data for a WAV/MP3 file.
Maximum length: 30 seconds.
target_audio
string
required
Target voice that should say the original utterances. Must be a valid URL or base64 data for a WAV/MP3 file.
Maximum length: 30 seconds.
base64
boolean
default:"false"
Whether the input sound clips are provided in base64 format. Default: false.
temp
boolean
default:"false"
Whether you want temporary links (useful if your country blocks access to storage sites). Default: false.
webhook
string
Provide a URL to receive a POST API call once the voice cover generation is complete.
track_id
string
This ID is returned in the webhook API call and will be used to identify the request.