Text to Video

Generate video from text prompt

curl --request POST \
  --url https://modelslab.com/api/v6/video/text2video \
  --header 'Content-Type: application/json' \
  --data '
{
  "key": "<string>",
  "model_id": "cogvideox",
  "prompt": "<string>",
  "negative_prompt": "<string>",
  "seed": 123,
  "height": 512,
  "width": 512,
  "num_frames": 25,
  "num_inference_steps": 20,
  "guidance_scale": 7,
  "clip_skip": 1,
  "upscale_height": 640,
  "upscale_width": 1024,
  "upscale_strength": 0.6,
  "upscale_guidance_scale": 12,
  "upscale_num_inference_steps": 20,
  "use_improved_sampling": false,
  "improved_sampling_seed": 123,
  "fps": 15,
  "output_type": "gif",
  "instant_response": false,
  "temp": false,
  "webhook": "<string>",
  "track_id": "<string>"
}
'

{
  "status": "success",
  "generationTime": 123,
  "id": 123,
  "output": [
    "<string>"
  ],
  "proxy_links": [
    "<string>"
  ],
  "future_links": [
    "<string>"
  ],
  "meta": {},
  "eta": 123,
  "message": "<string>",
  "tip": "<string>",
  "fetch_result": "<string>"
}

POST

video

text2video

Generate video from text prompt

curl --request POST \
  --url https://modelslab.com/api/v6/video/text2video \
  --header 'Content-Type: application/json' \
  --data '
{
  "key": "<string>",
  "model_id": "cogvideox",
  "prompt": "<string>",
  "negative_prompt": "<string>",
  "seed": 123,
  "height": 512,
  "width": 512,
  "num_frames": 25,
  "num_inference_steps": 20,
  "guidance_scale": 7,
  "clip_skip": 1,
  "upscale_height": 640,
  "upscale_width": 1024,
  "upscale_strength": 0.6,
  "upscale_guidance_scale": 12,
  "upscale_num_inference_steps": 20,
  "use_improved_sampling": false,
  "improved_sampling_seed": 123,
  "fps": 15,
  "output_type": "gif",
  "instant_response": false,
  "temp": false,
  "webhook": "<string>",
  "track_id": "<string>"
}
'

{
  "status": "success",
  "generationTime": 123,
  "id": 123,
  "output": [
    "<string>"
  ],
  "proxy_links": [
    "<string>"
  ],
  "future_links": [
    "<string>"
  ],
  "meta": {},
  "eta": 123,
  "message": "<string>",
  "tip": "<string>",
  "fetch_result": "<string>"
}

Request

Make a POST request to below endpoint and pass the required parameters in the request body.

curl

--request POST 'https://modelslab.com/api/v6/video/text2video' \

Body

json

{
    "key": "your_api_key",
    "model_id": "cogvideox",
    "prompt": "Space Station in space",
    "negative_prompt": "low quality",
    "height": 512,
    "width": 512,
    "num_frames": 25,
    "num_inference_steps": 20,
    "guidance_scale": 7,
    "upscale_height": 640,
    "upscale_width": 1024,
    "upscale_strength": 0.6,
    "upscale_guidance_scale": 12,
    "upscale_num_inference_steps": 20,
    "output_type": "gif",
    "webhook": null,
    "track_id": null
}

Body

application/json

key

string

required

Your API Key used for request authorization

model_id

enum<string>

required

The ID of the model to use

Available options:

cogvideox,

wanx

prompt

string

required

Text prompt describing the video content

negative_prompt

string

Items you don't want in the video

seed

integer | null

Seed for reproducible results. Same seed gives same result. Pass null for random

height

integer

default:512

Height of the video in pixels

Required range: x <= 512

width

integer

default:512

Width of the video in pixels

Required range: x <= 512

num_frames

integer

default:25

Number of frames in the video

Required range: x <= 25

num_inference_steps

integer

default:20

Number of denoising steps

Required range: x <= 50

guidance_scale

number

default:7

Scale for classifier-free guidance

Required range: 0 <= x <= 8

clip_skip

integer | null

Number of CLIP layers to skip. Skipping 2 layers often gives more aesthetic results

Required range: x <= 2

upscale_height

integer

default:640

The upscaled height for videos generated

Required range: x <= 1024

upscale_width

integer

default:1024

The upscaled width for videos generated

Required range: x <= 1024

upscale_strength

number

default:0.6

Strength of upscaling. Higher values result in more noticeable differences

Required range: 0 <= x <= 1

upscale_guidance_scale

number

default:12

Guidance scale for upscaling videos

Required range: 0 <= x <= 8

upscale_num_inference_steps

integer

default:20

Number of denoising steps for upscaling

Required range: x <= 50

use_improved_sampling

boolean

default:false

Whether to use improved sampling technique for better temporal consistency

improved_sampling_seed

integer

Seed for consistent video generation with improved sampling

fps

integer

Frames per second rate of the generated video

Required range: x <= 16

output_type

enum<string>

default:gif

Output format type

Available options:

mp4,

gif

instant_response

boolean

default:false

If true, returns future links for queued requests instantly instead of waiting

temp

boolean

default:false

If true, stores video in temporary storage (cleaned every 24 hours)

webhook

string<uri>

URL to receive a POST API call once video generation is complete

track_id

string

Unique ID used in webhook response to identify the request

Response

Video generation response

status

enum<string>

Status of the video generation

Available options:

success,

processing,

error

generationTime

number

Time taken to generate the video in seconds

integer

Unique identifier for the video generation

output

string<uri>[]

Array of generated video URLs

proxy_links

string<uri>[]

Array of proxy video URLs

future_links

string<uri>[]

Array of future video URLs for queued requests

Using the APIs

Our AI APIs

Text to Video

Text to Video

Request

Body

Body

Response

Using the APIs

Our AI APIs

​Text to Video

​Request

​Body

Body

Response

Text to Video

Request

Body