New Parameter in Song Generator API Endpoint: Song Generator API
Addedmodel_id parameter to select between diffrhythm-short and diffrhythm-long models for song generation.diffrhythm-short: Generates shorter with maximum duration of 1 minute 35 seconds.diffrhythm-long: Generates longer songs with maximum duration of 4 minute 45 seconds.
New parameter in Lyrics Generator API Endpoint: Lyrics Generator API
Addedlength parameter to specify desired length of generated lyrics.short: Generates shorter lyrics with maximum duration of 1 minute 35 seconds.long: Generates longer lyrics with maximum duration of 4 minute 45 seconds.
API Endpoints
- Standard API:
POST /api/v6/voice/song_generator - Standard API:
POST /api/v6/voice/lyrics_generator
Key Features
- Select between short and long models for song generation
- Specify desired length of generated lyrics
New Enterprise API Endpoint: Qwen Text to Image
Generate high-definition images from text using the Qwen model.API Endpoint
- Enterprise API:
POST /api/v1/enterprise/qwen/text2img
Key Features
- Generate high-definition images from text using Qwen model
- Supports various image styles and attributes
- Resolution up to 1024x1024 pixels.
New Video API Endpoint: Watermark Remover
Remove watermarks from SORA videos.API Endpoint
- Standard API:
POST /api/v6/video/watermark_remover
Key Features
- SORA watermark detection and removal
- Preserves video quality
New Image Editing Endpoint: Caption
Simple and powerful image captioning endpoint to generate descriptive text from images.API Endpoint
- Standard API:
POST /api/v6/image_editing/caption
Key Features
- Automatic image caption generation
- Customizable caption length (short, normal, long)
- Supports multiple image formats:
png,jpeg,jpg
Flux Kontext Dev Moved to Image Editing API
Flux Kontext Image to Image endpoint moved from Image Generation API to Image Editing API section for better organization.- New Location: Image Editing API → Flux Kontext Image to Image
- Endpoint:
POST /api/v6/images/img2img - Fixed OpenAPI playground display
New Image Editing Endpoint: Qwen Edit
Added Qwen Edit endpoint for AI-powered image editing using the Qwen model.API Endpoints
- Standard API:
POST /api/v6/image_editing/qwen_edit - Enterprise API:
POST /api/v1/enterprise/image_editing/qwen_edit
Key Features
- Prompt-based image editing and manipulation
- Support for single or multiple images (up to 4 images)
New Interior API Endpoints
Added two new endpoints to the Interior API for enhanced object manipulation capabilities:Object Removal
- Endpoint:
POST /api/v6/interior/object_removal - Remove unwanted objects from interior images using AI
- Parameters:
init_image,object_name,base64,webhook,track_id - Simple text-based object identification
Interior Mixer
- Endpoint:
POST /api/v6/interior/interior_mixer - Add objects from one image into another room image
- Parameters:
init_image,object_image,prompt,width,height,guidance_scale,num_inference_steps - Intelligent object placement with prompt-based positioning
- Configurable inference steps (default: 8) and guidance scale
Documentation Updates
- Added complete API reference documentation for both endpoints
- Updated OpenAPI specification with new schemas
- Added visual indicators for new endpoints in the overview
Rate Limits Documentation
Added comprehensive rate limits documentation with plan-specific queue limits:- Pay as you go plan: 5 queued API requests
- Standard plan: 10 queued API requests
- Unlimited Premium Plan: 15 queued API requests
Key Features
- Sequential Processing: Requests are processed one after another in queue order
- Queue Management: New requests are added to the queue and processed when previous ones complete
- Real-time Enforcement: Limits are enforced in real-time as requests come in
- FIFO Processing: Requests are processed in First-In-First-Out order
Enterprise API Updates
- Added Reset S3 endpoint to Enterprise API General section
- Updated S3 management capabilities for dedicated servers
New Model: Wan 2.5
Added Wan 2.5 to ModelsLab with enhanced video generation capabilities:- Text to Video: Generate videos from text prompts with audio support
- Image to Video: Transform static images into dynamic videos with sound
- Audio Integration: Built-in audio support for complete multimedia experiences
- Enhanced Quality: Improved motion smoothness and visual realism
Available Models
Added threshold parameter to NSFW Image Check
- Added
thresholdparameter toPOST /nsfw_image_check.- Type: number; range: 0–1; default: 0.5.
- Controls sensitivity for NSFW detection in images/videos.


