Create and manage AI generation tasks with a unified interface
/v1/tasks endpoint provides a unified interface for all AI generation models (video, image, audio).
/v1/tasks
Creates a new generation task for any supported model.
provider/model-name format. See Available Models below.true, the request is validated and the cost is calculated without actually creating a task or deducting from your balance. Useful for previewing the price of a request before committing.dry_run to true:
/v1/tasks/:task_id
Retrieves the status and output of a task.
| Model | Description |
|---|---|
google/veo-3.1-fast | Google Veo 3.1 Fast (720p, 1080p, 4k) |
google/veo-3.1-quality | Google Veo 3.1 Quality (720p, 1080p, 4k) |
openai/sora-2 | OpenAI Sora 2 (5s, 10s, 15s, 20s) |
openai/sora-2-pro | OpenAI Sora 2 Pro (10s, 15s, 20s, 25s) |
hailuo/minimax-2.0 | Minimax Hailuo 2.0 |
hailuo/minimax-2.3 | Minimax Hailuo 2.3 |
hailuo/minimax-2.3-fast | Minimax Hailuo 2.3 Fast |
higgsfield-ai/lite | Higgsfield Lite |
higgsfield-ai/turbo | Higgsfield Turbo |
higgsfield-ai/standard | Higgsfield Standard |
higgsfield-ai/cinematic-studio-video | Cinematic Studio Video |
kuaishou/kling-3.0-omni-video | Kling v3.0 Omni Video |
kuaishou/kling-3.0-omni-video-edit | Kling v3.0 Omni Video Edit |
kuaishou/kling-o1-video | Kling O1 Video |
kuaishou/kling-2.6 | Kling v2.6 |
kuaishou/kling-2.5 | Kling v2.5 |
kuaishou/kling-2.1 | Kling v2.1 |
kuaishou/kling-2.1-master | Kling v2.1 Master |
kuaishou/kling-2.5-turbo-pro | Kling v2.5 Turbo Pro |
kuaishou/kling-2.6-motion-control | Kling v2.6 Motion Control |
kuaishou/kling-3.0-motion-control | Kling v3.0 Motion Control |
topaz-labs/video-upscale | Topaz Video Upscale |
| Model | Description |
|---|---|
google/nano-banana | Nano Banana |
google/nano-banana-pro | Nano Banana Pro |
openai/gpt-image-1 | GPT Image 1 (low, medium, high quality) |
openai/gpt-image-1.5 | GPT Image 1.5 (low, medium, high quality) |
higgsfield-ai/cinematic-studio-image | Cinematic Studio Image |
black-forest-labs/flux.2-pro | Flux.2 Pro |
black-forest-labs/flux.2-flex | Flux.2 Flex |
black-forest-labs/flux.2-max | Flux.2 Max |
kuaishou/kling-o1-image | Kling O1 Image |
kuaishou/kling-3.0-omni-image | Kling v3.0 Omni Image |
topaz-labs/image-upscale | Topaz Image Upscale |
topaz-labs/image-generative | Topaz Image Generative |
| Model | Description |
|---|---|
suno-ai/music | Suno Music Generation |
suno-ai/add-vocals | Add Vocals to Track |
suno-ai/add-instrumental | Add Instrumental |
suno-ai/extend | Extend Audio |
suno-ai/cover | Create Cover |
suno-ai/stems | Extract Stems |
suno-ai/stems-all | Extract All Stems |
suno-ai/lyrics | Generate Lyrics |
suno-ai/wav | WAV Export |
higgsfield-ai/speak | Speak (standard, high quality) |
higgsfield-ai/tts | Text-to-Speech |
elevenlabs/text-to-speech | ElevenLabs Text-to-Speech |
elevenlabs/text-to-dialogue | ElevenLabs Multi-Voice Dialogue |
elevenlabs/sound-effect | ElevenLabs Sound Effects |
elevenlabs/voice-isolation | ElevenLabs Voice Isolation |
elevenlabs/speech-to-text | ElevenLabs Speech-to-Text |
google/veo-3.1-fast, google/veo-3.1-quality
| Parameter | Type | Required | Description |
|---|---|---|---|
prompt | string | Yes | Text description of the video |
aspect_ratio | string | No | 16:9 or 9:16 |
start_image_url | string | No | Starting frame image URL |
end_image_url | string | No | Ending frame image URL |
resolution | string | No | 720p (default), 1080p, 4k |
seed | integer | No | Seed for reproducibility |
openai/sora-2, openai/sora-2-pro
| Parameter | Type | Required | Description |
|---|---|---|---|
prompt | string | Yes | Text description of the video |
duration | integer | No | Sora 2: 5, 10, 15, 20s / Pro: 10, 15, 20, 25s |
aspect_ratio | string | No | 16:9 or 9:16 |
image_url | string | No | Reference image URL |
cameo_video_url | string | No | Cameo character video (max 4 seconds) |
character_prompt | string | No | Description of cameo character |
safety_instruction | string | No | Safety guidance for content |
hailuo/minimax-2.0, hailuo/minimax-2.3, hailuo/minimax-2.3-fast
| Parameter | Type | Required | Description |
|---|---|---|---|
prompt | string | Yes* | Max 2000 chars. *Required if no start_image_url |
start_image_url | string | Yes* | Image URL (auto-uploaded). *Required if no prompt (required for 2.3-fast) |
end_image_url | string | No | End frame image URL (minimax-2.0 only, 768p/1080p) |
duration | integer | No | 6 or 10 seconds. 1080p only supports 6 |
resolution | string | No | "512p" (2.0 I2V only), "768p" (default), "1080p" |
prompt_optimization | boolean | No | Let MiniMax optimize prompt |
higgsfield-ai/lite, higgsfield-ai/turbo, higgsfield-ai/standard
| Parameter | Type | Required | Description |
|---|---|---|---|
prompt | string | Yes | Text description |
start_image_url | string | Yes | Starting frame image URL |
motion_id | string | Yes | Motion preset ID (UUID) |
end_image_url | string | No | Ending frame image URL |
enhance_prompt | boolean | No | Enhance prompt with AI |
seed | integer | No | Seed for reproducibility |
higgsfield-ai/cinematic-studio-video
| Parameter | Type | Required | Description |
|---|---|---|---|
prompt | string | No | Text description |
start_image_url | string | Yes | Starting frame image URL |
end_image_url | string | No | Ending frame (disables preset_id, sound, slow_motion) |
duration | integer | No | 5 or 10 seconds |
aspect_ratio | string | No | 1:1, 3:4, 2:3, 9:16, 3:2, 4:3, 16:9, 21:9 |
preset_id | string | No | Camera movement preset ID |
sound | boolean | No | Enable AI-generated sound |
slow_motion | boolean | No | Enable slow motion effect |
seed | integer | No | 0-999999 for reproducibility |
kuaishou/kling-2.6
| Parameter | Type | Required | Description |
|---|---|---|---|
prompt | string | Yes* | *Required for text-to-video, optional for image-to-video |
start_image_url | string | No | Start frame URL (enables image-to-video) |
end_image_url | string | No | End frame URL |
duration | integer | No | 5 or 10 seconds |
aspect_ratio | string | No | "16:9", "9:16", "1:1". Text-to-video only |
mode | string | No | "std" or "pro" |
enable_audio | boolean | No | Enable audio generation |
kuaishou/kling-2.5
Pro-only model with 1080p output. Supports end frame.
| Parameter | Type | Required | Description |
|---|---|---|---|
prompt | string | Yes* | *Required for text-to-video, optional for image-to-video |
start_image_url | string | No | Start frame URL (enables image-to-video) |
end_image_url | string | No | End frame URL |
duration | integer | No | 5 or 10 seconds |
aspect_ratio | string | No | "16:9", "9:16", "1:1". Text-to-video only |
mode | string | No | Pro only |
enable_audio | boolean | No | Enable audio generation |
kuaishou/kling-2.1
Image-to-video only. Supports end frame.
| Parameter | Type | Required | Description |
|---|---|---|---|
prompt | string | No | Text description (optional) |
start_image_url | string | Yes | Starting image URL |
end_image_url | string | No | End frame URL |
duration | integer | No | 5 or 10 seconds |
mode | string | No | "std" or "pro" |
enable_audio | boolean | No | Enable audio generation |
kuaishou/kling-2.1-master
Pro-only model with 1080p output. No end image support.
| Parameter | Type | Required | Description |
|---|---|---|---|
prompt | string | Yes* | *Required for text-to-video, optional for image-to-video |
start_image_url | string | No | Starting image URL (enables image-to-video) |
duration | integer | No | 5 or 10 seconds |
aspect_ratio | string | No | "16:9", "9:16", "1:1". Text-to-video only |
mode | string | No | Pro only |
enable_audio | boolean | No | Enable audio generation |
kuaishou/kling-o1-video
| Parameter | Type | Required | Description |
|---|---|---|---|
prompt | string | Yes | Text description |
mode | string | No | "std" or "pro" |
duration | integer | No | 3–10 seconds |
aspect_ratio | string | No | "16:9", "9:16", "1:1" |
count | integer | No | 1–4 |
image_urls | string[] | No | Up to 7 image URLs, ref as @image_1 in prompt |
elements | ElementInput[] | No | Up to 7, IMAGE only. Ref as @element_1 in prompt |
start_image_url | string | No | Start frame URL |
end_image_url | string | No | End frame URL |
video_url | string | No | Reference video URL, ref as @video_1 |
video_mode | string | No | "reference" or "transform" |
keep_audio | boolean | No | Keep audio from reference video |
kuaishou/kling-3.0-omni-video
Same as O1 Video with these differences: duration 3–15, aspect_ratio adds "auto", elements support IMAGE + VIDEO, multi_shot supported (2–6 shots, total 3–15s).
Pricing: 5s 720p 0.16 (sound off/on), 1080p 0.20; 10s 720p 0.30, 1080p 0.40
kuaishou/kling-3.0-omni-video-edit
| Parameter | Type | Required | Description |
|---|---|---|---|
video_url | string | Yes | Video to reference or transform |
prompt | string | Yes | Text description |
video_mode | string | No | "reference" or "transform" |
keep_audio | boolean | No | Keep original video audio |
mode | string | No | "std" or "pro" |
aspect_ratio | string | No | "16:9", "9:16", "1:1" |
image_urls | string[] | No | Up to 4 reference image URLs |
elements | ElementInput[] | No | Up to 4, IMAGE + VIDEO |
kuaishou/kling-2.6-motion-control
| Parameter | Type | Required | Description |
|---|---|---|---|
image_url | string | Yes | Image with person (auto-uploaded to Kling) |
video_url | string | Yes | Motion reference video (auto-uploaded to Kling) |
prompt | string | No | Text guidance |
keep_audio | boolean | No | Preserve audio from motion video |
character_orientation | string | No | "video" or "image" |
mode | string | No | "std" or "pro" |
openai/gpt-image-1, openai/gpt-image-1.5
| Parameter | Type | Required | Description |
|---|---|---|---|
prompt | string | Yes | Text description |
image_urls | array | No | Reference images |
aspect_ratio | string | No | 1:1, 2:3, 3:2 |
quality | string | No | low, medium, high |
google/nano-banana, google/nano-banana-pro
| Parameter | Type | Required | Description |
|---|---|---|---|
prompt | string | Yes | Text description |
aspect_ratio | string | Yes | 1:1, 2:3, 3:2, 3:4, 4:3, 4:5, 5:4, 9:16, 16:9, 21:9 |
image_urls | array | No | Reference images |
resolution | string | No | Pro only: 1k, 2k, 4k |
black-forest-labs/flux.2-pro, black-forest-labs/flux.2-flex, black-forest-labs/flux.2-max
| Parameter | Type | Required | Description |
|---|---|---|---|
prompt | string | Yes | Text description |
image_urls | array | No | Reference images (Pro/Max: 8, Flex: 10) |
aspect_ratio | string | No | auto, 1:1, 4:3, 16:9, 3:2, 2:3, 9:16, 3:4 (Max also: 5:4, 21:9) |
quality | string | No | 1K or 2K |
steps | integer | No | Flex only: 1-50 (more = higher quality) |
cfg | number | No | Flex only: 1.5-10 (higher = follows prompt more strictly) |
higgsfield-ai/cinematic-studio-image
| Parameter | Type | Required | Description |
|---|---|---|---|
prompt | string | Yes | Text description |
resolution | string | No | 1k, 2k, 4k (default: 2k) |
image_urls | array | No | Reference images (max 4) |
camera_model_id | string | No | Camera model setting ID |
camera_lens_id | string | No | Lens type setting ID |
camera_aperture_id | string | No | Aperture setting ID |
camera_focal_length_id | string | No | Focal length setting ID |
suno-ai/music
| Parameter | Type | Required | Description |
|---|---|---|---|
mv | string | Yes | Model version: chirp-v3-5, chirp-v4, chirp-auk, chirp-bluejay, chirp-crow |
custom | boolean | Yes | false for simple mode, true for custom mode |
gpt_description_prompt | string | No | Simple mode: song description with lyrics |
prompt | string | No | Custom mode: detailed lyrics/prompt |
tags | string | No | Custom mode: genre/style tags |
title | string | No | Song title |
make_instrumental | boolean | No | Generate instrumental only |
negative_tags | string | No | Custom mode: styles to avoid |
suno-ai/add-vocals, suno-ai/add-instrumental, suno-ai/extend, suno-ai/cover
| Parameter | Type | Required | Description |
|---|---|---|---|
mv | string | Yes | Model version |
clip_id | string | Yes* | Existing clip ID |
audio_url | string | Yes* | Audio file URL (alternative to clip_id) |
custom | boolean | Yes | Simple or custom mode |
gpt_description_prompt | string | No | Simple mode description |
prompt | string | No | Custom mode prompt |
continue_at | number | No | Extend: time in seconds to continue from |
start_s | number | No | Start time for overlay |
end_s | number | No | End time for overlay |
suno-ai/stems, suno-ai/stems-all
| Parameter | Type | Required | Description |
|---|---|---|---|
clip_id | string | Yes | Clip ID to extract stems from |
title | string | No | Title for extraction |
suno-ai/lyrics
| Parameter | Type | Required | Description |
|---|---|---|---|
prompt | string | Yes | Description of lyrics to generate |
mv | string | Yes | Lyrics model: remi-v1 or default |
higgsfield-ai/tts
| Parameter | Type | Required | Description |
|---|---|---|---|
voice_id | string | Yes | Voice ID |
prompt | string | Yes | Text to convert to speech |
sound_id | string | No | Background sound ID |
similarity_boost | integer | No | 0-100 (default 90) |
style | integer | No | 0-100 (default 60) |
speed | number | No | 0-1.2 (default 1.1) |
stability | integer | No | 0-100 (default 30) |
higgsfield-ai/speak
| Parameter | Type | Required | Description |
|---|---|---|---|
image_url | string | Yes | Person/avatar image URL |
audio_url | string | Yes* | Audio file URL |
audio_task_id | string | Yes* | TTS task ID (alternative to audio_url) |
prompt | string | Yes | Text description for video |
quality | string | No | standard or high |
enhance_prompt | boolean | No | Enhance prompt (default true) |
seed | integer | No | Seed for reproducibility |
| Endpoint Type | Limit | Ban Duration |
|---|---|---|
| Generation (POST) | 10 req/sec | 60 seconds |
| Polling (GET) | 10 req/sec | 60 seconds |
| Code | Description |
|---|---|
| 400 | Bad Request - Invalid parameters |
| 401 | Unauthorized - Invalid or missing API key |
| 402 | Payment Required - Insufficient balance |
| 404 | Not Found - Task or model not found |
| 429 | Too Many Requests - Rate limited |
| 500 | Internal Server Error |