Skip to main content
POST
/
v1
/
tasks
Unified Tasks API
curl --request POST \
  --url https://api.unifically.com/v1/tasks \
  --header 'Authorization: Bearer <token>' \
  --header 'Content-Type: application/json' \
  --data '
{
  "model": "<string>",
  "input": {},
  "callback_url": "<string>",
  "dry_run": true
}
'
{
  "code": 200,
  "success": true,
  "data": {
    "task_id": "abc123def456",
    "status": "pending"
  }
}
The /v1/tasks endpoint provides a unified interface for all AI generation models (video, image, audio).
This is the unified API for all AI generation models. Use this endpoint for all integrations.

Create Task

POST /v1/tasks Creates a new generation task for any supported model.

Request

curl -X POST https://api.unifically.com/v1/tasks \
  -H "Content-Type: application/json" \
  -H "Authorization: Bearer YOUR_API_KEY" \
  -d '{
    "model": "google/veo-3.1-fast",
    "input": {
      "prompt": "A cat walking on the beach at sunset"
    },
    "callback_url": "https://your-server.com/webhook"
  }'

Request Parameters

model
string
required
Model identifier in provider/model-name format. See Available Models below.
input
object
required
Model-specific input parameters. See Input Parameters for details.
callback_url
string
Optional webhook URL. When provided, the API sends a POST request to this URL when the task completes or fails. See Webhooks & Callbacks for payload formats and details.
dry_run
boolean
Optional. When set to true, the request is validated and the cost is calculated without actually creating a task or deducting from your balance. Useful for previewing the price of a request before committing.

Response

{
  "code": 200,
  "success": true,
  "data": {
    "task_id": "abc123def456",
    "status": "pending"
  }
}

Dry Run

To check the cost of a request without creating a task or deducting from your balance, set dry_run to true:
curl -X POST https://api.unifically.com/v1/tasks \
  -H "Content-Type: application/json" \
  -H "Authorization: Bearer YOUR_API_KEY" \
  -d '{
    "model": "google/veo-3.1-fast",
    "input": {
      "prompt": "A cat walking on the beach at sunset"
    },
    "dry_run": true
  }'
{
  "code": 200,
  "success": true,
  "data": {
    "cost": 0.40
  }
}

Get Task Status

GET /v1/tasks/:task_id Retrieves the status and output of a task.

Request

curl https://api.unifically.com/v1/tasks/abc123def456 \
  -H "Authorization: Bearer YOUR_API_KEY"

Path Parameters

task_id
string
required
The unique task ID returned from the create task endpoint

Response (Processing)

{
  "code": 200,
  "success": true,
  "data": {
    "task_id": "abc123def456",
    "status": "processing"
  }
}

Response (Completed)

{
  "code": 200,
  "success": true,
  "data": {
    "task_id": "abc123def456",
    "status": "completed",
    "output": {
      "video_url": "https://cdn.unifically.com/outputs/abc123.mp4"
    }
  }
}

Available Models

Video Generation

ModelDescription
google/veo-3.1-fastGoogle Veo 3.1 Fast
google/veo-3.1-qualityGoogle Veo 3.1 Quality
google/veo-3.1-liteGoogle Veo 3.1 Lite
google/veo-3.1-lite-relaxedGoogle Veo 3.1 Lite Relaxed
google/veo-3.1-extendGoogle Veo 3.1 Extend
google/veo-3.1-upscaleGoogle Veo 3.1 Upscale
google/gemini-omni-flash-videoGoogle Gemini Omni Flash Video
google/gemini-omni-flash-video-editGoogle Gemini Omni Flash Video Edit
hailuo/minimax-2.0Minimax Hailuo 2.0
hailuo/minimax-2.3Minimax Hailuo 2.3
hailuo/minimax-2.3-fastMinimax Hailuo 2.3 Fast
kuaishou/kling-3.0-omni-videoKling v3.0 Omni Video
kuaishou/kling-3.0-omni-video-editKling v3.0 Omni Video Edit
kuaishou/kling-o1-videoKling O1 Video
kuaishou/kling-o1-video-editKling O1 Video Edit
kuaishou/kling-3.0-videoKling v3.0 Video
kuaishou/kling-3.0-turbo-videoKling v3.0 Turbo Video
kuaishou/kling-2.6-videoKling v2.6 Video
kuaishou/kling-2.5-turbo-videoKling v2.5 Turbo Video
kuaishou/kling-2.1-videoKling v2.1 Video
kuaishou/kling-2.1-master-videoKling v2.1 Master Video
kuaishou/kling-2.6-motion-controlKling v2.6 Motion Control
kuaishou/kling-3.0-motion-controlKling v3.0 Motion Control
xai/grok-imagine-video-extendGrok Imagine Video Extend
topaz-labs/video-upscaleTopaz Video Upscale

Image Generation

ModelDescription
google/nano-bananaNano Banana
google/nano-banana-proNano Banana Pro
openai/gpt-image-2GPT Image 2 (1K/2K/4K, multiple aspect ratios)
black-forest-labs/flux.2-proFlux.2 Pro
black-forest-labs/flux.2-flexFlux.2 Flex
black-forest-labs/flux.2-maxFlux.2 Max
kuaishou/kling-o1-imageKling O1 Image
kuaishou/kling-3.0-omni-imageKling v3.0 Omni Image
kuaishou/kling-3.0-imageKling v3.0 Image
kuaishou/kling-2.1-imageKling v2.1 Image
topaz-labs/image-upscaleTopaz Image Upscale
topaz-labs/image-generativeTopaz Image Generative
alibaba/qwen-image-2.0-proQwen Image 2.0 Pro (T2I + editing)
alibaba/qwen-image-2.0Qwen Image 2.0 (T2I + editing)
alibaba/qwen-image-maxQwen Image Max (T2I + editing)
alibaba/qwen-image-plusQwen Image Plus (T2I + editing)
alibaba/qwen-imageQwen Image (T2I + editing)
alibaba/z-image-turboZ-Image Turbo (T2I only)
alibaba/wan-2.7-pro-imageWan 2.7 Pro Image (T2I + editing, up to 4K)
alibaba/wan-2.7-imageWan 2.7 Image (T2I + editing)
alibaba/wan-2.6-imageWan 2.6 Image (T2I + editing)
alibaba/wan-2.5-imageWan 2.5 Image (T2I + editing)
alibaba/wan-2.2-imageWan 2.2 Image (T2I only)
alibaba/wan-2.2-flash-imageWan 2.2 Flash Image (T2I only)
xai/grok-imagine-imageGrok Imagine Image (T2I + editing)

Audio Generation

ModelDescription
suno-ai/musicSuno Music Generation
suno-ai/add-vocalsAdd Vocals to Track
suno-ai/add-instrumentalAdd Instrumental
suno-ai/extendExtend Audio
suno-ai/coverCreate Cover
suno-ai/stemsExtract Stems
suno-ai/stems-allExtract All Stems
suno-ai/lyricsGenerate Lyrics
suno-ai/wavWAV Export
elevenlabs/text-to-speechElevenLabs Text-to-Speech
elevenlabs/text-to-dialogueElevenLabs Multi-Voice Dialogue
elevenlabs/sound-effectElevenLabs Sound Effects
elevenlabs/voice-isolationElevenLabs Voice Isolation
elevenlabs/speech-to-textElevenLabs Speech-to-Text

Model Parameters

All tasks support callback_url (outside the input object) for webhook notifications. See Webhooks & Callbacks for full payload formats and best practices.

Google Veo 3.1

Generate

Models: google/veo-3.1-fast, google/veo-3.1-quality, google/veo-3.1-lite, google/veo-3.1-lite-relaxed Veo supports text-to-video, first-frame, first-and-last-frame, and reference-to-video workflows. Frame mode and reference mode are mutually exclusive.
ModeFieldsAvailability
Frame modestart_image_url [+ end_image_url]All models
Reference modereference_image_urls, reference_characters [+ voice]Image references are available on all modes. Character references are available on Fast, Lite, and Lite Relaxed only.
ParameterTypeRequiredDescription
promptstringYesText prompt for video generation. Use @ImageN or @CharacterN to point at specific references.
aspect_ratiostringNo"16:9" (default) or "9:16"
durationintegerNo4, 6, or 8 seconds. Default 4. Must be 8 when any image or character reference is set.
seedintegerNoReproducibility seed
start_image_urlstringNoPublic image URL used as the first frame. Cannot be combined with reference_image_urls or reference_characters.
end_image_urlstringNoPublic image URL used as the final frame. Requires start_image_url; cannot be used by itself.
reference_image_urlsstring[]NoImage references for reference-to-video. Max 3 total expanded image URLs across reference_image_urls and character images. Cannot be combined with start/end frame fields.
reference_charactersarrayNoCharacter references. Max 3 total expanded image URLs across images and character image_urls. Not available on Quality.
voicestringNoVoice preset ID. Requires at least 1 image or character reference. See voices endpoint.
Character items must be objects with image_urls, plus optional name and description. image_url and plain string character entries are not supported. Rejected combinations: end_image_url without start_image_url; frame fields with reference fields; reference_characters on google/veo-3.1-quality; any image or character reference with duration other than 8; more than 3 total expanded image URLs; empty character image_urls; character image_url; plain string character entries.

Extend

Model: google/veo-3.1-extend Extend a previously generated video. Aspect ratio is inherited from the source task.
ParameterTypeRequiredDescription
promptstringYesText prompt for the extended content
task_idstringYesTask ID of a completed generation
modelstringYesOne of: lite, fast, quality, lite-relaxed
durationintegerNoMust be 8 (only supported value for extend). Default 8.
seedintegerNoReproducibility seed

Upscale

Model: google/veo-3.1-upscale Upscale a completed video to a higher resolution.
ParameterTypeRequiredDescription
task_idstringYesTask ID of a completed generation
resolutionstringYes"1080p" or "4k"

Google Gemini Omni Flash Video

Model: google/gemini-omni-flash-video Generate 4, 6, 8, or 10 second clips in text-to-video, start-frame, or reference-to-video mode.
ParameterTypeRequiredDescription
promptstringYesMain video prompt. Use @ImageN or @CharacterN to point at specific references.
seedintegerNoReproducibility seed. If omitted, one is generated automatically.
aspect_ratiostringNo"16:9" (default) or "9:16"
durationintegerNo4, 6, 8, or 10 seconds. Default 4.
start_image_urlstringNoPublic image URL used as the first frame. Cannot be combined with reference_image_urls or reference_characters.
reference_image_urlsstring[]NoPublic image reference URLs. Max 7 total expanded image + character image URLs. Cannot be combined with start/end frame fields.
reference_charactersarrayNoCharacter references. Max 3 character items; expanded image URLs count toward the total 7 reference limit.
voicestringNoRequest-level voice preset ID. Requires at least one image or character reference. See voices endpoint.
Start/end frame support is now available. For now, only start frame control (start_image_url) is available — end frame support is not yet available on Google’s end, but it’s coming in an upcoming Google update.

Google Gemini Omni Flash Video Edit

Model: google/gemini-omni-flash-video-edit Edit an existing uploaded video. Provide exactly one source video URL in reference_video_urls.
ParameterTypeRequiredDescription
reference_video_urlsstring[]YesPublic source video URL. Must contain exactly one URL.
promptstringYesEdit instruction. Use @Video1 to refer to the source video and @ImageN/@CharacterN for extra references.
reference_image_urlsstring[]NoPublic image URLs used as edit references.
reference_charactersarrayNoCharacter references for the edit. Max 3 character items. Supports per-character voice or custom_voice. See voices endpoint.
seedintegerNoReproducibility seed. If omitted, one is generated automatically.
start_frameintegerNoFirst source frame index included in the edit range. Default 0.
end_frameintegerNoLast source frame index included in the edit range. Defaults to the detected final frame when available.
Limits: one video reference, up to 3 character references, up to 7 total video + image + character references, uploaded source video up to 1 GB and up to 30 seconds. Rejected combinations: task_id; missing, empty, or multiple reference_video_urls; end_frame lower than start_frame; more than 7 total references; empty character image_urls; character image_url; plain string character entries.

Minimax Hailuo

Models: hailuo/minimax-2.0, hailuo/minimax-2.3, hailuo/minimax-2.3-fast
ParameterTypeRequiredDescription
promptstringYes*Max 2000 chars. *Required if no start_image_url
start_image_urlstringYes*Image URL (auto-uploaded). *Required if no prompt (required for 2.3-fast)
end_image_urlstringNoEnd frame image URL (minimax-2.0 only, 768p/1080p)
durationintegerNo6 or 10 seconds. 1080p only supports 6
resolutionstringNo"768p" (default), "1080p"
prompt_optimizationbooleanNoLet MiniMax optimize prompt

Kling v3.0 Omni Video

Model: kuaishou/kling-3.0-omni-video
ParameterTypeRequiredDescription
video_modestringNo"elements" (default), "start_end_frame", "transform", "video_reference"
promptstringConditionalText prompt. Mutually exclusive with multi_shots
modestringNo"pro" (default). "std" (720p) or "pro" (1080p)
durationintegerNo3–15 seconds (default 5)
aspect_ratiostringNo"16:9" (default), "9:16", "1:1", "auto" (start_end_frame only)
native_audiobooleanNoGenerate AI audio (default false)
keep_audiobooleanNoPreserve audio from source video (default true)
image_urlsstring[]NoUp to 7 reference image URLs. Use @Image1, @Image2 in prompt
start_frame_urlstringNoFirst frame image URL (start_end_frame mode)
end_frame_urlstringNoLast frame image URL (start_end_frame mode)
video_urlstringNoSource video URL (transform/video_reference modes)
multi_shotsarrayNo2–6 shots, each { "prompt": string, "duration": int }. Mutually exclusive with prompt
elementsarrayNoCharacter/object elements (IMAGE + VIDEO)

Kling O1 Video

Model: kuaishou/kling-o1-video Same parameters as Omni 3.0 but does not support multi_shots or native_audio. Max duration 10s.
ParameterTypeRequiredDescription
video_modestringNo"elements" (default), "start_end_frame", "transform", "video_reference"
promptstringYesText prompt
modestringNo"pro" (default). "std" (720p) or "pro" (1080p)
durationintegerNo3–10 seconds (default 5)
aspect_ratiostringNo"16:9" (default), "9:16", "1:1", "auto" (start_end_frame only)
keep_audiobooleanNoPreserve audio from source video (default true)
image_urlsstring[]NoUp to 7 reference image URLs. Use @Image1, @Image2 in prompt
start_frame_urlstringNoFirst frame image URL (start_end_frame mode)
end_frame_urlstringNoLast frame image URL (start_end_frame mode)
video_urlstringNoSource video URL (transform/video_reference modes)

Kling v3.0 Omni Video Edit

Model: kuaishou/kling-3.0-omni-video-edit
ParameterTypeRequiredDescription
video_urlstringYesSource video URL to edit
promptstringYesText prompt describing the edit
video_modestringNo"reference" (default) or "transform"
keep_audiobooleanNoPreserve original audio (default false)
modestringNo"std" (default) or "pro"
aspect_ratiostringNo"16:9" (default), "9:16", "1:1"
image_urlsstring[]NoUp to 4 reference image URLs. Use @Image1, @Image2 in prompt
elementsarrayNoUp to 4 character/object elements
Duration is locked to the input video length.

Kling O1 Video Edit

Model: kuaishou/kling-o1-video-edit Same parameters as Omni 3.0 video edit but does not support elements.
ParameterTypeRequiredDescription
video_urlstringYesSource video URL to edit
promptstringYesText prompt describing the edit
video_modestringNo"reference" (default) or "transform"
keep_audiobooleanNoPreserve original audio (default false)
modestringNo"std" (default) or "pro"
aspect_ratiostringNo"16:9" (default), "9:16", "1:1"
image_urlsstring[]NoUp to 4 reference image URLs. Use @Image1, @Image2 in prompt

Kling v3.0 Video

Model: kuaishou/kling-3.0-video
ParameterTypeRequiredDescription
promptstringConditionalText prompt. Mutually exclusive with multi_shots
modestringNo"pro" (default). "std" (720p) or "pro" (1080p)
durationintegerNo3–15 seconds (default 5)
aspect_ratiostringNo"16:9" (default), "9:16", "1:1"
native_audiobooleanNoGenerate AI audio (default true)
start_frame_urlstringYesFirst frame image URL
end_frame_urlstringNoLast frame image URL
elementsarrayNoCharacter/object elements
multi_shotsarrayNo2–6 shots, each { "prompt": string, "duration": int }. Mutually exclusive with prompt

Kling v3.0 Turbo Video

Model: kuaishou/kling-3.0-turbo-video Faster variant of Kling v3.0. Text-to-video or optional start-frame image-to-video only. No native audio, multi-shot, end frame, or 4K.
ParameterTypeRequiredDescription
promptstringYesText prompt
modestringNo"pro" (default). "std" (720p) or "pro" (1080p)
durationintegerNo3–15 seconds (default 5)
aspect_ratiostringNo"16:9" (default), "9:16", "1:1"
start_frame_urlstringNoFirst frame image URL. Omit for text-to-video

Kling v2.6 Video

Model: kuaishou/kling-2.6-video
ParameterTypeRequiredDescription
promptstringYesText prompt
modestringNo"pro" (default). "std" (720p) or "pro" (1080p)
durationintegerNo5 or 10 seconds
native_audiobooleanNoEnable AI audio generation (default false). Requires pro mode
start_frame_urlstringYesFirst frame image URL
end_frame_urlstringNoLast frame image URL (not available with native_audio)
voicesarrayNoVoice references (max 5, requires native_audio). Each: { "voice_id": int } or { "voice_url": string }

Kling v2.5 Turbo Video

Model: kuaishou/kling-2.5-turbo-video
ParameterTypeRequiredDescription
promptstringYesText prompt
modestringNo"pro" (default). "std" (720p) or "pro" (1080p)
durationintegerNo5 or 10 seconds
aspect_ratiostringNo"16:9" (default), "9:16", "1:1". Ignored when start_frame_url is set
start_frame_urlstringNoFirst frame image URL
end_frame_urlstringNoLast frame image URL
sound_effectsobjectNo{ "sound": string, "music": string, "asmr_mode": boolean }. Omit to disable audio

Kling v2.1 Video

Model: kuaishou/kling-2.1-video Image-to-video only.
ParameterTypeRequiredDescription
promptstringYesText prompt
start_frame_urlstringYesFirst frame image URL
end_frame_urlstringNoLast frame image URL
durationintegerNo5 or 10 seconds
modestringNo"pro" (default). "std" or "pro"
sound_effectsobjectNo{ "sound": string, "music": string, "asmr_mode": boolean }. Omit to disable audio

Kling v2.1 Master Video

Model: kuaishou/kling-2.1-master-video Pro-only. No end frame support.
ParameterTypeRequiredDescription
promptstringYesText prompt
durationintegerNo5 or 10 seconds
start_frame_urlstringNoFirst frame image URL (optional)
sound_effectsobjectNo{ "sound": string, "music": string, "asmr_mode": boolean }. Omit to disable audio

Kling v3.0 Motion Control

Model: kuaishou/kling-3.0-motion-control
ParameterTypeRequiredDescription
promptstringYesText prompt describing the motion
image_urlstringYesCharacter/subject image URL
video_urlstringYesMotion reference video URL
modestringNo"std" (default) or "pro"
keep_audiobooleanNoPreserve audio from motion video (default true)
character_orientationstringNo"video" (default) or "image"
elementsarrayNoAdditional character/object elements

Kling v2.6 Motion Control

Model: kuaishou/kling-2.6-motion-control
ParameterTypeRequiredDescription
promptstringYesText prompt describing the motion
image_urlstringYesCharacter/subject image URL
video_urlstringYesMotion reference video URL
modestringNo"std" (default) or "pro"
keep_audiobooleanNoPreserve audio from motion video (default true)
character_orientationstringNo"video" (default) or "image"

Grok Imagine Video Extend

Model: xai/grok-imagine-video-extend Extend a previously generated video via HTTP streaming. Two mutually exclusive modes:
ModeHow to activateBehaviour
PresetProvide video_presetThe preset controls the video style; prompt, extend_at, extend_duration are ignored
CustomOmit video_presetYou control timing and prompt; prompt, extend_at, extend_duration are required
ParameterTypeRequiredDescription
task_idstringYesTask ID of a completed video generation
video_presetstringNo"spicy" or "normal". Enables preset mode
promptstringNoText prompt to guide the extension. Required in custom mode
extend_atfloatNoSecond to start the extension from. Required in custom mode
extend_durationintNo6 or 10 seconds. Required in custom mode

GPT Image

Models: openai/gpt-image-2
ParameterTypeRequiredDescription
promptstringYesText description
image_urlsarrayNoReference image URLs for image editing mode
aspect_ratiostringNo1:1, 3:2, 2:3, 16:9. Default: 1:1
resolutionstringNo1K, 2K, 4K. Default: 1K

Nano Banana

Models: google/nano-banana, google/nano-banana-pro
ParameterTypeRequiredDescription
promptstringYesText description
aspect_ratiostringYes1:1, 2:3, 3:2, 3:4, 4:3, 4:5, 5:4, 9:16, 16:9, 21:9
image_urlsarrayNoReference images
resolutionstringNoPro only: 1k, 2k, 4k

Flux.2

Models: black-forest-labs/flux.2-pro, black-forest-labs/flux.2-flex, black-forest-labs/flux.2-max
ParameterTypeRequiredDescription
promptstringYesText description
image_urlsarrayNoReference images (Pro/Max: 8, Flex: 10)
aspect_ratiostringNoauto, 1:1, 4:3, 16:9, 3:2, 2:3, 9:16, 3:4 (Max also: 5:4, 21:9)
qualitystringNo1K or 2K
stepsintegerNoFlex only: 1-50 (more = higher quality)
cfgnumberNoFlex only: 1.5-10 (higher = follows prompt more strictly)

Qwen Image 2.0 Pro

Model: alibaba/qwen-image-2.0-pro$0.0525/image Best quality. Text rendering, realistic textures. Automatically switches between T2I and editing based on whether image_urls is provided.
ParameterTypeRequiredDescription
promptstringYesText prompt (max 800 chars)
aspect_ratiostringNo1:1 (default), 16:9, 9:16, 4:3, 3:4
image_urlsstring[]NoOmit for T2I. Provide image URLs for editing
negative_promptstringNoWhat to avoid (max 500 chars)
prompt_extendbooleanNoSmart prompt rewriting (default true)
seedintegerNoSeed for reproducibility

Qwen Image 2.0

Model: alibaba/qwen-image-2.0$0.0245/image Faster version of 2.0 Pro. Same capabilities and parameters.

Qwen Image Max

Model: alibaba/qwen-image-max — T2I 0.0525/image/Edit0.0525/image** / Edit **0.0525/image Highest realism, fewest AI artifacts. Editing uses a specialized edit model under the hood (industrial design, geometric reasoning, character consistency). Same parameters as Qwen Image 2.0 Pro.

Qwen Image Plus

Model: alibaba/qwen-image-plus — T2I 0.021/image/Edit0.021/image** / Edit **0.021/image Diverse artistic styles, fast. Editing uses a specialized edit model under the hood. Same parameters as Qwen Image 2.0 Pro.

Qwen Image

Model: alibaba/qwen-image — T2I 0.0245/image/Edit0.0245/image** / Edit **0.0315/image Older base model. Editing uses a specialized edit model under the hood. Same parameters as Qwen Image 2.0 Pro.

Z-Image Turbo

Model: alibaba/z-image-turbo — **0.0105/image(or0.0105/image** (or 0.021 with prompt rewriting) Lightweight fast T2I only. Chinese and English text rendering.
ParameterTypeRequiredDescription
promptstringYesText prompt (max 800 chars)
aspect_ratiostringNo1:1 (default), 2:3, 3:2, 3:4, 4:3, 9:16, 16:9
prompt_extendbooleanNoPrompt rewriting (default false, doubles cost)
seedintegerNoSeed for reproducibility

Wan 2.7 Pro Image

Model: alibaba/wan-2.7-pro-image$0.0525/image Highest quality. Thinking mode for T2I. Supports editing with up to 9 images. Up to 4K resolution for T2I.
ParameterTypeRequiredDescription
promptstringYesText prompt (max 5000 chars)
aspect_ratiostringNo1:1 (default), 16:9, 9:16, 4:3, 3:4, 3:2, 2:3. Editing preserves input ratio
image_urlsstring[]NoOmit for T2I. Up to 9 images for editing
thinking_modebooleanNoBetter quality, slower (default true). T2I only
seedintegerNoSeed for reproducibility

Wan 2.7 Image

Model: alibaba/wan-2.7-image$0.021/image Faster variant of 2.7 Pro. Same capabilities, max 2K resolution. Same parameters as Wan 2.7 Pro Image.

Wan 2.6 Image

Model: alibaba/wan-2.6-image$0.021/image Automatically selects T2I or editing mode based on image_urls. Supports style transfer with 1–4 reference images.
ParameterTypeRequiredDescription
promptstringYesText prompt (max 2000 chars)
aspect_ratiostringNo1:1 (default), 2:3, 3:2, 3:4, 4:3, 9:16, 16:9
image_urlsstring[]NoOmit for T2I. 1–4 images for editing/style transfer
negative_promptstringNoWhat to avoid (max 500 chars)
prompt_extendbooleanNoSmart prompt rewriting (default true)
seedintegerNoSeed for reproducibility

Wan 2.5 Image

Model: alibaba/wan-2.5-image$0.021/image Automatically selects T2I or editing mode based on image_urls. Supports 1–3 reference images. Same parameters as Wan 2.6 Image.

Wan 2.2 Image

Model: alibaba/wan-2.2-image$0.035/image T2I only. Does not accept image_urls.
ParameterTypeRequiredDescription
promptstringYesText prompt (max 500 chars)
aspect_ratiostringNo1:1 (default), 3:4, 4:3, 9:16, 16:9
negative_promptstringNoWhat to avoid
seedintegerNoSeed for reproducibility

Wan 2.2 Flash Image

Model: alibaba/wan-2.2-flash-image$0.0175/image Fast T2I only. Cheapest Wan image model. Same parameters as Wan 2.2 Image.

Grok Imagine Image

Model: xai/grok-imagine-image — Pro mode: $0.025/image Generate and edit images using xAI’s Grok Imagine model. When image_urls is provided, the model runs in edit mode.
ParameterTypeRequiredDescription
promptstringYesText description or edit instruction
aspect_ratiostringNo"1:1" (default), "2:3", "3:2", "9:16", "16:9"
image_urlsstring[]No1–5 reference image URLs (triggers edit mode)
enable_probooleanNoEnable pro mode for higher quality results
upsample_promptbooleanNoLet AI enhance your prompt for better results
enable_nsfwbooleanNoEnable NSFW content generation

Suno Music

Model: suno-ai/music
ParameterTypeRequiredDescription
mvstringYesModel version: chirp-v3-5, chirp-v4, chirp-auk, chirp-bluejay, chirp-crow
custombooleanYesfalse for simple mode, true for custom mode
gpt_description_promptstringNoSimple mode: song description with lyrics
promptstringNoCustom mode: detailed lyrics/prompt
tagsstringNoCustom mode: genre/style tags
titlestringNoSong title
make_instrumentalbooleanNoGenerate instrumental only
negative_tagsstringNoCustom mode: styles to avoid
persona_idstringNoCustom voice ID from Suno voice creation; music uses that voice for vocals

Suno Audio Operations

Models: suno-ai/add-vocals, suno-ai/add-instrumental, suno-ai/extend, suno-ai/cover
ParameterTypeRequiredDescription
mvstringYesModel version
clip_idstringYes*Existing clip ID
audio_urlstringYes*Audio file URL (alternative to clip_id)
custombooleanYesSimple or custom mode
gpt_description_promptstringNoSimple mode description
promptstringNoCustom mode prompt
continue_atnumberNoExtend: time in seconds to continue from
start_snumberNoStart time for overlay
end_snumberNoEnd time for overlay

Suno Stems

Models: suno-ai/stems, suno-ai/stems-all
ParameterTypeRequiredDescription
clip_idstringYesClip ID to extract stems from
titlestringNoTitle for extraction

Suno Lyrics

Model: suno-ai/lyrics
ParameterTypeRequiredDescription
promptstringYesDescription of lyrics to generate
mvstringYesLyrics model: remi-v1 or default

Rate Limit Error Response

{
  "success": false,
  "code": 429,
  "data": {
    "message": "You have been ratelimited, this temporary restriction will be lifted in: 45 seconds"
  }
}

Error Responses

CodeDescription
400Bad Request - Invalid parameters
401Unauthorized - Invalid or missing API key
402Payment Required - Insufficient balance
404Not Found - Task or model not found
429Too Many Requests - Rate limited
500Internal Server Error

Error Response Format

{
  "success": false,
  "code": 400,
  "data": {
    "message": "Description of the error",
    "request_id": "abc123"
  }
}