Skip to main content
POST
/
v1
/
tasks
Unified Tasks API
curl --request POST \
  --url https://api.unifically.com/v1/tasks \
  --header 'Authorization: Bearer <token>' \
  --header 'Content-Type: application/json' \
  --data '
{
  "model": "<string>",
  "input": {},
  "callback_url": "<string>",
  "dry_run": true
}
'
{
  "code": 200,
  "success": true,
  "data": {
    "task_id": "abc123def456",
    "status": "pending"
  }
}

Documentation Index

Fetch the complete documentation index at: https://docs.unifically.com/llms.txt

Use this file to discover all available pages before exploring further.

The /v1/tasks endpoint provides a unified interface for all AI generation models (video, image, audio).
This is the unified API for all AI generation models. Use this endpoint for all integrations.

Create Task

POST /v1/tasks Creates a new generation task for any supported model.

Request

curl -X POST https://api.unifically.com/v1/tasks \
  -H "Content-Type: application/json" \
  -H "Authorization: Bearer YOUR_API_KEY" \
  -d '{
    "model": "google/veo-3.1-fast",
    "input": {
      "prompt": "A cat walking on the beach at sunset"
    },
    "callback_url": "https://your-server.com/webhook"
  }'

Request Parameters

model
string
required
Model identifier in provider/model-name format. See Available Models below.
input
object
required
Model-specific input parameters. See Input Parameters for details.
callback_url
string
Optional webhook URL. When provided, the API sends a POST request to this URL when the task completes or fails. See Webhooks & Callbacks for payload formats and details.
dry_run
boolean
Optional. When set to true, the request is validated and the cost is calculated without actually creating a task or deducting from your balance. Useful for previewing the price of a request before committing.

Response

{
  "code": 200,
  "success": true,
  "data": {
    "task_id": "abc123def456",
    "status": "pending"
  }
}

Dry Run

To check the cost of a request without creating a task or deducting from your balance, set dry_run to true:
curl -X POST https://api.unifically.com/v1/tasks \
  -H "Content-Type: application/json" \
  -H "Authorization: Bearer YOUR_API_KEY" \
  -d '{
    "model": "google/veo-3.1-fast",
    "input": {
      "prompt": "A cat walking on the beach at sunset"
    },
    "dry_run": true
  }'
{
  "code": 200,
  "success": true,
  "data": {
    "cost": 0.40
  }
}

Get Task Status

GET /v1/tasks/:task_id Retrieves the status and output of a task.

Request

curl https://api.unifically.com/v1/tasks/abc123def456 \
  -H "Authorization: Bearer YOUR_API_KEY"

Path Parameters

task_id
string
required
The unique task ID returned from the create task endpoint

Response (Processing)

{
  "code": 200,
  "success": true,
  "data": {
    "task_id": "abc123def456",
    "status": "processing"
  }
}

Response (Completed)

{
  "code": 200,
  "success": true,
  "data": {
    "task_id": "abc123def456",
    "status": "completed",
    "output": {
      "video_url": "https://cdn.unifically.com/outputs/abc123.mp4"
    }
  }
}

Available Models

Video Generation

ModelDescription
google/veo-3.1-fastGoogle Veo 3.1 Fast
google/veo-3.1-fast-relaxedGoogle Veo 3.1 Fast Relaxed
google/veo-3.1-qualityGoogle Veo 3.1 Quality
google/veo-3.1-liteGoogle Veo 3.1 Lite
google/veo-3.1-lite-relaxedGoogle Veo 3.1 Lite Relaxed
google/veo-3.1-extendGoogle Veo 3.1 Extend
google/veo-3.1-upscaleGoogle Veo 3.1 Upscale
hailuo/minimax-2.0Minimax Hailuo 2.0
hailuo/minimax-2.3Minimax Hailuo 2.3
hailuo/minimax-2.3-fastMinimax Hailuo 2.3 Fast
higgsfield-ai/liteHiggsfield Lite
higgsfield-ai/turboHiggsfield Turbo
higgsfield-ai/standardHiggsfield Standard
higgsfield-ai/cinematic-studio-video-3.5Cinematic Studio Video v3.5
higgsfield-ai/cinematic-studio-video-3.0Cinematic Studio Video v3.0
higgsfield-ai/cinematic-studio-video-2.5Cinematic Studio Video v2.5
kuaishou/kling-3.0-omni-videoKling v3.0 Omni Video
kuaishou/kling-3.0-omni-video-editKling v3.0 Omni Video Edit
kuaishou/kling-o1-videoKling O1 Video
kuaishou/kling-o1-video-editKling O1 Video Edit
kuaishou/kling-3.0-videoKling v3.0 Video
kuaishou/kling-2.6-videoKling v2.6 Video
kuaishou/kling-2.5-turbo-videoKling v2.5 Turbo Video
kuaishou/kling-2.1-videoKling v2.1 Video
kuaishou/kling-2.1-master-videoKling v2.1 Master Video
kuaishou/kling-2.6-motion-controlKling v2.6 Motion Control
kuaishou/kling-3.0-motion-controlKling v3.0 Motion Control
xai/grok-imagine-video-extendGrok Imagine Video Extend
topaz-labs/video-upscaleTopaz Video Upscale

Image Generation

ModelDescription
google/nano-bananaNano Banana
google/nano-banana-proNano Banana Pro
openai/gpt-image-2GPT Image 2 (1K/2K/4K, multiple aspect ratios)
higgsfield-ai/cinematic-studio-imageCinematic Studio Image
black-forest-labs/flux.2-proFlux.2 Pro
black-forest-labs/flux.2-flexFlux.2 Flex
black-forest-labs/flux.2-maxFlux.2 Max
kuaishou/kling-o1-imageKling O1 Image
kuaishou/kling-3.0-omni-imageKling v3.0 Omni Image
kuaishou/kling-3.0-imageKling v3.0 Image
kuaishou/kling-2.1-imageKling v2.1 Image
topaz-labs/image-upscaleTopaz Image Upscale
topaz-labs/image-generativeTopaz Image Generative
alibaba/qwen-image-2.0-proQwen Image 2.0 Pro (T2I + editing)
alibaba/qwen-image-2.0Qwen Image 2.0 (T2I + editing)
alibaba/qwen-image-maxQwen Image Max (T2I + editing)
alibaba/qwen-image-plusQwen Image Plus (T2I + editing)
alibaba/qwen-imageQwen Image (T2I + editing)
alibaba/z-image-turboZ-Image Turbo (T2I only)
alibaba/wan-2.7-pro-imageWan 2.7 Pro Image (T2I + editing, up to 4K)
alibaba/wan-2.7-imageWan 2.7 Image (T2I + editing)
alibaba/wan-2.6-imageWan 2.6 Image (T2I + editing)
alibaba/wan-2.5-imageWan 2.5 Image (T2I + editing)
alibaba/wan-2.2-imageWan 2.2 Image (T2I only)
alibaba/wan-2.2-flash-imageWan 2.2 Flash Image (T2I only)
xai/grok-imagine-imageGrok Imagine Image (T2I + editing)

Audio Generation

ModelDescription
suno-ai/musicSuno Music Generation
suno-ai/add-vocalsAdd Vocals to Track
suno-ai/add-instrumentalAdd Instrumental
suno-ai/extendExtend Audio
suno-ai/coverCreate Cover
suno-ai/stemsExtract Stems
suno-ai/stems-allExtract All Stems
suno-ai/lyricsGenerate Lyrics
suno-ai/wavWAV Export
higgsfield-ai/text-to-speechText-to-Speech
elevenlabs/text-to-speechElevenLabs Text-to-Speech
elevenlabs/text-to-dialogueElevenLabs Multi-Voice Dialogue
elevenlabs/sound-effectElevenLabs Sound Effects
elevenlabs/voice-isolationElevenLabs Voice Isolation
elevenlabs/speech-to-textElevenLabs Speech-to-Text

Model Parameters

All tasks support callback_url (outside the input object) for webhook notifications. See Webhooks & Callbacks for full payload formats and best practices.

Google Veo 3.1

Generate

Models: google/veo-3.1-fast, google/veo-3.1-fast-relaxed, google/veo-3.1-quality, google/veo-3.1-lite, google/veo-3.1-lite-relaxed There are two mutually exclusive image modes — the API rejects requests that mix them:
ModeFieldsAvailability
Frame modestart_image_url [+ end_image_url]All models
Reference modereference_image_urls [+ voice]Fast, Fast-relaxed, Lite, and Lite-relaxed
ParameterTypeRequiredDescription
promptstringYesText prompt for video generation
aspect_ratiostringNo"16:9" (default) or "9:16"
durationintegerNo4, 6, or 8 seconds. Default 4. Must be 8 when reference_image_urls is set.
seedintegerNoReproducibility seed
start_image_urlstringNoStart frame image URL. Cannot be combined with reference_image_urls
end_image_urlstringNoEnd frame image URL. Requires start_image_url. Cannot be combined with reference_image_urls
reference_image_urlsstring[]No1–3 reference image URLs. Fast, Fast-relaxed, Lite, and Lite-relaxed only. Cannot be combined with start_image_url/end_image_url
voicestringNoVoice preset ID. Requires at least 1 reference image. Fast, Fast-relaxed, Lite, and Lite-relaxed only. See voices endpoint

Extend

Model: google/veo-3.1-extend Extend a previously generated video. Aspect ratio is inherited from the source task.
ParameterTypeRequiredDescription
promptstringYesText prompt for the extended content
task_idstringYesTask ID of a completed generation
modelstringYesOne of: lite, fast, quality, lite-relaxed, fast-relaxed
durationintegerNoMust be 8 (only supported value for extend). Default 8.
seedintegerNoReproducibility seed

Upscale

Model: google/veo-3.1-upscale Upscale a completed video to a higher resolution.
ParameterTypeRequiredDescription
task_idstringYesTask ID of a completed generation
resolutionstringYes"1080p" or "4k"

Minimax Hailuo

Models: hailuo/minimax-2.0, hailuo/minimax-2.3, hailuo/minimax-2.3-fast
ParameterTypeRequiredDescription
promptstringYes*Max 2000 chars. *Required if no start_image_url
start_image_urlstringYes*Image URL (auto-uploaded). *Required if no prompt (required for 2.3-fast)
end_image_urlstringNoEnd frame image URL (minimax-2.0 only, 768p/1080p)
durationintegerNo6 or 10 seconds. 1080p only supports 6
resolutionstringNo"768p" (default), "1080p"
prompt_optimizationbooleanNoLet MiniMax optimize prompt

Higgsfield Video

Models: higgsfield-ai/lite, higgsfield-ai/turbo, higgsfield-ai/standard
ParameterTypeRequiredDescription
promptstringYesText description of the desired video motion/action
start_image_urlstringYesStarting frame image URL
end_image_urlstringNoEnding frame image URL (for guided transitions)
enhance_promptbooleanNoLet AI enhance your prompt for better results (default false)
seedintegerNo0-999999 for reproducibility
motion_idstringNoMotion preset ID (UUID)

Higgsfield Cinematic Studio Video v2.5

Model: higgsfield-ai/cinematic-studio-video-2.5
ParameterTypeRequiredDescription
promptstringConditionalRequired for single and multi_shot_auto shot modes. Supports image refs <<<image_1>>> etc.
shot_modestringNosingle (default), multi_shot_auto, multi_shot_manual
multi_shotsobject[]ConditionalRequired for multi_shot_manual. Each: prompt, duration, optional camera_movement_id
image_urlsstring[]NoReference image URLs (max 3)
start_image_urlstringNoStarting frame image URL
end_image_urlstringNoEnding frame URL. Forces generate_audio to false
durationintegerNo3-12 seconds (default 5)
aspect_ratiostringNo1:1, 3:4, 2:3, 9:16, 3:2, 4:3, 16:9, 21:9
resolutionstringNo720p (default) or 1080p
genrestringNoauto, action, horror, comedy, western, suspense, intimate, spectacle
camera_movement_idstringNoCamera movement preset ID
generate_audiobooleanNoEnable AI sound effects (default false)
seedintegerNo0-999999 for reproducibility

Higgsfield Cinematic Studio Video v3.0

Model: higgsfield-ai/cinematic-studio-video-3.0
ParameterTypeRequiredDescription
promptstringConditionalRequired when shot_mode is single or multi_shot_auto
shot_modestringNosingle (default), multi_shot_auto, multi_shot_manual
multi_shotsobject[]ConditionalRequired when shot_mode='multi_shot_manual'. Each item carries its own prompt, duration, camera_motion_id, and speedramp (ramp_up, flash_in, flash_out, hero_moment)
image_urlsstring[]NoUp to 3 reference image URLs
start_image_urlstringNoOptional first-frame image
end_image_urlstringNoOptional last-frame image
durationintegerNoTotal video duration in seconds (4-15)
aspect_ratiostringNoauto (default), 1:1, 3:4, 9:16, 4:3, 16:9, 21:9
resolutionstringNo480p, 720p, 1080p
genrestringNogeneral (default), action, horror, comedy, noir, epic
camera_motion_idstringNoTop-level camera movement preset id (sent as preset_id)
generate_audiobooleanNoGenerate accompanying audio
seedintegerNoRandom seed; auto-generated if omitted

Higgsfield Cinematic Studio Video v3.5

Model: higgsfield-ai/cinematic-studio-video-3.5 Single-shot only. Provide either a free-form style_prompt (which fully overrides the structured style fields) or the structured trio color_palette / lighting / camera_moveset_style.
ParameterTypeRequiredDescription
promptstringYesRequired text prompt
image_urlsstring[]NoUp to 3 reference image URLs
start_image_urlstringNoOptional first-frame image
end_image_urlstringNoOptional last-frame image
durationintegerNoVideo duration in seconds (4-15)
aspect_ratiostringNoauto (default), 1:1, 3:4, 9:16, 4:3, 16:9, 21:9
resolutionstringNo480p, 720p, 1080p
genrestringNogeneral (default), action, horror, comedy, noir, epic, drama
generate_audiobooleanNoGenerate accompanying audio
seedintegerNoRandom seed; auto-generated if omitted
style_promptstringNoFree-form style description. Fully overrides color_palette, lighting, and camera_moveset_style when provided
color_palettestringNoColor grading preset (sent as color_grading). Values: auto, naturalistic_clean, bleached_warm, hyper_neon, teal_orange_epic, sodium_decay, cold_steel, bleach_bypass, classic_bw
lightingstringNoLighting scheme (sent as light_scheme). Values: auto, soft_cross, contre_jour, overhead_fall, window, practicals, silhouette
camera_moveset_stylestringNoCamera moveset style (sent as camera_style). Values: auto, classic_static, silent_machine, one_take, epic_scale, …
camera_motion_idstringNoCamera movement preset id (sent as preset_id)
camera_model_idstringNoCamera body preset id (from GET /higgsfield/camera-settings)
camera_lens_idstringNoCamera lens preset id
camera_focal_length_idstringNoCamera focal length preset id
camera_aperture_idstringNoCamera aperture preset id

Kling v3.0 Omni Video

Model: kuaishou/kling-3.0-omni-video
ParameterTypeRequiredDescription
video_modestringNo"elements" (default), "start_end_frame", "transform", "video_reference"
promptstringConditionalText prompt. Mutually exclusive with multi_shots
modestringNo"pro" (default). "std" (720p) or "pro" (1080p)
durationintegerNo3–15 seconds (default 5)
aspect_ratiostringNo"16:9" (default), "9:16", "1:1", "auto" (start_end_frame only)
native_audiobooleanNoGenerate AI audio (default false)
keep_audiobooleanNoPreserve audio from source video (default true)
image_urlsstring[]NoUp to 7 reference image URLs. Use @Image1, @Image2 in prompt
start_frame_urlstringNoFirst frame image URL (start_end_frame mode)
end_frame_urlstringNoLast frame image URL (start_end_frame mode)
video_urlstringNoSource video URL (transform/video_reference modes)
multi_shotsarrayNo2–6 shots, each { "prompt": string, "duration": int }. Mutually exclusive with prompt
elementsarrayNoCharacter/object elements (IMAGE + VIDEO)

Kling O1 Video

Model: kuaishou/kling-o1-video Same parameters as Omni 3.0 but does not support multi_shots or native_audio. Max duration 10s.
ParameterTypeRequiredDescription
video_modestringNo"elements" (default), "start_end_frame", "transform", "video_reference"
promptstringYesText prompt
modestringNo"pro" (default). "std" (720p) or "pro" (1080p)
durationintegerNo3–10 seconds (default 5)
aspect_ratiostringNo"16:9" (default), "9:16", "1:1", "auto" (start_end_frame only)
keep_audiobooleanNoPreserve audio from source video (default true)
image_urlsstring[]NoUp to 7 reference image URLs. Use @Image1, @Image2 in prompt
start_frame_urlstringNoFirst frame image URL (start_end_frame mode)
end_frame_urlstringNoLast frame image URL (start_end_frame mode)
video_urlstringNoSource video URL (transform/video_reference modes)

Kling v3.0 Omni Video Edit

Model: kuaishou/kling-3.0-omni-video-edit
ParameterTypeRequiredDescription
video_urlstringYesSource video URL to edit
promptstringYesText prompt describing the edit
video_modestringNo"reference" (default) or "transform"
keep_audiobooleanNoPreserve original audio (default false)
modestringNo"std" (default) or "pro"
aspect_ratiostringNo"16:9" (default), "9:16", "1:1"
image_urlsstring[]NoUp to 4 reference image URLs. Use @Image1, @Image2 in prompt
elementsarrayNoUp to 4 character/object elements
Duration is locked to the input video length.

Kling O1 Video Edit

Model: kuaishou/kling-o1-video-edit Same parameters as Omni 3.0 video edit but does not support elements.
ParameterTypeRequiredDescription
video_urlstringYesSource video URL to edit
promptstringYesText prompt describing the edit
video_modestringNo"reference" (default) or "transform"
keep_audiobooleanNoPreserve original audio (default false)
modestringNo"std" (default) or "pro"
aspect_ratiostringNo"16:9" (default), "9:16", "1:1"
image_urlsstring[]NoUp to 4 reference image URLs. Use @Image1, @Image2 in prompt

Kling v3.0 Video

Model: kuaishou/kling-3.0-video
ParameterTypeRequiredDescription
promptstringConditionalText prompt. Mutually exclusive with multi_shots
modestringNo"pro" (default). "std" (720p) or "pro" (1080p)
durationintegerNo3–15 seconds (default 5)
aspect_ratiostringNo"16:9" (default), "9:16", "1:1"
native_audiobooleanNoGenerate AI audio (default true)
start_frame_urlstringYesFirst frame image URL
end_frame_urlstringNoLast frame image URL
elementsarrayNoCharacter/object elements
multi_shotsarrayNo2–6 shots, each { "prompt": string, "duration": int }. Mutually exclusive with prompt

Kling v2.6 Video

Model: kuaishou/kling-2.6-video
ParameterTypeRequiredDescription
promptstringYesText prompt
modestringNo"pro" (default). "std" (720p) or "pro" (1080p)
durationintegerNo5 or 10 seconds
native_audiobooleanNoEnable AI audio generation (default false). Requires pro mode
start_frame_urlstringYesFirst frame image URL
end_frame_urlstringNoLast frame image URL (not available with native_audio)
voicesarrayNoVoice references (max 5, requires native_audio). Each: { "voice_id": int } or { "voice_url": string }

Kling v2.5 Turbo Video

Model: kuaishou/kling-2.5-turbo-video
ParameterTypeRequiredDescription
promptstringYesText prompt
modestringNo"pro" (default). "std" (720p) or "pro" (1080p)
durationintegerNo5 or 10 seconds
aspect_ratiostringNo"16:9" (default), "9:16", "1:1". Ignored when start_frame_url is set
start_frame_urlstringNoFirst frame image URL
end_frame_urlstringNoLast frame image URL
sound_effectsobjectNo{ "sound": string, "music": string, "asmr_mode": boolean }. Omit to disable audio

Kling v2.1 Video

Model: kuaishou/kling-2.1-video Image-to-video only.
ParameterTypeRequiredDescription
promptstringYesText prompt
start_frame_urlstringYesFirst frame image URL
end_frame_urlstringNoLast frame image URL
durationintegerNo5 or 10 seconds
modestringNo"pro" (default). "std" or "pro"
sound_effectsobjectNo{ "sound": string, "music": string, "asmr_mode": boolean }. Omit to disable audio

Kling v2.1 Master Video

Model: kuaishou/kling-2.1-master-video Pro-only. No end frame support.
ParameterTypeRequiredDescription
promptstringYesText prompt
durationintegerNo5 or 10 seconds
start_frame_urlstringNoFirst frame image URL (optional)
sound_effectsobjectNo{ "sound": string, "music": string, "asmr_mode": boolean }. Omit to disable audio

Kling v3.0 Motion Control

Model: kuaishou/kling-3.0-motion-control
ParameterTypeRequiredDescription
promptstringYesText prompt describing the motion
image_urlstringYesCharacter/subject image URL
video_urlstringYesMotion reference video URL
modestringNo"std" (default) or "pro"
keep_audiobooleanNoPreserve audio from motion video (default true)
character_orientationstringNo"video" (default) or "image"
elementsarrayNoAdditional character/object elements

Kling v2.6 Motion Control

Model: kuaishou/kling-2.6-motion-control
ParameterTypeRequiredDescription
promptstringYesText prompt describing the motion
image_urlstringYesCharacter/subject image URL
video_urlstringYesMotion reference video URL
modestringNo"std" (default) or "pro"
keep_audiobooleanNoPreserve audio from motion video (default true)
character_orientationstringNo"video" (default) or "image"

Grok Imagine Video Extend

Model: xai/grok-imagine-video-extend Extend a previously generated video via HTTP streaming. Two mutually exclusive modes:
ModeHow to activateBehaviour
PresetProvide video_presetThe preset controls the video style; prompt, extend_at, extend_duration are ignored
CustomOmit video_presetYou control timing and prompt; prompt, extend_at, extend_duration are required
ParameterTypeRequiredDescription
task_idstringYesTask ID of a completed video generation
video_presetstringNo"spicy" or "normal". Enables preset mode
promptstringNoText prompt to guide the extension. Required in custom mode
extend_atfloatNoSecond to start the extension from. Required in custom mode
extend_durationintNo6 or 10 seconds. Required in custom mode

GPT Image

Models: openai/gpt-image-2
ParameterTypeRequiredDescription
promptstringYesText description
image_urlsarrayNoReference image URLs for image editing mode
aspect_ratiostringNo1:1, 3:2, 2:3, 16:9. Default: 1:1
resolutionstringNo1K, 2K, 4K. Default: 1K

Nano Banana

Models: google/nano-banana, google/nano-banana-pro
ParameterTypeRequiredDescription
promptstringYesText description
aspect_ratiostringYes1:1, 2:3, 3:2, 3:4, 4:3, 4:5, 5:4, 9:16, 16:9, 21:9
image_urlsarrayNoReference images
resolutionstringNoPro only: 1k, 2k, 4k

Flux.2

Models: black-forest-labs/flux.2-pro, black-forest-labs/flux.2-flex, black-forest-labs/flux.2-max
ParameterTypeRequiredDescription
promptstringYesText description
image_urlsarrayNoReference images (Pro/Max: 8, Flex: 10)
aspect_ratiostringNoauto, 1:1, 4:3, 16:9, 3:2, 2:3, 9:16, 3:4 (Max also: 5:4, 21:9)
qualitystringNo1K or 2K
stepsintegerNoFlex only: 1-50 (more = higher quality)
cfgnumberNoFlex only: 1.5-10 (higher = follows prompt more strictly)

Higgsfield Cinematic Studio Image

Model: higgsfield-ai/cinematic-studio-image
ParameterTypeRequiredDescription
promptstringYesText description
image_urlsstring[]NoReference image URLs (max 4)
aspect_ratiostringNo1:1, 3:4, 2:3, 9:16, 3:2, 4:3, 16:9, 21:9 (default: 16:9)
resolutionstringNo1k, 2k, 4k (default: 1k)
seedintegerNo0-999999 for reproducibility
camera_model_idstringNoCamera body ID
camera_lens_idstringNoLens ID
camera_aperture_idstringNoAperture ID
camera_focal_length_idstringNoFocal length ID

Qwen Image 2.0 Pro

Model: alibaba/qwen-image-2.0-pro$0.0525/image Best quality. Text rendering, realistic textures. Automatically switches between T2I and editing based on whether image_urls is provided.
ParameterTypeRequiredDescription
promptstringYesText prompt (max 800 chars)
aspect_ratiostringNo1:1 (default), 16:9, 9:16, 4:3, 3:4
image_urlsstring[]NoOmit for T2I. Provide image URLs for editing
negative_promptstringNoWhat to avoid (max 500 chars)
prompt_extendbooleanNoSmart prompt rewriting (default true)
seedintegerNoSeed for reproducibility

Qwen Image 2.0

Model: alibaba/qwen-image-2.0$0.0245/image Faster version of 2.0 Pro. Same capabilities and parameters.

Qwen Image Max

Model: alibaba/qwen-image-max — T2I 0.0525/image/Edit0.0525/image** / Edit **0.0525/image Highest realism, fewest AI artifacts. Editing uses a specialized edit model under the hood (industrial design, geometric reasoning, character consistency). Same parameters as Qwen Image 2.0 Pro.

Qwen Image Plus

Model: alibaba/qwen-image-plus — T2I 0.021/image/Edit0.021/image** / Edit **0.021/image Diverse artistic styles, fast. Editing uses a specialized edit model under the hood. Same parameters as Qwen Image 2.0 Pro.

Qwen Image

Model: alibaba/qwen-image — T2I 0.0245/image/Edit0.0245/image** / Edit **0.0315/image Older base model. Editing uses a specialized edit model under the hood. Same parameters as Qwen Image 2.0 Pro.

Z-Image Turbo

Model: alibaba/z-image-turbo — **0.0105/image(or0.0105/image** (or 0.021 with prompt rewriting) Lightweight fast T2I only. Chinese and English text rendering.
ParameterTypeRequiredDescription
promptstringYesText prompt (max 800 chars)
aspect_ratiostringNo1:1 (default), 2:3, 3:2, 3:4, 4:3, 9:16, 16:9
prompt_extendbooleanNoPrompt rewriting (default false, doubles cost)
seedintegerNoSeed for reproducibility

Wan 2.7 Pro Image

Model: alibaba/wan-2.7-pro-image$0.0525/image Highest quality. Thinking mode for T2I. Supports editing with up to 9 images. Up to 4K resolution for T2I.
ParameterTypeRequiredDescription
promptstringYesText prompt (max 5000 chars)
aspect_ratiostringNo1:1 (default), 16:9, 9:16, 4:3, 3:4, 3:2, 2:3. Editing preserves input ratio
image_urlsstring[]NoOmit for T2I. Up to 9 images for editing
thinking_modebooleanNoBetter quality, slower (default true). T2I only
seedintegerNoSeed for reproducibility

Wan 2.7 Image

Model: alibaba/wan-2.7-image$0.021/image Faster variant of 2.7 Pro. Same capabilities, max 2K resolution. Same parameters as Wan 2.7 Pro Image.

Wan 2.6 Image

Model: alibaba/wan-2.6-image$0.021/image Automatically selects T2I or editing mode based on image_urls. Supports style transfer with 1–4 reference images.
ParameterTypeRequiredDescription
promptstringYesText prompt (max 2000 chars)
aspect_ratiostringNo1:1 (default), 2:3, 3:2, 3:4, 4:3, 9:16, 16:9
image_urlsstring[]NoOmit for T2I. 1–4 images for editing/style transfer
negative_promptstringNoWhat to avoid (max 500 chars)
prompt_extendbooleanNoSmart prompt rewriting (default true)
seedintegerNoSeed for reproducibility

Wan 2.5 Image

Model: alibaba/wan-2.5-image$0.021/image Automatically selects T2I or editing mode based on image_urls. Supports 1–3 reference images. Same parameters as Wan 2.6 Image.

Wan 2.2 Image

Model: alibaba/wan-2.2-image$0.035/image T2I only. Does not accept image_urls.
ParameterTypeRequiredDescription
promptstringYesText prompt (max 500 chars)
aspect_ratiostringNo1:1 (default), 3:4, 4:3, 9:16, 16:9
negative_promptstringNoWhat to avoid
seedintegerNoSeed for reproducibility

Wan 2.2 Flash Image

Model: alibaba/wan-2.2-flash-image$0.0175/image Fast T2I only. Cheapest Wan image model. Same parameters as Wan 2.2 Image.

Grok Imagine Image

Model: xai/grok-imagine-image — Pro mode: $0.025/image Generate and edit images using xAI’s Grok Imagine model. When image_urls is provided, the model runs in edit mode.
ParameterTypeRequiredDescription
promptstringYesText description or edit instruction
aspect_ratiostringNo"1:1" (default), "2:3", "3:2", "9:16", "16:9"
image_urlsstring[]No1–5 reference image URLs (triggers edit mode)
enable_probooleanNoEnable pro mode for higher quality results
upsample_promptbooleanNoLet AI enhance your prompt for better results
enable_nsfwbooleanNoEnable NSFW content generation

Suno Music

Model: suno-ai/music
ParameterTypeRequiredDescription
mvstringYesModel version: chirp-v3-5, chirp-v4, chirp-auk, chirp-bluejay, chirp-crow
custombooleanYesfalse for simple mode, true for custom mode
gpt_description_promptstringNoSimple mode: song description with lyrics
promptstringNoCustom mode: detailed lyrics/prompt
tagsstringNoCustom mode: genre/style tags
titlestringNoSong title
make_instrumentalbooleanNoGenerate instrumental only
negative_tagsstringNoCustom mode: styles to avoid
persona_idstringNoCustom voice ID from Suno voice creation; music uses that voice for vocals

Suno Audio Operations

Models: suno-ai/add-vocals, suno-ai/add-instrumental, suno-ai/extend, suno-ai/cover
ParameterTypeRequiredDescription
mvstringYesModel version
clip_idstringYes*Existing clip ID
audio_urlstringYes*Audio file URL (alternative to clip_id)
custombooleanYesSimple or custom mode
gpt_description_promptstringNoSimple mode description
promptstringNoCustom mode prompt
continue_atnumberNoExtend: time in seconds to continue from
start_snumberNoStart time for overlay
end_snumberNoEnd time for overlay

Suno Stems

Models: suno-ai/stems, suno-ai/stems-all
ParameterTypeRequiredDescription
clip_idstringYesClip ID to extract stems from
titlestringNoTitle for extraction

Suno Lyrics

Model: suno-ai/lyrics
ParameterTypeRequiredDescription
promptstringYesDescription of lyrics to generate
mvstringYesLyrics model: remi-v1 or default

Higgsfield TTS

Model: higgsfield-ai/text-to-speech
ParameterTypeRequiredDescription
voice_idstringYesVoice ID
promptstringYesText to convert to speech
sound_idstringNoBackground sound ID
similarity_boostintegerNo0-100 (default 90)
styleintegerNo0-100 (default 60)
speednumberNo0-1.2 (default 1.1)
stabilityintegerNo0-100 (default 30)

Rate Limit Error Response

{
  "success": false,
  "code": 429,
  "data": {
    "message": "You have been ratelimited, this temporary restriction will be lifted in: 45 seconds"
  }
}

Error Responses

CodeDescription
400Bad Request - Invalid parameters
401Unauthorized - Invalid or missing API key
402Payment Required - Insufficient balance
404Not Found - Task or model not found
429Too Many Requests - Rate limited
500Internal Server Error

Error Response Format

{
  "success": false,
  "code": 400,
  "data": {
    "message": "Description of the error",
    "request_id": "abc123"
  }
}