Skip to main content

AI models

Models & pricing

BuilderStudio gives every workspace one bill for the best generative models across text, image, video, and audio. Pay only for what you use — no per-model subscriptions.

Vendors at a glance

Every model family available in BuilderStudio, and what it covers.

AI vendors available in BuilderStudio, with capabilities and model counts
VendorCapabilitiesModels
GeminiText generation, Image generation5
GrokText generation, Image generation, Video generation5
Anthropic Claudevia AI GatewayText generation2
OpenAIvia AI GatewayText generation2
FLUXvia Fal.aiImage generation2
Seedreamvia Fal.aiImage generation1
Recraftvia Fal.aiImage generation1
VeoVideo generation2
Klingvia Fal.aiVideo generation1
Seedancevia Fal.aiVideo generation5
Wanvia Fal.aiVideo generation3
ElevenLabsText to speech, Voice changer, Sound effects, Voice design7

Text & reasoning

Language models available in text-generation and agent nodes, billed per token.

Text & reasoning models offered in BuilderStudio, with vendor, adjustable parameters, billing unit, and pricing
ModelVendorParametersBilled perPricing
Gemini 3.5 FlashStable frontier Gemini Flash model for agentic text workgemini-3.5-flashGeminiTemperature: 0–2Max output tokens: up to 8,192Reasoning effort: minimal, low, medium, hightokensUsage-based — metered per token
Gemini 3.1 ProLatest high-capability Gemini modelgemini-3.1-pro-previewGeminiTemperature: 0–2Max output tokens: up to 8,192Reasoning effort: low, medium, hightokensUsage-based — metered per token
Gemini 3.1 Pro Custom ToolsGemini 3.1 Pro variant for complex tool usegemini-3.1-pro-preview-customtoolsGeminiTemperature: 0–2Max output tokens: up to 8,192Reasoning effort: low, medium, hightokensUsage-based — metered per token
Grok 4.3xAI flagship model for agentic text and reasoninggrok-4.3GrokTemperature: 0–2Max output tokens: up to 128,000Reasoning effort: none, low, medium, hightokensInput: $1.25 / 1M tokensCached input: $0.20 / 1M tokensOutput: $2.50 / 1M tokens
Grok Build 0.1xAI coding model trained for agentic build workflowsgrok-build-0.1GrokTemperature: 0–2Max output tokens: up to 128,000tokensUsage-based — metered per token
Claude Opus 4.8Anthropic Claude Opus — best for complex workflows via Vercel AI Gatewayanthropic/claude-opus-4.8Anthropic Claudevia AI GatewayTemperature: 0–2Max output tokens: up to 32,768tokensUsage-based — billed at the provider's metered rate
Claude Sonnet 4.6Anthropic Claude Sonnet — fast and capable via Vercel AI Gatewayanthropic/claude-sonnet-4.6Anthropic Claudevia AI GatewayTemperature: 0–2Max output tokens: up to 32,768tokensUsage-based — billed at the provider's metered rate
GPT-5.5OpenAI model via Vercel AI Gatewayopenai/gpt-5.5OpenAIvia AI GatewayTemperature: 0–2Max output tokens: up to 128,000Reasoning effort: none, low, medium, high, xhighVerbosity: low, medium, hightokensUsage-based — billed at the provider's metered rate
GPT-5.5 ProOpenAI model via Vercel AI Gatewayopenai/gpt-5.5-proOpenAIvia AI GatewayTemperature: 0–2Max output tokens: up to 128,000Reasoning effort: none, low, medium, high, xhighVerbosity: low, medium, hightokensUsage-based — billed at the provider's metered rate

Image generation

Text-to-image and image-editing models, billed per generated image.

Image generation models offered in BuilderStudio, with vendor, adjustable parameters, billing unit, and pricing
ModelVendorParametersBilled perPricing
Nano Banana ProHighest quality image gen (Gemini 3 Pro)gemini-3-pro-imageGeminiAspect ratio: 1:1, 2:3, 3:2, 3:4, 4:3, 4:5, 5:4, 9:16, 16:9, 21:9, 1:4, 4:1, 1:8, 8:1Resolution: 1K, 2K, 4KimagesUsage-based — metered per image
Nano BananaSpeed-optimized image gen (Gemini 2.5 Flash)gemini-2.5-flash-imageGeminiAspect ratio: 1:1, 2:3, 3:2, 3:4, 4:3, 4:5, 5:4, 9:16, 16:9, 21:9, 1:4, 4:1, 1:8, 8:1imagesUsage-based — metered per image
Grok Imagine QualityxAI Grok Imagine image generation and editinggrok-imagine-image-qualityGrokAspect ratio: 1:1, 3:4, 4:3, 9:16, 16:9, 2:3, 3:2, 9:19.5, 19.5:9, 9:20, 20:9, 1:2, 2:1, autoResolution: 1K, 2Kimages$0.05 / image
FLUX.2 DevHigh-quality text-to-imagefal-ai/flux-2FLUXvia Fal.aiAspect ratio: 1:1, 2:3, 3:2, 3:4, 4:3, 4:5, 5:4, 9:16, 16:9, 21:9, 1:4, 4:1, 1:8, 8:1Guidance scale: 1–20Steps: 1–50Seed: -1–2,147,483,647imagesUsage-based — metered at Fal.ai's live per-request rate
FLUX.2 ProHigh-quality generation, balanced speedfal-ai/flux-2-proFLUXvia Fal.aiAspect ratio: 1:1, 2:3, 3:2, 3:4, 4:3, 4:5, 5:4, 9:16, 16:9, 21:9, 1:4, 4:1, 1:8, 8:1Seed: -1–2,147,483,647imagesUsage-based — metered at Fal.ai's live per-request rate
Seedream 4.5Unified high-quality image generationfal-ai/bytedance/seedream/v4.5/text-to-imageSeedreamvia Fal.aiAspect ratio: 1:1, 2:3, 3:2, 3:4, 4:3, 4:5, 5:4, 9:16, 16:9, 21:9, 1:4, 4:1, 1:8, 8:1Seed: -1–2,147,483,647imagesUsage-based — metered at Fal.ai's live per-request rate
Recraft V4.1Design-focused image generationfal-ai/recraft/v4.1/text-to-imageRecraftvia Fal.aiAspect ratio: 1:1, 2:3, 3:2, 3:4, 4:3, 4:5, 5:4, 9:16, 16:9, 21:9, 1:4, 4:1, 1:8, 8:1imagesUsage-based — metered at Fal.ai's live per-request rate

Video generation

Text-, image-, and reference-to-video models, billed per second of output.

Video generation models offered in BuilderStudio, with vendor, adjustable parameters, billing unit, and pricing
ModelVendorParametersBilled perPricing
Veo 3.1Highest-quality Veo 3.1 video generationveo-3.1-generate-previewVeoAspect ratio: 16:9, 9:16Duration: 4, 6, 8Resolution: 720p, 1080p, 4ksecondsUsage-based — metered per second
Veo 3.1 FastLower-latency Veo 3.1 video generationveo-3.1-fast-generate-previewVeoAspect ratio: 16:9, 9:16Duration: 4, 6, 8Resolution: 720p, 1080p, 4ksecondsUsage-based — metered per second
Grok Imagine VideoxAI text-to-video and image-to-video generationgrok-imagine-videoGrokAspect ratio: 1:1, 16:9, 9:16, 4:3, 3:4, 3:2, 2:3Duration: 1–15Resolution: 480p, 720p, 1080psecondsOutput: $0.05 / second
Grok Imagine Video 1.5 PreviewxAI image-to-video preview modelgrok-imagine-video-1.5-previewGrokAspect ratio: 1:1, 16:9, 9:16, 4:3, 3:4, 3:2, 2:3Duration: 1–15Resolution: 480p, 720p, 1080psecondsOutput 480p: $0.08 / secondOutput 720p: $0.14 / secondInput: $0.01 / image
Kling 3.0 ProPremium image-to-video generationfal-ai/kling-video/v3/pro/image-to-videoKlingvia Fal.aiDuration: 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15Generate audio: on / offNegative promptCFG scalesecondsUsage-based — metered at Fal.ai's live per-request rate
Seedance 2.0 Image to VideoSeedance 2.0 image-to-video generationbytedance/seedance-2.0/image-to-videoSeedancevia Fal.aiAspect ratio: auto, 21:9, 16:9, 4:3, 1:1, 3:4, 9:16Resolution: 480p, 720p, 1080pDuration: auto, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15Generate audio: on / offSeedsecondsUsage-based — metered at Fal.ai's live per-request rate
Seedance 2.0 Reference to VideoSeedance 2.0 multimodal reference-to-video generationbytedance/seedance-2.0/reference-to-videoSeedancevia Fal.aiAspect ratio: auto, 21:9, 16:9, 4:3, 1:1, 3:4, 9:16Resolution: 480p, 720p, 1080pDuration: auto, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15Generate audio: on / offSeedsecondsUsage-based — metered at Fal.ai's live per-request rate
Seedance 2.0 Fast Text to VideoLower-latency Seedance 2.0 text-to-video generationbytedance/seedance-2.0/fast/text-to-videoSeedancevia Fal.aiAspect ratio: auto, 21:9, 16:9, 4:3, 1:1, 3:4, 9:16Resolution: 480p, 720pDuration: auto, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15Generate audio: on / offSeedsecondsUsage-based — metered at Fal.ai's live per-request rate
Seedance 2.0 Fast Image to VideoLower-latency Seedance 2.0 image-to-video generationbytedance/seedance-2.0/fast/image-to-videoSeedancevia Fal.aiAspect ratio: auto, 21:9, 16:9, 4:3, 1:1, 3:4, 9:16Resolution: 480p, 720pDuration: auto, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15Generate audio: on / offSeedsecondsUsage-based — metered at Fal.ai's live per-request rate
Seedance 2.0 Fast Reference to VideoLower-latency Seedance 2.0 multimodal reference-to-video generationbytedance/seedance-2.0/fast/reference-to-videoSeedancevia Fal.aiAspect ratio: auto, 21:9, 16:9, 4:3, 1:1, 3:4, 9:16Resolution: 480p, 720pDuration: auto, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15Generate audio: on / offSeedsecondsUsage-based — metered at Fal.ai's live per-request rate
Wan 2.7 Text to VideoWan 2.7 text-to-video generationfal-ai/wan/v2.7/text-to-videoWanvia Fal.aiAspect ratio: 16:9, 9:16, 1:1, 4:3, 3:4Duration: 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15Negative promptSeedResolution: 720p, 1080pSafety checker: on / offPrompt expansion: on / offsecondsUsage-based — metered at Fal.ai's live per-request rate
Wan 2.7 Image to VideoWan 2.7 image-to-video generationfal-ai/wan/v2.7/image-to-videoWanvia Fal.aiDuration: 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15Negative promptSeedResolution: 720p, 1080pSafety checker: on / offPrompt expansion: on / offsecondsUsage-based — metered at Fal.ai's live per-request rate
Wan 2.7 Reference to VideoWan 2.7 image and video reference-to-video generationfal-ai/wan/v2.7/reference-to-videoWanvia Fal.aiAspect ratio: 16:9, 9:16, 1:1, 4:3, 3:4Duration: 2, 3, 4, 5, 6, 7, 8, 9, 10Negative promptMulti-shot segmentation: on / offSeedResolution: 720p, 1080pSafety checker: on / offsecondsUsage-based — metered at Fal.ai's live per-request rate

Audio & voice

Text-to-speech, voice conversion, sound effects, and voice design, powered by ElevenLabs.

Audio & voice models offered in BuilderStudio, with vendor, adjustable parameters, billing unit, and pricing
ModelVendorCapabilityParametersBilled perPricing
Multilingual V2Best quality, 29 languageseleven_multilingual_v2ElevenLabsText to speechVoice: any workspace ElevenLabs voiceStability: 0–1Similarity boost: 0–1Style: 0–1Speed: 0.7–1.2Speaker boost: on / offSeed: 0–4,294,967,295characters$0.10 / 1K characters
Flash V2.5Ultra-low latency (~75ms)eleven_flash_v2_5ElevenLabsText to speechVoice: any workspace ElevenLabs voiceStability: 0–1Similarity boost: 0–1Style: 0–1Speed: 0.7–1.2Speaker boost: on / offSeed: 0–4,294,967,295characters$0.05 / 1K characters
Eleven V3Most expressive, 70+ languageseleven_v3ElevenLabsText to speechVoice: any workspace ElevenLabs voiceStability: 0–1Similarity boost: 0–1Style: 0–1Speed: 0.7–1.2Speaker boost: on / offSeed: 0–4,294,967,295characters$0.10 / 1K characters
Multilingual STS V2Speech-to-speech voice conversioneleven_multilingual_sts_v2ElevenLabsVoice changerVoice: any workspace ElevenLabs voiceStability: 0–1Similarity boost: 0–1Style: 0–1Speed: 0.7–1.2Speaker boost: on / offSeed: 0–4,294,967,295Remove background noise: on / offcharactersUsage-based — metered per character
Sound Effects V2Text-to-sound-effects generationeleven_text_to_sound_v2ElevenLabsSound effectsDuration: 0.5–30 secondsLoop: on / offPrompt influence: 0–1generationsUsage-based — metered per generation
Voice Design Multilingual V2Create custom voices from a text prompteleven_multilingual_ttv_v2ElevenLabsVoice designVoice description: 20–1,000 charactersgenerationsUsage-based — metered per generation
Voice Design V3Most expressive voice design modeleleven_ttv_v3ElevenLabsVoice designVoice description: 20–1,000 charactersgenerationsUsage-based — metered per generation

ElevenLabs usage is metered in characters or generations; the effective rate can vary with the connected ElevenLabs plan.

Fixed rates are vendor list prices last verified on 2026-06-10. Usage-based models are metered at the vendor's current rate at request time. Model availability and pricing may change as vendors update their offerings.