TL;DR

Veo 3.1 developer guide: per-second pricing for all three model variants, timestamp prompting, First/Last Frame, and Ingredients to Video with working Python code.

Veo 3.1 shipped with six features that Veo 3 couldn’t do, and timestamp prompting — the ability to direct multiple shots inside a single 8-second generation call — is the one that changes how you structure every production pipeline. The model reached Vertex AI general availability in late May 2026 and hit the Gemini API developer tier in early June. If you’re still generating clips the same way you did with Veo 3, you’re leaving cost efficiency and creative control on the table.

This is the complete developer guide: three model variants with exact pricing, working Python code for every major API pattern, and an honest rundown of what still doesn’t work.

Three Models, Three Price Points

Veo 3.1 ships as three distinct models. The quality difference between Standard and Fast is real but narrow for most social and web content; the difference between Fast and Lite is more significant. All three generate synchronized native audio by default.

Model	API ID	Price/sec	8s clip cost	Best for
Veo 3.1 Standard	veo-3.1-generate-preview	$0.40	$3.20	Hero shots, broadcast deliverables, client finals
Veo 3.1 Fast	veo-3.1-fast-generate-preview	$0.15	$1.20	Production pipelines, high-volume social content
Veo 3.1 Lite	veo-3.1-lite-generate-preview	$0.05	$0.40	Iteration, concept validation, draft review

Standard delivers the cinematic quality Google demonstrated at I/O 2026. Fast is roughly 70% of Standard quality at 37% of the cost — indistinguishable to most viewers on a phone screen, but visible under technical review in fine detail and hair. Lite is the iteration tier: good enough to verify shot composition, timing, and prompt intent before you commit Fast or Standard budget to the final version.

All three support 1080p output at 16:9 and 9:16 aspect ratios. Duration options are 4, 6, or 8 seconds per generation call. Disabling audio saves roughly 33% off the per-second rate on any tier — useful for clips where you’re adding a post-production soundtrack anyway.

One pricing gotcha: the Gemini API documentation for Veo 3.1 Lite shows a “$0.05 per video” number that multiple developers in the Google AI forum have flagged as misleading. The billing is per-second, not per-video, so an 8-second Lite clip costs $0.40, not $0.05. Benchmark your actual token usage before committing to volume.

API Setup: Your First Generation

Veo 3.1 requires Python SDK version 1.52+ and a Gemini API key with Paid Tier access. It is not available on the free tier. Video generation is asynchronous — unlike the synchronous image API, calls return an operation object that you poll until completion:

import time
from google import genai
from google.genai import types

client = genai.Client(api_key="YOUR_GEMINI_API_KEY")

operation = client.models.generate_videos(
    model="veo-3.1-fast-generate-preview",
    prompt=(
        "A wide establishing shot of a neon-lit Tokyo street at 2am, rain falling, "
        "reflections shimmering in puddles. Slow pan right. "
        "SFX: Rain on pavement, distant traffic, a single bicycle bell."
    ),
    config=types.GenerateVideosConfig(
        aspect_ratio="16:9",
        resolution="1080p",
        duration_seconds=8,
        enhance_prompt=True,
    )
)

# Fast tier typically completes in 60-90 seconds
while not operation.done:
    time.sleep(15)
    operation = client.operations.get(operation)

video = operation.result.generated_videos[0]
with open("output.mp4", "wb") as f:
    f.write(video.video.video_bytes)

The enhance_prompt flag rewrites your prompt with additional cinematography detail before sending it to the model. It improves output quality on vague prompts but reduces precision on highly crafted ones. Set it to False if you’ve spent time engineering a specific prompt and want the model to interpret it literally. For quick exploration where quality matters more than control, leave it at True.

Standard tier generation takes 3–5 minutes per clip on average. Budget your timeout accordingly — a 10-second sleep interval is too short for Standard; 30 seconds is more appropriate.

Three Models, Three Price Points

API Setup: Your First Generation

Try Our Free Tools

JSON Formatter & Validator

cURL to Code Converter

More from AI Tools & Tutorials

OpenAI Codex Goal Mode Is Now GA — Multi-Hour Autonomous Coding Sessions

Timestamp Prompting: One Call, Multiple Shots

First and Last Frame: Controlled Transitions

Ingredients to Video: Character Consistency Across Clips

Audio Prompting Syntax

Pricing in Practice

What Veo 3.1 Still Can’t Do

Where Veo 3.1 Fits Right Now

Ready to ship faster?

One insight, every Monday. 7am IST. Zero fluff.

Comments · 0

Key takeaways · 5

Topics

Article stats

Regex Playground

Base64 Encoder / Decoder

UUID Generator

GitHub Copilot Token Billing Week 1: What Developers Are Actually Paying

Claude Sonnet 4.8 Evidence Found in Anthropic Source Maps — What We Know

xAI Launched Grok Build — A Terminal Coding Agent to Fight Claude Code and Codex

OpenAI Dreaming V3: ChatGPT Now Learns While You Sleep

DeepSeek V4 Pro Just Got 75% Cheaper — What It Means for Your AI Stack