我们隆重推出首款完全多模态嵌入模型 Gemini Embedding 2。

在 Gemini API 中使用 Veo 3.1 生成视频

如需了解视频理解，请参阅视频理解指南。

Veo 3.1 是 Google 最先进的模型，可生成高保真 8 秒 720p、1080p 或 4k 视频，具有惊人的逼真效果和原生生成的音频。您可以使用 Gemini API 以编程方式访问此模型。如需详细了解可用的 Veo 模型变体，请参阅模型版本部分。

Veo 3.1 擅长各种视觉和电影风格，并引入了多项新功能：

竖屏视频：选择横屏 (16:9) 视频或竖屏 (9:16) 视频。
视频扩展：扩展之前使用 Veo 生成的视频。
指定帧生成：通过指定第一帧和/或最后一帧来生成视频。
基于图片的指导：使用最多三张参考图片来指导生成的视频的内容。

如需详细了解如何编写有效的文本提示来生成视频，请参阅 Veo 提示指南

文生视频生成

选择一个示例，了解如何生成视频，其中包含对话、电影级真实感或创意动画：

Python

import time
from google import genai
from google.genai import types

client = genai.Client()

prompt = """A close up of two people staring at a cryptic drawing on a wall, torchlight flickering.
A man murmurs, 'This must be it. That's the secret code.' The woman looks at him and whispering excitedly, 'What did you find?'"""

operation = client.models.generate_videos(
    model="veo-3.1-generate-preview",
    prompt=prompt,
)

# Poll the operation status until the video is ready.
while not operation.done:
    print("Waiting for video generation to complete...")
    time.sleep(10)
    operation = client.operations.get(operation)

# Download the generated video.
generated_video = operation.response.generated_videos[0]
client.files.download(file=generated_video.video)
generated_video.video.save("dialogue_example.mp4")
print("Generated video saved to dialogue_example.mp4")

JavaScript

import { GoogleGenAI } from "@google/genai";

const ai = new GoogleGenAI({});

const prompt = `A close up of two people staring at a cryptic drawing on a wall, torchlight flickering.
A man murmurs, 'This must be it. That's the secret code.' The woman looks at him and whispering excitedly, 'What did you find?'`;

let operation = await ai.models.generateVideos({
    model: "veo-3.1-generate-preview",
    prompt: prompt,
});

// Poll the operation status until the video is ready.
while (!operation.done) {
    console.log("Waiting for video generation to complete...")
    await new Promise((resolve) => setTimeout(resolve, 10000));
    operation = await ai.operations.getVideosOperation({
        operation: operation,
    });
}

// Download the generated video.
ai.files.download({
    file: operation.response.generatedVideos[0].video,
    downloadPath: "dialogue_example.mp4",
});
console.log(`Generated video saved to dialogue_example.mp4`);

Go

package main

import (
    "context"
    "log"
    "os"
    "time"

    "google.golang.org/genai"
)

func main() {
    ctx := context.Background()
    client, err := genai.NewClient(ctx, nil)
    if err != nil {
        log.Fatal(err)
    }

    prompt := `A close up of two people staring at a cryptic drawing on a wall, torchlight flickering.
    A man murmurs, 'This must be it. That's the secret code.' The woman looks at him and whispering excitedly, 'What did you find?'`

    operation, _ := client.Models.GenerateVideos(
        ctx,
        "veo-3.1-generate-preview",
        prompt,
        nil,
        nil,
    )

    // Poll the operation status until the video is ready.
    for !operation.Done {
    log.Println("Waiting for video generation to complete...")
        time.Sleep(10 * time.Second)
        operation, _ = client.Operations.GetVideosOperation(ctx, operation, nil)
    }

    // Download the generated video.
    video := operation.Response.GeneratedVideos[0]
    client.Files.Download(ctx, video.Video, nil)
    fname := "dialogue_example.mp4"
    _ = os.WriteFile(fname, video.Video.VideoBytes, 0644)
    log.Printf("Generated video saved to %s\n", fname)
}

Java

import com.google.genai.Client;
import com.google.genai.types.GenerateVideosOperation;
import com.google.genai.types.Video;
import java.nio.file.Files;
import java.nio.file.Path;
import java.nio.file.Paths;

class GenerateVideoFromText {
  public static void main(String[] args) throws Exception {
    Client client = new Client();

    String prompt = "A close up of two people staring at a cryptic drawing on a wall, torchlight flickering.\n" +
"A man murmurs, 'This must be it. That's the secret code.' The woman looks at him and whispering excitedly, 'What did you find?'";

    GenerateVideosOperation operation =
        client.models.generateVideos("veo-3.1-generate-preview", prompt, null, null);

    // Poll the operation status until the video is ready.
    while (!operation.done().isPresent() || !operation.done().get()) {
      System.out.println("Waiting for video generation to complete...");
      Thread.sleep(10000);
      operation = client.operations.getVideosOperation(operation, null);
    }

    // Download the generated video.
    Video video = operation.response().get().generatedVideos().get().get(0).video().get();
    Path path = Paths.get("dialogue_example.mp4");
    client.files.download(video, path.toString(), null);
    if (video.videoBytes().isPresent()) {
      Files.write(path, video.videoBytes().get());
      System.out.println("Generated video saved to dialogue_example.mp4");
    }
  }
}

REST

# Note: This script uses jq to parse the JSON response.
# GEMINI API Base URL
BASE_URL="https://generativelanguage.googleapis.com/v1beta"

# Send request to generate video and capture the operation name into a variable.
operation_name=$(curl -s "${BASE_URL}/models/veo-3.1-generate-preview:predictLongRunning" \
  -H "x-goog-api-key: $GEMINI_API_KEY" \
  -H "Content-Type: application/json" \
  -X "POST" \
  -d '{
    "instances": [{
        "prompt": "A close up of two people staring at a cryptic drawing on a wall, torchlight flickering. A man murmurs, \"This must be it. That'\''s the secret code.\" The woman looks at him and whispering excitedly, \"What did you find?\""
      }
    ]
  }' | jq -r .name)

# Poll the operation status until the video is ready
while true; do
  # Get the full JSON status and store it in a variable.
  status_response=$(curl -s -H "x-goog-api-key: $GEMINI_API_KEY" "${BASE_URL}/${operation_name}")

  # Check the "done" field from the JSON stored in the variable.
  is_done=$(echo "${status_response}" | jq .done)

  if [ "${is_done}" = "true" ]; then
    # Extract the download URI from the final response.
    video_uri=$(echo "${status_response}" | jq -r '.response.generateVideoResponse.generatedSamples[0].video.uri')
    echo "Downloading video from: ${video_uri}"

    # Download the video using the URI and API key and follow redirects.
    curl -L -o dialogue_example.mp4 -H "x-goog-api-key: $GEMINI_API_KEY" "${video_uri}"
    break
  fi
  # Wait for 5 seconds before checking again.
  sleep 10
done

控制宽高比

借助 Veo 3.1，您可以创建横屏视频（16:9，默认设置）或竖屏视频 (9:16)。您可以使用 aspect_ratio 参数告知模型您想要哪个：

Python

import time
from google import genai
from google.genai import types

client = genai.Client()

prompt = """A montage of pizza making: a chef tossing and flattening the floury dough, ladling rich red tomato sauce in a spiral, sprinkling mozzarella cheese and pepperoni, and a final shot of the bubbling golden-brown pizza, upbeat electronic music with a rhythmical beat is playing, high energy professional video."""

operation = client.models.generate_videos(
    model="veo-3.1-generate-preview",
    prompt=prompt,
    config=types.GenerateVideosConfig(
      aspect_ratio="9:16",
    ),
)

# Poll the operation status until the video is ready.
while not operation.done:
    print("Waiting for video generation to complete...")
    time.sleep(10)
    operation = client.operations.get(operation)

# Download the generated video.
generated_video = operation.response.generated_videos[0]
client.files.download(file=generated_video.video)
generated_video.video.save("pizza_making.mp4")
print("Generated video saved to pizza_making.mp4")

JavaScript

import { GoogleGenAI } from "@google/genai";

const ai = new GoogleGenAI({});

const prompt = `A montage of pizza making: a chef tossing and flattening the floury dough, ladling rich red tomato sauce in a spiral, sprinkling mozzarella cheese and pepperoni, and a final shot of the bubbling golden-brown pizza, upbeat electronic music with a rhythmical beat is playing, high energy professional video.`;

let operation = await ai.models.generateVideos({
    model: "veo-3.1-generate-preview",
    prompt: prompt,
    config: {
      aspectRatio: "9:16",
    },
});

// Poll the operation status until the video is ready.
while (!operation.done) {
    console.log("Waiting for video generation to complete...")
    await new Promise((resolve) => setTimeout(resolve, 10000));
    operation = await ai.operations.getVideosOperation({
        operation: operation,
    });
}

// Download the generated video.
ai.files.download({
    file: operation.response.generatedVideos[0].video,
    downloadPath: "pizza_making.mp4",
});
console.log(`Generated video saved to pizza_making.mp4`);

Go

package main

import (
    "context"
    "log"
    "os"
    "time"

    "google.golang.org/genai"
)

func main() {
    ctx := context.Background()
    client, err := genai.NewClient(ctx, nil)
    if err != nil {
        log.Fatal(err)
    }

    prompt := `A montage of pizza making: a chef tossing and flattening the floury dough, ladling rich red tomato sauce in a spiral, sprinkling mozzarella cheese and pepperoni, and a final shot of the bubbling golden-brown pizza, upbeat electronic music with a rhythmical beat is playing, high energy professional video.`

  videoConfig := &genai.GenerateVideosConfig{
      AspectRatio: "9:16",
  }

    operation, _ := client.Models.GenerateVideos(
        ctx,
        "veo-3.1-generate-preview",
        prompt,
        nil,
        videoConfig,
    )

    // Poll the operation status until the video is ready.
    for !operation.Done {
    log.Println("Waiting for video generation to complete...")
        time.Sleep(10 * time.Second)
        operation, _ = client.Operations.GetVideosOperation(ctx, operation, nil)
    }

    // Download the generated video.
    video := operation.Response.GeneratedVideos[0]
    client.Files.Download(ctx, video.Video, nil)
    fname := "pizza_making.mp4"
    _ = os.WriteFile(fname, video.Video.VideoBytes, 0644)
    log.Printf("Generated video saved to %s\n", fname)
}

REST

# Note: This script uses jq to parse the JSON response.
# GEMINI API Base URL
BASE_URL="https://generativelanguage.googleapis.com/v1beta"

# Send request to generate video and capture the operation name into a variable.
operation_name=$(curl -s "${BASE_URL}/models/veo-3.1-generate-preview:predictLongRunning" \
  -H "x-goog-api-key: $GEMINI_API_KEY" \
  -H "Content-Type: application/json" \
  -X "POST" \
  -d '{
    "instances": [{
        "prompt": "A montage of pizza making: a chef tossing and flattening the floury dough, ladling rich red tomato sauce in a spiral, sprinkling mozzarella cheese and pepperoni, and a final shot of the bubbling golden-brown pizza, upbeat electronic music with a rhythmical beat is playing, high energy professional video."
      }
    ],
    "parameters": {
      "aspectRatio": "9:16"
    }
  }' | jq -r .name)

# Poll the operation status until the video is ready
while true; do
  # Get the full JSON status and store it in a variable.
  status_response=$(curl -s -H "x-goog-api-key: $GEMINI_API_KEY" "${BASE_URL}/${operation_name}")

  # Check the "done" field from the JSON stored in the variable.
  is_done=$(echo "${status_response}" | jq .done)

  if [ "${is_done}" = "true" ]; then
    # Extract the download URI from the final response.
    video_uri=$(echo "${status_response}" | jq -r '.response.generateVideoResponse.generatedSamples[0].video.uri')
    echo "Downloading video from: ${video_uri}"

    # Download the video using the URI and API key and follow redirects.
    curl -L -o pizza_making.mp4 -H "x-goog-api-key: $GEMINI_API_KEY" "${video_uri}"
    break
  fi
  # Wait for 5 seconds before checking again.
  sleep 10
done

控制分辨率

Veo 3.1 还可以直接生成 720p、1080p 或 4k 视频。

请注意，分辨率越高，延迟时间就越长。4K 视频的价格也更高（请参阅价格）。

视频扩展广告也仅限于 720p 视频。

Python

import time
from google import genai
from google.genai import types

client = genai.Client()

prompt = """A stunning drone view of the Grand Canyon during a flamboyant sunset that highlights the canyon's colors. The drone slowly flies towards the sun then accelerates, dives and flies inside the canyon."""

operation = client.models.generate_videos(
    model="veo-3.1-generate-preview",
    prompt=prompt,
    config=types.GenerateVideosConfig(
      resolution="4k",
    ),
)

# Poll the operation status until the video is ready.
while not operation.done:
    print("Waiting for video generation to complete...")
    time.sleep(10)
    operation = client.operations.get(operation)

# Download the generated video.
generated_video = operation.response.generated_videos[0]
client.files.download(file=generated_video.video)
generated_video.video.save("4k_grand_canyon.mp4")
print("Generated video saved to 4k_grand_canyon.mp4")

JavaScript

import { GoogleGenAI } from "@google/genai";

const ai = new GoogleGenAI({});

const prompt = `A stunning drone view of the Grand Canyon during a flamboyant sunset that highlights the canyon's colors. The drone slowly flies towards the sun then accelerates, dives and flies inside the canyon.`;

let operation = await ai.models.generateVideos({
    model: "veo-3.1-generate-preview",
    prompt: prompt,
    config: {
      resolution: "4k",
    },
});

// Poll the operation status until the video is ready.
while (!operation.done) {
    console.log("Waiting for video generation to complete...")
    await new Promise((resolve) => setTimeout(resolve, 10000));
    operation = await ai.operations.getVideosOperation({
        operation: operation,
    });
}

// Download the generated video.
ai.files.download({
    file: operation.response.generatedVideos[0].video,
    downloadPath: "4k_grand_canyon.mp4",
});
console.log(`Generated video saved to 4k_grand_canyon.mp4`);

Go

package main

import (
    "context"
    "log"
    "os"
    "time"

    "google.golang.org/genai"
)

func main() {
    ctx := context.Background()
    client, err := genai.NewClient(ctx, nil)
    if err != nil {
        log.Fatal(err)
    }

    prompt := `A stunning drone view of the Grand Canyon during a flamboyant sunset that highlights the canyon's colors. The drone slowly flies towards the sun then accelerates, dives and flies inside the canyon.`

  videoConfig := &genai.GenerateVideosConfig{
      Resolution: "4k",
  }

    operation, _ := client.Models.GenerateVideos(
        ctx,
        "veo-3.1-generate-preview",
        prompt,
        nil,
        videoConfig,
    )

    // Poll the operation status until the video is ready.
    for !operation.Done {
    log.Println("Waiting for video generation to complete...")
        time.Sleep(10 * time.Second)
        operation, _ = client.Operations.GetVideosOperation(ctx, operation, nil)
    }

    // Download the generated video.
    video := operation.Response.GeneratedVideos[0]
    client.Files.Download(ctx, video.Video, nil)
    fname := "4k_grand_canyon.mp4"
    _ = os.WriteFile(fname, video.Video.VideoBytes, 0644)
    log.Printf("Generated video saved to %s\n", fname)
}

REST

# Note: This script uses jq to parse the JSON response.
# GEMINI API Base URL
BASE_URL="https://generativelanguage.googleapis.com/v1beta"

# Send request to generate video and capture the operation name into a variable.
operation_name=$(curl -s "${BASE_URL}/models/veo-3.1-generate-preview:predictLongRunning" \
  -H "x-goog-api-key: $GEMINI_API_KEY" \
  -H "Content-Type: application/json" \
  -X "POST" \
  -d '{
    "instances": [{
        "prompt": "A stunning drone view of the Grand Canyon during a flamboyant sunset that highlights the canyon'\''s colors. The drone slowly flies towards the sun then accelerates, dives and flies inside the canyon."
      }
    ],
    "parameters": {
      "resolution": "4k"
    }
  }' | jq -r .name)

# Poll the operation status until the video is ready
while true; do
  # Get the full JSON status and store it in a variable.
  status_response=$(curl -s -H "x-goog-api-key: $GEMINI_API_KEY" "${BASE_URL}/${operation_name}")

  # Check the "done" field from the JSON stored in the variable.
  is_done=$(echo "${status_response}" | jq .done)

  if [ "${is_done}" = "true" ]; then
    # Extract the download URI from the final response.
    video_uri=$(echo "${status_response}" | jq -r '.response.generateVideoResponse.generatedSamples[0].video.uri')
    echo "Downloading video from: ${video_uri}"

    # Download the video using the URI and API key and follow redirects.
    curl -L -o 4k_grand_canyon.mp4 -H "x-goog-api-key: $GEMINI_API_KEY" "${video_uri}"
    break
  fi
  # Wait for 5 seconds before checking again.
  sleep 10
done

图片转视频生成

以下代码演示了如何使用 Gemini 2.5 Flash Image（又称 Nano Banana）生成图片，然后将该图片用作起始帧，以使用 Veo 3.1 生成视频。

Python

import time
from google import genai

client = genai.Client()

prompt = "Panning wide shot of a calico kitten sleeping in the sunshine"

# Step 1: Generate an image with Nano Banana.
image = client.models.generate_content(
    model="gemini-2.5-flash-image",
    contents=prompt,
    config={"response_modalities":['IMAGE']}
)

# Step 2: Generate video with Veo 3.1 using the image.
operation = client.models.generate_videos(
    model="veo-3.1-generate-preview",
    prompt=prompt,
    image=image.parts[0].as_image(),
)

# Poll the operation status until the video is ready.
while not operation.done:
    print("Waiting for video generation to complete...")
    time.sleep(10)
    operation = client.operations.get(operation)

# Download the video.
video = operation.response.generated_videos[0]
client.files.download(file=video.video)
video.video.save("veo3_with_image_input.mp4")
print("Generated video saved to veo3_with_image_input.mp4")

JavaScript

import { GoogleGenAI } from "@google/genai";

const ai = new GoogleGenAI({});

const prompt = "Panning wide shot of a calico kitten sleeping in the sunshine";

// Step 1: Generate an image with Nano Banana.
const imageResponse = await ai.models.generateContent({
  model: "gemini-2.5-flash-image",
  prompt: prompt,
});

// Step 2: Generate video with Veo 3.1 using the image.
let operation = await ai.models.generateVideos({
  model: "veo-3.1-generate-preview",
  prompt: prompt,
  image: {
    imageBytes: imageResponse.generatedImages[0].image.imageBytes,
    mimeType: "image/png",
  },
});

// Poll the operation status until the video is ready.
while (!operation.done) {
  console.log("Waiting for video generation to complete...")
  await new Promise((resolve) => setTimeout(resolve, 10000));
  operation = await ai.operations.getVideosOperation({
    operation: operation,
  });
}

// Download the video.
ai.files.download({
    file: operation.response.generatedVideos[0].video,
    downloadPath: "veo3_with_image_input.mp4",
});
console.log(`Generated video saved to veo3_with_image_input.mp4`);

Go

package main

import (
    "context"
    "log"
    "os"
    "time"

    "google.golang.org/genai"
)

func main() {
    ctx := context.Background()
    client, err := genai.NewClient(ctx, nil)
    if err != nil {
        log.Fatal(err)
    }

    prompt := "Panning wide shot of a calico kitten sleeping in the sunshine"

    // Step 1: Generate an image with Nano Banana.
    imageResponse, err := client.Models.GenerateContent(
        ctx,
        "gemini-2.5-flash-image",
        prompt,
        nil, // GenerateImagesConfig
    )
    if err != nil {
        log.Fatal(err)
    }

    // Step 2: Generate video with Veo 3.1 using the image.
    operation, err := client.Models.GenerateVideos(
        ctx,
        "veo-3.1-generate-preview",
        prompt,
        imageResponse.GeneratedImages[0].Image,
        nil, // GenerateVideosConfig
    )
    if err != nil {
        log.Fatal(err)
    }

    // Poll the operation status until the video is ready.
    for !operation.Done {
        log.Println("Waiting for video generation to complete...")
        time.Sleep(10 * time.Second)
        operation, _ = client.Operations.GetVideosOperation(ctx, operation, nil)
    }

    // Download the video.
    video := operation.Response.GeneratedVideos[0]
    client.Files.Download(ctx, video.Video, nil)
    fname := "veo3_with_image_input.mp4"
    _ = os.WriteFile(fname, video.Video.VideoBytes, 0644)
    log.Printf("Generated video saved to %s\n", fname)
}

Java

import com.google.genai.Client;
import com.google.genai.types.GenerateVideosOperation;
import com.google.genai.types.Image;
import com.google.genai.types.Video;
import java.nio.file.Files;
import java.nio.file.Path;
import java.nio.file.Paths;

class GenerateVideoFromImage {
  public static void main(String[] args) throws Exception {
    Client client = new Client();

    String prompt = "Panning wide shot of a calico kitten sleeping in the sunshine";

    // Step 1: Generate an image with Nano Banana:
    // ...
    // We assume 'image' contains the generated image from step 1,
    // or is loaded from a file:
    Image image = Image.fromFile("path/to/your/image.png");

    // Step 2: Generate video with Veo 3.1 using the image.
    GenerateVideosOperation operation =
        client.models.generateVideos("veo-3.1-generate-preview", prompt, image, null);

    // Poll the operation status until the video is ready.
    while (!operation.done().isPresent() || !operation.done().get()) {
      System.out.println("Waiting for video generation to complete...");
      Thread.sleep(10000);
      operation = client.operations.getVideosOperation(operation, null);
    }

    // Download the video.
    Video video = operation.response().get().generatedVideos().get().get(0).video().get();
    Path path = Paths.get("veo3_with_image_input.mp4");
    client.files.download(video, path.toString(), null);
    if (video.videoBytes().isPresent()) {
      Files.write(path, video.videoBytes().get());
      System.out.println("Generated video saved to veo3_with_image_input.mp4");
    }
  }
}

使用参考图片

Veo 3.1 现在最多可接受 3 张参考图片，以指导生成的视频的内容。提供人物、角色或产品的图片，以便在输出视频中保留主题的外观。

例如，使用 Nano Banana 生成的以下三张图片作为参考，并搭配精心撰写的提示，即可生成以下视频：

`dress_image`	`woman_image`	`glasses_image`

Python

import time
from google import genai

client = genai.Client()

prompt = "The video opens with a medium, eye-level shot of a beautiful woman with dark hair and warm brown eyes. She wears a magnificent, high-fashion flamingo dress with layers of pink and fuchsia feathers, complemented by whimsical pink, heart-shaped sunglasses. She walks with serene confidence through the crystal-clear, shallow turquoise water of a sun-drenched lagoon. The camera slowly pulls back to a medium-wide shot, revealing the breathtaking scene as the dress's long train glides and floats gracefully on the water's surface behind her. The cinematic, dreamlike atmosphere is enhanced by the vibrant colors of the dress against the serene, minimalist landscape, capturing a moment of pure elegance and high-fashion fantasy."

dress_reference = types.VideoGenerationReferenceImage(
  image=dress_image, # Generated separately with Nano Banana
  reference_type="asset"
)

sunglasses_reference = types.VideoGenerationReferenceImage(
  image=glasses_image, # Generated separately with Nano Banana
  reference_type="asset"
)

woman_reference = types.VideoGenerationReferenceImage(
  image=woman_image, # Generated separately with Nano Banana
  reference_type="asset"
)

operation = client.models.generate_videos(
    model="veo-3.1-generate-preview",
    prompt=prompt,
    config=types.GenerateVideosConfig(
      reference_images=[dress_reference, glasses_reference, woman_reference],
    ),
)

# Poll the operation status until the video is ready.
while not operation.done:
    print("Waiting for video generation to complete...")
    time.sleep(10)
    operation = client.operations.get(operation)

# Download the video.
video = operation.response.generated_videos[0]
client.files.download(file=video.video)
video.video.save("veo3.1_with_reference_images.mp4")
print("Generated video saved to veo3.1_with_reference_images.mp4")

JavaScript

import { GoogleGenAI } from "@google/genai";

const ai = new GoogleGenAI({});

const prompt = "The video opens with a medium, eye-level shot of a beautiful woman with dark hair and warm brown eyes. She wears a magnificent, high-fashion flamingo dress with layers of pink and fuchsia feathers, complemented by whimsical pink, heart-shaped sunglasses. She walks with serene confidence through the crystal-clear, shallow turquoise water of a sun-drenched lagoon. The camera slowly pulls back to a medium-wide shot, revealing the breathtaking scene as the dress's long train glides and floats gracefully on the water's surface behind her. The cinematic, dreamlike atmosphere is enhanced by the vibrant colors of the dress against the serene, minimalist landscape, capturing a moment of pure elegance and high-fashion fantasy.";

// dressImage, glassesImage, womanImage generated separately with Nano Banana
// and available as objects like { imageBytes: "...", mimeType: "image/png" }
const dressReference = {
  image: dressImage,
  referenceType: "asset",
};
const sunglassesReference = {
  image: glassesImage,
  referenceType: "asset",
};
const womanReference = {
  image: womanImage,
  referenceType: "asset",
};

let operation = await ai.models.generateVideos({
  model: "veo-3.1-generate-preview",
  prompt: prompt,
  config: {
    referenceImages: [
      dressReference,
      sunglassesReference,
      womanReference,
    ],
  },
});

// Poll the operation status until the video is ready.
while (!operation.done) {
  console.log("Waiting for video generation to complete...");
  await new Promise((resolve) => setTimeout(resolve, 10000));
  operation = await ai.operations.getVideosOperation({
    operation: operation,
  });
}

// Download the video.
ai.files.download({
  file: operation.response.generatedVideos[0].video,
  downloadPath: "veo3.1_with_reference_images.mp4",
});
console.log(`Generated video saved to veo3.1_with_reference_images.mp4`);

Go

package main

import (
    "context"
    "log"
    "os"
    "time"

    "google.golang.org/genai"
)

func main() {
    ctx := context.Background()
    client, err := genai.NewClient(ctx, nil)
    if err != nil {
        log.Fatal(err)
    }

  prompt := `The video opens with a medium, eye-level shot of a beautiful woman with dark hair and warm brown eyes. She wears a magnificent, high-fashion flamingo dress with layers of pink and fuchsia feathers, complemented by whimsical pink, heart-shaped sunglasses. She walks with serene confidence through the crystal-clear, shallow turquoise water of a sun-drenched lagoon. The camera slowly pulls back to a medium-wide shot, revealing the breathtaking scene as the dress's long train glides and floats gracefully on the water's surface behind her. The cinematic, dreamlike atmosphere is enhanced by the vibrant colors of the dress against the serene, minimalist landscape, capturing a moment of pure elegance and high-fashion fantasy.`

  // dressImage, glassesImage, womanImage generated separately with Nano Banana
  // and available as *genai.Image objects.
  var dressImage, glassesImage, womanImage *genai.Image

  dressReference := &genai.VideoGenerationReferenceImage{
    Image: dressImage,
    ReferenceType: "asset",
  }
  sunglassesReference := &genai.VideoGenerationReferenceImage{
    Image: glassesImage,
    ReferenceType: "asset",
  }
  womanReference := &genai.VideoGenerationReferenceImage{
    Image: womanImage,
    ReferenceType: "asset",
  }

    operation, _ := client.Models.GenerateVideos(
        ctx,
        "veo-3.1-generate-preview",
        prompt,
    nil, // image
        &genai.GenerateVideosConfig{
      ReferenceImages: []*genai.VideoGenerationReferenceImage{
        dressReference,
        sunglassesReference,
        womanReference,
      },
    },
    )

    // Poll the operation status until the video is ready.
    for !operation.Done {
        log.Println("Waiting for video generation to complete...")
        time.Sleep(10 * time.Second)
        operation, _ = client.Operations.GetVideosOperation(ctx, operation, nil)
    }

    // Download the video.
    video := operation.Response.GeneratedVideos[0]
    client.Files.Download(ctx, video.Video, nil)
    fname := "veo3.1_with_reference_images.mp4"
    _ = os.WriteFile(fname, video.Video.VideoBytes, 0644)
    log.Printf("Generated video saved to %s\n", fname)
}

REST

# Note: This script uses jq to parse the JSON response.
# It assumes dress_image_base64, glasses_image_base64, and woman_image_base64
# contain base64-encoded image data.

# GEMINI API Base URL
BASE_URL="https://generativelanguage.googleapis.com/v1beta"

# Send request to generate video and capture the operation name into a variable.
operation_name=$(curl -s "${BASE_URL}/models/veo-3.1-generate-preview:predictLongRunning" \
  -H "x-goog-api-key: $GEMINI_API_KEY" \
  -H "Content-Type: application/json" \
  -X "POST" \
  -d '{
    "instances": [{
      "prompt": "The video opens with a medium, eye-level shot of a beautiful woman with dark hair and warm brown eyes. She wears a magnificent, high-fashion flamingo dress with layers of pink and fuchsia feathers, complemented by whimsical pink, heart-shaped sunglasses. She walks with serene confidence through the crystal-clear, shallow turquoise water of a sun-drenched lagoon. The camera slowly pulls back to a medium-wide shot, revealing the breathtaking scene as the dress'\''s long train glides and floats gracefully on the water'\''s surface behind her. The cinematic, dreamlike atmosphere is enhanced by the vibrant colors of the dress against the serene, minimalist landscape, capturing a moment of pure elegance and high-fashion fantasy.",
      "referenceImages": [
        {
          "image": {"inlineData": {"mimeType": "image/png", "data": "'"$dress_image_base64"'"}},
          "referenceType": "asset"
        },
        {
          "image": {"inlineData": {"mimeType": "image/png", "data": "'"$glasses_image_base64"'"}},
          "referenceType": "asset"
        },
        {
          "image": {"inlineData": {"mimeType": "image/png", "data": "'"$woman_image_base64"'"}},
          "referenceType": "asset"
        }
      ]
    }],
  }' | jq -r .name)

# Poll the operation status until the video is ready
while true; do
  # Get the full JSON status and store it in a variable.
  status_response=$(curl -s -H "x-goog-api-key: $GEMINI_API_KEY" "${BASE_URL}/${operation_name}")

  # Check the "done" field from the JSON stored in the variable.
  is_done=$(echo "${status_response}" | jq .done)

  if [ "${is_done}" = "true" ]; then
    # Extract the download URI from the final response.
    video_uri=$(echo "${status_response}" | jq -r '.response.generateVideoResponse.generatedSamples[0].video.uri')
    echo "Downloading video from: ${video_uri}"

    # Download the video using the URI and API key and follow redirects.
    curl -L -o veo3.1_with_reference_images.mp4 -H "x-goog-api-key: $GEMINI_API_KEY" "${video_uri}"
    break
  fi
  # Wait for 10 seconds before checking again.
  sleep 10
done

使用第一帧和最后一帧

借助 Veo 3.1，您可以使用插值或指定视频的第一帧和最后一帧来创作视频。如需了解如何编写有效的文本提示来生成视频，请参阅 Veo 提示指南。

Python

import time
from google import genai

client = genai.Client()

prompt = "A cinematic, haunting video. A ghostly woman with long white hair and a flowing dress swings gently on a rope swing beneath a massive, gnarled tree in a foggy, moonlit clearing. The fog thickens and swirls around her, and she slowly fades away, vanishing completely. The empty swing is left swaying rhythmically on its own in the eerie silence."

operation = client.models.generate_videos(
    model="veo-3.1-generate-preview",
    prompt=prompt,
    image=first_image, # The starting frame is passed as a primary input
    config=types.GenerateVideosConfig(
      last_frame=last_image # The ending frame is passed as a generation constraint in the config
    ),
)

# Poll the operation status until the video is ready.
while not operation.done:
    print("Waiting for video generation to complete...")
    time.sleep(10)
    operation = client.operations.get(operation)

# Download the video.
video = operation.response.generated_videos[0]
client.files.download(file=video.video)
video.video.save("veo3.1_with_interpolation.mp4")
print("Generated video saved to veo3.1_with_interpolation.mp4")

JavaScript

import { GoogleGenAI } from "@google/genai";

const ai = new GoogleGenAI({});

const prompt = "A cinematic, haunting video. A ghostly woman with long white hair and a flowing dress swings gently on a rope swing beneath a massive, gnarled tree in a foggy, moonlit clearing. The fog thickens and swirls around her, and she slowly fades away, vanishing completely. The empty swing is left swaying rhythmically on its own in the eerie silence.";

// firstImage and lastImage generated separately with Nano Banana
// and available as objects like { imageBytes: "...", mimeType: "image/png" }
let operation = await ai.models.generateVideos({
    model: "veo-3.1-generate-preview",
    prompt: prompt,
    image: firstImage, // The starting frame is passed as a primary input
    config: {
      lastFrame: lastImage, // The ending frame is passed as a generation constraint in the config
    },
});

// Poll the operation status until the video is ready.
while (!operation.done) {
    console.log("Waiting for video generation to complete...")
    await new Promise((resolve) => setTimeout(resolve, 10000));
    operation = await ai.operations.getVideosOperation({
        operation: operation,
    });
}

// Download the video.
ai.files.download({
    file: operation.response.generatedVideos[0].video,
    downloadPath: "veo3.1_with_interpolation.mp4",
});
console.log(`Generated video saved to veo3.1_with_interpolation.mp4`);

Go

package main

import (
    "context"
    "log"
    "os"
    "time"

    "google.golang.org/genai"
)

func main() {
    ctx := context.Background()
    client, err := genai.NewClient(ctx, nil)
    if err != nil {
        log.Fatal(err)
    }

  prompt := `A cinematic, haunting video. A ghostly woman with long white hair and a flowing dress swings gently on a rope swing beneath a massive, gnarled tree in a foggy, moonlit clearing. The fog thickens and swirls around her, and she slowly fades away, vanishing completely. The empty swing is left swaying rhythmically on its own in the eerie silence.`

  // firstImage and lastImage generated separately with Nano Banana
  // and available as *genai.Image objects.
  var firstImage, lastImage *genai.Image

    operation, _ := client.Models.GenerateVideos(
        ctx,
        "veo-3.1-generate-preview",
        prompt,
    firstImage, // The starting frame is passed as a primary input
        &genai.GenerateVideosConfig{
      LastFrame: lastImage, // The ending frame is passed as a generation constraint in the config
    },
    )

    // Poll the operation status until the video is ready.
    for !operation.Done {
        log.Println("Waiting for video generation to complete...")
        time.Sleep(10 * time.Second)
        operation, _ = client.Operations.GetVideosOperation(ctx, operation, nil)
    }

    // Download the video.
    video := operation.Response.GeneratedVideos[0]
    client.Files.Download(ctx, video.Video, nil)
    fname := "veo3.1_with_interpolation.mp4"
    _ = os.WriteFile(fname, video.Video.VideoBytes, 0644)
    log.Printf("Generated video saved to %s\n", fname)
}

REST

# Note: This script uses jq to parse the JSON response.
# It assumes first_image_base64 and last_image_base64
# contain base64-encoded image data.

# GEMINI API Base URL
BASE_URL="https://generativelanguage.googleapis.com/v1beta"

# Send request to generate video and capture the operation name into a variable.
# The starting frame is passed as a primary input
# The ending frame is passed as a generation constraint in the config
operation_name=$(curl -s "${BASE_URL}/models/veo-3.1-generate-preview:predictLongRunning" \
  -H "x-goog-api-key: $GEMINI_API_KEY" \
  -H "Content-Type: application/json" \
  -X "POST" \
  -d '{
    "instances": [{
      "prompt": "A cinematic, haunting video. A ghostly woman with long white hair and a flowing dress swings gently on a rope swing beneath a massive, gnarled tree in a foggy, moonlit clearing. The fog thickens and swirls around her, and she slowly fades away, vanishing completely. The empty swing is left swaying rhythmically on its own in the eerie silence.",
      "image": {"inlineData": {"mimeType": "image/png", "data": "'"$first_image_base64"'"}},
      "lastFrame": {"inlineData": {"mimeType": "image/png", "data": "'"$last_image_base64"'"}}
    }],
  }' | jq -r .name)

# Poll the operation status until the video is ready
while true; do
  # Get the full JSON status and store it in a variable.
  status_response=$(curl -s -H "x-goog-api-key: $GEMINI_API_KEY" "${BASE_URL}/${operation_name}")

  # Check the "done" field from the JSON stored in the variable.
  is_done=$(echo "${status_response}" | jq .done)

  if [ "${is_done}" = "true" ]; then
    # Extract the download URI from the final response.
    video_uri=$(echo "${status_response}" | jq -r '.response.generateVideoResponse.generatedSamples[0].video.uri')
    echo "Downloading video from: ${video_uri}"

    # Download the video using the URI and API key and follow redirects.
    curl -L -o veo3.1_with_interpolation.mp4 -H "x-goog-api-key: $GEMINI_API_KEY" "${video_uri}"
    break
  fi
  # Wait for 10 seconds before checking again.
  sleep 10
done

`first_image`	`last_image`	veo3.1_with_interpolation.mp4

延长 Veo 视频

使用 Veo 3.1 可将之前使用 Veo 生成的视频延长 7 秒，最多可延长 20 次。

输入视频限制：

Veo 生成的视频时长上限为 141 秒。
Gemini API 仅支持 Veo 生成的视频的视频扩展功能。
视频应来自上一代设备，例如 operation.response.generated_videos[0].video
视频的存储期限为 2 天，但如果视频被引用以用于扩展，其 2 天的存储期限计时器会重置。您只能延长过去两天内生成或引用的视频。
输入视频应具有一定的时长、宽高比和尺寸：
- 宽高比：9:16 或 16:9
- 分辨率：720p
- 视频时长：不超过 141 秒

该扩展程序的输出是一个视频，其中包含用户输入的视频和生成的扩展视频，总时长最长为 148 秒。

此示例采用 Veo 生成的视频（此处显示了其原始提示），并使用 video 参数和新提示对其进行扩展：

提示	输出：`butterfly_video`
一只折纸蝴蝶拍打着翅膀，从法式落地门飞到花园里。

Python

import time
from google import genai

client = genai.Client()

prompt = "Track the butterfly into the garden as it lands on an orange origami flower. A fluffy white puppy runs up and gently pats the flower."

operation = client.models.generate_videos(
    model="veo-3.1-generate-preview",
    video=operation.response.generated_videos[0].video, # This must be a video from a previous generation
    prompt=prompt,
    config=types.GenerateVideosConfig(
        number_of_videos=1,
        resolution="720p"
    ),
)

# Poll the operation status until the video is ready.
while not operation.done:
    print("Waiting for video generation to complete...")
    time.sleep(10)
    operation = client.operations.get(operation)

# Download the video.
video = operation.response.generated_videos[0]
client.files.download(file=video.video)
video.video.save("veo3.1_extension.mp4")
print("Generated video saved to veo3.1_extension.mp4")

JavaScript

import { GoogleGenAI } from "@google/genai";

const ai = new GoogleGenAI({});

const prompt = "Track the butterfly into the garden as it lands on an orange origami flower. A fluffy white puppy runs up and gently pats the flower.";

// butterflyVideo must be a video from a previous generation
// available as an object like { videoBytes: "...", mimeType: "video/mp4" }
let operation = await ai.models.generateVideos({
    model: "veo-3.1-generate-preview",
    video: butterflyVideo,
    prompt: prompt,
    config: {
        numberOfVideos: 1,
        resolution: "720p",
    },
});

// Poll the operation status until the video is ready.
while (!operation.done) {
    console.log("Waiting for video generation to complete...")
    await new Promise((resolve) => setTimeout(resolve, 10000));
    operation = await ai.operations.getVideosOperation({
        operation: operation,
    });
}

// Download the video.
ai.files.download({
    file: operation.response.generatedVideos[0].video,
    downloadPath: "veo3.1_extension.mp4",
});
console.log(`Generated video saved to veo3.1_extension.mp4`);

Go

package main

import (
    "context"
    "log"
    "os"
    "time"

    "google.golang.org/genai"
)

func main() {
    ctx := context.Background()
    client, err := genai.NewClient(ctx, nil)
    if err != nil {
        log.Fatal(err)
    }

  prompt := `Track the butterfly into the garden as it lands on an orange origami flower. A fluffy white puppy runs up and gently pats the flower.`

  // butterflyVideo must be a video from a previous generation
  // available as a *genai.Video object.
  var butterflyVideo *genai.Video

    operation, _ := client.Models.GenerateVideos(
        ctx,
        "veo-3.1-generate-preview",
        prompt,
    nil, // image
    butterflyVideo,
        &genai.GenerateVideosConfig{
      NumberOfVideos: 1,
      Resolution: "720p",
    },
    )

    // Poll the operation status until the video is ready.
    for !operation.Done {
        log.Println("Waiting for video generation to complete...")
        time.Sleep(10 * time.Second)
        operation, _ = client.Operations.GetVideosOperation(ctx, operation, nil)
    }

    // Download the video.
    video := operation.Response.GeneratedVideos[0]
    client.Files.Download(ctx, video.Video, nil)
    fname := "veo3.1_extension.mp4"
    _ = os.WriteFile(fname, video.Video.VideoBytes, 0644)
    log.Printf("Generated video saved to %s\n", fname)
}

REST

# Note: This script uses jq to parse the JSON response.
# It assumes butterfly_video_base64 contains base64-encoded
# video data from a previous generation.

# GEMINI API Base URL
BASE_URL="https://generativelanguage.googleapis.com/v1beta"

# Send request to generate video and capture the operation name into a variable.
operation_name=$(curl -s "${BASE_URL}/models/veo-3.1-generate-preview:predictLongRunning" \
  -H "x-goog-api-key: $GEMINI_API_KEY" \
  -H "Content-Type: application/json" \
  -X "POST" \
  -d '{
    "instances": [{
      "prompt": "Track the butterfly into the garden as it lands on an orange origami flower. A fluffy white puppy runs up and gently pats the flower.",
      "video": {"inlineData": {"mimeType": "video/mp4", "data": "'"$butterfly_video_base64"'"}}
    }],
    "parameters": {
      "numberOfVideos": 1,
      "resolution": "720p"
    }
  }' | jq -r .name)

# Poll the operation status until the video is ready
while true; do
  # Get the full JSON status and store it in a variable.
  status_response=$(curl -s -H "x-goog-api-key: $GEMINI_API_KEY" "${BASE_URL}/${operation_name}")

  # Check the "done" field from the JSON stored in the variable.
  is_done=$(echo "${status_response}" | jq .done)

  if [ "${is_done}" = "true" ]; then
    # Extract the download URI from the final response.
    video_uri=$(echo "${status_response}" | jq -r '.response.generateVideoResponse.generatedSamples[0].video.uri')
    echo "Downloading video from: ${video_uri}"

    # Download the video using the URI and API key and follow redirects.
    curl -L -o veo3.1_extension.mp4 -H "x-goog-api-key: $GEMINI_API_KEY" "${video_uri}"
    break
  fi
  # Wait for 10 seconds before checking again.
  sleep 10
done

如需了解如何编写有效的文本提示来生成视频，请参阅 Veo 提示指南。

处理异步操作

视频生成是一项计算密集型任务。当您向 API 发送请求时，它会启动一个长时间运行的作业，并立即返回一个 operation 对象。然后，您必须进行轮询，直到视频准备就绪（以 done 状态为 true 表示）。

此流程的核心是一个轮询循环，用于定期检查作业的状态。

Python

import time
from google import genai
from google.genai import types

client = genai.Client()

# After starting the job, you get an operation object.
operation = client.models.generate_videos(
    model="veo-3.1-generate-preview",
    prompt="A cinematic shot of a majestic lion in the savannah.",
)

# Alternatively, you can use operation.name to get the operation.
operation = types.GenerateVideosOperation(name=operation.name)

# This loop checks the job status every 10 seconds.
while not operation.done:
    time.sleep(10)
    # Refresh the operation object to get the latest status.
    operation = client.operations.get(operation)

# Once done, the result is in operation.response.
# ... process and download your video ...

JavaScript

import { GoogleGenAI } from "@google/genai";

const ai = new GoogleGenAI({});

// After starting the job, you get an operation object.
let operation = await ai.models.generateVideos({
  model: "veo-3.1-generate-preview",
  prompt: "A cinematic shot of a majestic lion in the savannah.",
});

// Alternatively, you can use operation.name to get the operation.
// operation = types.GenerateVideosOperation(name=operation.name)

// This loop checks the job status every 10 seconds.
while (!operation.done) {
    await new Promise((resolve) => setTimeout(resolve, 1000));
    // Refresh the operation object to get the latest status.
    operation = await ai.operations.getVideosOperation({ operation });
}

// Once done, the result is in operation.response.
// ... process and download your video ...

Go

package main

import (
    "context"
    "log"
    "time"

    "google.golang.org/genai"
)

func main() {
    ctx := context.Background()
    client, err := genai.NewClient(ctx, nil)
    if err != nil {
        log.Fatal(err)
    }

    // After starting the job, you get an operation object.
    operation, _ := client.Models.GenerateVideos(
        ctx,
        "veo-3.1-generate-preview",
        "A cinematic shot of a majestic lion in the savannah.",
        nil,
        nil,
    )

    // This loop checks the job status every 10 seconds.
    for !operation.Done {
        time.Sleep(10 * time.Second)
        // Refresh the operation object to get the latest status.
        operation, _ = client.Operations.GetVideosOperation(ctx, operation, nil)
    }

    // Once done, the result is in operation.Response.
    // ... process and download your video ...
}

Java

import com.google.genai.Client;
import com.google.genai.types.GenerateVideosOperation;
import com.google.genai.types.Video;
import java.nio.file.Files;
import java.nio.file.Path;
import java.nio.file.Paths;

class HandleAsync {
  public static void main(String[] args) throws Exception {
    Client client = new Client();

    // After starting the job, you get an operation object.
    GenerateVideosOperation operation =
        client.models.generateVideos(
            "veo-3.1-generate-preview",
            "A cinematic shot of a majestic lion in the savannah.",
            null,
            null);

    // This loop checks the job status every 10 seconds.
    while (!operation.done().isPresent() || !operation.done().get()) {
      Thread.sleep(10000);
      // Refresh the operation object to get the latest status.
      operation = client.operations.getVideosOperation(operation, null);
    }

    // Once done, the result is in operation.response.
    // Download the generated video.
    Video video = operation.response().get().generatedVideos().get().get(0).video().get();
    Path path = Paths.get("async_example.mp4");
    client.files.download(video, path.toString(), null);
    if (video.videoBytes().isPresent()) {
      Files.write(path, video.videoBytes().get());
      System.out.println("Generated video saved to async_example.mp4");
    }
  }
}

REST

# Note: This script uses jq to parse the JSON response.
# GEMINI API Base URL
BASE_URL="https://generativelanguage.googleapis.com/v1beta"

# Send request to generate video and capture the operation name into a variable.
operation_name=$(curl -s "${BASE_URL}/models/veo-3.1-generate-preview:predictLongRunning" \
  -H "x-goog-api-key: $GEMINI_API_KEY" \
  -H "Content-Type: application/json" \
  -X "POST" \
  -d '{
    "instances": [{
        "prompt": "A cinematic shot of a majestic lion in the savannah."
      }
    ]
  }' | jq -r .name)

# This loop checks the job status every 10 seconds.
while true; do
  # Get the full JSON status and store it in a variable.
  status_response=$(curl -s -H "x-goog-api-key: $GEMINI_API_KEY" "${BASE_URL}/${operation_name}")

  # Check the "done" field from the JSON stored in the variable.
  is_done=$(echo "${status_response}" | jq .done)

  if [ "${is_done}" = "true" ]; then
    # Once done, the result is in status_response.
    # ... process and download your video ...
    echo "Video generation complete."
    break
  fi
  # Wait for 10 seconds before checking again.
  echo "Waiting for video generation to complete..."
  sleep 10
done

Veo API 参数和规范

您可以在 API 请求中设置以下参数来控制视频生成过程。

参数	说明	Veo 3.1 和 Veo 3.1 Fast	Veo 3 和 Veo 3 Fast	Veo 2
实例
`prompt`	视频的文字说明。支持音频提示。	`string`	`string`	`string`
`image`	要添加动画效果的初始图片。	`Image` 对象	`Image` 对象	`Image` 对象
`lastFrame`	插值视频要过渡到的最终图片。必须与 `image` 参数搭配使用。	`Image` 对象	`Image` 对象	`Image` 对象
`referenceImages`	最多三张图片，用作风格和内容参考。	`VideoGenerationReferenceImage` 个对象（仅限 Veo 3.1）	无	无
`video`	用于视频广告附加信息的视频。	上一代产品的 `Video` 对象	无	无
参数
`aspectRatio`	视频的宽高比。	`"16:9"`（默认值）、 `"9:16"`	`"16:9"`（默认值）、 `"9:16"`	`"16:9"`（默认值）、 `"9:16"`
`durationSeconds`	生成的视频的时长。	`"4"`，`"6"`，`"8"`。使用扩展、参考图片或 1080p 和 4k 分辨率时，必须为“8”	`"4"`，`"6"`，`"8"`。使用扩展、参考图片或 1080p 和 4k 分辨率时，必须为“8”	`"5"`、`"6"`、`"8"`
`personGeneration`	控制人物的生成。（有关地区限制，请参阅限制）	文生视频和扩展功能：仅限 `"allow_all"` 图生视频、插帧和参考图片：仅限 `"allow_adult"`	文生视频：仅限 `"allow_all"` 图生视频：仅限 `"allow_adult"`	文生视频： `"allow_all"`、`"allow_adult"`、`"dont_allow"` 图生视频： `"allow_adult"` 和 `"dont_allow"`
`resolution`	视频的分辨率。	`"720p"`（默认）、 `"1080p"`（仅支持 8 秒时长）、 `"4k"`（仅支持 8 秒时长） `"720p"` 仅适用于扩展服务	`"720p"`（默认）、 `"1080p"`（仅支持 8 秒时长）、 `"4k"`（仅支持 8 秒时长） `"720p"` 仅适用于扩展服务	不支持

请注意，seed 参数也适用于 Veo 3 模型。它不能保证确定性，但可以略微提高确定性。

Veo 提示指南

本部分包含一些示例视频，展示了如何使用 Veo 创建视频，以及如何修改提示以生成不同的结果。

安全过滤器

Veo 会在 Gemini 中应用安全过滤条件，以帮助确保生成的视频和上传的照片不包含冒犯性内容。违反我们条款和准则的提示会被屏蔽。

提示撰写的基础知识

良好的提示应具有描述性且清晰明了。如要充分利用 Veo，请先确定核心创意，然后通过添加关键字和修饰符来完善创意，并在提示中加入视频专用术语。

您的提示中应包含以下元素：

正文：您希望在视频中呈现的对象、人物、动物或场景，例如城市景观、自然、车辆或小狗。
动作：正文正在做的事情（例如，走路、跑步或转头）。
风格：使用特定的电影风格关键字（例如科幻、恐怖片、黑色电影）或动画风格关键字（例如卡通）指定创意方向。
相机位置和运动：[可选] 使用航拍视图、平视、俯拍、轨道拍摄或仰拍等术语控制相机的位置和运动。
构图：[可选] 拍摄镜头的构图方式，例如广角镜头、特写镜头、单人镜头或双人镜头。
对焦和镜头效果：[可选] 使用浅景深、深景深、柔焦、微距镜头和广角镜头等术语来实现特定的视觉效果。
氛围：[可选] 颜色和光线对场景的贡献，例如蓝色调、夜间或暖色调。

有关编写提示的更多技巧

使用描述性语言：使用形容词和副词，为 Veo 描绘清晰的画面。
增强面部细节：指定面部细节作为照片的焦点，例如在提示中使用“portrait”一词。

如需了解更全面的提示策略，请参阅提示设计简介。

提示音频

借助 Veo 3，您可以为音效、环境噪音和对话提供提示。该模型会捕捉这些提示的细微差别，以生成同步的音轨。

对话：使用引号表示具体对话。（例如：“这一定是钥匙，”他低声说道。）
音效 (SFX)：明确描述声音。（示例：轮胎发出刺耳的尖叫声，发动机发出轰鸣声。）
环境噪声：描述环境的声景。（示例：背景中回荡着微弱而诡异的嗡嗡声。）

这些视频展示了如何通过提供越来越详细的提示来让 Veo 3 生成音频。

提示	生成的输出
更多细节（对话和环境音）一个广角镜头，拍摄的是雾气缭绕的太平洋西北森林。两名疲惫的徒步者（一男一女）在蕨类植物丛中艰难前行，突然，男士停下脚步，盯着一棵树。特写：树皮上留有新鲜而深的爪痕。男士：（手放在猎刀上）“那不是普通的熊。”女声：（声音因恐惧而紧绷，目光扫视着树林）“那是什么？”粗糙的树皮、折断的树枝、潮湿泥土上的脚步声。一只孤零零的鸟儿鸣叫着。
细节较少（对话）剪纸动画。新图书管理员：“禁书放在哪里？”老馆长：“我们没有。他们会留着我们。”

提示

生成的输出

更多细节（对话和环境音）
一个广角镜头，拍摄的是雾气缭绕的太平洋西北森林。两名疲惫的徒步者（一男一女）在蕨类植物丛中艰难前行，突然，男士停下脚步，盯着一棵树。特写：树皮上留有新鲜而深的爪痕。男士：（手放在猎刀上）“那不是普通的熊。”女声：（声音因恐惧而紧绷，目光扫视着树林）“那是什么？”粗糙的树皮、折断的树枝、潮湿泥土上的脚步声。一只孤零零的鸟儿鸣叫着。

细节较少（对话）
剪纸动画。新图书管理员：“禁书放在哪里？”老馆长：“我们没有。他们会留着我们。”

不妨亲自尝试一下这些提示，听听音频！试用 Veo 3

使用参考图片进行提示

您可以利用 Veo 的图片转视频功能，使用一张或多张图片作为输入来引导生成的视频。Veo 会使用输入图片作为初始帧。选择一张与您设想的视频首个场景最接近的图片，为日常物品添加动画效果，让绘画作品更加生动，也可以为自然景观增添动感和声音。

提示	生成的输出
输入图片（由 Nano Banana 生成）一张超写实的微距照片，照片中，迷你冲浪者在古朴的石制浴室水槽内乘风破浪。一个老式黄铜水龙头正在流水，营造出永恒的冲浪声。超现实、奇幻、明亮的自然光线。
输出视频（由 Veo 3.1 生成）一段超现实的电影级微距视频。微型冲浪者在石制浴室水槽内乘着永恒的滚滚波浪。一个正在运行的复古黄铜水龙头制造出无尽的冲浪声。镜头缓慢平移，掠过阳光明媚的奇幻场景，微缩人物在碧绿的水面上熟练地雕刻。

借助 Veo 3.1，您可以参考图片或素材来指导生成的视频内容。提供最多三张单个人物、角色或产品的素材资源图片。Veo 会在输出视频中保留主题的外观。

提示	生成的输出
参考图片（由 Nano Banana 生成）一条深海安康鱼潜伏在深暗的水中，露出牙齿，鱼饵发光。
参考图片（由 Nano Banana 生成）一套粉色儿童公主服装，配有魔杖和皇冠，背景为纯色商品背景。
输出视频（由 Veo 3.1 生成）制作一个搞笑的卡通版鱼，让它穿着服装、游泳并挥舞魔杖。

借助 Veo 3.1，您还可以通过指定视频的第一帧和最后一帧来生成视频。

提示	生成的输出
第一张图片（由 Nano Banana 生成）：一只姜黄色猫咪驾驶一辆红色敞篷赛车行驶在法国里维埃拉海岸，这是一张逼真的高画质正面图片。
最后一张图片（由 Nano Banana 生成）显示汽车从悬崖上起飞时会发生什么情况。
输出视频（由 Veo 3.1 生成）可选

借助此功能，您可以定义开始帧和结束帧，从而精确控制镜头的构图。上传图片或使用之前生成的视频中的帧，确保场景的开头和结尾完全符合您的设想。

提示扩展

如需使用 Veo 3.1 延长 Veo 生成的视频，请将该视频用作输入内容，并可选择性地添加文本提示。“延长”功能会完成视频的最后一秒或 24 帧，并继续拍摄动作。

请注意，如果视频的最后 1 秒内没有声音，则无法有效地延长声音。

提示	生成的输出
输入视频（由 Veo 3.1 生成）滑翔伞从山顶起飞，开始向山下俯瞰鲜花覆盖的山谷滑翔。
输出视频（由 Veo 3.1 生成）延长此视频，让滑翔伞缓缓下降。

提示和输出示例

本部分提供了多个提示，重点介绍了描述性细节如何提升每个视频的效果。

冰柱

本视频演示了如何在提示中使用提示撰写基础知识中的元素。

提示	生成的输出
特写镜头（构图）：冰冻的岩壁（背景）上融化的冰柱（正文），冷蓝色调（氛围），放大（相机运动），保持水滴（动作）的特写细节。

一位男士正在打电话

这些视频展示了如何通过添加越来越具体的细节来修改提示，让 Veo 按照您的喜好优化输出内容。

提示	生成的输出
细节较少镜头从远处推近，展现一位身着绿色风衣、神情绝望的男人。他正在用一部绿色霓虹灯照亮的转盘式壁挂电话拨号。看起来像电影场景。
更多细节一个电影特写镜头跟随着一位身着破旧绿色风衣、神情绝望的男人，他正在拨打安装在粗糙砖墙上的转盘式电话，周围笼罩着绿色霓虹灯的诡异光芒。镜头缓缓推进，显示出他下巴的紧张感，以及他努力拨打电话时脸上刻着的绝望。浅景深效果将焦点对准了他紧锁的眉头和黑色转盘电话，模糊的背景则呈现出霓虹色彩和模糊的阴影，营造出一种紧迫感和孤立感。

提示

生成的输出

细节较少
镜头从远处推近，展现一位身着绿色风衣、神情绝望的男人。他正在用一部绿色霓虹灯照亮的转盘式壁挂电话拨号。看起来像电影场景。

更多细节
一个电影特写镜头跟随着一位身着破旧绿色风衣、神情绝望的男人，他正在拨打安装在粗糙砖墙上的转盘式电话，周围笼罩着绿色霓虹灯的诡异光芒。镜头缓缓推进，显示出他下巴的紧张感，以及他努力拨打电话时脸上刻着的绝望。浅景深效果将焦点对准了他紧锁的眉头和黑色转盘电话，模糊的背景则呈现出霓虹色彩和模糊的阴影，营造出一种紧迫感和孤立感。

雪豹

提示	生成的输出
简单提示：一只毛发像雪豹一样可爱的生物在冬季森林中行走，3D 卡通风格渲染。
详细提示：创作一个简短的 3D 动画场景，采用欢快的卡通风格。一只可爱的生物，有着雪豹般的皮毛、富有表现力的大眼睛和圆润友好的身形，在奇幻的冬季森林中欢快地跳跃。场景应包含圆润的雪树、缓缓飘落的雪花，以及透过树枝的温暖阳光。生物的弹跳动作和灿烂笑容应传达出纯粹的喜悦。采用欢快温馨的基调，搭配明亮欢快的色彩和活泼的动画。

提示

生成的输出

简单提示：
一只毛发像雪豹一样可爱的生物在冬季森林中行走，3D 卡通风格渲染。

详细提示：
创作一个简短的 3D 动画场景，采用欢快的卡通风格。一只可爱的生物，有着雪豹般的皮毛、富有表现力的大眼睛和圆润友好的身形，在奇幻的冬季森林中欢快地跳跃。场景应包含圆润的雪树、缓缓飘落的雪花，以及透过树枝的温暖阳光。生物的弹跳动作和灿烂笑容应传达出纯粹的喜悦。采用欢快温馨的基调，搭配明亮欢快的色彩和活泼的动画。

按写作要素划分的示例

以下示例展示了如何根据每个基本元素优化提示。

主题和背景

指定主要焦点（正文）和背景或环境（上下文）。

提示	生成的输出
一栋白色混凝土公寓楼的建筑效果图，具有流畅的有机形状，与茂盛的绿色植物和未来派元素无缝融合
一颗卫星在太空中漂浮，背景是月球和一些星星。

操作

指定正文正在做的事情（例如，走路、跑步或转头）。

提示	生成的输出
广角镜头：一位女士在海滩上行走，在日落时分面朝地平线，看起来很满足和放松。

样式

添加关键字，引导生成器朝着特定美学风格（例如超现实主义、复古、未来主义、黑色电影）生成图片。

提示	生成的输出
黑色电影风格，一男一女走在街上，神秘、电影感、黑白。

相机运动和构图

指定镜头的移动方式（第一人称视角、航拍视图、跟踪无人机视角）以及镜头的取景方式（广角镜头、特写镜头、低角度）。

提示	生成的输出
一个主视角镜头，拍摄的是一辆复古汽车在雨中行驶，加拿大夜景，电影风格。
眼睛的极近特写，眼睛中映出城市。

氛围

调色板和光线会影响情绪。您可以尝试使用“柔和的橙色暖色调”“自然光线”“日出”或“冷色调蓝色”等字词。

提示	生成的输出
在阳光明媚的公园里，一个女孩抱着可爱的金毛猎犬小狗的特写镜头。
电影般的特写镜头：一位悲伤的女性在雨中乘坐公交车，画面采用冷色调蓝色，营造出悲伤的氛围。

否定提示

排除提示用于指定您不希望视频中包含的元素。

❌ 请勿使用“不”或“不要”等指令性语言。（例如 “无墙”）。
✅ 描述您不想看到的内容。（例如 “墙、框架”）。

提示	生成的输出
不使用负提示：生成一段简短的风格化动画，内容是一棵巨大的孤零零的橡树，树叶在强风中剧烈摇摆... [截断]
使用负面提示： [相同提示] 负面提示：城市背景、人造结构、黑暗、暴风雨或威胁性氛围。

宽高比

借助 Veo，您可以指定视频的宽高比。

提示	生成的输出
宽屏 (16:9) 制作一段视频，内容为：一架无人机跟拍一名男子驾驶一辆红色敞篷车在 20 世纪 70 年代的棕榈泉行驶，阳光温暖，阴影拉长。
竖屏 (9:16) 制作一段视频，突出展示茂密热带雨林中壮丽的夏威夷瀑布的流畅动态。重点呈现逼真的水流、细致的树叶和自然光线，营造宁静的氛围。捕捉湍急的水流、雾气弥漫的氛围以及透过茂密树冠的斑驳阳光。使用流畅的电影级镜头移动来展示瀑布及其周围环境。力求营造宁静而真实的氛围，让观看者仿佛置身于夏威夷雨林的宁静美景之中。

提示

生成的输出

宽屏 (16:9)
制作一段视频，内容为：一架无人机跟拍一名男子驾驶一辆红色敞篷车在 20 世纪 70 年代的棕榈泉行驶，阳光温暖，阴影拉长。

竖屏 (9:16)
制作一段视频，突出展示茂密热带雨林中壮丽的夏威夷瀑布的流畅动态。重点呈现逼真的水流、细致的树叶和自然光线，营造宁静的氛围。捕捉湍急的水流、雾气弥漫的氛围以及透过茂密树冠的斑驳阳光。使用流畅的电影级镜头移动来展示瀑布及其周围环境。力求营造宁静而真实的氛围，让观看者仿佛置身于夏威夷雨林的宁静美景之中。

限制

请求延迟时间：最短：11 秒；最长：6 分钟（高峰时段）。
地区限制：在欧盟、英国、瑞士、中东和北非地区，personGeneration 的允许值为：
- Veo 3：仅限 allow_adult。
- Veo 2：dont_allow 和 allow_adult。默认值为 dont_allow。
视频保留期限：生成的视频会在服务器上存储 2 天，之后会被移除。如需保存本地副本，您必须在视频生成后的 2 天内下载。加长版视频会被视为新生成的视频。
添加水印：Veo 创建的视频会使用 SynthID（我们的 AI 生成内容水印添加和识别工具）添加水印。您可以使用 SynthID 验证平台来验证视频。
安全性：生成的视频会通过安全过滤和记忆检查流程，以帮助降低隐私、版权和偏见风险。
音频错误：由于安全过滤条件或音频的其他处理问题，Veo 3.1 有时会阻止视频生成。如果您的视频被阻止生成，我们不会向您收取费用。

模型功能

功能	说明	Veo 3.1 和 Veo 3.1 Fast	Veo 3 和 Veo 3 Fast	Veo 2
音频	原生生成包含音频的视频。	原生生成包含音频的视频。	✔️ 始终开启	❌ 仅限静音
输入模态	用于生成的输入类型。	文生视频、图生视频、视频生视频	文生视频、图生视频	文生视频、图生视频
解决方法	视频的输出分辨率。	720p、1080p（仅限 8 秒时长）、4k（仅限 8 秒时长）使用视频扩展广告时，仅支持 720p。	720p 和 1080p（仅限 16:9）	720p
帧速率	视频的输出帧速率。	24 帧/秒	24 帧/秒	24 帧/秒
视频时长	生成的视频的时长。	8 秒、6 秒、4 秒仅在 1080p 或 4k 或使用参考图片时为 8 秒	8 秒	5-8 秒
每个请求的视频数量	每个请求生成的视频数量。	1	1	1 或 2
状态和详细信息	模型可用性和更多详细信息。	预览	稳定版	稳定版

模型版本

如需详细了解特定于 Veo 模型的用量，请参阅价格和速率限制页面。

借助 Veo Fast 版本，开发者可以创作有声视频，同时保持高画质并针对速度和业务用例进行优化。它们非常适合以编程方式生成广告的后端服务、用于快速对广告素材概念进行 A/B 测试的工具，或需要快速制作社交媒体内容的应用。

Veo 3.1 预览版

属性	说明
模型代码	Gemini API `veo-3.1-generate-preview`
支持的数据类型	输入文字、图片输出带音频的视频
限制	文本输入 1,024 个词元输出视频 1
最新更新	2026 年 1 月

Veo 3.1 Fast 预览版

属性	说明
模型代码	Gemini API `veo-3.1-fast-generate-preview`
支持的数据类型	输入文字、图片输出带音频的视频
限制	文本输入 1,024 个词元输出视频 1
最新更新	2026 年 1 月

Veo 2

属性	说明
模型代码	Gemini API `veo-2.0-generate-001`
支持的数据类型	输入文字、图片输出视频
限制	文本输入不适用图片输入任意分辨率和宽高比，文件大小不超过 20MB 输出视频最多 2 个
最新更新	2025 年 4 月

后续步骤

通过在 Veo 快速入门 Colab 和 Veo 3.1 applet 中进行实验，开始使用 Veo 3.1 API。
如需了解如何撰写更好的提示，请参阅我们的提示设计简介。