Interactions API 现已正式发布。我们建议使用此 API 来访问所有最新功能和模型。

Google uses AI technology to translate content into your preferred language. AI translations can contain errors.

Generating content

Gemini API 支持生成包含图片、音频、代码、工具等的内容。如需详细了解这些功能，请继续阅读并查看以任务为中心的示例代码，或阅读全面的指南。

方法：models.generateContent

根据输入 GenerateContentRequest 生成模型回答。如需了解详细的使用信息，请参阅文本生成指南。输入功能因模型而异，包括经过调优的模型。如需了解详情，请参阅模型指南和调优指南。

端点

post https://generativelanguage.googleapis.com/v1beta/{model=models/*}:generateContent

路径参数

model string

必需。用于生成补全的 Model 的名称。

格式：models/{model}。格式为 models/{model}。

请求正文

请求正文中包含结构如下的数据：

字段

contents[] object (Content)

必需。与模型当前对话的内容。

对于单轮查询，这是单个实例。对于多轮查询（例如聊天），这是包含对话历史记录和最新请求的重复字段。

tools[] object (Tool)

可选。Model 可能用于生成下一个回答的 Tools 列表。

Tool 是一段代码，可让系统与外部系统进行交互，以在 Model 的知识和范围之外执行操作或一组操作。支持的 Tool 为 Function 和 codeExecution。如需了解详情，请参阅函数调用和代码执行指南。

toolConfig object (ToolConfig)

可选。请求中指定的任何 Tool 的工具配置。如需查看使用示例，请参阅函数调用指南。

safetySettings[] object (SafetySetting)

可选。用于屏蔽不安全内容的唯一 SafetySetting 实例的列表。

此限制将在 GenerateContentRequest.contents 和 GenerateContentResponse.candidates 上强制执行。每种 SafetyCategory 类型不应有多个设置。API 会屏蔽任何不符合这些设置所设阈值的内容和回答。此列表会替换 safetySettings 中指定的每个 SafetyCategory 的默认设置。如果列表中未提供指定 SafetyCategory 的 SafetySetting，API 将使用相应类别的默认安全设置。支持的危害类别包括 HARM_CATEGORY_HATE_SPEECH、HARM_CATEGORY_SEXUALLY_EXPLICIT、HARM_CATEGORY_DANGEROUS_CONTENT、HARM_CATEGORY_HARASSMENT、HARM_CATEGORY_CIVIC_INTEGRITY。如需详细了解可用的安全设置，请参阅指南。另请参阅安全指南，了解如何在 AI 应用中纳入安全考虑因素。

systemInstruction object (Content)

可选。开发者设置了系统指令。目前仅支持文本。

generationConfig object (GenerationConfig)

可选。模型生成和输出的配置选项。

cachedContent string

可选。用作提供预测的上下文的缓存内容的名称。格式：cachedContents/{cachedContent}

serviceTier enum (ServiceTier)

可选。请求的服务层级。

store boolean

可选。为指定请求配置日志记录行为。如果设置了此配置，则其优先级高于项目级日志记录配置。

示例请求

文字

Python

from google import genai

client = genai.Client()
response = client.models.generate_content(
    model="gemini-3.5-flash", contents="Write a story about a magic backpack."
)
print(response.text)text_generation.py

Node.js

// Make sure to include the following import:
// import {GoogleGenAI} from '@google/genai';
const ai = new GoogleGenAI({ apiKey: process.env.GEMINI_API_KEY });

const response = await ai.models.generateContent({
  model: "gemini-3.5-flash",
  contents: "Write a story about a magic backpack.",
});
console.log(response.text);text_generation.js

Go

ctx := context.Background()
client, err := genai.NewClient(ctx, &genai.ClientConfig{
	APIKey:  os.Getenv("GEMINI_API_KEY"),
	Backend: genai.BackendGeminiAPI,
})
if err != nil {
	log.Fatal(err)
}
contents := []*genai.Content{
	genai.NewContentFromText("Write a story about a magic backpack.", genai.RoleUser),
}
response, err := client.Models.GenerateContent(ctx, "gemini-3.5-flash", contents, nil)
if err != nil {
	log.Fatal(err)
}
printResponse(response)text_generation.go

Shell

curl "https://generativelanguage.googleapis.com/v1beta/models/gemini-2.0-flash:generateContent?key=$GEMINI_API_KEY" \
    -H 'Content-Type: application/json' \
    -X POST \
    -d '{
      "contents": [{
        "parts":[{"text": "Write a story about a magic backpack."}]
        }]
       }' 2> /dev/nulltext_generation.sh

Java

Client client = new Client();

GenerateContentResponse response =
        client.models.generateContent(
                "gemini-3.5-flash",
                "Write a story about a magic backpack.",
                null);

System.out.println(response.text());TextGeneration.java

图片

Python

from google import genai
import PIL.Image

client = genai.Client()
organ = PIL.Image.open(media / "organ.jpg")
response = client.models.generate_content(
    model="gemini-3.5-flash", contents=["Tell me about this instrument", organ]
)
print(response.text)text_generation.py

Node.js

// Make sure to include the following import:
// import {GoogleGenAI} from '@google/genai';
const ai = new GoogleGenAI({ apiKey: process.env.GEMINI_API_KEY });

const organ = await ai.files.upload({
  file: path.join(media, "organ.jpg"),
});

const response = await ai.models.generateContent({
  model: "gemini-3.5-flash",
  contents: [
    createUserContent([
      "Tell me about this instrument", 
      createPartFromUri(organ.uri, organ.mimeType)
    ]),
  ],
});
console.log(response.text);text_generation.js

Go

ctx := context.Background()
client, err := genai.NewClient(ctx, &genai.ClientConfig{
	APIKey:  os.Getenv("GEMINI_API_KEY"),
	Backend: genai.BackendGeminiAPI,
})
if err != nil {
	log.Fatal(err)
}

file, err := client.Files.UploadFromPath(
	ctx, 
	filepath.Join(getMedia(), "organ.jpg"), 
	&genai.UploadFileConfig{
		MIMEType : "image/jpeg",
	},
)
if err != nil {
	log.Fatal(err)
}
parts := []*genai.Part{
	genai.NewPartFromText("Tell me about this instrument"),
	genai.NewPartFromURI(file.URI, file.MIMEType),
}
contents := []*genai.Content{
	genai.NewContentFromParts(parts, genai.RoleUser),
}

response, err := client.Models.GenerateContent(ctx, "gemini-3.5-flash", contents, nil)
if err != nil {
	log.Fatal(err)
}
printResponse(response)text_generation.go

Shell

# Use a temporary file to hold the base64 encoded image data
TEMP_B64=$(mktemp)
trap 'rm -f "$TEMP_B64"' EXIT
base64 $B64FLAGS $IMG_PATH > "$TEMP_B64"

# Use a temporary file to hold the JSON payload
TEMP_JSON=$(mktemp)
trap 'rm -f "$TEMP_JSON"' EXIT

cat > "$TEMP_JSON" << EOF
{
  "contents": [{
    "parts":[
      {"text": "Tell me about this instrument"},
      {
        "inline_data": {
          "mime_type":"image/jpeg",
          "data": "$(cat "$TEMP_B64")"
        }
      }
    ]
  }]
}
EOF

curl "https://generativelanguage.googleapis.com/v1beta/models/gemini-2.0-flash:generateContent?key=$GEMINI_API_KEY" \
    -H 'Content-Type: application/json' \
    -X POST \
    -d "@$TEMP_JSON" 2> /dev/nulltext_generation.sh

Java

Client client = new Client();

String path = media_path + "organ.jpg";
byte[] imageData = Files.readAllBytes(Paths.get(path));

Content content =
        Content.fromParts(
                Part.fromText("Tell me about this instrument."),
                Part.fromBytes(imageData, "image/jpeg"));

GenerateContentResponse response = client.models.generateContent("gemini-3.5-flash", content, null);

System.out.println(response.text());TextGeneration.java

音频

Python

from google import genai

client = genai.Client()
sample_audio = client.files.upload(file=media / "sample.mp3")
response = client.models.generate_content(
    model="gemini-3.5-flash",
    contents=["Give me a summary of this audio file.", sample_audio],
)
print(response.text)text_generation.py

Node.js

// Make sure to include the following import:
// import {GoogleGenAI} from '@google/genai';
const ai = new GoogleGenAI({ apiKey: process.env.GEMINI_API_KEY });

const audio = await ai.files.upload({
  file: path.join(media, "sample.mp3"),
});

const response = await ai.models.generateContent({
  model: "gemini-3.5-flash",
  contents: [
    createUserContent([
      "Give me a summary of this audio file.",
      createPartFromUri(audio.uri, audio.mimeType),
    ]),
  ],
});
console.log(response.text);text_generation.js

Go

ctx := context.Background()
client, err := genai.NewClient(ctx, &genai.ClientConfig{
	APIKey:  os.Getenv("GEMINI_API_KEY"),
	Backend: genai.BackendGeminiAPI,
})
if err != nil {
	log.Fatal(err)
}

file, err := client.Files.UploadFromPath(
	ctx, 
	filepath.Join(getMedia(), "sample.mp3"), 
	&genai.UploadFileConfig{
		MIMEType : "audio/mpeg",
	},
)
if err != nil {
	log.Fatal(err)
}

parts := []*genai.Part{
	genai.NewPartFromText("Give me a summary of this audio file."),
	genai.NewPartFromURI(file.URI, file.MIMEType),
}

contents := []*genai.Content{
	genai.NewContentFromParts(parts, genai.RoleUser),
}

response, err := client.Models.GenerateContent(ctx, "gemini-3.5-flash", contents, nil)
if err != nil {
	log.Fatal(err)
}
printResponse(response)text_generation.go

Shell

# Use File API to upload audio data to API request.
MIME_TYPE=$(file -b --mime-type "${AUDIO_PATH}")
NUM_BYTES=$(wc -c < "${AUDIO_PATH}")
DISPLAY_NAME=AUDIO

tmp_header_file=upload-header.tmp

# Initial resumable request defining metadata.
# The upload url is in the response headers dump them to a file.
curl "${BASE_URL}/upload/v1beta/files?key=${GEMINI_API_KEY}" \
  -D upload-header.tmp \
  -H "X-Goog-Upload-Protocol: resumable" \
  -H "X-Goog-Upload-Command: start" \
  -H "X-Goog-Upload-Header-Content-Length: ${NUM_BYTES}" \
  -H "X-Goog-Upload-Header-Content-Type: ${MIME_TYPE}" \
  -H "Content-Type: application/json" \
  -d "{'file': {'display_name': '${DISPLAY_NAME}'}}" 2> /dev/null

upload_url=$(grep -i "x-goog-upload-url: " "${tmp_header_file}" | cut -d" " -f2 | tr -d "\r")
rm "${tmp_header_file}"

# Upload the actual bytes.
curl "${upload_url}" \
  -H "Content-Length: ${NUM_BYTES}" \
  -H "X-Goog-Upload-Offset: 0" \
  -H "X-Goog-Upload-Command: upload, finalize" \
  --data-binary "@${AUDIO_PATH}" 2> /dev/null > file_info.json

file_uri=$(jq ".file.uri" file_info.json)
echo file_uri=$file_uri

curl "https://generativelanguage.googleapis.com/v1beta/models/gemini-2.0-flash:generateContent?key=$GEMINI_API_KEY" \
    -H 'Content-Type: application/json' \
    -X POST \
    -d '{
      "contents": [{
        "parts":[
          {"text": "Please describe this file."},
          {"file_data":{"mime_type": "audio/mpeg", "file_uri": '$file_uri'}}]
        }]
       }' 2> /dev/null > response.json

cat response.json
echo

jq ".candidates[].content.parts[].text" response.jsontext_generation.sh

视频

Python

from google import genai
import time

client = genai.Client()
# Video clip (CC BY 3.0) from https://peach.blender.org/download/
myfile = client.files.upload(file=media / "Big_Buck_Bunny.mp4")
print(f"{myfile=}")

# Poll until the video file is completely processed (state becomes ACTIVE).
while not myfile.state or myfile.state.name != "ACTIVE":
    print("Processing video...")
    print("File state:", myfile.state)
    time.sleep(5)
    myfile = client.files.get(name=myfile.name)

response = client.models.generate_content(
    model="gemini-3.5-flash", contents=[myfile, "Describe this video clip"]
)
print(f"{response.text=}")text_generation.py

Node.js

// Make sure to include the following import:
// import {GoogleGenAI} from '@google/genai';
const ai = new GoogleGenAI({ apiKey: process.env.GEMINI_API_KEY });

let video = await ai.files.upload({
  file: path.join(media, 'Big_Buck_Bunny.mp4'),
});

// Poll until the video file is completely processed (state becomes ACTIVE).
while (!video.state || video.state.toString() !== 'ACTIVE') {
  console.log('Processing video...');
  console.log('File state: ', video.state);
  await sleep(5000);
  video = await ai.files.get({name: video.name});
}

const response = await ai.models.generateContent({
  model: "gemini-3.5-flash",
  contents: [
    createUserContent([
      "Describe this video clip",
      createPartFromUri(video.uri, video.mimeType),
    ]),
  ],
});
console.log(response.text);text_generation.js

Go

ctx := context.Background()
client, err := genai.NewClient(ctx, &genai.ClientConfig{
	APIKey:  os.Getenv("GEMINI_API_KEY"),
	Backend: genai.BackendGeminiAPI,
})
if err != nil {
	log.Fatal(err)
}

file, err := client.Files.UploadFromPath(
	ctx, 
	filepath.Join(getMedia(), "Big_Buck_Bunny.mp4"), 
	&genai.UploadFileConfig{
		MIMEType : "video/mp4",
	},
)
if err != nil {
	log.Fatal(err)
}

// Poll until the video file is completely processed (state becomes ACTIVE).
for file.State == genai.FileStateUnspecified || file.State != genai.FileStateActive {
	fmt.Println("Processing video...")
	fmt.Println("File state:", file.State)
	time.Sleep(5 * time.Second)

	file, err = client.Files.Get(ctx, file.Name, nil)
	if err != nil {
		log.Fatal(err)
	}
}

parts := []*genai.Part{
	genai.NewPartFromText("Describe this video clip"),
	genai.NewPartFromURI(file.URI, file.MIMEType),
}

contents := []*genai.Content{
	genai.NewContentFromParts(parts, genai.RoleUser),
}

response, err := client.Models.GenerateContent(ctx, "gemini-3.5-flash", contents, nil)
if err != nil {
	log.Fatal(err)
}
printResponse(response)text_generation.go

Shell

# Use File API to upload audio data to API request.
MIME_TYPE=$(file -b --mime-type "${VIDEO_PATH}")
NUM_BYTES=$(wc -c < "${VIDEO_PATH}")
DISPLAY_NAME=VIDEO

# Initial resumable request defining metadata.
# The upload url is in the response headers dump them to a file.
curl "${BASE_URL}/upload/v1beta/files?key=${GEMINI_API_KEY}" \
  -D "${tmp_header_file}" \
  -H "X-Goog-Upload-Protocol: resumable" \
  -H "X-Goog-Upload-Command: start" \
  -H "X-Goog-Upload-Header-Content-Length: ${NUM_BYTES}" \
  -H "X-Goog-Upload-Header-Content-Type: ${MIME_TYPE}" \
  -H "Content-Type: application/json" \
  -d "{'file': {'display_name': '${DISPLAY_NAME}'}}" 2> /dev/null

upload_url=$(grep -i "x-goog-upload-url: " "${tmp_header_file}" | cut -d" " -f2 | tr -d "\r")
rm "${tmp_header_file}"

# Upload the actual bytes.
curl "${upload_url}" \
  -H "Content-Length: ${NUM_BYTES}" \
  -H "X-Goog-Upload-Offset: 0" \
  -H "X-Goog-Upload-Command: upload, finalize" \
  --data-binary "@${VIDEO_PATH}" 2> /dev/null > file_info.json

file_uri=$(jq ".file.uri" file_info.json)
echo file_uri=$file_uri

state=$(jq ".file.state" file_info.json)
echo state=$state

name=$(jq ".file.name" file_info.json)
echo name=$name

while [[ "($state)" = *"PROCESSING"* ]];
do
  echo "Processing video..."
  sleep 5
  # Get the file of interest to check state
  curl https://generativelanguage.googleapis.com/v1beta/files/$name > file_info.json
  state=$(jq ".file.state" file_info.json)
done

curl "https://generativelanguage.googleapis.com/v1beta/models/gemini-2.0-flash:generateContent?key=$GEMINI_API_KEY" \
    -H 'Content-Type: application/json' \
    -X POST \
    -d '{
      "contents": [{
        "parts":[
          {"text": "Transcribe the audio from this video, giving timestamps for salient events in the video. Also provide visual descriptions."},
          {"file_data":{"mime_type": "video/mp4", "file_uri": '$file_uri'}}]
        }]
       }' 2> /dev/null > response.json

cat response.json
echo

jq ".candidates[].content.parts[].text" response.jsontext_generation.sh

PDF

Python

from google import genai

client = genai.Client()
sample_pdf = client.files.upload(file=media / "test.pdf")
response = client.models.generate_content(
    model="gemini-3.5-flash",
    contents=["Give me a summary of this document:", sample_pdf],
)
print(f"{response.text=}")text_generation.py

Go

ctx := context.Background()
client, err := genai.NewClient(ctx, &genai.ClientConfig{
	APIKey:  os.Getenv("GEMINI_API_KEY"),
	Backend: genai.BackendGeminiAPI,
})
if err != nil {
	log.Fatal(err)
}

file, err := client.Files.UploadFromPath(
	ctx, 
	filepath.Join(getMedia(), "test.pdf"), 
	&genai.UploadFileConfig{
		MIMEType : "application/pdf",
	},
)
if err != nil {
	log.Fatal(err)
}

parts := []*genai.Part{
	genai.NewPartFromText("Give me a summary of this document:"),
	genai.NewPartFromURI(file.URI, file.MIMEType),
}

contents := []*genai.Content{
	genai.NewContentFromParts(parts, genai.RoleUser),
}

response, err := client.Models.GenerateContent(ctx, "gemini-3.5-flash", contents, nil)
if err != nil {
	log.Fatal(err)
}
printResponse(response)text_generation.go

Shell

MIME_TYPE=$(file -b --mime-type "${PDF_PATH}")
NUM_BYTES=$(wc -c < "${PDF_PATH}")
DISPLAY_NAME=TEXT


echo $MIME_TYPE
tmp_header_file=upload-header.tmp

# Initial resumable request defining metadata.
# The upload url is in the response headers dump them to a file.
curl "${BASE_URL}/upload/v1beta/files?key=${GEMINI_API_KEY}" \
  -D upload-header.tmp \
  -H "X-Goog-Upload-Protocol: resumable" \
  -H "X-Goog-Upload-Command: start" \
  -H "X-Goog-Upload-Header-Content-Length: ${NUM_BYTES}" \
  -H "X-Goog-Upload-Header-Content-Type: ${MIME_TYPE}" \
  -H "Content-Type: application/json" \
  -d "{'file': {'display_name': '${DISPLAY_NAME}'}}" 2> /dev/null

upload_url=$(grep -i "x-goog-upload-url: " "${tmp_header_file}" | cut -d" " -f2 | tr -d "\r")
rm "${tmp_header_file}"

# Upload the actual bytes.
curl "${upload_url}" \
  -H "Content-Length: ${NUM_BYTES}" \
  -H "X-Goog-Upload-Offset: 0" \
  -H "X-Goog-Upload-Command: upload, finalize" \
  --data-binary "@${PDF_PATH}" 2> /dev/null > file_info.json

file_uri=$(jq ".file.uri" file_info.json)
echo file_uri=$file_uri

# Now generate content using that file
curl "https://generativelanguage.googleapis.com/v1beta/models/gemini-2.0-flash:generateContent?key=$GEMINI_API_KEY" \
    -H 'Content-Type: application/json' \
    -X POST \
    -d '{
      "contents": [{
        "parts":[
          {"text": "Can you add a few more lines to this poem?"},
          {"file_data":{"mime_type": "application/pdf", "file_uri": '$file_uri'}}]
        }]
       }' 2> /dev/null > response.json

cat response.json
echo

jq ".candidates[].content.parts[].text" response.jsontext_generation.sh

聊天

Python

from google import genai
from google.genai import types

client = genai.Client()
# Pass initial history using the "history" argument
chat = client.chats.create(
    model="gemini-3.5-flash",
    history=[
        types.Content(role="user", parts=[types.Part(text="Hello")]),
        types.Content(
            role="model",
            parts=[
                types.Part(
                    text="Great to meet you. What would you like to know?"
                )
            ],
        ),
    ],
)
response = chat.send_message(message="I have 2 dogs in my house.")
print(response.text)
response = chat.send_message(message="How many paws are in my house?")
print(response.text)chat.py

Node.js

// Make sure to include the following import:
// import {GoogleGenAI} from '@google/genai';
const ai = new GoogleGenAI({ apiKey: process.env.GEMINI_API_KEY });
const chat = ai.chats.create({
  model: "gemini-3.5-flash",
  history: [
    {
      role: "user",
      parts: [{ text: "Hello" }],
    },
    {
      role: "model",
      parts: [{ text: "Great to meet you. What would you like to know?" }],
    },
  ],
});

const response1 = await chat.sendMessage({
  message: "I have 2 dogs in my house.",
});
console.log("Chat response 1:", response1.text);

const response2 = await chat.sendMessage({
  message: "How many paws are in my house?",
});
console.log("Chat response 2:", response2.text);chat.js

Go

ctx := context.Background()
client, err := genai.NewClient(ctx, &genai.ClientConfig{
	APIKey:  os.Getenv("GEMINI_API_KEY"),
	Backend: genai.BackendGeminiAPI,
})
if err != nil {
	log.Fatal(err)
}

// Pass initial history using the History field.
history := []*genai.Content{
	genai.NewContentFromText("Hello", genai.RoleUser),
	genai.NewContentFromText("Great to meet you. What would you like to know?", genai.RoleModel),
}

chat, err := client.Chats.Create(ctx, "gemini-3.5-flash", nil, history)
if err != nil {
	log.Fatal(err)
}

firstResp, err := chat.SendMessage(ctx, genai.Part{Text: "I have 2 dogs in my house."})
if err != nil {
	log.Fatal(err)
}
fmt.Println(firstResp.Text())

secondResp, err := chat.SendMessage(ctx, genai.Part{Text: "How many paws are in my house?"})
if err != nil {
	log.Fatal(err)
}
fmt.Println(secondResp.Text())chat.go

Shell

curl https://generativelanguage.googleapis.com/v1beta/models/gemini-2.0-flash:generateContent?key=$GEMINI_API_KEY \
    -H 'Content-Type: application/json' \
    -X POST \
    -d '{
      "contents": [
        {"role":"user",
         "parts":[{
           "text": "Hello"}]},
        {"role": "model",
         "parts":[{
           "text": "Great to meet you. What would you like to know?"}]},
        {"role":"user",
         "parts":[{
           "text": "I have two dogs in my house. How many paws are in my house?"}]},
      ]
    }' 2> /dev/null | grep "text"chat.sh

Java

Client client = new Client();

Content userContent = Content.fromParts(Part.fromText("Hello"));
Content modelContent =
        Content.builder()
                .role("model")
                .parts(
                        Collections.singletonList(
                                Part.fromText("Great to meet you. What would you like to know?")
                        )
                ).build();

Chat chat = client.chats.create(
        "gemini-3.5-flash",
        GenerateContentConfig.builder()
                .systemInstruction(userContent)
                .systemInstruction(modelContent)
                .build()
);

GenerateContentResponse response1 = chat.sendMessage("I have 2 dogs in my house.");
System.out.println(response1.text());

GenerateContentResponse response2 = chat.sendMessage("How many paws are in my house?");
System.out.println(response2.text());
ChatSession.java

缓存

Python

from google import genai
from google.genai import types

client = genai.Client()
document = client.files.upload(file=media / "a11.txt")
model_name = "gemini-3.5-flash"

cache = client.caches.create(
    model=model_name,
    config=types.CreateCachedContentConfig(
        contents=[document],
        system_instruction="You are an expert analyzing transcripts.",
    ),
)
print(cache)

response = client.models.generate_content(
    model=model_name,
    contents="Please summarize this transcript",
    config=types.GenerateContentConfig(cached_content=cache.name),
)
print(response.text)cache.py

Node.js

// Make sure to include the following import:
// import {GoogleGenAI} from '@google/genai';
const ai = new GoogleGenAI({ apiKey: process.env.GEMINI_API_KEY });
const filePath = path.join(media, "a11.txt");
const document = await ai.files.upload({
  file: filePath,
  config: { mimeType: "text/plain" },
});
console.log("Uploaded file name:", document.name);
const modelName = "gemini-3.5-flash";

const contents = [
  createUserContent(createPartFromUri(document.uri, document.mimeType)),
];

const cache = await ai.caches.create({
  model: modelName,
  config: {
    contents: contents,
    systemInstruction: "You are an expert analyzing transcripts.",
  },
});
console.log("Cache created:", cache);

const response = await ai.models.generateContent({
  model: modelName,
  contents: "Please summarize this transcript",
  config: { cachedContent: cache.name },
});
console.log("Response text:", response.text);cache.js

Go

ctx := context.Background()
client, err := genai.NewClient(ctx, &genai.ClientConfig{
	APIKey:  os.Getenv("GEMINI_API_KEY"), 
	Backend: genai.BackendGeminiAPI,
})
if err != nil {
	log.Fatal(err)
}

modelName := "gemini-3.5-flash"
document, err := client.Files.UploadFromPath(
	ctx, 
	filepath.Join(getMedia(), "a11.txt"), 
	&genai.UploadFileConfig{
		MIMEType : "text/plain",
	},
)
if err != nil {
	log.Fatal(err)
}
parts := []*genai.Part{
	genai.NewPartFromURI(document.URI, document.MIMEType),
}
contents := []*genai.Content{
	genai.NewContentFromParts(parts, genai.RoleUser),
}
cache, err := client.Caches.Create(ctx, modelName, &genai.CreateCachedContentConfig{
	Contents: contents,
	SystemInstruction: genai.NewContentFromText(
		"You are an expert analyzing transcripts.", genai.RoleUser,
	),
})
if err != nil {
	log.Fatal(err)
}
fmt.Println("Cache created:")
fmt.Println(cache)

// Use the cache for generating content.
response, err := client.Models.GenerateContent(
	ctx,
	modelName,
	genai.Text("Please summarize this transcript"),
	&genai.GenerateContentConfig{
		CachedContent: cache.Name,
	},
)
if err != nil {
	log.Fatal(err)
}
printResponse(response)cache.go

经调整的模型

Python

# With Gemini 2 we're launching a new SDK. See the following doc for details.
# https://ai.google.dev/gemini-api/docs/migrateREADME.md

JSON 模式

Python

from google import genai
from google.genai import types
from typing_extensions import TypedDict

class Recipe(TypedDict):
    recipe_name: str
    ingredients: list[str]

client = genai.Client()
result = client.models.generate_content(
    model="gemini-3.5-flash",
    contents="List a few popular cookie recipes.",
    config=types.GenerateContentConfig(
        response_mime_type="application/json", response_schema=list[Recipe]
    ),
)
print(result)controlled_generation.py

Node.js

// Make sure to include the following import:
// import {GoogleGenAI} from '@google/genai';
const ai = new GoogleGenAI({ apiKey: process.env.GEMINI_API_KEY });
const response = await ai.models.generateContent({
  model: "gemini-3.5-flash",
  contents: "List a few popular cookie recipes.",
  config: {
    responseMimeType: "application/json",
    responseSchema: {
      type: "array",
      items: {
        type: "object",
        properties: {
          recipeName: { type: "string" },
          ingredients: { type: "array", items: { type: "string" } },
        },
        required: ["recipeName", "ingredients"],
      },
    },
  },
});
console.log(response.text);controlled_generation.js

Go

ctx := context.Background()
client, err := genai.NewClient(ctx, &genai.ClientConfig{
	APIKey:  os.Getenv("GEMINI_API_KEY"), 
	Backend: genai.BackendGeminiAPI,
})
if err != nil {
	log.Fatal(err)
}

schema := &genai.Schema{
	Type: genai.TypeArray,
	Items: &genai.Schema{
		Type: genai.TypeObject,
		Properties: map[string]*genai.Schema{
			"recipe_name": {Type: genai.TypeString},
			"ingredients": {
				Type:  genai.TypeArray,
				Items: &genai.Schema{Type: genai.TypeString},
			},
		},
		Required: []string{"recipe_name"},
	},
}

config := &genai.GenerateContentConfig{
	ResponseMIMEType: "application/json",
	ResponseSchema:   schema,
}

response, err := client.Models.GenerateContent(
	ctx,
	"gemini-3.5-flash",
	genai.Text("List a few popular cookie recipes."),
	config,
)
if err != nil {
	log.Fatal(err)
}
printResponse(response)controlled_generation.go

Shell

curl "https://generativelanguage.googleapis.com/v1beta/models/gemini-2.0-flash:generateContent?key=$GEMINI_API_KEY" \
-H 'Content-Type: application/json' \
-d '{
    "contents": [{
      "parts":[
        {"text": "List 5 popular cookie recipes"}
        ]
    }],
    "generationConfig": {
        "response_mime_type": "application/json",
        "response_schema": {
          "type": "ARRAY",
          "items": {
            "type": "OBJECT",
            "properties": {
              "recipe_name": {"type":"STRING"},
            }
          }
        }
    }
}' 2> /dev/null | headcontrolled_generation.sh

Java

Client client = new Client();

Schema recipeSchema = Schema.builder()
        .type(Array.class.getSimpleName())
        .items(Schema.builder()
                .type(Object.class.getSimpleName())
                .properties(
                        Map.of("recipe_name", Schema.builder()
                                        .type(String.class.getSimpleName())
                                        .build(),
                                "ingredients", Schema.builder()
                                        .type(Array.class.getSimpleName())
                                        .items(Schema.builder()
                                                .type(String.class.getSimpleName())
                                                .build())
                                        .build())
                )
                .required(List.of("recipe_name", "ingredients"))
                .build())
        .build();

GenerateContentConfig config =
        GenerateContentConfig.builder()
                .responseMimeType("application/json")
                .responseSchema(recipeSchema)
                .build();

GenerateContentResponse response =
        client.models.generateContent(
                "gemini-3.5-flash",
                "List a few popular cookie recipes.",
                config);

System.out.println(response.text());ControlledGeneration.java

代码执行

Python

from google import genai
from google.genai import types

client = genai.Client()
response = client.models.generate_content(
    model="gemini-3.5-flash",
    contents=(
        "Write and execute code that calculates the sum of the first 50 prime numbers. "
        "Ensure that only the executable code and its resulting output are generated."
    ),
)
# Each part may contain text, executable code, or an execution result.
for part in response.candidates[0].content.parts:
    print(part, "\n")

print("-" * 80)
# The .text accessor concatenates the parts into a markdown-formatted text.
print("\n", response.text)code_execution.py

Go

ctx := context.Background()
client, err := genai.NewClient(ctx, &genai.ClientConfig{
	APIKey:  os.Getenv("GEMINI_API_KEY"),
	Backend: genai.BackendGeminiAPI,
})
if err != nil {
	log.Fatal(err)
}

response, err := client.Models.GenerateContent(
	ctx,
	"gemini-3.5-flash",
	genai.Text(
		`Write and execute code that calculates the sum of the first 50 prime numbers.
		 Ensure that only the executable code and its resulting output are generated.`,
	),
	&genai.GenerateContentConfig{},
)
if err != nil {
	log.Fatal(err)
}

// Print the response.
printResponse(response)

fmt.Println("--------------------------------------------------------------------------------")
fmt.Println(response.Text())code_execution.go

Java

Client client = new Client();

String prompt = """
        Write and execute code that calculates the sum of the first 50 prime numbers.
        Ensure that only the executable code and its resulting output are generated.
        """;

GenerateContentResponse response =
        client.models.generateContent(
                "gemini-3.5-flash",
                prompt,
                null);

for (Part part : response.candidates().get().getFirst().content().get().parts().get()) {
    System.out.println(part + "\n");
}

System.out.println("-".repeat(80));
System.out.println(response.text());CodeExecution.java

函数调用

Python

from google import genai
from google.genai import types

client = genai.Client()

def add(a: float, b: float) -> float:
    """returns a + b."""
    return a + b

def subtract(a: float, b: float) -> float:
    """returns a - b."""
    return a - b

def multiply(a: float, b: float) -> float:
    """returns a * b."""
    return a * b

def divide(a: float, b: float) -> float:
    """returns a / b."""
    return a / b

# Create a chat session; function calling (via tools) is enabled in the config.
chat = client.chats.create(
    model="gemini-3.5-flash",
    config=types.GenerateContentConfig(tools=[add, subtract, multiply, divide]),
)
response = chat.send_message(
    message="I have 57 cats, each owns 44 mittens, how many mittens is that in total?"
)
print(response.text)function_calling.py

Go

ctx := context.Background()
client, err := genai.NewClient(ctx, &genai.ClientConfig{
	APIKey:  os.Getenv("GEMINI_API_KEY"),
	Backend: genai.BackendGeminiAPI,
})
if err != nil {
	log.Fatal(err)
}
modelName := "gemini-3.5-flash"

// Create the function declarations for arithmetic operations.
addDeclaration := createArithmeticToolDeclaration("addNumbers", "Return the result of adding two numbers.")
subtractDeclaration := createArithmeticToolDeclaration("subtractNumbers", "Return the result of subtracting the second number from the first.")
multiplyDeclaration := createArithmeticToolDeclaration("multiplyNumbers", "Return the product of two numbers.")
divideDeclaration := createArithmeticToolDeclaration("divideNumbers", "Return the quotient of dividing the first number by the second.")

// Group the function declarations as a tool.
tools := []*genai.Tool{
	{
		FunctionDeclarations: []*genai.FunctionDeclaration{
			addDeclaration,
			subtractDeclaration,
			multiplyDeclaration,
			divideDeclaration,
		},
	},
}

// Create the content prompt.
contents := []*genai.Content{
	genai.NewContentFromText(
		"I have 57 cats, each owns 44 mittens, how many mittens is that in total?", genai.RoleUser,
	),
}

// Set up the generate content configuration with function calling enabled.
config := &genai.GenerateContentConfig{
	Tools: tools,
	ToolConfig: &genai.ToolConfig{
		FunctionCallingConfig: &genai.FunctionCallingConfig{
			// The mode equivalent to FunctionCallingConfigMode.ANY in JS.
			Mode: genai.FunctionCallingConfigModeAny,
		},
	},
}

genContentResp, err := client.Models.GenerateContent(ctx, modelName, contents, config)
if err != nil {
	log.Fatal(err)
}

// Assume the response includes a list of function calls.
if len(genContentResp.FunctionCalls()) == 0 {
	log.Println("No function call returned from the AI.")
	return nil
}
functionCall := genContentResp.FunctionCalls()[0]
log.Printf("Function call: %+v\n", functionCall)

// Marshal the Args map into JSON bytes.
argsMap, err := json.Marshal(functionCall.Args)
if err != nil {
	log.Fatal(err)
}

// Unmarshal the JSON bytes into the ArithmeticArgs struct.
var args ArithmeticArgs
if err := json.Unmarshal(argsMap, &args); err != nil {
	log.Fatal(err)
}

// Map the function name to the actual arithmetic function.
var result float64
switch functionCall.Name {
	case "addNumbers":
		result = add(args.FirstParam, args.SecondParam)
	case "subtractNumbers":
		result = subtract(args.FirstParam, args.SecondParam)
	case "multiplyNumbers":
		result = multiply(args.FirstParam, args.SecondParam)
	case "divideNumbers":
		result = divide(args.FirstParam, args.SecondParam)
	default:
		return fmt.Errorf("unimplemented function: %s", functionCall.Name)
}
log.Printf("Function result: %v\n", result)

// Prepare the final result message as content.
resultContents := []*genai.Content{
	genai.NewContentFromText("The final result is " + fmt.Sprintf("%v", result), genai.RoleUser),
}

// Use GenerateContent to send the final result.
finalResponse, err := client.Models.GenerateContent(ctx, modelName, resultContents, &genai.GenerateContentConfig{})
if err != nil {
	log.Fatal(err)
}

printResponse(finalResponse)function_calling.go

Node.js

  // Make sure to include the following import:
  // import {GoogleGenAI} from '@google/genai';
  const ai = new GoogleGenAI({ apiKey: process.env.GEMINI_API_KEY });

  /**
   * The add function returns the sum of two numbers.
   * @param {number} a
   * @param {number} b
   * @returns {number}
   */
  function add(a, b) {
    return a + b;
  }

  /**
   * The subtract function returns the difference (a - b).
   * @param {number} a
   * @param {number} b
   * @returns {number}
   */
  function subtract(a, b) {
    return a - b;
  }

  /**
   * The multiply function returns the product of two numbers.
   * @param {number} a
   * @param {number} b
   * @returns {number}
   */
  function multiply(a, b) {
    return a * b;
  }

  /**
   * The divide function returns the quotient of a divided by b.
   * @param {number} a
   * @param {number} b
   * @returns {number}
   */
  function divide(a, b) {
    return a / b;
  }

  const addDeclaration = {
    name: "addNumbers",
    parameters: {
      type: "object",
      description: "Return the result of adding two numbers.",
      properties: {
        firstParam: {
          type: "number",
          description:
            "The first parameter which can be an integer or a floating point number.",
        },
        secondParam: {
          type: "number",
          description:
            "The second parameter which can be an integer or a floating point number.",
        },
      },
      required: ["firstParam", "secondParam"],
    },
  };

  const subtractDeclaration = {
    name: "subtractNumbers",
    parameters: {
      type: "object",
      description:
        "Return the result of subtracting the second number from the first.",
      properties: {
        firstParam: {
          type: "number",
          description: "The first parameter.",
        },
        secondParam: {
          type: "number",
          description: "The second parameter.",
        },
      },
      required: ["firstParam", "secondParam"],
    },
  };

  const multiplyDeclaration = {
    name: "multiplyNumbers",
    parameters: {
      type: "object",
      description: "Return the product of two numbers.",
      properties: {
        firstParam: {
          type: "number",
          description: "The first parameter.",
        },
        secondParam: {
          type: "number",
          description: "The second parameter.",
        },
      },
      required: ["firstParam", "secondParam"],
    },
  };

  const divideDeclaration = {
    name: "divideNumbers",
    parameters: {
      type: "object",
      description:
        "Return the quotient of dividing the first number by the second.",
      properties: {
        firstParam: {
          type: "number",
          description: "The first parameter.",
        },
        secondParam: {
          type: "number",
          description: "The second parameter.",
        },
      },
      required: ["firstParam", "secondParam"],
    },
  };

  // Step 1: Call generateContent with function calling enabled.
  const generateContentResponse = await ai.models.generateContent({
    model: "gemini-3.5-flash",
    contents:
      "I have 57 cats, each owns 44 mittens, how many mittens is that in total?",
    config: {
      toolConfig: {
        functionCallingConfig: {
          mode: FunctionCallingConfigMode.ANY,
        },
      },
      tools: [
        {
          functionDeclarations: [
            addDeclaration,
            subtractDeclaration,
            multiplyDeclaration,
            divideDeclaration,
          ],
        },
      ],
    },
  });

  // Step 2: Extract the function call.(
  // Assuming the response contains a 'functionCalls' array.
  const functionCall =
    generateContentResponse.functionCalls &&
    generateContentResponse.functionCalls[0];
  console.log(functionCall);

  // Parse the arguments.
  const args = functionCall.args;
  // Expected args format: { firstParam: number, secondParam: number }

  // Step 3: Invoke the actual function based on the function name.
  const functionMapping = {
    addNumbers: add,
    subtractNumbers: subtract,
    multiplyNumbers: multiply,
    divideNumbers: divide,
  };
  const func = functionMapping[functionCall.name];
  if (!func) {
    console.error("Unimplemented error:", functionCall.name);
    return generateContentResponse;
  }
  const resultValue = func(args.firstParam, args.secondParam);
  console.log("Function result:", resultValue);

  // Step 4: Use the chat API to send the result as the final answer.
  const chat = ai.chats.create({ model: "gemini-3.5-flash" });
  const chatResponse = await chat.sendMessage({
    message: "The final result is " + resultValue,
  });
  console.log(chatResponse.text);
  return chatResponse;
}
function_calling.js

Shell


cat > tools.json << EOF
{
  "function_declarations": [
    {
      "name": "enable_lights",
      "description": "Turn on the lighting system."
    },
    {
      "name": "set_light_color",
      "description": "Set the light color. Lights must be enabled for this to work.",
      "parameters": {
        "type": "object",
        "properties": {
          "rgb_hex": {
            "type": "string",
            "description": "The light color as a 6-digit hex string, e.g. ff0000 for red."
          }
        },
        "required": [
          "rgb_hex"
        ]
      }
    },
    {
      "name": "stop_lights",
      "description": "Turn off the lighting system."
    }
  ]
} 
EOF

curl "https://generativelanguage.googleapis.com/v1beta/models/gemini-2.0-flash:generateContent?key=$GEMINI_API_KEY" \
  -H 'Content-Type: application/json' \
  -d @<(echo '
  {
    "system_instruction": {
      "parts": {
        "text": "You are a helpful lighting system bot. You can turn lights on and off, and you can set the color. Do not perform any other tasks."
      }
    },
    "tools": ['$(cat tools.json)'],

    "tool_config": {
      "function_calling_config": {"mode": "auto"}
    },

    "contents": {
      "role": "user",
      "parts": {
        "text": "Turn on the lights please."
      }
    }
  }
') 2>/dev/null |sed -n '/"content"/,/"finishReason"/p'function_calling.sh

Java

Client client = new Client();

FunctionDeclaration addFunction =
        FunctionDeclaration.builder()
                .name("addNumbers")
                .parameters(
                        Schema.builder()
                                .type("object")
                                .properties(Map.of(
                                        "firstParam", Schema.builder().type("number").description("First number").build(),
                                        "secondParam", Schema.builder().type("number").description("Second number").build()))
                                .required(Arrays.asList("firstParam", "secondParam"))
                                .build())
                .build();

FunctionDeclaration subtractFunction =
        FunctionDeclaration.builder()
                .name("subtractNumbers")
                .parameters(
                        Schema.builder()
                                .type("object")
                                .properties(Map.of(
                                        "firstParam", Schema.builder().type("number").description("First number").build(),
                                        "secondParam", Schema.builder().type("number").description("Second number").build()))
                                .required(Arrays.asList("firstParam", "secondParam"))
                                .build())
                .build();

FunctionDeclaration multiplyFunction =
        FunctionDeclaration.builder()
                .name("multiplyNumbers")
                .parameters(
                        Schema.builder()
                                .type("object")
                                .properties(Map.of(
                                        "firstParam", Schema.builder().type("number").description("First number").build(),
                                        "secondParam", Schema.builder().type("number").description("Second number").build()))
                                .required(Arrays.asList("firstParam", "secondParam"))
                                .build())
                .build();

FunctionDeclaration divideFunction =
        FunctionDeclaration.builder()
                .name("divideNumbers")
                .parameters(
                        Schema.builder()
                                .type("object")
                                .properties(Map.of(
                                        "firstParam", Schema.builder().type("number").description("First number").build(),
                                        "secondParam", Schema.builder().type("number").description("Second number").build()))
                                .required(Arrays.asList("firstParam", "secondParam"))
                                .build())
                .build();

GenerateContentConfig config = GenerateContentConfig.builder()
        .toolConfig(ToolConfig.builder().functionCallingConfig(
                FunctionCallingConfig.builder().mode("ANY").build()
        ).build())
        .tools(
                Collections.singletonList(
                        Tool.builder().functionDeclarations(
                                Arrays.asList(
                                        addFunction,
                                        subtractFunction,
                                        divideFunction,
                                        multiplyFunction
                                )
                        ).build()

                )
        )
        .build();

GenerateContentResponse response =
        client.models.generateContent(
                "gemini-3.5-flash",
                "I have 57 cats, each owns 44 mittens, how many mittens is that in total?",
                config);


if (response.functionCalls() == null || response.functionCalls().isEmpty()) {
    System.err.println("No function call received");
    return null;
}

var functionCall = response.functionCalls().getFirst();
String functionName = functionCall.name().get();
var arguments = functionCall.args();

Map<String, BiFunction<Double, Double, Double>> functionMapping = new HashMap<>();
functionMapping.put("addNumbers", (a, b) -> a + b);
functionMapping.put("subtractNumbers", (a, b) -> a - b);
functionMapping.put("multiplyNumbers", (a, b) -> a * b);
functionMapping.put("divideNumbers", (a, b) -> b != 0 ? a / b : Double.NaN);

BiFunction<Double, Double, Double> function = functionMapping.get(functionName);

Number firstParam = (Number) arguments.get().get("firstParam");
Number secondParam = (Number) arguments.get().get("secondParam");
Double result = function.apply(firstParam.doubleValue(), secondParam.doubleValue());

System.out.println(result);FunctionCalling.java

生成配置

Python

from google import genai
from google.genai import types

client = genai.Client()
response = client.models.generate_content(
    model="gemini-3.5-flash",
    contents="Tell me a story about a magic backpack.",
    config=types.GenerateContentConfig(
        candidate_count=1,
        stop_sequences=["x"],
        max_output_tokens=20,
        temperature=1.0,
    ),
)
print(response.text)configure_model_parameters.py

Node.js

// Make sure to include the following import:
// import {GoogleGenAI} from '@google/genai';
const ai = new GoogleGenAI({ apiKey: process.env.GEMINI_API_KEY });

const response = await ai.models.generateContent({
  model: "gemini-3.5-flash",
  contents: "Tell me a story about a magic backpack.",
  config: {
    candidateCount: 1,
    stopSequences: ["x"],
    maxOutputTokens: 20,
    temperature: 1.0,
  },
});

console.log(response.text);configure_model_parameters.js

Go

ctx := context.Background()
client, err := genai.NewClient(ctx, &genai.ClientConfig{
	APIKey:  os.Getenv("GEMINI_API_KEY"),
	Backend: genai.BackendGeminiAPI,
})
if err != nil {
	log.Fatal(err)
}

// Create local variables for parameters.
candidateCount := int32(1)
maxOutputTokens := int32(20)
temperature := float32(1.0)

response, err := client.Models.GenerateContent(
	ctx,
	"gemini-3.5-flash",
	genai.Text("Tell me a story about a magic backpack."),
	&genai.GenerateContentConfig{
		CandidateCount:  candidateCount,
		StopSequences:   []string{"x"},
		MaxOutputTokens: maxOutputTokens,
		Temperature:     &temperature,
	},
)
if err != nil {
	log.Fatal(err)
}

printResponse(response)configure_model_parameters.go

Shell

curl https://generativelanguage.googleapis.com/v1beta/models/gemini-2.0-flash:generateContent?key=$GEMINI_API_KEY \
    -H 'Content-Type: application/json' \
    -X POST \
    -d '{
        "contents": [{
            "parts":[
                {"text": "Explain how AI works"}
            ]
        }],
        "generationConfig": {
            "stopSequences": [
                "Title"
            ],
            "temperature": 1.0,
            "maxOutputTokens": 800,
            "topP": 0.8,
            "topK": 10
        }
    }'  2> /dev/null | grep "text"configure_model_parameters.sh

Java

Client client = new Client();

GenerateContentConfig config =
        GenerateContentConfig.builder()
                .candidateCount(1)
                .stopSequences(List.of("x"))
                .maxOutputTokens(20)
                .temperature(1.0F)
                .build();

GenerateContentResponse response =
        client.models.generateContent(
                "gemini-3.5-flash",
                "Tell me a story about a magic backpack.",
                config);

System.out.println(response.text());ConfigureModelParameters.java

安全设置

Python

from google import genai
from google.genai import types

client = genai.Client()
unsafe_prompt = (
    "I support Martians Soccer Club and I think Jupiterians Football Club sucks! "
    "Write a ironic phrase about them including expletives."
)
response = client.models.generate_content(
    model="gemini-3.5-flash",
    contents=unsafe_prompt,
    config=types.GenerateContentConfig(
        safety_settings=[
            types.SafetySetting(
                category="HARM_CATEGORY_HATE_SPEECH",
                threshold="BLOCK_MEDIUM_AND_ABOVE",
            ),
            types.SafetySetting(
                category="HARM_CATEGORY_HARASSMENT", threshold="BLOCK_ONLY_HIGH"
            ),
        ]
    ),
)
try:
    print(response.text)
except Exception:
    print("No information generated by the model.")

print(response.candidates[0].safety_ratings)safety_settings.py

Node.js

  // Make sure to include the following import:
  // import {GoogleGenAI} from '@google/genai';
  const ai = new GoogleGenAI({ apiKey: process.env.GEMINI_API_KEY });
  const unsafePrompt =
    "I support Martians Soccer Club and I think Jupiterians Football Club sucks! Write a ironic phrase about them including expletives.";

  const response = await ai.models.generateContent({
    model: "gemini-3.5-flash",
    contents: unsafePrompt,
    config: {
      safetySettings: [
        {
          category: "HARM_CATEGORY_HATE_SPEECH",
          threshold: "BLOCK_MEDIUM_AND_ABOVE",
        },
        {
          category: "HARM_CATEGORY_HARASSMENT",
          threshold: "BLOCK_ONLY_HIGH",
        },
      ],
    },
  });

  try {
    console.log("Generated text:", response.text);
  } catch (error) {
    console.log("No information generated by the model.");
  }
  console.log("Safety ratings:", response.candidates[0].safetyRatings);
  return response;
}
safety_settings.js

Go

ctx := context.Background()
client, err := genai.NewClient(ctx, &genai.ClientConfig{
	APIKey:  os.Getenv("GEMINI_API_KEY"),
	Backend: genai.BackendGeminiAPI,
})
if err != nil {
	log.Fatal(err)
}

unsafePrompt := "I support Martians Soccer Club and I think Jupiterians Football Club sucks! " +
	"Write a ironic phrase about them including expletives."

config := &genai.GenerateContentConfig{
	SafetySettings: []*genai.SafetySetting{
		{
			Category:  "HARM_CATEGORY_HATE_SPEECH",
			Threshold: "BLOCK_MEDIUM_AND_ABOVE",
		},
		{
			Category:  "HARM_CATEGORY_HARASSMENT",
			Threshold: "BLOCK_ONLY_HIGH",
		},
	},
}
contents := []*genai.Content{
	genai.NewContentFromText(unsafePrompt, genai.RoleUser),
}
response, err := client.Models.GenerateContent(ctx, "gemini-3.5-flash", contents, config)
if err != nil {
	log.Fatal(err)
}

// Print the generated text.
text := response.Text()
fmt.Println("Generated text:", text)

// Print the and safety ratings from the first candidate.
if len(response.Candidates) > 0 {
	fmt.Println("Finish reason:", response.Candidates[0].FinishReason)
	safetyRatings, err := json.MarshalIndent(response.Candidates[0].SafetyRatings, "", "  ")
	if err != nil {
		return err
	}
	fmt.Println("Safety ratings:", string(safetyRatings))
} else {
	fmt.Println("No candidate returned.")
}safety_settings.go

Shell

echo '{
    "safetySettings": [
        {"category": "HARM_CATEGORY_HARASSMENT", "threshold": "BLOCK_ONLY_HIGH"},
        {"category": "HARM_CATEGORY_HATE_SPEECH", "threshold": "BLOCK_MEDIUM_AND_ABOVE"}
    ],
    "contents": [{
        "parts":[{
            "text": "'I support Martians Soccer Club and I think Jupiterians Football Club sucks! Write a ironic phrase about them.'"}]}]}' > request.json

curl "https://generativelanguage.googleapis.com/v1beta/models/gemini-2.0-flash:generateContent?key=$GEMINI_API_KEY" \
    -H 'Content-Type: application/json' \
    -X POST \
    -d @request.json 2> /dev/nullsafety_settings.sh

Java

Client client = new Client();

String unsafePrompt = """
         I support Martians Soccer Club and I think Jupiterians Football Club sucks!
         Write a ironic phrase about them including expletives.
        """;

GenerateContentConfig config =
        GenerateContentConfig.builder()
                .safetySettings(Arrays.asList(
                        SafetySetting.builder()
                                .category("HARM_CATEGORY_HATE_SPEECH")
                                .threshold("BLOCK_MEDIUM_AND_ABOVE")
                                .build(),
                        SafetySetting.builder()
                                .category("HARM_CATEGORY_HARASSMENT")
                                .threshold("BLOCK_ONLY_HIGH")
                                .build()
                )).build();

GenerateContentResponse response =
        client.models.generateContent(
                "gemini-3.5-flash",
                unsafePrompt,
                config);

try {
    System.out.println(response.text());
} catch (Exception e) {
    System.out.println("No information generated by the model");
}

System.out.println(response.candidates().get().getFirst().safetyRatings());SafetySettings.java

系统指令

Python

from google import genai
from google.genai import types

client = genai.Client()
response = client.models.generate_content(
    model="gemini-3.5-flash",
    contents="Good morning! How are you?",
    config=types.GenerateContentConfig(
        system_instruction="You are a cat. Your name is Neko."
    ),
)
print(response.text)system_instruction.py

Node.js

// Make sure to include the following import:
// import {GoogleGenAI} from '@google/genai';
const ai = new GoogleGenAI({ apiKey: process.env.GEMINI_API_KEY });
const response = await ai.models.generateContent({
  model: "gemini-3.5-flash",
  contents: "Good morning! How are you?",
  config: {
    systemInstruction: "You are a cat. Your name is Neko.",
  },
});
console.log(response.text);system_instruction.js

Go

ctx := context.Background()
client, err := genai.NewClient(ctx, &genai.ClientConfig{
	APIKey:  os.Getenv("GEMINI_API_KEY"),
	Backend: genai.BackendGeminiAPI,
})
if err != nil {
	log.Fatal(err)
}

// Construct the user message contents.
contents := []*genai.Content{
	genai.NewContentFromText("Good morning! How are you?", genai.RoleUser),
}

// Set the system instruction as a *genai.Content.
config := &genai.GenerateContentConfig{
	SystemInstruction: genai.NewContentFromText("You are a cat. Your name is Neko.", genai.RoleUser),
}

response, err := client.Models.GenerateContent(ctx, "gemini-3.5-flash", contents, config)
if err != nil {
	log.Fatal(err)
}
printResponse(response)system_instruction.go

Shell

curl "https://generativelanguage.googleapis.com/v1beta/models/gemini-2.0-flash:generateContent?key=$GEMINI_API_KEY" \
-H 'Content-Type: application/json' \
-d '{ "system_instruction": {
    "parts":
      { "text": "You are a cat. Your name is Neko."}},
    "contents": {
      "parts": {
        "text": "Hello there"}}}'system_instruction.sh

Java

Client client = new Client();

Part textPart = Part.builder().text("You are a cat. Your name is Neko.").build();

Content content = Content.builder().role("system").parts(ImmutableList.of(textPart)).build();

GenerateContentConfig config = GenerateContentConfig.builder()
        .systemInstruction(content)
        .build();

GenerateContentResponse response =
        client.models.generateContent(
                "gemini-3.5-flash",
                "Good morning! How are you?",
                config);

System.out.println(response.text());SystemInstruction.java

响应正文

如果成功，则响应正文包含一个 GenerateContentResponse 实例。

根据输入 GenerateContentRequest 从模型生成流式回答。

端点

post https://generativelanguage.googleapis.com/v1beta/{model=models/*}:streamGenerateContent

路径参数

model string

必需。用于生成补全的 Model 的名称。

格式：models/{model}。格式为 models/{model}。

请求正文

请求正文中包含结构如下的数据：

字段

contents[] object (Content)

必需。与模型当前对话的内容。

对于单轮查询，这是单个实例。对于多轮查询（例如聊天），这是包含对话历史记录和最新请求的重复字段。

tools[] object (Tool)

可选。Model 可能用于生成下一个回答的 Tools 列表。

toolConfig object (ToolConfig)

可选。请求中指定的任何 Tool 的工具配置。如需查看使用示例，请参阅函数调用指南。

safetySettings[] object (SafetySetting)

可选。用于屏蔽不安全内容的唯一 SafetySetting 实例的列表。

systemInstruction object (Content)

可选。开发者设置了系统指令。目前仅支持文本。

generationConfig object (GenerationConfig)

可选。模型生成和输出的配置选项。

cachedContent string

可选。用作提供预测的上下文的缓存内容的名称。格式：cachedContents/{cachedContent}

serviceTier enum (ServiceTier)

可选。请求的服务层级。

store boolean

可选。为指定请求配置日志记录行为。如果设置了此配置，则其优先级高于项目级日志记录配置。

示例请求

文字

Python

from google import genai

client = genai.Client()
response = client.models.generate_content_stream(
    model="gemini-3.5-flash", contents="Write a story about a magic backpack."
)
for chunk in response:
    print(chunk.text)
    print("_" * 80)text_generation.py

Node.js

// Make sure to include the following import:
// import {GoogleGenAI} from '@google/genai';
const ai = new GoogleGenAI({ apiKey: process.env.GEMINI_API_KEY });

const response = await ai.models.generateContentStream({
  model: "gemini-3.5-flash",
  contents: "Write a story about a magic backpack.",
});
let text = "";
for await (const chunk of response) {
  console.log(chunk.text);
  text += chunk.text;
}text_generation.js

Go

ctx := context.Background()
client, err := genai.NewClient(ctx, &genai.ClientConfig{
	APIKey:  os.Getenv("GEMINI_API_KEY"),
	Backend: genai.BackendGeminiAPI,
})
if err != nil {
	log.Fatal(err)
}
contents := []*genai.Content{
	genai.NewContentFromText("Write a story about a magic backpack.", genai.RoleUser),
}
for response, err := range client.Models.GenerateContentStream(
	ctx,
	"gemini-3.5-flash",
	contents,
	nil,
) {
	if err != nil {
		log.Fatal(err)
	}
	fmt.Print(response.Candidates[0].Content.Parts[0].Text)
}text_generation.go

Shell

curl "https://generativelanguage.googleapis.com/v1beta/models/gemini-2.0-flash:streamGenerateContent?alt=sse&key=${GEMINI_API_KEY}" \
        -H 'Content-Type: application/json' \
        --no-buffer \
        -d '{ "contents":[{"parts":[{"text": "Write a story about a magic backpack."}]}]}'text_generation.sh

Java

Client client = new Client();

ResponseStream<GenerateContentResponse> responseStream =
        client.models.generateContentStream(
                "gemini-3.5-flash",
                "Write a story about a magic backpack.",
                null);

StringBuilder response = new StringBuilder();
for (GenerateContentResponse res : responseStream) {
    System.out.print(res.text());
    response.append(res.text());
}

responseStream.close();TextGeneration.java

图片

Python

from google import genai
import PIL.Image

client = genai.Client()
organ = PIL.Image.open(media / "organ.jpg")
response = client.models.generate_content_stream(
    model="gemini-3.5-flash", contents=["Tell me about this instrument", organ]
)
for chunk in response:
    print(chunk.text)
    print("_" * 80)text_generation.py

Node.js

// Make sure to include the following import:
// import {GoogleGenAI} from '@google/genai';
const ai = new GoogleGenAI({ apiKey: process.env.GEMINI_API_KEY });

const organ = await ai.files.upload({
  file: path.join(media, "organ.jpg"),
});

const response = await ai.models.generateContentStream({
  model: "gemini-3.5-flash",
  contents: [
    createUserContent([
      "Tell me about this instrument", 
      createPartFromUri(organ.uri, organ.mimeType)
    ]),
  ],
});
let text = "";
for await (const chunk of response) {
  console.log(chunk.text);
  text += chunk.text;
}text_generation.js

Go

ctx := context.Background()
client, err := genai.NewClient(ctx, &genai.ClientConfig{
	APIKey:  os.Getenv("GEMINI_API_KEY"),
	Backend: genai.BackendGeminiAPI,
})
if err != nil {
	log.Fatal(err)
}
file, err := client.Files.UploadFromPath(
	ctx, 
	filepath.Join(getMedia(), "organ.jpg"), 
	&genai.UploadFileConfig{
		MIMEType : "image/jpeg",
	},
)
if err != nil {
	log.Fatal(err)
}
parts := []*genai.Part{
	genai.NewPartFromText("Tell me about this instrument"),
	genai.NewPartFromURI(file.URI, file.MIMEType),
}
contents := []*genai.Content{
	genai.NewContentFromParts(parts, genai.RoleUser),
}
for response, err := range client.Models.GenerateContentStream(
	ctx,
	"gemini-3.5-flash",
	contents,
	nil,
) {
	if err != nil {
		log.Fatal(err)
	}
	fmt.Print(response.Candidates[0].Content.Parts[0].Text)
}text_generation.go

Shell

cat > "$TEMP_JSON" << EOF
{
  "contents": [{
    "parts":[
      {"text": "Tell me about this instrument"},
      {
        "inline_data": {
          "mime_type":"image/jpeg",
          "data": "$(cat "$TEMP_B64")"
        }
      }
    ]
  }]
}
EOF

curl "https://generativelanguage.googleapis.com/v1beta/models/gemini-2.0-flash:streamGenerateContent?alt=sse&key=$GEMINI_API_KEY" \
    -H 'Content-Type: application/json' \
    -X POST \
    -d "@$TEMP_JSON" 2> /dev/nulltext_generation.sh

Java

Client client = new Client();

String path = media_path + "organ.jpg";
byte[] imageData = Files.readAllBytes(Paths.get(path));

Content content =
        Content.fromParts(
                Part.fromText("Tell me about this instrument."),
                Part.fromBytes(imageData, "image/jpeg"));


ResponseStream<GenerateContentResponse> responseStream =
        client.models.generateContentStream(
                "gemini-3.5-flash",
                content,
                null);

StringBuilder response = new StringBuilder();
for (GenerateContentResponse res : responseStream) {
    System.out.print(res.text());
    response.append(res.text());
}

responseStream.close();TextGeneration.java

音频

Python

from google import genai

client = genai.Client()
sample_audio = client.files.upload(file=media / "sample.mp3")
response = client.models.generate_content_stream(
    model="gemini-3.5-flash",
    contents=["Give me a summary of this audio file.", sample_audio],
)
for chunk in response:
    print(chunk.text)
    print("_" * 80)text_generation.py

Go

ctx := context.Background()
client, err := genai.NewClient(ctx, &genai.ClientConfig{
	APIKey:  os.Getenv("GEMINI_API_KEY"),
	Backend: genai.BackendGeminiAPI,
})
if err != nil {
	log.Fatal(err)
}

file, err := client.Files.UploadFromPath(
	ctx, 
	filepath.Join(getMedia(), "sample.mp3"), 
	&genai.UploadFileConfig{
		MIMEType : "audio/mpeg",
	},
)
if err != nil {
	log.Fatal(err)
}

parts := []*genai.Part{
	genai.NewPartFromText("Give me a summary of this audio file."),
	genai.NewPartFromURI(file.URI, file.MIMEType),
}

contents := []*genai.Content{
	genai.NewContentFromParts(parts, genai.RoleUser),
}

for result, err := range client.Models.GenerateContentStream(
	ctx,
	"gemini-3.5-flash",
	contents,
	nil,
) {
	if err != nil {
		log.Fatal(err)
	}
	fmt.Print(result.Candidates[0].Content.Parts[0].Text)
}text_generation.go

Shell

# Use File API to upload audio data to API request.
MIME_TYPE=$(file -b --mime-type "${AUDIO_PATH}")
NUM_BYTES=$(wc -c < "${AUDIO_PATH}")
DISPLAY_NAME=AUDIO

tmp_header_file=upload-header.tmp

# Initial resumable request defining metadata.
# The upload url is in the response headers dump them to a file.
curl "${BASE_URL}/upload/v1beta/files?key=${GEMINI_API_KEY}" \
  -D upload-header.tmp \
  -H "X-Goog-Upload-Protocol: resumable" \
  -H "X-Goog-Upload-Command: start" \
  -H "X-Goog-Upload-Header-Content-Length: ${NUM_BYTES}" \
  -H "X-Goog-Upload-Header-Content-Type: ${MIME_TYPE}" \
  -H "Content-Type: application/json" \
  -d "{'file': {'display_name': '${DISPLAY_NAME}'}}" 2> /dev/null

upload_url=$(grep -i "x-goog-upload-url: " "${tmp_header_file}" | cut -d" " -f2 | tr -d "\r")
rm "${tmp_header_file}"

# Upload the actual bytes.
curl "${upload_url}" \
  -H "Content-Length: ${NUM_BYTES}" \
  -H "X-Goog-Upload-Offset: 0" \
  -H "X-Goog-Upload-Command: upload, finalize" \
  --data-binary "@${AUDIO_PATH}" 2> /dev/null > file_info.json

file_uri=$(jq ".file.uri" file_info.json)
echo file_uri=$file_uri

curl "https://generativelanguage.googleapis.com/v1beta/models/gemini-2.0-flash:streamGenerateContent?alt=sse&key=$GEMINI_API_KEY" \
    -H 'Content-Type: application/json' \
    -X POST \
    -d '{
      "contents": [{
        "parts":[
          {"text": "Please describe this file."},
          {"file_data":{"mime_type": "audio/mpeg", "file_uri": '$file_uri'}}]
        }]
       }' 2> /dev/null > response.json

cat response.json
echotext_generation.sh

视频

Python

from google import genai
import time

client = genai.Client()
# Video clip (CC BY 3.0) from https://peach.blender.org/download/
myfile = client.files.upload(file=media / "Big_Buck_Bunny.mp4")
print(f"{myfile=}")

# Poll until the video file is completely processed (state becomes ACTIVE).
while not myfile.state or myfile.state.name != "ACTIVE":
    print("Processing video...")
    print("File state:", myfile.state)
    time.sleep(5)
    myfile = client.files.get(name=myfile.name)

response = client.models.generate_content_stream(
    model="gemini-3.5-flash", contents=[myfile, "Describe this video clip"]
)
for chunk in response:
    print(chunk.text)
    print("_" * 80)text_generation.py

Node.js

// Make sure to include the following import:
// import {GoogleGenAI} from '@google/genai';
const ai = new GoogleGenAI({ apiKey: process.env.GEMINI_API_KEY });

let video = await ai.files.upload({
  file: path.join(media, 'Big_Buck_Bunny.mp4'),
});

// Poll until the video file is completely processed (state becomes ACTIVE).
while (!video.state || video.state.toString() !== 'ACTIVE') {
  console.log('Processing video...');
  console.log('File state: ', video.state);
  await sleep(5000);
  video = await ai.files.get({name: video.name});
}

const response = await ai.models.generateContentStream({
  model: "gemini-3.5-flash",
  contents: [
    createUserContent([
      "Describe this video clip",
      createPartFromUri(video.uri, video.mimeType),
    ]),
  ],
});
let text = "";
for await (const chunk of response) {
  console.log(chunk.text);
  text += chunk.text;
}text_generation.js

Go

ctx := context.Background()
client, err := genai.NewClient(ctx, &genai.ClientConfig{
	APIKey:  os.Getenv("GEMINI_API_KEY"),
	Backend: genai.BackendGeminiAPI,
})
if err != nil {
	log.Fatal(err)
}

file, err := client.Files.UploadFromPath(
	ctx, 
	filepath.Join(getMedia(), "Big_Buck_Bunny.mp4"), 
	&genai.UploadFileConfig{
		MIMEType : "video/mp4",
	},
)
if err != nil {
	log.Fatal(err)
}

// Poll until the video file is completely processed (state becomes ACTIVE).
for file.State == genai.FileStateUnspecified || file.State != genai.FileStateActive {
	fmt.Println("Processing video...")
	fmt.Println("File state:", file.State)
	time.Sleep(5 * time.Second)

	file, err = client.Files.Get(ctx, file.Name, nil)
	if err != nil {
		log.Fatal(err)
	}
}

parts := []*genai.Part{
	genai.NewPartFromText("Describe this video clip"),
	genai.NewPartFromURI(file.URI, file.MIMEType),
}

contents := []*genai.Content{
	genai.NewContentFromParts(parts, genai.RoleUser),
}

for result, err := range client.Models.GenerateContentStream(
	ctx,
	"gemini-3.5-flash",
	contents,
	nil,
) {
	if err != nil {
		log.Fatal(err)
	}
	fmt.Print(result.Candidates[0].Content.Parts[0].Text)
}text_generation.go

Shell

# Use File API to upload audio data to API request.
MIME_TYPE=$(file -b --mime-type "${VIDEO_PATH}")
NUM_BYTES=$(wc -c < "${VIDEO_PATH}")
DISPLAY_NAME=VIDEO_PATH

# Initial resumable request defining metadata.
# The upload url is in the response headers dump them to a file.
curl "${BASE_URL}/upload/v1beta/files?key=${GEMINI_API_KEY}" \
  -D upload-header.tmp \
  -H "X-Goog-Upload-Protocol: resumable" \
  -H "X-Goog-Upload-Command: start" \
  -H "X-Goog-Upload-Header-Content-Length: ${NUM_BYTES}" \
  -H "X-Goog-Upload-Header-Content-Type: ${MIME_TYPE}" \
  -H "Content-Type: application/json" \
  -d "{'file': {'display_name': '${DISPLAY_NAME}'}}" 2> /dev/null

upload_url=$(grep -i "x-goog-upload-url: " "${tmp_header_file}" | cut -d" " -f2 | tr -d "\r")
rm "${tmp_header_file}"

# Upload the actual bytes.
curl "${upload_url}" \
  -H "Content-Length: ${NUM_BYTES}" \
  -H "X-Goog-Upload-Offset: 0" \
  -H "X-Goog-Upload-Command: upload, finalize" \
  --data-binary "@${VIDEO_PATH}" 2> /dev/null > file_info.json

file_uri=$(jq ".file.uri" file_info.json)
echo file_uri=$file_uri

state=$(jq ".file.state" file_info.json)
echo state=$state

while [[ "($state)" = *"PROCESSING"* ]];
do
  echo "Processing video..."
  sleep 5
  # Get the file of interest to check state
  curl https://generativelanguage.googleapis.com/v1beta/files/$name > file_info.json
  state=$(jq ".file.state" file_info.json)
done

curl "https://generativelanguage.googleapis.com/v1beta/models/gemini-2.0-flash:streamGenerateContent?alt=sse&key=$GEMINI_API_KEY" \
    -H 'Content-Type: application/json' \
    -X POST \
    -d '{
      "contents": [{
        "parts":[
          {"text": "Please describe this file."},
          {"file_data":{"mime_type": "video/mp4", "file_uri": '$file_uri'}}]
        }]
       }' 2> /dev/null > response.json

cat response.json
echotext_generation.sh

PDF

Python

from google import genai

client = genai.Client()
sample_pdf = client.files.upload(file=media / "test.pdf")
response = client.models.generate_content_stream(
    model="gemini-3.5-flash",
    contents=["Give me a summary of this document:", sample_pdf],
)

for chunk in response:
    print(chunk.text)
    print("_" * 80)text_generation.py

Go

ctx := context.Background()
client, err := genai.NewClient(ctx, &genai.ClientConfig{
	APIKey:  os.Getenv("GEMINI_API_KEY"),
	Backend: genai.BackendGeminiAPI,
})
if err != nil {
	log.Fatal(err)
}

file, err := client.Files.UploadFromPath(
	ctx, 
	filepath.Join(getMedia(), "test.pdf"), 
	&genai.UploadFileConfig{
		MIMEType : "application/pdf",
	},
)
if err != nil {
	log.Fatal(err)
}

parts := []*genai.Part{
	genai.NewPartFromText("Give me a summary of this document:"),
	genai.NewPartFromURI(file.URI, file.MIMEType),
}

contents := []*genai.Content{
	genai.NewContentFromParts(parts, genai.RoleUser),
}

for result, err := range client.Models.GenerateContentStream(
	ctx,
	"gemini-3.5-flash",
	contents,
	nil,
) {
	if err != nil {
		log.Fatal(err)
	}
	fmt.Print(result.Candidates[0].Content.Parts[0].Text)
}text_generation.go

Shell

MIME_TYPE=$(file -b --mime-type "${PDF_PATH}")
NUM_BYTES=$(wc -c < "${PDF_PATH}")
DISPLAY_NAME=TEXT


echo $MIME_TYPE
tmp_header_file=upload-header.tmp

# Initial resumable request defining metadata.
# The upload url is in the response headers dump them to a file.
curl "${BASE_URL}/upload/v1beta/files?key=${GEMINI_API_KEY}" \
  -D upload-header.tmp \
  -H "X-Goog-Upload-Protocol: resumable" \
  -H "X-Goog-Upload-Command: start" \
  -H "X-Goog-Upload-Header-Content-Length: ${NUM_BYTES}" \
  -H "X-Goog-Upload-Header-Content-Type: ${MIME_TYPE}" \
  -H "Content-Type: application/json" \
  -d "{'file': {'display_name': '${DISPLAY_NAME}'}}" 2> /dev/null

upload_url=$(grep -i "x-goog-upload-url: " "${tmp_header_file}" | cut -d" " -f2 | tr -d "\r")
rm "${tmp_header_file}"

# Upload the actual bytes.
curl "${upload_url}" \
  -H "Content-Length: ${NUM_BYTES}" \
  -H "X-Goog-Upload-Offset: 0" \
  -H "X-Goog-Upload-Command: upload, finalize" \
  --data-binary "@${PDF_PATH}" 2> /dev/null > file_info.json

file_uri=$(jq ".file.uri" file_info.json)
echo file_uri=$file_uri

# Now generate content using that file
curl "https://generativelanguage.googleapis.com/v1beta/models/gemini-2.0-flash:streamGenerateContent?alt=sse&key=$GEMINI_API_KEY" \
    -H 'Content-Type: application/json' \
    -X POST \
    -d '{
      "contents": [{
        "parts":[
          {"text": "Can you add a few more lines to this poem?"},
          {"file_data":{"mime_type": "application/pdf", "file_uri": '$file_uri'}}]
        }]
       }' 2> /dev/null > response.json

cat response.json
echotext_generation.sh

聊天

Python

from google import genai
from google.genai import types

client = genai.Client()
chat = client.chats.create(
    model="gemini-3.5-flash",
    history=[
        types.Content(role="user", parts=[types.Part(text="Hello")]),
        types.Content(
            role="model",
            parts=[
                types.Part(
                    text="Great to meet you. What would you like to know?"
                )
            ],
        ),
    ],
)
response = chat.send_message_stream(message="I have 2 dogs in my house.")
for chunk in response:
    print(chunk.text)
    print("_" * 80)
response = chat.send_message_stream(message="How many paws are in my house?")
for chunk in response:
    print(chunk.text)
    print("_" * 80)

print(chat.get_history())chat.py

Node.js

// Make sure to include the following import:
// import {GoogleGenAI} from '@google/genai';
const ai = new GoogleGenAI({ apiKey: process.env.GEMINI_API_KEY });
const chat = ai.chats.create({
  model: "gemini-3.5-flash",
  history: [
    {
      role: "user",
      parts: [{ text: "Hello" }],
    },
    {
      role: "model",
      parts: [{ text: "Great to meet you. What would you like to know?" }],
    },
  ],
});

console.log("Streaming response for first message:");
const stream1 = await chat.sendMessageStream({
  message: "I have 2 dogs in my house.",
});
for await (const chunk of stream1) {
  console.log(chunk.text);
  console.log("_".repeat(80));
}

console.log("Streaming response for second message:");
const stream2 = await chat.sendMessageStream({
  message: "How many paws are in my house?",
});
for await (const chunk of stream2) {
  console.log(chunk.text);
  console.log("_".repeat(80));
}

console.log(chat.getHistory());chat.js

Go

ctx := context.Background()
client, err := genai.NewClient(ctx, &genai.ClientConfig{
	APIKey:  os.Getenv("GEMINI_API_KEY"),
	Backend: genai.BackendGeminiAPI,
})
if err != nil {
	log.Fatal(err)
}

history := []*genai.Content{
	genai.NewContentFromText("Hello", genai.RoleUser),
	genai.NewContentFromText("Great to meet you. What would you like to know?", genai.RoleModel),
}
chat, err := client.Chats.Create(ctx, "gemini-3.5-flash", nil, history)
if err != nil {
	log.Fatal(err)
}

for chunk, err := range chat.SendMessageStream(ctx, genai.Part{Text: "I have 2 dogs in my house."}) {
	if err != nil {
		log.Fatal(err)
	}
	fmt.Println(chunk.Text())
	fmt.Println(strings.Repeat("_", 64))
}

for chunk, err := range chat.SendMessageStream(ctx, genai.Part{Text: "How many paws are in my house?"}) {
	if err != nil {
		log.Fatal(err)
	}
	fmt.Println(chunk.Text())
	fmt.Println(strings.Repeat("_", 64))
}

fmt.Println(chat.History(false))chat.go

Shell

curl https://generativelanguage.googleapis.com/v1beta/models/gemini-2.0-flash:streamGenerateContent?alt=sse&key=$GEMINI_API_KEY \
    -H 'Content-Type: application/json' \
    -X POST \
    -d '{
      "contents": [
        {"role":"user",
         "parts":[{
           "text": "Hello"}]},
        {"role": "model",
         "parts":[{
           "text": "Great to meet you. What would you like to know?"}]},
        {"role":"user",
         "parts":[{
           "text": "I have two dogs in my house. How many paws are in my house?"}]},
      ]
    }' 2> /dev/null | grep "text"chat.sh

响应正文

如果成功，响应正文将包含 GenerateContentResponse 实例数据流。

GenerateContentResponse

JSON 表示法
PromptFeedback
- JSON 表示法
BlockReason
UsageMetadata
- JSON 表示法
ModelStatus
- JSON 表示法
ModelStage

支持多个候选回答的模型的回答。

系统会针对 GenerateContentResponse.prompt_feedback 中的每个提示和 finishReason 及 safetyRatings 中的每个候选答案报告安全等级和内容过滤情况。该 API： - 要么返回所有请求的候选内容，要么不返回任何候选内容 - 仅当提示存在问题时（检查 promptFeedback），才不会返回任何候选内容 - 在 finishReason 和 safetyRatings 中报告有关每个候选内容的反馈。

字段

candidates[] object (Candidate)

模型给出的候选回答。

promptFeedback object (PromptFeedback)

返回与内容过滤器相关的提示反馈。

usageMetadata object (UsageMetadata)

仅限输出。有关生成请求的 token 使用情况的元数据。

modelVersion string

仅限输出。用于生成回答的模型版本。

responseId string

仅限输出。responseId 用于标识每个响应。

modelStatus object (ModelStatus)

仅限输出。相应模型的当前模型状态。

JSON 表示法

JSON 表示法
{ "candidates": [ { object (`Candidate`) } ], "promptFeedback": { object (`PromptFeedback`) }, "usageMetadata": { object (`UsageMetadata`) }, "modelVersion": string, "responseId": string, "modelStatus": { object (`ModelStatus`) } }

{
  "candidates": [
    {
      object (Candidate)
    }
  ],
  "promptFeedback": {
    object (PromptFeedback)
  },
  "usageMetadata": {
    object (UsageMetadata)
  },
  "modelVersion": string,
  "responseId": string,
  "modelStatus": {
    object (ModelStatus)
  }
}

PromptFeedback

提示在 GenerateContentRequest.content 中指定的一组反馈元数据。

字段

blockReason enum (BlockReason)

可选。如果设置了此字段，则表示提示已被屏蔽，并且不会返回任何候选结果。改述提示。

safetyRatings[] object (SafetyRating)

提示的安全等级。每个类别最多只能有一个分级。

JSON 表示法
{ "blockReason": enum (`BlockReason`), "safetyRatings": [ { object (`SafetyRating`) } ] }

BlockReason

指定屏蔽提示的原因。

枚举
`BLOCK_REASON_UNSPECIFIED`	默认值。此值未使用。
`SAFETY`	出于安全原因，系统屏蔽了相应提示。检查 `safetyRatings` 以了解是哪个安全类别屏蔽了相应输出。
`OTHER`	提示因未知原因被屏蔽。
`BLOCKLIST`	提示因包含术语屏蔽名单中的术语而被屏蔽。
`PROHIBITED_CONTENT`	提示因包含禁止的内容而被屏蔽。
`IMAGE_SAFETY`	因生成不安全的图片内容而屏蔽了候选回答。

UsageMetadata

有关生成请求的 token 使用情况的元数据。

字段

promptTokenCount integer

提示中的 token 数量。如果设置了 cachedContent，这仍然是有效提示的总大小，这意味着它包含缓存内容中的词元数。

cachedContentTokenCount integer

提示的缓存部分（即缓存的内容）中的 token 数量

candidatesTokenCount integer

所有生成的回答候选项中的 token 总数。

toolUsePromptTokenCount integer

仅限输出。工具使用提示中的 token 数量。

thoughtsTokenCount integer

仅限输出。思考模型的思考 token 数。

totalTokenCount integer

生成请求（提示 + 思路 + 回答候选）的总 token 数。

promptTokensDetails[] object (ModalityTokenCount)

仅限输出。请求输入中处理的模态列表。

cacheTokensDetails[] object (ModalityTokenCount)

仅限输出。请求输入中缓存内容的模态列表。

candidatesTokensDetails[] object (ModalityTokenCount)

仅限输出。响应中返回的模态列表。

toolUsePromptTokensDetails[] object (ModalityTokenCount)

仅限输出。为工具使用请求输入处理的模态列表。

serviceTier enum (ServiceTier)

仅限输出。请求的服务等级。

JSON 表示法

JSON 表示法
{ "promptTokenCount": integer, "cachedContentTokenCount": integer, "candidatesTokenCount": integer, "toolUsePromptTokenCount": integer, "thoughtsTokenCount": integer, "totalTokenCount": integer, "promptTokensDetails": [ { object (`ModalityTokenCount`) } ], "cacheTokensDetails": [ { object (`ModalityTokenCount`) } ], "candidatesTokensDetails": [ { object (`ModalityTokenCount`) } ], "toolUsePromptTokensDetails": [ { object (`ModalityTokenCount`) } ], "serviceTier": enum (`ServiceTier`) }

{
  "promptTokenCount": integer,
  "cachedContentTokenCount": integer,
  "candidatesTokenCount": integer,
  "toolUsePromptTokenCount": integer,
  "thoughtsTokenCount": integer,
  "totalTokenCount": integer,
  "promptTokensDetails": [
    {
      object (ModalityTokenCount)
    }
  ],
  "cacheTokensDetails": [
    {
      object (ModalityTokenCount)
    }
  ],
  "candidatesTokensDetails": [
    {
      object (ModalityTokenCount)
    }
  ],
  "toolUsePromptTokensDetails": [
    {
      object (ModalityTokenCount)
    }
  ],
  "serviceTier": enum (ServiceTier)
}

ModelStatus

底层模型的状态。用于指示基础模型的阶段以及退役时间（如适用）。

字段

modelStage enum (ModelStage)

底层模型的阶段。

retirementTime string (Timestamp format)

模型退役的时间。

采用 RFC 3339 标准，生成的输出将始终进行 Z 规范化（即转换为 UTC 零时区格式并在末尾附加 Z），并使用 0、3、6 或 9 个小数位。不进行“Z”归一化处理的偏差时间也是可以接受的。示例："2014-10-02T15:01:23Z"、"2014-10-02T15:01:23.045123456Z" 或 "2014-10-02T15:01:23+05:30"。

message string

说明模型状态的消息。

JSON 表示法
{ "modelStage": enum (`ModelStage`), "retirementTime": string, "message": string }

ModelStage

定义底层模型的阶段。

枚举
`MODEL_STAGE_UNSPECIFIED`	未指定模型阶段。
`UNSTABLE_EXPERIMENTAL`	底层模型会进行大量调整。此项已弃用！
`EXPERIMENTAL`	此阶段的模型仅用于实验目的。
`PREVIEW`	此阶段的模型比实验性模型更成熟。
`STABLE`	此阶段的模型被认为是稳定的，可用于生产环境。
`LEGACY`	如果模型处于此阶段，则表示该模型在不久的将来会弃用。只有现有客户可以使用此模型。
`DEPRECATED`	此阶段中的模型已被弃用。这些模型无法使用。此项已弃用！
`RETIRED`	此阶段的模型已弃用。这些模型无法使用。

候选人

模型生成的候选回答。

字段

content object (Content)

仅限输出。模型返回的生成内容。

finishReason enum (FinishReason)

可选。仅限输出。模型停止生成 token 的原因。

如果为空，则模型尚未停止生成词元。

safetyRatings[] object (SafetyRating)

候选回答的安全评分列表。

每个类别最多只能有一个分级。

citationMetadata object (CitationMetadata)

仅限输出。模型生成的候选回答的引用信息。

此字段可能会填充 content 中包含的任何文本的朗读信息。这些内容是从基础 LLM 的训练数据中的受版权保护的材料中“背诵”出来的。

tokenCount integer

仅限输出。相应候选对象的 token 数。

groundingAttributions[] object (GroundingAttribution)

仅限输出。为有依据的回答做出贡献的来源的提供方信息。

系统会针对 GenerateAnswer 调用填充此字段。

groundingMetadata object (GroundingMetadata)

仅限输出。候选人的接地元数据。

系统会针对 GenerateContent 调用填充此字段。

avgLogprobs number

仅限输出。候选者的平均对数概率得分。

logprobsResult object (LogprobsResult)

仅限输出。回答 token 和热门 token 的对数似然得分

urlContextMetadata object (UrlContextMetadata)

仅限输出。与网址上下文检索工具相关的元数据。

index integer

仅限输出。响应候选列表中的候选索引。

finishMessage string

可选。仅限输出。详细说明了模型停止生成词元的原因。仅当设置了 finishReason 时，才会填充此字段。

JSON 表示法

JSON 表示法
{ "content": { object (`Content`) }, "finishReason": enum (`FinishReason`), "safetyRatings": [ { object (`SafetyRating`) } ], "citationMetadata": { object (`CitationMetadata`) }, "tokenCount": integer, "groundingAttributions": [ { object (`GroundingAttribution`) } ], "groundingMetadata": { object (`GroundingMetadata`) }, "avgLogprobs": number, "logprobsResult": { object (`LogprobsResult`) }, "urlContextMetadata": { object (`UrlContextMetadata`) }, "index": integer, "finishMessage": string }

{
  "content": {
    object (Content)
  },
  "finishReason": enum (FinishReason),
  "safetyRatings": [
    {
      object (SafetyRating)
    }
  ],
  "citationMetadata": {
    object (CitationMetadata)
  },
  "tokenCount": integer,
  "groundingAttributions": [
    {
      object (GroundingAttribution)
    }
  ],
  "groundingMetadata": {
    object (GroundingMetadata)
  },
  "avgLogprobs": number,
  "logprobsResult": {
    object (LogprobsResult)
  },
  "urlContextMetadata": {
    object (UrlContextMetadata)
  },
  "index": integer,
  "finishMessage": string
}

FinishReason

定义模型停止生成令牌的原因。

枚举
`FINISH_REASON_UNSPECIFIED`	默认值。此值未使用。
`STOP`	模型的自然停止点或提供的停止序列。
`MAX_TOKENS`	已达到请求中指定的 token 数量上限。
`SAFETY`	回答候选内容因安全原因而被标记。
`RECITATION`	回答候选内容因背诵原因而被标记。
`LANGUAGE`	系统标记了候选回答内容，原因是其使用了不受支持的语言。
`OTHER`	原因未知。
`BLOCKLIST`	由于内容包含违禁字词，因此 token 生成操作已停止。
`PROHIBITED_CONTENT`	由于可能包含禁止的内容，因此 token 生成操作已停止。
`SPII`	由于内容可能包含敏感的个人身份信息 (SPII)，因此 token 生成操作已停止。
`MALFORMED_FUNCTION_CALL`	模型生成的函数调用无效。
`IMAGE_SAFETY`	由于生成的图片包含违规内容，词元生成已停止。
`IMAGE_PROHIBITED_CONTENT`	图片生成已停止，因为生成的图片包含其他禁止的内容。
`IMAGE_OTHER`	由于其他杂项问题，图片生成已停止。
`NO_IMAGE`	模型本应生成图片，但却未生成任何图片。
`IMAGE_RECITATION`	由于存在重复内容，图片生成操作已停止。
`UNEXPECTED_TOOL_CALL`	模型生成了工具调用，但请求中未启用任何工具。
`TOO_MANY_TOOL_CALLS`	模型连续调用了过多的工具，因此系统退出了执行。
`MISSING_THOUGHT_SIGNATURE`	请求至少缺少一个思路签名。
`MALFORMED_RESPONSE`	因响应格式不正确而完成。

GroundingAttribution

为促成回答的来源提供的提供方信息。

字段

sourceId object (AttributionSourceId)

仅限输出。促成相应归因的来源的标识符。

content object (Content)

构成此归因的接地源内容。

JSON 表示法
{ "sourceId": { object (`AttributionSourceId`) }, "content": { object (`Content`) } }

AttributionSourceId

促成相应归因的来源的标识符。

字段

source Union type

source 只能是下列其中一项：

groundingPassage object (GroundingPassageId)

内嵌段落的标识符。

semanticRetrieverChunk object (SemanticRetrieverChunk)

通过语义检索器提取的 Chunk 的标识符。

JSON 表示法
{ // source "groundingPassage": { object (`GroundingPassageId`) }, "semanticRetrieverChunk": { object (`SemanticRetrieverChunk`) } // Union type }

GroundingPassageId

GroundingPassage 中某个部分的标识符。

字段

passageId string

仅限输出。与 GenerateAnswerRequest 的 GroundingPassage.id 相匹配的段落的 ID。

partIndex integer

仅限输出。GenerateAnswerRequest 的 GroundingPassage.content 中相应部分的索引。

JSON 表示法
{ "passageId": string, "partIndex": integer }

SemanticRetrieverChunk

通过 SemanticRetrieverConfig 使用 GenerateAnswerRequest 中指定的语义检索器检索到的 Chunk 的标识符。

字段

source string

仅限输出。与请求的 SemanticRetrieverConfig.source 匹配的来源的名称。示例：corpora/123 或 corpora/123/documents/abc

chunk string

仅限输出。包含归因文本的 Chunk 的名称。示例：corpora/123/documents/abc/chunks/xyz

JSON 表示法
{ "source": string, "chunk": string }

GroundingMetadata

启用接地时返回给客户端的元数据。

字段

groundingChunks[] object (GroundingChunk)

从指定的事实依据来源检索到的支持性参考资料的列表。在流式传输时，此字段仅包含尚未包含在之前响应的接地元数据中的接地块。

groundingSupports[] object (GroundingSupport)

接地支持列表。

webSearchQueries[] string

后续网络搜索的网页搜索查询。

imageSearchQueries[] string

用于建立依据的图片搜索查询。

searchEntryPoint object (SearchEntryPoint)

可选。Google 搜索条目，用于后续的网页搜索。

retrievalMetadata object (RetrievalMetadata)

与接地流程中的检索相关的元数据。

googleMapsWidgetContextToken string

可选。Google 地图 widget 上下文令牌的资源名称，可与 PlacesContextElement widget 搭配使用，以渲染上下文数据。仅在启用 Grounding with Google Maps 时填充。

JSON 表示法

JSON 表示法
{ "groundingChunks": [ { object (`GroundingChunk`) } ], "groundingSupports": [ { object (`GroundingSupport`) } ], "webSearchQueries": [ string ], "imageSearchQueries": [ string ], "searchEntryPoint": { object (`SearchEntryPoint`) }, "retrievalMetadata": { object (`RetrievalMetadata`) }, "googleMapsWidgetContextToken": string }

{
  "groundingChunks": [
    {
      object (GroundingChunk)
    }
  ],
  "groundingSupports": [
    {
      object (GroundingSupport)
    }
  ],
  "webSearchQueries": [
    string
  ],
  "imageSearchQueries": [
    string
  ],
  "searchEntryPoint": {
    object (SearchEntryPoint)
  },
  "retrievalMetadata": {
    object (RetrievalMetadata)
  },
  "googleMapsWidgetContextToken": string
}

SearchEntryPoint

Google 搜索入口点。

字段

renderedContent string

可选。可嵌入网页或应用 WebView 中的 Web 内容代码段。

sdkBlob string (bytes format)

可选。以 Base64 编码的 JSON，表示 <搜索字词、搜索网址> 元组的数组。

使用 base64 编码的字符串。

JSON 表示法
{ "renderedContent": string, "sdkBlob": string }

GroundingChunk

GroundingChunk 表示支持模型回答的证据片段。它可以是来自网页的文本块、从文件中检索到的上下文，也可以是来自 Google 地图的信息。

字段

chunk_type Union type

分块类型。chunk_type 只能是下列其中一项：

web object (Web)

来自网络的接地块。

image object (Image)

可选。来自图片搜索的接地块。

retrievedContext object (RetrievedContext)

可选。通过文件搜索工具检索到的上下文中的标准答案块。

maps object (Maps)

可选。来自 Google 地图的接地块。

JSON 表示法
{ // chunk_type "web": { object (`Web`) }, "image": { object (`Image`) }, "retrievedContext": { object (`RetrievedContext`) }, "maps": { object (`Maps`) } // Union type }

Web

来自网络的块。

字段

uri string

仅限输出。块的 URI 引用。

title string

仅限输出。块的标题。

JSON 表示法
{ "uri": string, "title": string }

图片

图片搜索中的块。

字段

sourceUri string

用于归因的网页 URI。

imageUri string

图片素材资源的网址。

title string

图片来源网页的标题。

domain string

相应图片所在的网页的根域名，例如“example.com”。

JSON 表示法
{ "sourceUri": string, "imageUri": string, "title": string, "domain": string }

RetrievedContext

通过文件搜索工具检索到的上下文中的块。

字段

customMetadata[] object (CustomMetadata)

可选。用户提供的有关检索到的上下文的元数据。

uri string

可选。语义检索文档的 URI 引用。

title string

可选。文档的标题。

text string

可选。块的文本。

fileSearchStore string

可选。包含相应文档的 FileSearchStore 的名称。示例：fileSearchStores/123

pageNumber integer

可选。检索到的上下文的页码（如果适用）。

mediaId string

可选。多模态文件搜索结果的媒体 blob 资源名称。格式：fileSearchStores/{file_search_store_id}/media/{blobId}

JSON 表示法
{ "customMetadata": [ { object (`CustomMetadata`) } ], "uri": string, "title": string, "text": string, "fileSearchStore": string, "pageNumber": integer, "mediaId": string }

CustomMetadata

用户提供的有关 GroundingFact 的元数据。

字段

key string

元数据的键。

value Union type

元数据的值。可以是字符串、字符串列表或数字。value 只能是下列其中一项：

stringValue string

可选。元数据的字符串值。

stringListValue object (StringList)

可选。元数据的字符串值列表。

numericValue number

可选。元数据的数值。此值的预期范围取决于所用的具体 key。

JSON 表示法
{ "key": string, // value "stringValue": string, "stringListValue": { object (`StringList`) }, "numericValue": number // Union type }

StringList

字符串值列表。

字段

values[] string

列表的字符串值。

JSON 表示法
{ "values": [ string ] }

地图

来自 Google 地图的接地块。一个地图块对应于一个地点。

字段

uri string

地点的 URI 引用。

title string

地点的名称。

text string

地点答案的文字说明。

placeId string

地点的 ID，采用 places/{placeId} 格式。用户可以使用此 ID 查找相应地点。

placeAnswerSources object (PlaceAnswerSources)

提供有关 Google 地图中特定地点特征的回答的来源。

JSON 表示法
{ "uri": string, "title": string, "text": string, "placeId": string, "placeAnswerSources": { object (`PlaceAnswerSources`) } }

PlaceAnswerSources

提供有关 Google 地图中指定地点的特征的答案的来源集合。每个 PlaceAnswerSources 消息都对应 Google 地图中的特定地点。Google 地图工具使用这些来源来回答有关地点特征的问题（例如：“Bar Foo 是否提供 Wi-Fi”或“Foo Bar 是否适合轮椅使用者？”）。目前，我们仅支持将评价摘要作为来源。

字段

reviewSnippets[] object (ReviewSnippet)

用于生成有关 Google 地图中指定地点的特征的回答的评价摘要。

JSON 表示法
{ "reviewSnippets": [ { object (`ReviewSnippet`) } ] }

ReviewSnippet

封装了用户评价的一段内容，其中回答了有关 Google 地图中特定地点的功能的问题。

字段

reviewId string

评价摘要的 ID。

googleMapsUri string

与 Google 地图上的用户评价对应的链接。

title string

评价的标题。

JSON 表示法
{ "reviewId": string, "googleMapsUri": string, "title": string }

GroundingSupport

接地支持。

字段

groundingChunkIndices[] integer

可选。一个索引（指向 response.candidate.grounding_metadata 中的“grounding_chunk”）列表，用于指定与声明关联的引用。例如，[1,3,4] 表示 grounding_chunk[1]、grounding_chunk[3]、grounding_chunk[4] 是归因于相应声明的检索到的内容。如果响应是流式传输的，则 groundingChunkIndices 是指所有响应中的索引。客户端有责任从所有响应中累积 grounding 块（同时保持相同的顺序）。

confidenceScores[] number

可选。支持参考资料的置信度分数。范围为 0 到 1。1 表示最有信心。此列表的大小必须与 groundingChunkIndices 相同。

renderedParts[] integer

仅限输出。候选人内容的 parts 字段中的索引。这些索引用于指定哪些渲染部分与此支持来源相关联。

segment object (Segment)

相应支持所涉及的内容片段。

JSON 表示法
{ "groundingChunkIndices": [ integer ], "confidenceScores": [ number ], "renderedParts": [ integer ], "segment": { object (`Segment`) } }

Segment

内容片段。

字段

partIndex integer

相应 Part 对象在其父 Content 对象中的索引。

startIndex integer

指定 Part 中的起始索引（以字节为单位）。从 Part 开始的偏移量（含），从零开始。

endIndex integer

指定 Part 中的结束索引（以字节为单位）。从相应部分的开头开始的偏移量（不含），从零开始。

text string

响应中与相应细分对应的文本。

JSON 表示法
{ "partIndex": integer, "startIndex": integer, "endIndex": integer, "text": string }

RetrievalMetadata

与接地流程中的检索相关的元数据。

字段

googleSearchDynamicRetrievalScore number

可选。一个分数，用于指示 Google 搜索中的信息有多大可能有助于回答提示。得分介于 [0, 1] 范围内，其中 0 表示可能性最低，1 表示可能性最高。仅当启用 Google 搜索接地和动态检索时，系统才会填充此得分。系统会将该值与阈值进行比较，以确定是否触发 Google 搜索。

JSON 表示法
{ "googleSearchDynamicRetrievalScore": number }

LogprobsResult

Logprobs 结果

字段

topCandidates[] object (TopCandidates)

长度 = 解码步总数。

chosenCandidates[] object (Candidate)

长度 = 解码步总数。所选候选词元可能位于 topCandidates 中，也可能不在其中。

logProbabilitySum number

所有 token 的对数概率之和。

JSON 表示法
{ "topCandidates": [ { object (`TopCandidates`) } ], "chosenCandidates": [ { object (`Candidate`) } ], "logProbabilitySum": number }

TopCandidates

每个解码步骤中具有最高对数概率的候选对象。

字段

candidates[] object (Candidate)

按对数概率降序排序。

JSON 表示法
{ "candidates": [ { object (`Candidate`) } ] }

候选人

logprobs token 和得分的候选对象。

字段

token string

候选令牌字符串值。

tokenId integer

候选 token 的 ID 值。

logProbability number

候选词元的对数概率。

JSON 表示法
{ "token": string, "tokenId": integer, "logProbability": number }

UrlContextMetadata

与网址上下文检索工具相关的元数据。

字段

urlMetadata[] object (UrlMetadata)

网址上下文列表。

JSON 表示法
{ "urlMetadata": [ { object (`UrlMetadata`) } ] }

UrlMetadata

单个网址检索的上下文。

字段

retrievedUrl string

由工具检索到的网址。

urlRetrievalStatus enum (UrlRetrievalStatus)

网址检索的状态。

JSON 表示法
{ "retrievedUrl": string, "urlRetrievalStatus": enum (`UrlRetrievalStatus`) }

UrlRetrievalStatus

网址检索的状态。

枚举
`URL_RETRIEVAL_STATUS_UNSPECIFIED`	默认值。此值未使用。
`URL_RETRIEVAL_STATUS_SUCCESS`	网址检索成功。
`URL_RETRIEVAL_STATUS_ERROR`	由于出错，网址检索失败。
`URL_RETRIEVAL_STATUS_PAYWALL`	由于内容受付费墙保护，网址检索失败。
`URL_RETRIEVAL_STATUS_UNSAFE`	由于内容不安全，网址检索失败。

CitationMetadata

JSON 表示法
CitationSource
- JSON 表示法

一段内容的一组来源归因。

字段

citationSources[] object (CitationSource)

特定回答的来源引用。

JSON 表示法
{ "citationSources": [ { object (`CitationSource`) } ] }

CitationSource

特定回答中某部分内容的来源引用。

字段

startIndex integer

可选。归因于相应来源的回答部分的起始位置。

索引指示段落的开始，以字节为单位衡量。

endIndex integer

可选。归因段落的结束，不包括此索引。

uri string

可选。归因于部分文本的来源的 URI。

license string

可选。归因片段的 GitHub 项目的许可。

代码引用需要许可信息。

JSON 表示法
{ "startIndex": integer, "endIndex": integer, "uri": string, "license": string }

GenerationConfig

模型生成和输出的配置选项。并非所有模型的参数都可以配置。

字段

stopSequences[] string

可选。将停止输出生成的字符序列集（最多 5 个）。如果指定了此参数，API 将在首次出现 stop_sequence 时停止。停止序列不会包含在回答中。

responseMimeType string

可选。生成的候选文本的 MIME 类型。支持的 MIME 类型包括：text/plain：（默认）文本输出。application/json：响应候选项中的 JSON 响应。text/x.enum：响应候选项中以字符串形式表示的 ENUM。如需查看所有受支持的文本 MIME 类型的列表，请参阅文档。

responseSchema object (Schema)

可选。生成的候选文本的输出架构。架构必须是 OpenAPI 架构的子集，并且可以是对象、基元或数组。

如果设置了此字段，则还必须设置兼容的 responseMimeType。兼容的 MIME 类型：application/json：JSON 响应的架构。如需了解详情，请参阅 JSON 文本生成指南。

_responseJsonSchema value (Value format)

可选。生成的回答的输出架构。这是 responseSchema 的替代方案，可接受 JSON 架构。

如果设置了此参数，则必须省略 responseSchema，但需要设置 responseMimeType。

虽然可以发送完整的 JSON 架构，但并非所有功能都受支持。具体来说，仅支持以下属性：

$id
$defs
$ref
$anchor
type
format
title
description
enum（适用于字符串和数字）
items
prefixItems
minItems
maxItems
minimum
maximum
anyOf
oneOf（与 anyOf 的解读方式相同）
properties
additionalProperties
required

还可以设置非标准 propertyOrdering 属性。

循环引用会展开到一定程度，因此只能在非必需属性中使用。（可为 null 的属性不足。）如果子架构上设置了 $ref，则除了以 $ 开头的属性之外，不得设置任何其他属性。

responseJsonSchema value (Value format)

可选。内部详细信息。请使用 responseJsonSchema，而不是此字段。

responseModalities[] enum (Modality)

可选。所请求的响应模态。表示模型可以返回且应在响应中预期的模态集合。这与回答的模态完全匹配。

一个模型可能支持多种模态组合。如果请求的模态与任何支持的组合都不匹配，则会返回错误。

空列表相当于仅请求文本。

candidateCount integer

可选。要返回的生成响应数量。如果未设置，则默认为 1。请注意，此功能不适用于上一代模型（Gemini 1.0 系列）

maxOutputTokens integer

可选。候选回答中包含的 token 数量上限。

注意：默认值因模型而异，请参阅 getModel 函数返回的 Model 的 Model.output_token_limit 属性。

temperature number

可选。控制输出的随机性。

注意：默认值因模型而异，请参阅 getModel 函数返回的 Model 的 Model.temperature 属性。

值可介于 [0.0, 2.0] 之间。

topP number

可选。抽样时要考虑的 token 的最大累积概率。

该模型使用 Top-k 和 Top-p（核）采样相结合的方式。

系统会根据词元的分配概率对其进行排序，以便仅考虑最有可能的词元。Top-k 采样直接限制要考虑的 token 的数量上限，而核采样则根据累积概率限制 token 的数量。

注意：默认值因 Model 而异，由 getModel 函数返回的 Model.top_p 属性指定。如果 topK 属性为空，则表示模型不应用 top-k 抽样，并且不允许在请求中设置 topK。

topK integer

可选。抽样时要考虑的令牌数量上限。

Gemini 模型使用 Top-p（核）采样或 Top-k 与核采样的组合。Top-k 抽样会考虑概率最高的 topK 个 token。采用核采样的模型不允许设置 topK。

seed integer

可选。解码中使用的种子。如果未设置，请求会使用随机生成的种子。

presencePenalty number

可选。如果下一个令牌已在响应中出现，则应用于该令牌的 logprobs 的存在惩罚。

此惩罚是二元（开启/关闭）的，不取决于令牌的使用次数（首次使用后）。使用 frequencyPenalty 表示每次使用都会增加的惩罚。

正值惩罚会阻止使用已在回答中使用的令牌，从而增加词汇量。

负惩罚会鼓励使用已在回答中使用的令牌，从而减少词汇量。

frequencyPenalty number

可选。应用于下一个词元的对数概率的频次惩罚，乘以每个词元在目前为止的回答中出现的次数。

正惩罚会抑制对已使用过的 token 的使用，抑制程度与 token 的使用次数成正比：token 的使用次数越多，模型就越难再次使用该 token，从而增加回答的词汇量。

注意：负惩罚会促使模型重复使用 token，重复使用的次数与 token 的使用次数成正比。较小的负值会减少回答的词汇量。负值越大，模型开始重复常见令牌的次数就越多，直到达到 maxOutputTokens 限制。

responseLogprobs boolean

可选。如果为 true，则在响应中导出 logprobs 结果。

logprobs integer

可选。仅在 responseLogprobs=True 时有效。此参数用于设置在 Candidate.logprobs_result 中每个解码步骤中返回的对数概率最高的候选词元数量（包括所选候选词元）。该数字必须介于 [0, 20] 之间。

enableEnhancedCivicAnswers boolean

可选。启用增强型公民信息回答。此功能可能仅适用于部分型号。

speechConfig object (SpeechConfig)

可选。语音生成配置。

thinkingConfig object (ThinkingConfig)

可选。思考功能的配置。如果为不支持思考的模型设置此字段，系统将返回错误。

imageConfig object (ImageConfig)

可选。图片生成配置。如果为不支持这些配置选项的模型设置此字段，系统将返回错误。

mediaResolution enum (MediaResolution)

可选。如果指定，系统将使用指定的媒体分辨率。

responseFormat object (ResponseFormatConfig)

可选。响应输出格式的配置。允许以扁平结构指定每种模态（文本、音频、图片）的输出配置。

JSON 表示法

JSON 表示法
{ "stopSequences": [ string ], "responseMimeType": string, "responseSchema": { object (`Schema`) }, "_responseJsonSchema": value, "responseJsonSchema": value, "responseModalities": [ enum (`Modality`) ], "candidateCount": integer, "maxOutputTokens": integer, "temperature": number, "topP": number, "topK": integer, "seed": integer, "presencePenalty": number, "frequencyPenalty": number, "responseLogprobs": boolean, "logprobs": integer, "enableEnhancedCivicAnswers": boolean, "speechConfig": { object (`SpeechConfig`) }, "thinkingConfig": { object (`ThinkingConfig`) }, "imageConfig": { object (`ImageConfig`) }, "mediaResolution": enum (`MediaResolution`), "responseFormat": { object (`ResponseFormatConfig`) } }

{
  "stopSequences": [
    string
  ],
  "responseMimeType": string,
  "responseSchema": {
    object (Schema)
  },
  "_responseJsonSchema": value,
  "responseJsonSchema": value,
  "responseModalities": [
    enum (Modality)
  ],
  "candidateCount": integer,
  "maxOutputTokens": integer,
  "temperature": number,
  "topP": number,
  "topK": integer,
  "seed": integer,
  "presencePenalty": number,
  "frequencyPenalty": number,
  "responseLogprobs": boolean,
  "logprobs": integer,
  "enableEnhancedCivicAnswers": boolean,
  "speechConfig": {
    object (SpeechConfig)
  },
  "thinkingConfig": {
    object (ThinkingConfig)
  },
  "imageConfig": {
    object (ImageConfig)
  },
  "mediaResolution": enum (MediaResolution),
  "responseFormat": {
    object (ResponseFormatConfig)
  }
}

模态

支持的响应模态。

枚举
`MODALITY_UNSPECIFIED`	默认值。
`TEXT`	表示模型应返回文本。
`IMAGE`	表示模型应返回图片。
`AUDIO`	表示模型应返回音频。

SpeechConfig

语音生成和转写配置。

字段

voiceConfig object (VoiceConfig)

单语音输出时的配置。

multiSpeakerVoiceConfig object (MultiSpeakerVoiceConfig)

可选。多音箱设置的配置。它与 voiceConfig 字段互斥。

languageCode string

可选。用户配置应用使用的 IETF BCP-47 语言代码。用于语音识别和语音合成。

有效值包括：de-DE、en-AU、en-GB、en-IN、en-US、es-US、fr-FR、hi-IN、pt-BR、ar-XA、es-ES、fr-CA、id-ID、it-IT、ja-JP、tr-TR、vi-VN、bn-IN、gu-IN、kn-IN、ml-IN、mr-IN、ta-IN、te-IN、nl-NL、ko-KR、cmn-CN、pl-PL、ru-RU 和 th-TH。

JSON 表示法
{ "voiceConfig": { object (`VoiceConfig`) }, "multiSpeakerVoiceConfig": { object (`MultiSpeakerVoiceConfig`) }, "languageCode": string }

VoiceConfig

要使用的语音的配置。

字段

voice_config Union type

要使用的音箱配置。voice_config 只能是下列其中一项：

prebuiltVoiceConfig object (PrebuiltVoiceConfig)

要使用的预构建语音的配置。

JSON 表示法
{ // voice_config "prebuiltVoiceConfig": { object (`PrebuiltVoiceConfig`) } // Union type }

PrebuiltVoiceConfig

要使用的预构建扬声器的配置。

字段

voiceName string

要使用的预设语音的名称。

JSON 表示法
{ "voiceName": string }

MultiSpeakerVoiceConfig

多音箱设置的配置。

字段

speakerVoiceConfigs[] object (SpeakerVoiceConfig)

必需。所有已启用的朗读语音。

JSON 表示法
{ "speakerVoiceConfigs": [ { object (`SpeakerVoiceConfig`) } ] }

SpeakerVoiceConfig

多音箱设置中单个音箱的配置。

字段

speaker string

必需。要使用的扬声器的名称。应与提示中的内容相同。

voiceConfig object (VoiceConfig)

必需。要使用的语音的配置。

JSON 表示法
{ "speaker": string, "voiceConfig": { object (`VoiceConfig`) } }

ThinkingConfig

思考功能的配置。

字段

includeThoughts boolean

指示是否在回答中包含思考过程。如果为 true，则仅在有想法时返回想法。

thinkingBudget integer

模型应生成的想法 token 的数量。

thinkingLevel enum (ThinkingLevel)

可选。控制模型在生成回答之前执行的内部推理过程的最大深度。默认值取决于型号。如需了解详情，请参阅思维水平指南。建议用于 Gemini 3 或更高版本的模型。与较早型号搭配使用会导致错误。

JSON 表示法
{ "includeThoughts": boolean, "thinkingBudget": integer, "thinkingLevel": enum (`ThinkingLevel`) }

ThinkingLevel

允许用户使用枚举而非整数预算来指定思考量。

枚举
`THINKING_LEVEL_UNSPECIFIED`	默认值。
`MINIMAL`	几乎没有思考。
`LOW`	低思考等级。
`MEDIUM`	中等思考等级。
`HIGH`	高思考等级。

ImageConfig

图片生成功能的配置。

字段

aspectRatio string

可选。要生成的图片的宽高比。支持的宽高比：1:1、1:4、4:1、1:8、8:1、2:3、3:2、3:4、4:3、4:5、5:4、9:16、16:9 或 21:9。

如果未指定，模型将根据提供的任何参考图片选择默认宽高比。

imageSize string

可选。指定生成的图片的大小。支持的值为 512、1K、2K、4K。如果未指定，模型将使用默认值 1K。

JSON 表示法
{ "aspectRatio": string, "imageSize": string }

MediaResolution

输入媒体的媒体分辨率。

枚举
`MEDIA_RESOLUTION_UNSPECIFIED`	媒体分辨率尚未设置。
`MEDIA_RESOLUTION_LOW`	媒体分辨率设置为低（64 个 token）。
`MEDIA_RESOLUTION_MEDIUM`	媒体分辨率设置为中等（256 个 token）。
`MEDIA_RESOLUTION_HIGH`	媒体分辨率设置为高（缩放重构，256 个 token）。

ResponseFormatConfig

响应输出格式的配置。这是一个扁平对象，其中每个可选子字段都用于配置特定的输出模态。

字段

text object (TextResponseFormat)

可选。文本输出格式配置。

audio object (AudioResponseFormat)

可选。音频输出格式配置。

image object (ImageResponseFormat)

可选。图片输出格式配置。

JSON 表示法
{ "text": { object (`TextResponseFormat`) }, "audio": { object (`AudioResponseFormat`) }, "image": { object (`ImageResponseFormat`) } }

TextResponseFormat

文本输出格式的配置。

字段

mimeType enum (MimeType)

可选。文本输出的 MIME 类型。

schema value (Value format)

可选。输出应遵循的 JSON 架构。仅在 mimeType 为 APPLICATION_JSON 时适用。

JSON 表示法
{ "mimeType": enum (`MimeType`), "schema": value }

MimeType

支持的文本输出 MIME 类型。

枚举
`MIME_TYPE_UNSPECIFIED`	默认值。此值未使用。
`APPLICATION_JSON`	JSON 输出格式。
`TEXT_PLAIN`	纯文本输出格式。

AudioResponseFormat

音频输出格式的配置。

字段

mimeType enum (MimeType)

可选。音频输出的 MIME 类型。

delivery enum (Delivery)

可选。音频输出的传送模式。

sampleRate integer

可选。采样率（以 Hz 为单位）。

bitRate integer

可选。比特率，以每秒比特数 (bps) 为单位。仅适用于压缩格式（MP3、Opus）。

JSON 表示法
{ "mimeType": enum (`MimeType`), "delivery": enum (`Delivery`), "sampleRate": integer, "bitRate": integer }

MimeType

音频输出支持的 MIME 类型。

枚举
`MIME_TYPE_UNSPECIFIED`	默认值。此值未使用。
`AUDIO_MP3`	MP3 音频格式。
`AUDIO_OGG_OPUS`	OGG Opus 音频格式。
`AUDIO_L16`	原始 PCM (L16) 音频格式。
`AUDIO_WAV`	WAV 音频格式。
`AUDIO_ALAW`	A-law 音频格式。
`AUDIO_MULAW`	Mu-law 音频格式。

传送

音频输出的传送模式。

枚举
`DELIVERY_UNSPECIFIED`	默认值。此值未使用。
`INLINE`	音频数据以内嵌方式在响应中返回。
`URI`	音频数据以 URI 形式返回。

ImageResponseFormat

图片输出格式的配置。

字段

mimeType enum (MimeType)

可选。图片输出的 MIME 类型。

delivery enum (Delivery)

可选。图片输出的传送模式。

aspectRatio enum (AspectRatio)

可选。图片输出的宽高比。

imageSize enum (ImageSize)

可选。输出图片的尺寸。

JSON 表示法
{ "mimeType": enum (`MimeType`), "delivery": enum (`Delivery`), "aspectRatio": enum (`AspectRatio`), "imageSize": enum (`ImageSize`) }

MimeType

支持的图片输出 MIME 类型。

枚举
`MIME_TYPE_UNSPECIFIED`	默认值。此值未使用。
`IMAGE_JPEG`	JPEG 图片格式。

传送

图片输出的传送模式。

枚举
`DELIVERY_UNSPECIFIED`	默认值。此值未使用。
`INLINE`	图片数据以内嵌方式在响应中返回。
`URI`	图片数据以 URI 形式返回。

AspectRatio

支持的图片输出宽高比。

枚举
`ASPECT_RATIO_UNSPECIFIED`	默认值。此值未使用。
`ASPECT_RATIO_ONE_BY_ONE`	1:1 的宽高比。
`ASPECT_RATIO_TWO_BY_THREE`	宽高比为 2:3。
`ASPECT_RATIO_THREE_BY_TWO`	3:2 宽高比。
`ASPECT_RATIO_THREE_BY_FOUR`	3:4 宽高比。
`ASPECT_RATIO_FOUR_BY_THREE`	4:3 宽高比。
`ASPECT_RATIO_FOUR_BY_FIVE`	宽高比为 4:5。
`ASPECT_RATIO_FIVE_BY_FOUR`	5:4 宽高比。
`ASPECT_RATIO_NINE_BY_SIXTEEN`	9:16 宽高比。
`ASPECT_RATIO_SIXTEEN_BY_NINE`	宽高比：16:9。
`ASPECT_RATIO_TWENTY_ONE_BY_NINE`	21:9 宽高比。
`ASPECT_RATIO_ONE_BY_EIGHT`	宽高比为 1:8。
`ASPECT_RATIO_EIGHT_BY_ONE`	宽高比为 8:1。
`ASPECT_RATIO_ONE_BY_FOUR`	宽高比为 1:4。
`ASPECT_RATIO_FOUR_BY_ONE`	宽高比为 4:1。

ImageSize

图片输出支持的图片大小。

枚举
`IMAGE_SIZE_UNSPECIFIED`	默认值。此值未使用。
`IMAGE_SIZE_FIVE_TWELVE`	512 像素的图片大小。
`IMAGE_SIZE_ONE_K`	1K 图片大小。
`IMAGE_SIZE_TWO_K`	2K 图片大小。
`IMAGE_SIZE_FOUR_K`	4K 图像大小。

HarmCategory

评分的类别。

这些类别涵盖了开发者可能希望调整的各种类型的危害。

枚举
`HARM_CATEGORY_UNSPECIFIED`	未指定类别。
`HARM_CATEGORY_DEROGATORY`	PaLM - 针对身份和/或受保护属性的负面或有害评论。
`HARM_CATEGORY_TOXICITY`	PaLM - 粗鲁、无礼或亵渎性的内容。
`HARM_CATEGORY_VIOLENCE`	PaLM - 描述描绘针对个人或团体的暴力行为的场景，或一般性血腥描述。
`HARM_CATEGORY_SEXUAL`	PaLM - 包含对性行为或其他淫秽内容的引用。
`HARM_CATEGORY_MEDICAL`	PaLM - 宣传未经核实的医疗建议。
`HARM_CATEGORY_DANGEROUS`	PaLM - 宣扬、助长或鼓励有害行为的危险内容。
`HARM_CATEGORY_HARASSMENT`	Gemini - 骚扰内容。
`HARM_CATEGORY_HATE_SPEECH`	Gemini - 仇恨言论和内容。
`HARM_CATEGORY_SEXUALLY_EXPLICIT`	Gemini - 露骨色情内容。
`HARM_CATEGORY_DANGEROUS_CONTENT`	Gemini - 危险内容。
`HARM_CATEGORY_CIVIC_INTEGRITY`	Gemini - 可能被用于损害公民诚信的内容。已弃用：请改用 enableEnhancedCivicAnswers。此项已弃用！

ModalityTokenCount

JSON 表示法
模态

表示单个模态的令牌计数信息。

字段

modality enum (Modality)

与此令牌数量关联的模态。

tokenCount integer

令牌数量。

JSON 表示法
{ "modality": enum (`Modality`), "tokenCount": integer }

模态

内容部分的模态

枚举
`MODALITY_UNSPECIFIED`	未指定模态。
`TEXT`	纯文本。
`IMAGE`	图片。
`VIDEO`	视频。
`AUDIO`	音频。
`DOCUMENT`	文档，例如 PDF。

SafetyRating

JSON 表示法
HarmProbability

内容的安全评级。

安全评级包含内容所属的危害类别以及该类别中的危害概率级别。内容会根据多个危害类别进行安全分类，并在此处显示危害分类的概率。

字段

category enum (HarmCategory)

必需。相应评分的类别。

probability enum (HarmProbability)

必需。相应内容的有害概率。

blocked boolean

此内容是否因该评级而被屏蔽？

JSON 表示法
{ "category": enum (`HarmCategory`), "probability": enum (`HarmProbability`), "blocked": boolean }

HarmProbability

内容有害的概率。

分类系统会给出内容不安全的概率。这并不表示内容的危害严重程度。

枚举
`HARM_PROBABILITY_UNSPECIFIED`	概率未指定。
`NEGLIGIBLE`	内容不安全的概率可忽略不计。
`LOW`	内容不安全的概率较低。
`MEDIUM`	内容不安全的可能性为中等。
`HIGH`	内容不安全的概率较高。

SafetySetting

JSON 表示法
HarmBlockThreshold

安全设置，会影响安全屏蔽行为。

为某个类别传递安全设置会更改允许的内容屏蔽概率。

字段

category enum (HarmCategory)

必需。相应设置的类别。

threshold enum (HarmBlockThreshold)

必需。控制屏蔽有害内容的概率阈值。

JSON 表示法
{ "category": enum (`HarmCategory`), "threshold": enum (`HarmBlockThreshold`) }

HarmBlockThreshold

在达到或超过指定危害概率时进行屏蔽。

枚举
`HARM_BLOCK_THRESHOLD_UNSPECIFIED`	阈值未指定。
`BLOCK_LOW_AND_ABOVE`	内容评级为“可忽略”的视频将获准投放广告。
`BLOCK_MEDIUM_AND_ABOVE`	系统将允许发布风险为“可忽略”和“低”的内容。
`BLOCK_ONLY_HIGH`	内容风险等级为“可忽略”“低”和“中”时，将允许发布。
`BLOCK_NONE`	允许所有内容。
`OFF`	关闭安全过滤条件。

ServiceTier

请求的服务等级。

枚举
`unspecified`	默认服务层级，即标准层级。
`standard`	标准服务层级。
`flex`	Flex 服务层级。
`priority`	优先服务层级。