API для взаимодействия теперь общедоступн. Мы рекомендуем использовать этот API для доступа ко всем новейшим функциям и моделям.

Эта страница переведена с помощью Cloud Translation API.

Generating content

API Gemini поддерживает генерацию контента с использованием изображений, аудио, кода, инструментов и многого другого. Для получения подробной информации о каждой из этих функций читайте дальше и ознакомьтесь с примерами кода, ориентированными на решение конкретных задач, или прочитайте подробные руководства.

Метод: models.generateContent

Генерирует ответ модели на входной запрос GenerateContentRequest . Подробную информацию об использовании см. в руководстве по генерации текста . Возможности ввода различаются для разных моделей, включая оптимизированные модели. Для получения более подробной информации см. руководство по моделям и руководство по оптимизации .

Конечная точка

post https: / /generativelanguage.googleapis.com /v1beta /{model=models /*}:generateContent

Параметры пути

string model

Обязательно. Название Model , которая будет использоваться для генерации автозавершения.

Формат: models/{model} . Он принимает вид models/{model} .

Текст запроса

Тело запроса содержит данные следующей структуры:

Поля

contents[] object ( Content )

Обязательно. Содержание текущего разговора с моделью.

Для запросов с одним циклом обработки это один экземпляр. Для запросов с несколькими циклами обработки, таких как чат , это повторяющееся поле, содержащее историю переписки и последний запрос.

tools[] object ( Tool )

Необязательно. Список Tools Model может использовать для генерации следующего ответа.

Tool — это фрагмент кода, позволяющий системе взаимодействовать с внешними системами для выполнения действия или набора действий, выходящих за рамки знаний и области действия Model . Поддерживаемые Tool — это Function и codeExecution . Для получения дополнительной информации обратитесь к руководствам по вызову функций и выполнению кода .

объект toolConfig object ( ToolConfig )

Необязательно. Конфигурация инструмента для любого Tool указанного в запросе. Пример использования см. в руководстве по вызову функций .

safetySettings[] object ( SafetySetting )

Необязательно. Список уникальных экземпляров SafetySetting для блокировки небезопасного контента.

Это будет применяться к GenerateContentRequest.contents и GenerateContentResponse.candidates . Для каждого типа SafetyCategory не должно быть более одной настройки. API будет блокировать любой контент и ответы, которые не соответствуют пороговым значениям, установленным этими настройками. Этот список переопределяет настройки по умолчанию для каждой SafetyCategory , указанной в safetySettings. Если для данной SafetyCategory в списке не указана SafetySetting безопасности, API будет использовать настройку безопасности по умолчанию для этой категории. Поддерживаются категории вреда HARM_CATEGORY_HATE_SPEECH, HARM_CATEGORY_SEXUALLY_EXPLICIT, HARM_CATEGORY_DANGEROUS_CONTENT, HARM_CATEGORY_HARASSMENT, HARM_CATEGORY_CIVIC_INTEGRITY, HARM_CATEGORY_JAILBREAK. Подробную информацию о доступных настройках безопасности см. в руководстве . Также ознакомьтесь с рекомендациями по безопасности , чтобы узнать, как учитывать вопросы безопасности в ваших приложениях на основе искусственного интеллекта.

объект systemInstruction object ( Content )

Необязательно. Разработчик задает системные инструкции . В настоящее время только текст.

объект generationConfig object ( GenerationConfig )

Необязательно. Параметры конфигурации для генерации модели и выходных данных.

cachedContent string

Необязательно. Название кэшированного контента, используемого в качестве контекста для выполнения прогнозирования. Формат: cachedContents/{cachedContent}

перечисление serviceTier enum ( ServiceTier )

Необязательно. Уровень обслуживания запроса.

store boolean

Необязательный параметр. Задает поведение логирования для данного запроса. Если задан, он имеет приоритет над конфигурацией логирования на уровне проекта.

Пример запроса

Текст

Python

from google import genai

client = genai.Client()
response = client.models.generate_content(
    model="gemini-3.6-flash", contents="Write a story about a magic backpack."
)
print(response.text)text_generation.py

Node.js

// Make sure to include the following import:
// import {GoogleGenAI} from '@google/genai';
const ai = new GoogleGenAI({ apiKey: process.env.GEMINI_API_KEY });

const response = await ai.models.generateContent({
  model: "gemini-3.6-flash",
  contents: "Write a story about a magic backpack.",
});
console.log(response.text);text_generation.js

Идти

ctx := context.Background()
client, err := genai.NewClient(ctx, &genai.ClientConfig{
	APIKey:  os.Getenv("GEMINI_API_KEY"),
	Backend: genai.BackendGeminiAPI,
})
if err != nil {
	log.Fatal(err)
}
contents := []*genai.Content{
	genai.NewContentFromText("Write a story about a magic backpack.", genai.RoleUser),
}
response, err := client.Models.GenerateContent(ctx, "gemini-3.6-flash", contents, nil)
if err != nil {
	log.Fatal(err)
}
printResponse(response)text_generation.go

Оболочка

curl "https://generativelanguage.googleapis.com/v1beta/models/gemini-2.0-flash:generateContent?key=$GEMINI_API_KEY" \
    -H 'Content-Type: application/json' \
    -X POST \
    -d '{
      "contents": [{
        "parts":[{"text": "Write a story about a magic backpack."}]
        }]
       }' 2> /dev/nulltext_generation.sh

Java

Client client = new Client();

GenerateContentResponse response =
        client.models.generateContent(
                "gemini-3.6-flash",
                "Write a story about a magic backpack.",
                null);

System.out.println(response.text());TextGeneration.java

Изображение

Python

from google import genai
import PIL.Image

client = genai.Client()
organ = PIL.Image.open(media / "organ.jpg")
response = client.models.generate_content(
    model="gemini-3.6-flash", contents=["Tell me about this instrument", organ]
)
print(response.text)text_generation.py

Node.js

// Make sure to include the following import:
// import {GoogleGenAI} from '@google/genai';
const ai = new GoogleGenAI({ apiKey: process.env.GEMINI_API_KEY });

const organ = await ai.files.upload({
  file: path.join(media, "organ.jpg"),
});

const response = await ai.models.generateContent({
  model: "gemini-3.6-flash",
  contents: [
    createUserContent([
      "Tell me about this instrument", 
      createPartFromUri(organ.uri, organ.mimeType)
    ]),
  ],
});
console.log(response.text);text_generation.js

Идти

ctx := context.Background()
client, err := genai.NewClient(ctx, &genai.ClientConfig{
	APIKey:  os.Getenv("GEMINI_API_KEY"),
	Backend: genai.BackendGeminiAPI,
})
if err != nil {
	log.Fatal(err)
}

file, err := client.Files.UploadFromPath(
	ctx, 
	filepath.Join(getMedia(), "organ.jpg"), 
	&genai.UploadFileConfig{
		MIMEType : "image/jpeg",
	},
)
if err != nil {
	log.Fatal(err)
}
parts := []*genai.Part{
	genai.NewPartFromText("Tell me about this instrument"),
	genai.NewPartFromURI(file.URI, file.MIMEType),
}
contents := []*genai.Content{
	genai.NewContentFromParts(parts, genai.RoleUser),
}

response, err := client.Models.GenerateContent(ctx, "gemini-3.6-flash", contents, nil)
if err != nil {
	log.Fatal(err)
}
printResponse(response)text_generation.go

Оболочка

# Use a temporary file to hold the base64 encoded image data
TEMP_B64=$(mktemp)
trap 'rm -f "$TEMP_B64"' EXIT
base64 $B64FLAGS $IMG_PATH > "$TEMP_B64"

# Use a temporary file to hold the JSON payload
TEMP_JSON=$(mktemp)
trap 'rm -f "$TEMP_JSON"' EXIT

cat > "$TEMP_JSON" << EOF
{
  "contents": [{
    "parts":[
      {"text": "Tell me about this instrument"},
      {
        "inline_data": {
          "mime_type":"image/jpeg",
          "data": "$(cat "$TEMP_B64")"
        }
      }
    ]
  }]
}
EOF

curl "https://generativelanguage.googleapis.com/v1beta/models/gemini-2.0-flash:generateContent?key=$GEMINI_API_KEY" \
    -H 'Content-Type: application/json' \
    -X POST \
    -d "@$TEMP_JSON" 2> /dev/nulltext_generation.sh

Java

Client client = new Client();

String path = media_path + "organ.jpg";
byte[] imageData = Files.readAllBytes(Paths.get(path));

Content content =
        Content.fromParts(
                Part.fromText("Tell me about this instrument."),
                Part.fromBytes(imageData, "image/jpeg"));

GenerateContentResponse response = client.models.generateContent("gemini-3.6-flash", content, null);

System.out.println(response.text());TextGeneration.java

Аудио

Python

from google import genai

client = genai.Client()
sample_audio = client.files.upload(file=media / "sample.mp3")
response = client.models.generate_content(
    model="gemini-3.6-flash",
    contents=["Give me a summary of this audio file.", sample_audio],
)
print(response.text)text_generation.py

Node.js

// Make sure to include the following import:
// import {GoogleGenAI} from '@google/genai';
const ai = new GoogleGenAI({ apiKey: process.env.GEMINI_API_KEY });

const audio = await ai.files.upload({
  file: path.join(media, "sample.mp3"),
});

const response = await ai.models.generateContent({
  model: "gemini-3.6-flash",
  contents: [
    createUserContent([
      "Give me a summary of this audio file.",
      createPartFromUri(audio.uri, audio.mimeType),
    ]),
  ],
});
console.log(response.text);text_generation.js

Идти

ctx := context.Background()
client, err := genai.NewClient(ctx, &genai.ClientConfig{
	APIKey:  os.Getenv("GEMINI_API_KEY"),
	Backend: genai.BackendGeminiAPI,
})
if err != nil {
	log.Fatal(err)
}

file, err := client.Files.UploadFromPath(
	ctx, 
	filepath.Join(getMedia(), "sample.mp3"), 
	&genai.UploadFileConfig{
		MIMEType : "audio/mpeg",
	},
)
if err != nil {
	log.Fatal(err)
}

parts := []*genai.Part{
	genai.NewPartFromText("Give me a summary of this audio file."),
	genai.NewPartFromURI(file.URI, file.MIMEType),
}

contents := []*genai.Content{
	genai.NewContentFromParts(parts, genai.RoleUser),
}

response, err := client.Models.GenerateContent(ctx, "gemini-3.6-flash", contents, nil)
if err != nil {
	log.Fatal(err)
}
printResponse(response)text_generation.go

Оболочка

# Use File API to upload audio data to API request.
MIME_TYPE=$(file -b --mime-type "${AUDIO_PATH}")
NUM_BYTES=$(wc -c < "${AUDIO_PATH}")
DISPLAY_NAME=AUDIO

tmp_header_file=upload-header.tmp

# Initial resumable request defining metadata.
# The upload url is in the response headers dump them to a file.
curl "${BASE_URL}/upload/v1beta/files?key=${GEMINI_API_KEY}" \
  -D upload-header.tmp \
  -H "X-Goog-Upload-Protocol: resumable" \
  -H "X-Goog-Upload-Command: start" \
  -H "X-Goog-Upload-Header-Content-Length: ${NUM_BYTES}" \
  -H "X-Goog-Upload-Header-Content-Type: ${MIME_TYPE}" \
  -H "Content-Type: application/json" \
  -d "{'file': {'display_name': '${DISPLAY_NAME}'}}" 2> /dev/null

upload_url=$(grep -i "x-goog-upload-url: " "${tmp_header_file}" | cut -d" " -f2 | tr -d "\r")
rm "${tmp_header_file}"

# Upload the actual bytes.
curl "${upload_url}" \
  -H "Content-Length: ${NUM_BYTES}" \
  -H "X-Goog-Upload-Offset: 0" \
  -H "X-Goog-Upload-Command: upload, finalize" \
  --data-binary "@${AUDIO_PATH}" 2> /dev/null > file_info.json

file_uri=$(jq ".file.uri" file_info.json)
echo file_uri=$file_uri

curl "https://generativelanguage.googleapis.com/v1beta/models/gemini-2.0-flash:generateContent?key=$GEMINI_API_KEY" \
    -H 'Content-Type: application/json' \
    -X POST \
    -d '{
      "contents": [{
        "parts":[
          {"text": "Please describe this file."},
          {"file_data":{"mime_type": "audio/mpeg", "file_uri": '$file_uri'}}]
        }]
       }' 2> /dev/null > response.json

cat response.json
echo

jq ".candidates[].content.parts[].text" response.jsontext_generation.sh

Видео

Python

from google import genai
import time

client = genai.Client()
# Video clip (CC BY 3.0) from https://peach.blender.org/download/
myfile = client.files.upload(file=media / "Big_Buck_Bunny.mp4")
print(f"{myfile=}")

# Poll until the video file is completely processed (state becomes ACTIVE).
while not myfile.state or myfile.state.name != "ACTIVE":
    print("Processing video...")
    print("File state:", myfile.state)
    time.sleep(5)
    myfile = client.files.get(name=myfile.name)

response = client.models.generate_content(
    model="gemini-3.6-flash", contents=[myfile, "Describe this video clip"]
)
print(f"{response.text=}")text_generation.py

Node.js

// Make sure to include the following import:
// import {GoogleGenAI} from '@google/genai';
const ai = new GoogleGenAI({ apiKey: process.env.GEMINI_API_KEY });

let video = await ai.files.upload({
  file: path.join(media, 'Big_Buck_Bunny.mp4'),
});

// Poll until the video file is completely processed (state becomes ACTIVE).
while (!video.state || video.state.toString() !== 'ACTIVE') {
  console.log('Processing video...');
  console.log('File state: ', video.state);
  await sleep(5000);
  video = await ai.files.get({name: video.name});
}

const response = await ai.models.generateContent({
  model: "gemini-3.6-flash",
  contents: [
    createUserContent([
      "Describe this video clip",
      createPartFromUri(video.uri, video.mimeType),
    ]),
  ],
});
console.log(response.text);text_generation.js

Идти

ctx := context.Background()
client, err := genai.NewClient(ctx, &genai.ClientConfig{
	APIKey:  os.Getenv("GEMINI_API_KEY"),
	Backend: genai.BackendGeminiAPI,
})
if err != nil {
	log.Fatal(err)
}

file, err := client.Files.UploadFromPath(
	ctx, 
	filepath.Join(getMedia(), "Big_Buck_Bunny.mp4"), 
	&genai.UploadFileConfig{
		MIMEType : "video/mp4",
	},
)
if err != nil {
	log.Fatal(err)
}

// Poll until the video file is completely processed (state becomes ACTIVE).
for file.State == genai.FileStateUnspecified || file.State != genai.FileStateActive {
	fmt.Println("Processing video...")
	fmt.Println("File state:", file.State)
	time.Sleep(5 * time.Second)

	file, err = client.Files.Get(ctx, file.Name, nil)
	if err != nil {
		log.Fatal(err)
	}
}

parts := []*genai.Part{
	genai.NewPartFromText("Describe this video clip"),
	genai.NewPartFromURI(file.URI, file.MIMEType),
}

contents := []*genai.Content{
	genai.NewContentFromParts(parts, genai.RoleUser),
}

response, err := client.Models.GenerateContent(ctx, "gemini-3.6-flash", contents, nil)
if err != nil {
	log.Fatal(err)
}
printResponse(response)text_generation.go

Оболочка

# Use File API to upload audio data to API request.
MIME_TYPE=$(file -b --mime-type "${VIDEO_PATH}")
NUM_BYTES=$(wc -c < "${VIDEO_PATH}")
DISPLAY_NAME=VIDEO

# Initial resumable request defining metadata.
# The upload url is in the response headers dump them to a file.
curl "${BASE_URL}/upload/v1beta/files?key=${GEMINI_API_KEY}" \
  -D "${tmp_header_file}" \
  -H "X-Goog-Upload-Protocol: resumable" \
  -H "X-Goog-Upload-Command: start" \
  -H "X-Goog-Upload-Header-Content-Length: ${NUM_BYTES}" \
  -H "X-Goog-Upload-Header-Content-Type: ${MIME_TYPE}" \
  -H "Content-Type: application/json" \
  -d "{'file': {'display_name': '${DISPLAY_NAME}'}}" 2> /dev/null

upload_url=$(grep -i "x-goog-upload-url: " "${tmp_header_file}" | cut -d" " -f2 | tr -d "\r")
rm "${tmp_header_file}"

# Upload the actual bytes.
curl "${upload_url}" \
  -H "Content-Length: ${NUM_BYTES}" \
  -H "X-Goog-Upload-Offset: 0" \
  -H "X-Goog-Upload-Command: upload, finalize" \
  --data-binary "@${VIDEO_PATH}" 2> /dev/null > file_info.json

file_uri=$(jq ".file.uri" file_info.json)
echo file_uri=$file_uri

state=$(jq ".file.state" file_info.json)
echo state=$state

name=$(jq ".file.name" file_info.json)
echo name=$name

while [[ "($state)" = *"PROCESSING"* ]];
do
  echo "Processing video..."
  sleep 5
  # Get the file of interest to check state
  curl https://generativelanguage.googleapis.com/v1beta/files/$name > file_info.json
  state=$(jq ".file.state" file_info.json)
done

curl "https://generativelanguage.googleapis.com/v1beta/models/gemini-2.0-flash:generateContent?key=$GEMINI_API_KEY" \
    -H 'Content-Type: application/json' \
    -X POST \
    -d '{
      "contents": [{
        "parts":[
          {"text": "Transcribe the audio from this video, giving timestamps for salient events in the video. Also provide visual descriptions."},
          {"file_data":{"mime_type": "video/mp4", "file_uri": '$file_uri'}}]
        }]
       }' 2> /dev/null > response.json

cat response.json
echo

jq ".candidates[].content.parts[].text" response.jsontext_generation.sh

PDF

Python

from google import genai

client = genai.Client()
sample_pdf = client.files.upload(file=media / "test.pdf")
response = client.models.generate_content(
    model="gemini-3.6-flash",
    contents=["Give me a summary of this document:", sample_pdf],
)
print(f"{response.text=}")text_generation.py

Идти

ctx := context.Background()
client, err := genai.NewClient(ctx, &genai.ClientConfig{
	APIKey:  os.Getenv("GEMINI_API_KEY"),
	Backend: genai.BackendGeminiAPI,
})
if err != nil {
	log.Fatal(err)
}

file, err := client.Files.UploadFromPath(
	ctx, 
	filepath.Join(getMedia(), "test.pdf"), 
	&genai.UploadFileConfig{
		MIMEType : "application/pdf",
	},
)
if err != nil {
	log.Fatal(err)
}

parts := []*genai.Part{
	genai.NewPartFromText("Give me a summary of this document:"),
	genai.NewPartFromURI(file.URI, file.MIMEType),
}

contents := []*genai.Content{
	genai.NewContentFromParts(parts, genai.RoleUser),
}

response, err := client.Models.GenerateContent(ctx, "gemini-3.6-flash", contents, nil)
if err != nil {
	log.Fatal(err)
}
printResponse(response)text_generation.go

Оболочка

MIME_TYPE=$(file -b --mime-type "${PDF_PATH}")
NUM_BYTES=$(wc -c < "${PDF_PATH}")
DISPLAY_NAME=TEXT


echo $MIME_TYPE
tmp_header_file=upload-header.tmp

# Initial resumable request defining metadata.
# The upload url is in the response headers dump them to a file.
curl "${BASE_URL}/upload/v1beta/files?key=${GEMINI_API_KEY}" \
  -D upload-header.tmp \
  -H "X-Goog-Upload-Protocol: resumable" \
  -H "X-Goog-Upload-Command: start" \
  -H "X-Goog-Upload-Header-Content-Length: ${NUM_BYTES}" \
  -H "X-Goog-Upload-Header-Content-Type: ${MIME_TYPE}" \
  -H "Content-Type: application/json" \
  -d "{'file': {'display_name': '${DISPLAY_NAME}'}}" 2> /dev/null

upload_url=$(grep -i "x-goog-upload-url: " "${tmp_header_file}" | cut -d" " -f2 | tr -d "\r")
rm "${tmp_header_file}"

# Upload the actual bytes.
curl "${upload_url}" \
  -H "Content-Length: ${NUM_BYTES}" \
  -H "X-Goog-Upload-Offset: 0" \
  -H "X-Goog-Upload-Command: upload, finalize" \
  --data-binary "@${PDF_PATH}" 2> /dev/null > file_info.json

file_uri=$(jq ".file.uri" file_info.json)
echo file_uri=$file_uri

# Now generate content using that file
curl "https://generativelanguage.googleapis.com/v1beta/models/gemini-2.0-flash:generateContent?key=$GEMINI_API_KEY" \
    -H 'Content-Type: application/json' \
    -X POST \
    -d '{
      "contents": [{
        "parts":[
          {"text": "Can you add a few more lines to this poem?"},
          {"file_data":{"mime_type": "application/pdf", "file_uri": '$file_uri'}}]
        }]
       }' 2> /dev/null > response.json

cat response.json
echo

jq ".candidates[].content.parts[].text" response.jsontext_generation.sh

Чат

Python

from google import genai
from google.genai import types

client = genai.Client()
# Pass initial history using the "history" argument
chat = client.chats.create(
    model="gemini-3.6-flash",
    history=[
        types.Content(role="user", parts=[types.Part(text="Hello")]),
        types.Content(
            role="model",
            parts=[
                types.Part(
                    text="Great to meet you. What would you like to know?"
                )
            ],
        ),
    ],
)
response = chat.send_message(message="I have 2 dogs in my house.")
print(response.text)
response = chat.send_message(message="How many paws are in my house?")
print(response.text)chat.py

Node.js

// Make sure to include the following import:
// import {GoogleGenAI} from '@google/genai';
const ai = new GoogleGenAI({ apiKey: process.env.GEMINI_API_KEY });
const chat = ai.chats.create({
  model: "gemini-3.6-flash",
  history: [
    {
      role: "user",
      parts: [{ text: "Hello" }],
    },
    {
      role: "model",
      parts: [{ text: "Great to meet you. What would you like to know?" }],
    },
  ],
});

const response1 = await chat.sendMessage({
  message: "I have 2 dogs in my house.",
});
console.log("Chat response 1:", response1.text);

const response2 = await chat.sendMessage({
  message: "How many paws are in my house?",
});
console.log("Chat response 2:", response2.text);chat.js

Идти

ctx := context.Background()
client, err := genai.NewClient(ctx, &genai.ClientConfig{
	APIKey:  os.Getenv("GEMINI_API_KEY"),
	Backend: genai.BackendGeminiAPI,
})
if err != nil {
	log.Fatal(err)
}

// Pass initial history using the History field.
history := []*genai.Content{
	genai.NewContentFromText("Hello", genai.RoleUser),
	genai.NewContentFromText("Great to meet you. What would you like to know?", genai.RoleModel),
}

chat, err := client.Chats.Create(ctx, "gemini-3.6-flash", nil, history)
if err != nil {
	log.Fatal(err)
}

firstResp, err := chat.SendMessage(ctx, genai.Part{Text: "I have 2 dogs in my house."})
if err != nil {
	log.Fatal(err)
}
fmt.Println(firstResp.Text())

secondResp, err := chat.SendMessage(ctx, genai.Part{Text: "How many paws are in my house?"})
if err != nil {
	log.Fatal(err)
}
fmt.Println(secondResp.Text())chat.go

Оболочка

curl https://generativelanguage.googleapis.com/v1beta/models/gemini-2.0-flash:generateContent?key=$GEMINI_API_KEY \
    -H 'Content-Type: application/json' \
    -X POST \
    -d '{
      "contents": [
        {"role":"user",
         "parts":[{
           "text": "Hello"}]},
        {"role": "model",
         "parts":[{
           "text": "Great to meet you. What would you like to know?"}]},
        {"role":"user",
         "parts":[{
           "text": "I have two dogs in my house. How many paws are in my house?"}]},
      ]
    }' 2> /dev/null | grep "text"chat.sh

Java

Client client = new Client();

Content userContent = Content.fromParts(Part.fromText("Hello"));
Content modelContent =
        Content.builder()
                .role("model")
                .parts(
                        Collections.singletonList(
                                Part.fromText("Great to meet you. What would you like to know?")
                        )
                ).build();

Chat chat = client.chats.create(
        "gemini-3.6-flash",
        GenerateContentConfig.builder()
                .systemInstruction(userContent)
                .systemInstruction(modelContent)
                .build()
);

GenerateContentResponse response1 = chat.sendMessage("I have 2 dogs in my house.");
System.out.println(response1.text());

GenerateContentResponse response2 = chat.sendMessage("How many paws are in my house?");
System.out.println(response2.text());
ChatSession.java

Кэш

Python

from google import genai
from google.genai import types

client = genai.Client()
document = client.files.upload(file=media / "a11.txt")
model_name = "gemini-3.6-flash"

cache = client.caches.create(
    model=model_name,
    config=types.CreateCachedContentConfig(
        contents=[document],
        system_instruction="You are an expert analyzing transcripts.",
    ),
)
print(cache)

response = client.models.generate_content(
    model=model_name,
    contents="Please summarize this transcript",
    config=types.GenerateContentConfig(cached_content=cache.name),
)
print(response.text)cache.py

Node.js

// Make sure to include the following import:
// import {GoogleGenAI} from '@google/genai';
const ai = new GoogleGenAI({ apiKey: process.env.GEMINI_API_KEY });
const filePath = path.join(media, "a11.txt");
const document = await ai.files.upload({
  file: filePath,
  config: { mimeType: "text/plain" },
});
console.log("Uploaded file name:", document.name);
const modelName = "gemini-3.6-flash";

const contents = [
  createUserContent(createPartFromUri(document.uri, document.mimeType)),
];

const cache = await ai.caches.create({
  model: modelName,
  config: {
    contents: contents,
    systemInstruction: "You are an expert analyzing transcripts.",
  },
});
console.log("Cache created:", cache);

const response = await ai.models.generateContent({
  model: modelName,
  contents: "Please summarize this transcript",
  config: { cachedContent: cache.name },
});
console.log("Response text:", response.text);cache.js

Идти

ctx := context.Background()
client, err := genai.NewClient(ctx, &genai.ClientConfig{
	APIKey:  os.Getenv("GEMINI_API_KEY"), 
	Backend: genai.BackendGeminiAPI,
})
if err != nil {
	log.Fatal(err)
}

modelName := "gemini-3.6-flash"
document, err := client.Files.UploadFromPath(
	ctx, 
	filepath.Join(getMedia(), "a11.txt"), 
	&genai.UploadFileConfig{
		MIMEType : "text/plain",
	},
)
if err != nil {
	log.Fatal(err)
}
parts := []*genai.Part{
	genai.NewPartFromURI(document.URI, document.MIMEType),
}
contents := []*genai.Content{
	genai.NewContentFromParts(parts, genai.RoleUser),
}
cache, err := client.Caches.Create(ctx, modelName, &genai.CreateCachedContentConfig{
	Contents: contents,
	SystemInstruction: genai.NewContentFromText(
		"You are an expert analyzing transcripts.", genai.RoleUser,
	),
})
if err != nil {
	log.Fatal(err)
}
fmt.Println("Cache created:")
fmt.Println(cache)

// Use the cache for generating content.
response, err := client.Models.GenerateContent(
	ctx,
	modelName,
	genai.Text("Please summarize this transcript"),
	&genai.GenerateContentConfig{
		CachedContent: cache.Name,
	},
)
if err != nil {
	log.Fatal(err)
}
printResponse(response)cache.go

Тюнингованная модель

Python

# With Gemini 2 we're launching a new SDK. See the following doc for details.
# https://ai.google.dev/gemini-api/docs/migrateREADME.md

Режим JSON

Python

from google import genai
from google.genai import types
from typing_extensions import TypedDict

class Recipe(TypedDict):
    recipe_name: str
    ingredients: list[str]

client = genai.Client()
result = client.models.generate_content(
    model="gemini-3.6-flash",
    contents="List a few popular cookie recipes.",
    config=types.GenerateContentConfig(
        response_mime_type="application/json", response_schema=list[Recipe]
    ),
)
print(result)controlled_generation.py

Node.js

// Make sure to include the following import:
// import {GoogleGenAI} from '@google/genai';
const ai = new GoogleGenAI({ apiKey: process.env.GEMINI_API_KEY });
const response = await ai.models.generateContent({
  model: "gemini-3.6-flash",
  contents: "List a few popular cookie recipes.",
  config: {
    responseMimeType: "application/json",
    responseSchema: {
      type: "array",
      items: {
        type: "object",
        properties: {
          recipeName: { type: "string" },
          ingredients: { type: "array", items: { type: "string" } },
        },
        required: ["recipeName", "ingredients"],
      },
    },
  },
});
console.log(response.text);controlled_generation.js

Идти

ctx := context.Background()
client, err := genai.NewClient(ctx, &genai.ClientConfig{
	APIKey:  os.Getenv("GEMINI_API_KEY"), 
	Backend: genai.BackendGeminiAPI,
})
if err != nil {
	log.Fatal(err)
}

schema := &genai.Schema{
	Type: genai.TypeArray,
	Items: &genai.Schema{
		Type: genai.TypeObject,
		Properties: map[string]*genai.Schema{
			"recipe_name": {Type: genai.TypeString},
			"ingredients": {
				Type:  genai.TypeArray,
				Items: &genai.Schema{Type: genai.TypeString},
			},
		},
		Required: []string{"recipe_name"},
	},
}

config := &genai.GenerateContentConfig{
	ResponseMIMEType: "application/json",
	ResponseSchema:   schema,
}

response, err := client.Models.GenerateContent(
	ctx,
	"gemini-3.6-flash",
	genai.Text("List a few popular cookie recipes."),
	config,
)
if err != nil {
	log.Fatal(err)
}
printResponse(response)controlled_generation.go

Оболочка

curl "https://generativelanguage.googleapis.com/v1beta/models/gemini-2.0-flash:generateContent?key=$GEMINI_API_KEY" \
-H 'Content-Type: application/json' \
-d '{
    "contents": [{
      "parts":[
        {"text": "List 5 popular cookie recipes"}
        ]
    }],
    "generationConfig": {
        "response_mime_type": "application/json",
        "response_schema": {
          "type": "ARRAY",
          "items": {
            "type": "OBJECT",
            "properties": {
              "recipe_name": {"type":"STRING"},
            }
          }
        }
    }
}' 2> /dev/null | headcontrolled_generation.sh

Java

Client client = new Client();

Schema recipeSchema = Schema.builder()
        .type(Array.class.getSimpleName())
        .items(Schema.builder()
                .type(Object.class.getSimpleName())
                .properties(
                        Map.of("recipe_name", Schema.builder()
                                        .type(String.class.getSimpleName())
                                        .build(),
                                "ingredients", Schema.builder()
                                        .type(Array.class.getSimpleName())
                                        .items(Schema.builder()
                                                .type(String.class.getSimpleName())
                                                .build())
                                        .build())
                )
                .required(List.of("recipe_name", "ingredients"))
                .build())
        .build();

GenerateContentConfig config =
        GenerateContentConfig.builder()
                .responseMimeType("application/json")
                .responseSchema(recipeSchema)
                .build();

GenerateContentResponse response =
        client.models.generateContent(
                "gemini-3.6-flash",
                "List a few popular cookie recipes.",
                config);

System.out.println(response.text());ControlledGeneration.java

Выполнение кода

Python

from google import genai
from google.genai import types

client = genai.Client()
response = client.models.generate_content(
    model="gemini-3.6-flash",
    contents=(
        "Write and execute code that calculates the sum of the first 50 prime numbers. "
        "Ensure that only the executable code and its resulting output are generated."
    ),
)
# Each part may contain text, executable code, or an execution result.
for part in response.candidates[0].content.parts:
    print(part, "\n")

print("-" * 80)
# The .text accessor concatenates the parts into a markdown-formatted text.
print("\n", response.text)code_execution.py

Идти

ctx := context.Background()
client, err := genai.NewClient(ctx, &genai.ClientConfig{
	APIKey:  os.Getenv("GEMINI_API_KEY"),
	Backend: genai.BackendGeminiAPI,
})
if err != nil {
	log.Fatal(err)
}

response, err := client.Models.GenerateContent(
	ctx,
	"gemini-3.6-flash",
	genai.Text(
		`Write and execute code that calculates the sum of the first 50 prime numbers.
		 Ensure that only the executable code and its resulting output are generated.`,
	),
	&genai.GenerateContentConfig{},
)
if err != nil {
	log.Fatal(err)
}

// Print the response.
printResponse(response)

fmt.Println("--------------------------------------------------------------------------------")
fmt.Println(response.Text())code_execution.go

Java

Client client = new Client();

String prompt = """
        Write and execute code that calculates the sum of the first 50 prime numbers.
        Ensure that only the executable code and its resulting output are generated.
        """;

GenerateContentResponse response =
        client.models.generateContent(
                "gemini-3.6-flash",
                prompt,
                null);

for (Part part : response.candidates().get().getFirst().content().get().parts().get()) {
    System.out.println(part + "\n");
}

System.out.println("-".repeat(80));
System.out.println(response.text());CodeExecution.java

Вызов функции

Python

from google import genai
from google.genai import types

client = genai.Client()

def add(a: float, b: float) -> float:
    """returns a + b."""
    return a + b

def subtract(a: float, b: float) -> float:
    """returns a - b."""
    return a - b

def multiply(a: float, b: float) -> float:
    """returns a * b."""
    return a * b

def divide(a: float, b: float) -> float:
    """returns a / b."""
    return a / b

# Create a chat session; function calling (via tools) is enabled in the config.
chat = client.chats.create(
    model="gemini-3.6-flash",
    config=types.GenerateContentConfig(tools=[add, subtract, multiply, divide]),
)
response = chat.send_message(
    message="I have 57 cats, each owns 44 mittens, how many mittens is that in total?"
)
print(response.text)function_calling.py

Идти

ctx := context.Background()
client, err := genai.NewClient(ctx, &genai.ClientConfig{
	APIKey:  os.Getenv("GEMINI_API_KEY"),
	Backend: genai.BackendGeminiAPI,
})
if err != nil {
	log.Fatal(err)
}
modelName := "gemini-3.6-flash"

// Create the function declarations for arithmetic operations.
addDeclaration := createArithmeticToolDeclaration("addNumbers", "Return the result of adding two numbers.")
subtractDeclaration := createArithmeticToolDeclaration("subtractNumbers", "Return the result of subtracting the second number from the first.")
multiplyDeclaration := createArithmeticToolDeclaration("multiplyNumbers", "Return the product of two numbers.")
divideDeclaration := createArithmeticToolDeclaration("divideNumbers", "Return the quotient of dividing the first number by the second.")

// Group the function declarations as a tool.
tools := []*genai.Tool{
	{
		FunctionDeclarations: []*genai.FunctionDeclaration{
			addDeclaration,
			subtractDeclaration,
			multiplyDeclaration,
			divideDeclaration,
		},
	},
}

// Create the content prompt.
contents := []*genai.Content{
	genai.NewContentFromText(
		"I have 57 cats, each owns 44 mittens, how many mittens is that in total?", genai.RoleUser,
	),
}

// Set up the generate content configuration with function calling enabled.
config := &genai.GenerateContentConfig{
	Tools: tools,
	ToolConfig: &genai.ToolConfig{
		FunctionCallingConfig: &genai.FunctionCallingConfig{
			// The mode equivalent to FunctionCallingConfigMode.ANY in JS.
			Mode: genai.FunctionCallingConfigModeAny,
		},
	},
}

genContentResp, err := client.Models.GenerateContent(ctx, modelName, contents, config)
if err != nil {
	log.Fatal(err)
}

// Assume the response includes a list of function calls.
if len(genContentResp.FunctionCalls()) == 0 {
	log.Println("No function call returned from the AI.")
	return nil
}
functionCall := genContentResp.FunctionCalls()[0]
log.Printf("Function call: %+v\n", functionCall)

// Marshal the Args map into JSON bytes.
argsMap, err := json.Marshal(functionCall.Args)
if err != nil {
	log.Fatal(err)
}

// Unmarshal the JSON bytes into the ArithmeticArgs struct.
var args ArithmeticArgs
if err := json.Unmarshal(argsMap, &args); err != nil {
	log.Fatal(err)
}

// Map the function name to the actual arithmetic function.
var result float64
switch functionCall.Name {
	case "addNumbers":
		result = add(args.FirstParam, args.SecondParam)
	case "subtractNumbers":
		result = subtract(args.FirstParam, args.SecondParam)
	case "multiplyNumbers":
		result = multiply(args.FirstParam, args.SecondParam)
	case "divideNumbers":
		result = divide(args.FirstParam, args.SecondParam)
	default:
		return fmt.Errorf("unimplemented function: %s", functionCall.Name)
}
log.Printf("Function result: %v\n", result)

// Prepare the final result message as content.
resultContents := []*genai.Content{
	genai.NewContentFromText("The final result is " + fmt.Sprintf("%v", result), genai.RoleUser),
}

// Use GenerateContent to send the final result.
finalResponse, err := client.Models.GenerateContent(ctx, modelName, resultContents, &genai.GenerateContentConfig{})
if err != nil {
	log.Fatal(err)
}

printResponse(finalResponse)function_calling.go

Node.js

  // Make sure to include the following import:
  // import {GoogleGenAI} from '@google/genai';
  const ai = new GoogleGenAI({ apiKey: process.env.GEMINI_API_KEY });

  /**
   * The add function returns the sum of two numbers.
   * @param {number} a
   * @param {number} b
   * @returns {number}
   */
  function add(a, b) {
    return a + b;
  }

  /**
   * The subtract function returns the difference (a - b).
   * @param {number} a
   * @param {number} b
   * @returns {number}
   */
  function subtract(a, b) {
    return a - b;
  }

  /**
   * The multiply function returns the product of two numbers.
   * @param {number} a
   * @param {number} b
   * @returns {number}
   */
  function multiply(a, b) {
    return a * b;
  }

  /**
   * The divide function returns the quotient of a divided by b.
   * @param {number} a
   * @param {number} b
   * @returns {number}
   */
  function divide(a, b) {
    return a / b;
  }

  const addDeclaration = {
    name: "addNumbers",
    parameters: {
      type: "object",
      description: "Return the result of adding two numbers.",
      properties: {
        firstParam: {
          type: "number",
          description:
            "The first parameter which can be an integer or a floating point number.",
        },
        secondParam: {
          type: "number",
          description:
            "The second parameter which can be an integer or a floating point number.",
        },
      },
      required: ["firstParam", "secondParam"],
    },
  };

  const subtractDeclaration = {
    name: "subtractNumbers",
    parameters: {
      type: "object",
      description:
        "Return the result of subtracting the second number from the first.",
      properties: {
        firstParam: {
          type: "number",
          description: "The first parameter.",
        },
        secondParam: {
          type: "number",
          description: "The second parameter.",
        },
      },
      required: ["firstParam", "secondParam"],
    },
  };

  const multiplyDeclaration = {
    name: "multiplyNumbers",
    parameters: {
      type: "object",
      description: "Return the product of two numbers.",
      properties: {
        firstParam: {
          type: "number",
          description: "The first parameter.",
        },
        secondParam: {
          type: "number",
          description: "The second parameter.",
        },
      },
      required: ["firstParam", "secondParam"],
    },
  };

  const divideDeclaration = {
    name: "divideNumbers",
    parameters: {
      type: "object",
      description:
        "Return the quotient of dividing the first number by the second.",
      properties: {
        firstParam: {
          type: "number",
          description: "The first parameter.",
        },
        secondParam: {
          type: "number",
          description: "The second parameter.",
        },
      },
      required: ["firstParam", "secondParam"],
    },
  };

  // Step 1: Call generateContent with function calling enabled.
  const generateContentResponse = await ai.models.generateContent({
    model: "gemini-3.6-flash",
    contents:
      "I have 57 cats, each owns 44 mittens, how many mittens is that in total?",
    config: {
      toolConfig: {
        functionCallingConfig: {
          mode: FunctionCallingConfigMode.ANY,
        },
      },
      tools: [
        {
          functionDeclarations: [
            addDeclaration,
            subtractDeclaration,
            multiplyDeclaration,
            divideDeclaration,
          ],
        },
      ],
    },
  });

  // Step 2: Extract the function call.(
  // Assuming the response contains a 'functionCalls' array.
  const functionCall =
    generateContentResponse.functionCalls &&
    generateContentResponse.functionCalls[0];
  console.log(functionCall);

  // Parse the arguments.
  const args = functionCall.args;
  // Expected args format: { firstParam: number, secondParam: number }

  // Step 3: Invoke the actual function based on the function name.
  const functionMapping = {
    addNumbers: add,
    subtractNumbers: subtract,
    multiplyNumbers: multiply,
    divideNumbers: divide,
  };
  const func = functionMapping[functionCall.name];
  if (!func) {
    console.error("Unimplemented error:", functionCall.name);
    return generateContentResponse;
  }
  const resultValue = func(args.firstParam, args.secondParam);
  console.log("Function result:", resultValue);

  // Step 4: Use the chat API to send the result as the final answer.
  const chat = ai.chats.create({ model: "gemini-3.6-flash" });
  const chatResponse = await chat.sendMessage({
    message: "The final result is " + resultValue,
  });
  console.log(chatResponse.text);
  return chatResponse;
}
function_calling.js

Оболочка


cat > tools.json << EOF
{
  "function_declarations": [
    {
      "name": "enable_lights",
      "description": "Turn on the lighting system."
    },
    {
      "name": "set_light_color",
      "description": "Set the light color. Lights must be enabled for this to work.",
      "parameters": {
        "type": "object",
        "properties": {
          "rgb_hex": {
            "type": "string",
            "description": "The light color as a 6-digit hex string, e.g. ff0000 for red."
          }
        },
        "required": [
          "rgb_hex"
        ]
      }
    },
    {
      "name": "stop_lights",
      "description": "Turn off the lighting system."
    }
  ]
} 
EOF

curl "https://generativelanguage.googleapis.com/v1beta/models/gemini-2.0-flash:generateContent?key=$GEMINI_API_KEY" \
  -H 'Content-Type: application/json' \
  -d @<(echo '
  {
    "system_instruction": {
      "parts": {
        "text": "You are a helpful lighting system bot. You can turn lights on and off, and you can set the color. Do not perform any other tasks."
      }
    },
    "tools": ['$(cat tools.json)'],

    "tool_config": {
      "function_calling_config": {"mode": "auto"}
    },

    "contents": {
      "role": "user",
      "parts": {
        "text": "Turn on the lights please."
      }
    }
  }
') 2>/dev/null |sed -n '/"content"/,/"finishReason"/p'function_calling.sh

Java

Client client = new Client();

FunctionDeclaration addFunction =
        FunctionDeclaration.builder()
                .name("addNumbers")
                .parameters(
                        Schema.builder()
                                .type("object")
                                .properties(Map.of(
                                        "firstParam", Schema.builder().type("number").description("First number").build(),
                                        "secondParam", Schema.builder().type("number").description("Second number").build()))
                                .required(Arrays.asList("firstParam", "secondParam"))
                                .build())
                .build();

FunctionDeclaration subtractFunction =
        FunctionDeclaration.builder()
                .name("subtractNumbers")
                .parameters(
                        Schema.builder()
                                .type("object")
                                .properties(Map.of(
                                        "firstParam", Schema.builder().type("number").description("First number").build(),
                                        "secondParam", Schema.builder().type("number").description("Second number").build()))
                                .required(Arrays.asList("firstParam", "secondParam"))
                                .build())
                .build();

FunctionDeclaration multiplyFunction =
        FunctionDeclaration.builder()
                .name("multiplyNumbers")
                .parameters(
                        Schema.builder()
                                .type("object")
                                .properties(Map.of(
                                        "firstParam", Schema.builder().type("number").description("First number").build(),
                                        "secondParam", Schema.builder().type("number").description("Second number").build()))
                                .required(Arrays.asList("firstParam", "secondParam"))
                                .build())
                .build();

FunctionDeclaration divideFunction =
        FunctionDeclaration.builder()
                .name("divideNumbers")
                .parameters(
                        Schema.builder()
                                .type("object")
                                .properties(Map.of(
                                        "firstParam", Schema.builder().type("number").description("First number").build(),
                                        "secondParam", Schema.builder().type("number").description("Second number").build()))
                                .required(Arrays.asList("firstParam", "secondParam"))
                                .build())
                .build();

GenerateContentConfig config = GenerateContentConfig.builder()
        .toolConfig(ToolConfig.builder().functionCallingConfig(
                FunctionCallingConfig.builder().mode("ANY").build()
        ).build())
        .tools(
                Collections.singletonList(
                        Tool.builder().functionDeclarations(
                                Arrays.asList(
                                        addFunction,
                                        subtractFunction,
                                        divideFunction,
                                        multiplyFunction
                                )
                        ).build()

                )
        )
        .build();

GenerateContentResponse response =
        client.models.generateContent(
                "gemini-3.6-flash",
                "I have 57 cats, each owns 44 mittens, how many mittens is that in total?",
                config);


if (response.functionCalls() == null || response.functionCalls().isEmpty()) {
    System.err.println("No function call received");
    return null;
}

var functionCall = response.functionCalls().getFirst();
String functionName = functionCall.name().get();
var arguments = functionCall.args();

Map<String, BiFunction<Double, Double, Double>> functionMapping = new HashMap<>();
functionMapping.put("addNumbers", (a, b) -> a + b);
functionMapping.put("subtractNumbers", (a, b) -> a - b);
functionMapping.put("multiplyNumbers", (a, b) -> a * b);
functionMapping.put("divideNumbers", (a, b) -> b != 0 ? a / b : Double.NaN);

BiFunction<Double, Double, Double> function = functionMapping.get(functionName);

Number firstParam = (Number) arguments.get().get("firstParam");
Number secondParam = (Number) arguments.get().get("secondParam");
Double result = function.apply(firstParam.doubleValue(), secondParam.doubleValue());

System.out.println(result);FunctionCalling.java

Конфигурация генерации

Python

from google import genai
from google.genai import types

client = genai.Client()
response = client.models.generate_content(
    model="gemini-3.6-flash",
    contents="Tell me a story about a magic backpack.",
    config=types.GenerateContentConfig(
        candidate_count=1,
        stop_sequences=["x"],
        max_output_tokens=20,
        temperature=1.0,
    ),
)
print(response.text)configure_model_parameters.py

Node.js

// Make sure to include the following import:
// import {GoogleGenAI} from '@google/genai';
const ai = new GoogleGenAI({ apiKey: process.env.GEMINI_API_KEY });

const response = await ai.models.generateContent({
  model: "gemini-3.6-flash",
  contents: "Tell me a story about a magic backpack.",
  config: {
    candidateCount: 1,
    stopSequences: ["x"],
    maxOutputTokens: 20,
    temperature: 1.0,
  },
});

console.log(response.text);configure_model_parameters.js

Идти

ctx := context.Background()
client, err := genai.NewClient(ctx, &genai.ClientConfig{
	APIKey:  os.Getenv("GEMINI_API_KEY"),
	Backend: genai.BackendGeminiAPI,
})
if err != nil {
	log.Fatal(err)
}

// Create local variables for parameters.
candidateCount := int32(1)
maxOutputTokens := int32(20)
temperature := float32(1.0)

response, err := client.Models.GenerateContent(
	ctx,
	"gemini-3.6-flash",
	genai.Text("Tell me a story about a magic backpack."),
	&genai.GenerateContentConfig{
		CandidateCount:  candidateCount,
		StopSequences:   []string{"x"},
		MaxOutputTokens: maxOutputTokens,
		Temperature:     &temperature,
	},
)
if err != nil {
	log.Fatal(err)
}

printResponse(response)configure_model_parameters.go

Оболочка

curl https://generativelanguage.googleapis.com/v1beta/models/gemini-2.0-flash:generateContent?key=$GEMINI_API_KEY \
    -H 'Content-Type: application/json' \
    -X POST \
    -d '{
        "contents": [{
            "parts":[
                {"text": "Explain how AI works"}
            ]
        }],
        "generationConfig": {
            "stopSequences": [
                "Title"
            ],
            "temperature": 1.0,
            "maxOutputTokens": 800,
            "topP": 0.8,
            "topK": 10
        }
    }'  2> /dev/null | grep "text"configure_model_parameters.sh

Java

Client client = new Client();

GenerateContentConfig config =
        GenerateContentConfig.builder()
                .candidateCount(1)
                .stopSequences(List.of("x"))
                .maxOutputTokens(20)
                .temperature(1.0F)
                .build();

GenerateContentResponse response =
        client.models.generateContent(
                "gemini-3.6-flash",
                "Tell me a story about a magic backpack.",
                config);

System.out.println(response.text());ConfigureModelParameters.java

Настройки безопасности

Python

from google import genai
from google.genai import types

client = genai.Client()
unsafe_prompt = (
    "I support Martians Soccer Club and I think Jupiterians Football Club sucks! "
    "Write a ironic phrase about them including expletives."
)
response = client.models.generate_content(
    model="gemini-3.6-flash",
    contents=unsafe_prompt,
    config=types.GenerateContentConfig(
        safety_settings=[
            types.SafetySetting(
                category="HARM_CATEGORY_HATE_SPEECH",
                threshold="BLOCK_MEDIUM_AND_ABOVE",
            ),
            types.SafetySetting(
                category="HARM_CATEGORY_HARASSMENT", threshold="BLOCK_ONLY_HIGH"
            ),
        ]
    ),
)
try:
    print(response.text)
except Exception:
    print("No information generated by the model.")

print(response.candidates[0].safety_ratings)safety_settings.py

Node.js

  // Make sure to include the following import:
  // import {GoogleGenAI} from '@google/genai';
  const ai = new GoogleGenAI({ apiKey: process.env.GEMINI_API_KEY });
  const unsafePrompt =
    "I support Martians Soccer Club and I think Jupiterians Football Club sucks! Write a ironic phrase about them including expletives.";

  const response = await ai.models.generateContent({
    model: "gemini-3.6-flash",
    contents: unsafePrompt,
    config: {
      safetySettings: [
        {
          category: "HARM_CATEGORY_HATE_SPEECH",
          threshold: "BLOCK_MEDIUM_AND_ABOVE",
        },
        {
          category: "HARM_CATEGORY_HARASSMENT",
          threshold: "BLOCK_ONLY_HIGH",
        },
      ],
    },
  });

  try {
    console.log("Generated text:", response.text);
  } catch (error) {
    console.log("No information generated by the model.");
  }
  console.log("Safety ratings:", response.candidates[0].safetyRatings);
  return response;
}
safety_settings.js

Идти

ctx := context.Background()
client, err := genai.NewClient(ctx, &genai.ClientConfig{
	APIKey:  os.Getenv("GEMINI_API_KEY"),
	Backend: genai.BackendGeminiAPI,
})
if err != nil {
	log.Fatal(err)
}

unsafePrompt := "I support Martians Soccer Club and I think Jupiterians Football Club sucks! " +
	"Write a ironic phrase about them including expletives."

config := &genai.GenerateContentConfig{
	SafetySettings: []*genai.SafetySetting{
		{
			Category:  "HARM_CATEGORY_HATE_SPEECH",
			Threshold: "BLOCK_MEDIUM_AND_ABOVE",
		},
		{
			Category:  "HARM_CATEGORY_HARASSMENT",
			Threshold: "BLOCK_ONLY_HIGH",
		},
	},
}
contents := []*genai.Content{
	genai.NewContentFromText(unsafePrompt, genai.RoleUser),
}
response, err := client.Models.GenerateContent(ctx, "gemini-3.6-flash", contents, config)
if err != nil {
	log.Fatal(err)
}

// Print the generated text.
text := response.Text()
fmt.Println("Generated text:", text)

// Print the and safety ratings from the first candidate.
if len(response.Candidates) > 0 {
	fmt.Println("Finish reason:", response.Candidates[0].FinishReason)
	safetyRatings, err := json.MarshalIndent(response.Candidates[0].SafetyRatings, "", "  ")
	if err != nil {
		return err
	}
	fmt.Println("Safety ratings:", string(safetyRatings))
} else {
	fmt.Println("No candidate returned.")
}safety_settings.go

Оболочка

echo '{
    "safetySettings": [
        {"category": "HARM_CATEGORY_HARASSMENT", "threshold": "BLOCK_ONLY_HIGH"},
        {"category": "HARM_CATEGORY_HATE_SPEECH", "threshold": "BLOCK_MEDIUM_AND_ABOVE"}
    ],
    "contents": [{
        "parts":[{
            "text": "'I support Martians Soccer Club and I think Jupiterians Football Club sucks! Write a ironic phrase about them.'"}]}]}' > request.json

curl "https://generativelanguage.googleapis.com/v1beta/models/gemini-2.0-flash:generateContent?key=$GEMINI_API_KEY" \
    -H 'Content-Type: application/json' \
    -X POST \
    -d @request.json 2> /dev/nullsafety_settings.sh

Java

Client client = new Client();

String unsafePrompt = """
         I support Martians Soccer Club and I think Jupiterians Football Club sucks!
         Write a ironic phrase about them including expletives.
        """;

GenerateContentConfig config =
        GenerateContentConfig.builder()
                .safetySettings(Arrays.asList(
                        SafetySetting.builder()
                                .category("HARM_CATEGORY_HATE_SPEECH")
                                .threshold("BLOCK_MEDIUM_AND_ABOVE")
                                .build(),
                        SafetySetting.builder()
                                .category("HARM_CATEGORY_HARASSMENT")
                                .threshold("BLOCK_ONLY_HIGH")
                                .build()
                )).build();

GenerateContentResponse response =
        client.models.generateContent(
                "gemini-3.6-flash",
                unsafePrompt,
                config);

try {
    System.out.println(response.text());
} catch (Exception e) {
    System.out.println("No information generated by the model");
}

System.out.println(response.candidates().get().getFirst().safetyRatings());SafetySettings.java

System Instruction

Python

from google import genai
from google.genai import types

client = genai.Client()
response = client.models.generate_content(
    model="gemini-3.6-flash",
    contents="Good morning! How are you?",
    config=types.GenerateContentConfig(
        system_instruction="You are a cat. Your name is Neko."
    ),
)
print(response.text)system_instruction.py

Node.js

// Make sure to include the following import:
// import {GoogleGenAI} from '@google/genai';
const ai = new GoogleGenAI({ apiKey: process.env.GEMINI_API_KEY });
const response = await ai.models.generateContent({
  model: "gemini-3.6-flash",
  contents: "Good morning! How are you?",
  config: {
    systemInstruction: "You are a cat. Your name is Neko.",
  },
});
console.log(response.text);system_instruction.js

Идти

ctx := context.Background()
client, err := genai.NewClient(ctx, &genai.ClientConfig{
	APIKey:  os.Getenv("GEMINI_API_KEY"),
	Backend: genai.BackendGeminiAPI,
})
if err != nil {
	log.Fatal(err)
}

// Construct the user message contents.
contents := []*genai.Content{
	genai.NewContentFromText("Good morning! How are you?", genai.RoleUser),
}

// Set the system instruction as a *genai.Content.
config := &genai.GenerateContentConfig{
	SystemInstruction: genai.NewContentFromText("You are a cat. Your name is Neko.", genai.RoleUser),
}

response, err := client.Models.GenerateContent(ctx, "gemini-3.6-flash", contents, config)
if err != nil {
	log.Fatal(err)
}
printResponse(response)system_instruction.go

Оболочка

curl "https://generativelanguage.googleapis.com/v1beta/models/gemini-2.0-flash:generateContent?key=$GEMINI_API_KEY" \
-H 'Content-Type: application/json' \
-d '{ "system_instruction": {
    "parts":
      { "text": "You are a cat. Your name is Neko."}},
    "contents": {
      "parts": {
        "text": "Hello there"}}}'system_instruction.sh

Java

Client client = new Client();

Part textPart = Part.builder().text("You are a cat. Your name is Neko.").build();

Content content = Content.builder().role("system").parts(ImmutableList.of(textPart)).build();

GenerateContentConfig config = GenerateContentConfig.builder()
        .systemInstruction(content)
        .build();

GenerateContentResponse response =
        client.models.generateContent(
                "gemini-3.6-flash",
                "Good morning! How are you?",
                config);

System.out.println(response.text());SystemInstruction.java

Ответный текст

В случае успеха тело ответа будет содержать экземпляр GenerateContentResponse .

Генерирует потоковый ответ от модели на основе входных данных GenerateContentRequest .

Конечная точка

post https: / /generativelanguage.googleapis.com /v1beta /{model=models /*}:streamGenerateContent

Параметры пути

string model

Обязательно. Название Model , которая будет использоваться для генерации автозавершения.

Формат: models/{model} . Он принимает вид models/{model} .

Текст запроса

Тело запроса содержит данные следующей структуры:

Поля

contents[] object ( Content )

Обязательно. Содержание текущего разговора с моделью.

tools[] object ( Tool )

Необязательно. Список Tools Model может использовать для генерации следующего ответа.

объект toolConfig object ( ToolConfig )

safetySettings[] object ( SafetySetting )

Необязательно. Список уникальных экземпляров SafetySetting для блокировки небезопасного контента.

объект systemInstruction object ( Content )

Необязательно. Разработчик задает системные инструкции . В настоящее время только текст.

объект generationConfig object ( GenerationConfig )

Необязательно. Параметры конфигурации для генерации модели и выходных данных.

cachedContent string

перечисление serviceTier enum ( ServiceTier )

Необязательно. Уровень обслуживания запроса.

store boolean

Пример запроса

Текст

Python

from google import genai

client = genai.Client()
response = client.models.generate_content_stream(
    model="gemini-3.6-flash", contents="Write a story about a magic backpack."
)
for chunk in response:
    print(chunk.text)
    print("_" * 80)text_generation.py

Node.js

// Make sure to include the following import:
// import {GoogleGenAI} from '@google/genai';
const ai = new GoogleGenAI({ apiKey: process.env.GEMINI_API_KEY });

const response = await ai.models.generateContentStream({
  model: "gemini-3.6-flash",
  contents: "Write a story about a magic backpack.",
});
let text = "";
for await (const chunk of response) {
  console.log(chunk.text);
  text += chunk.text;
}text_generation.js

Идти

ctx := context.Background()
client, err := genai.NewClient(ctx, &genai.ClientConfig{
	APIKey:  os.Getenv("GEMINI_API_KEY"),
	Backend: genai.BackendGeminiAPI,
})
if err != nil {
	log.Fatal(err)
}
contents := []*genai.Content{
	genai.NewContentFromText("Write a story about a magic backpack.", genai.RoleUser),
}
for response, err := range client.Models.GenerateContentStream(
	ctx,
	"gemini-3.6-flash",
	contents,
	nil,
) {
	if err != nil {
		log.Fatal(err)
	}
	fmt.Print(response.Candidates[0].Content.Parts[0].Text)
}text_generation.go

Оболочка

curl "https://generativelanguage.googleapis.com/v1beta/models/gemini-2.0-flash:streamGenerateContent?alt=sse&key=${GEMINI_API_KEY}" \
        -H 'Content-Type: application/json' \
        --no-buffer \
        -d '{ "contents":[{"parts":[{"text": "Write a story about a magic backpack."}]}]}'text_generation.sh

Java

Client client = new Client();

ResponseStream<GenerateContentResponse> responseStream =
        client.models.generateContentStream(
                "gemini-3.6-flash",
                "Write a story about a magic backpack.",
                null);

StringBuilder response = new StringBuilder();
for (GenerateContentResponse res : responseStream) {
    System.out.print(res.text());
    response.append(res.text());
}

responseStream.close();TextGeneration.java

Изображение

Python

from google import genai
import PIL.Image

client = genai.Client()
organ = PIL.Image.open(media / "organ.jpg")
response = client.models.generate_content_stream(
    model="gemini-3.6-flash", contents=["Tell me about this instrument", organ]
)
for chunk in response:
    print(chunk.text)
    print("_" * 80)text_generation.py

Node.js

// Make sure to include the following import:
// import {GoogleGenAI} from '@google/genai';
const ai = new GoogleGenAI({ apiKey: process.env.GEMINI_API_KEY });

const organ = await ai.files.upload({
  file: path.join(media, "organ.jpg"),
});

const response = await ai.models.generateContentStream({
  model: "gemini-3.6-flash",
  contents: [
    createUserContent([
      "Tell me about this instrument", 
      createPartFromUri(organ.uri, organ.mimeType)
    ]),
  ],
});
let text = "";
for await (const chunk of response) {
  console.log(chunk.text);
  text += chunk.text;
}text_generation.js

Идти

ctx := context.Background()
client, err := genai.NewClient(ctx, &genai.ClientConfig{
	APIKey:  os.Getenv("GEMINI_API_KEY"),
	Backend: genai.BackendGeminiAPI,
})
if err != nil {
	log.Fatal(err)
}
file, err := client.Files.UploadFromPath(
	ctx, 
	filepath.Join(getMedia(), "organ.jpg"), 
	&genai.UploadFileConfig{
		MIMEType : "image/jpeg",
	},
)
if err != nil {
	log.Fatal(err)
}
parts := []*genai.Part{
	genai.NewPartFromText("Tell me about this instrument"),
	genai.NewPartFromURI(file.URI, file.MIMEType),
}
contents := []*genai.Content{
	genai.NewContentFromParts(parts, genai.RoleUser),
}
for response, err := range client.Models.GenerateContentStream(
	ctx,
	"gemini-3.6-flash",
	contents,
	nil,
) {
	if err != nil {
		log.Fatal(err)
	}
	fmt.Print(response.Candidates[0].Content.Parts[0].Text)
}text_generation.go

Оболочка

cat > "$TEMP_JSON" << EOF
{
  "contents": [{
    "parts":[
      {"text": "Tell me about this instrument"},
      {
        "inline_data": {
          "mime_type":"image/jpeg",
          "data": "$(cat "$TEMP_B64")"
        }
      }
    ]
  }]
}
EOF

curl "https://generativelanguage.googleapis.com/v1beta/models/gemini-2.0-flash:streamGenerateContent?alt=sse&key=$GEMINI_API_KEY" \
    -H 'Content-Type: application/json' \
    -X POST \
    -d "@$TEMP_JSON" 2> /dev/nulltext_generation.sh

Java

Client client = new Client();

String path = media_path + "organ.jpg";
byte[] imageData = Files.readAllBytes(Paths.get(path));

Content content =
        Content.fromParts(
                Part.fromText("Tell me about this instrument."),
                Part.fromBytes(imageData, "image/jpeg"));


ResponseStream<GenerateContentResponse> responseStream =
        client.models.generateContentStream(
                "gemini-3.6-flash",
                content,
                null);

StringBuilder response = new StringBuilder();
for (GenerateContentResponse res : responseStream) {
    System.out.print(res.text());
    response.append(res.text());
}

responseStream.close();TextGeneration.java

Аудио

Python

from google import genai

client = genai.Client()
sample_audio = client.files.upload(file=media / "sample.mp3")
response = client.models.generate_content_stream(
    model="gemini-3.6-flash",
    contents=["Give me a summary of this audio file.", sample_audio],
)
for chunk in response:
    print(chunk.text)
    print("_" * 80)text_generation.py

Идти

ctx := context.Background()
client, err := genai.NewClient(ctx, &genai.ClientConfig{
	APIKey:  os.Getenv("GEMINI_API_KEY"),
	Backend: genai.BackendGeminiAPI,
})
if err != nil {
	log.Fatal(err)
}

file, err := client.Files.UploadFromPath(
	ctx, 
	filepath.Join(getMedia(), "sample.mp3"), 
	&genai.UploadFileConfig{
		MIMEType : "audio/mpeg",
	},
)
if err != nil {
	log.Fatal(err)
}

parts := []*genai.Part{
	genai.NewPartFromText("Give me a summary of this audio file."),
	genai.NewPartFromURI(file.URI, file.MIMEType),
}

contents := []*genai.Content{
	genai.NewContentFromParts(parts, genai.RoleUser),
}

for result, err := range client.Models.GenerateContentStream(
	ctx,
	"gemini-3.6-flash",
	contents,
	nil,
) {
	if err != nil {
		log.Fatal(err)
	}
	fmt.Print(result.Candidates[0].Content.Parts[0].Text)
}text_generation.go

Оболочка

# Use File API to upload audio data to API request.
MIME_TYPE=$(file -b --mime-type "${AUDIO_PATH}")
NUM_BYTES=$(wc -c < "${AUDIO_PATH}")
DISPLAY_NAME=AUDIO

tmp_header_file=upload-header.tmp

# Initial resumable request defining metadata.
# The upload url is in the response headers dump them to a file.
curl "${BASE_URL}/upload/v1beta/files?key=${GEMINI_API_KEY}" \
  -D upload-header.tmp \
  -H "X-Goog-Upload-Protocol: resumable" \
  -H "X-Goog-Upload-Command: start" \
  -H "X-Goog-Upload-Header-Content-Length: ${NUM_BYTES}" \
  -H "X-Goog-Upload-Header-Content-Type: ${MIME_TYPE}" \
  -H "Content-Type: application/json" \
  -d "{'file': {'display_name': '${DISPLAY_NAME}'}}" 2> /dev/null

upload_url=$(grep -i "x-goog-upload-url: " "${tmp_header_file}" | cut -d" " -f2 | tr -d "\r")
rm "${tmp_header_file}"

# Upload the actual bytes.
curl "${upload_url}" \
  -H "Content-Length: ${NUM_BYTES}" \
  -H "X-Goog-Upload-Offset: 0" \
  -H "X-Goog-Upload-Command: upload, finalize" \
  --data-binary "@${AUDIO_PATH}" 2> /dev/null > file_info.json

file_uri=$(jq ".file.uri" file_info.json)
echo file_uri=$file_uri

curl "https://generativelanguage.googleapis.com/v1beta/models/gemini-2.0-flash:streamGenerateContent?alt=sse&key=$GEMINI_API_KEY" \
    -H 'Content-Type: application/json' \
    -X POST \
    -d '{
      "contents": [{
        "parts":[
          {"text": "Please describe this file."},
          {"file_data":{"mime_type": "audio/mpeg", "file_uri": '$file_uri'}}]
        }]
       }' 2> /dev/null > response.json

cat response.json
echotext_generation.sh

Видео

Python

from google import genai
import time

client = genai.Client()
# Video clip (CC BY 3.0) from https://peach.blender.org/download/
myfile = client.files.upload(file=media / "Big_Buck_Bunny.mp4")
print(f"{myfile=}")

# Poll until the video file is completely processed (state becomes ACTIVE).
while not myfile.state or myfile.state.name != "ACTIVE":
    print("Processing video...")
    print("File state:", myfile.state)
    time.sleep(5)
    myfile = client.files.get(name=myfile.name)

response = client.models.generate_content_stream(
    model="gemini-3.6-flash", contents=[myfile, "Describe this video clip"]
)
for chunk in response:
    print(chunk.text)
    print("_" * 80)text_generation.py

Node.js

// Make sure to include the following import:
// import {GoogleGenAI} from '@google/genai';
const ai = new GoogleGenAI({ apiKey: process.env.GEMINI_API_KEY });

let video = await ai.files.upload({
  file: path.join(media, 'Big_Buck_Bunny.mp4'),
});

// Poll until the video file is completely processed (state becomes ACTIVE).
while (!video.state || video.state.toString() !== 'ACTIVE') {
  console.log('Processing video...');
  console.log('File state: ', video.state);
  await sleep(5000);
  video = await ai.files.get({name: video.name});
}

const response = await ai.models.generateContentStream({
  model: "gemini-3.6-flash",
  contents: [
    createUserContent([
      "Describe this video clip",
      createPartFromUri(video.uri, video.mimeType),
    ]),
  ],
});
let text = "";
for await (const chunk of response) {
  console.log(chunk.text);
  text += chunk.text;
}text_generation.js

Идти

ctx := context.Background()
client, err := genai.NewClient(ctx, &genai.ClientConfig{
	APIKey:  os.Getenv("GEMINI_API_KEY"),
	Backend: genai.BackendGeminiAPI,
})
if err != nil {
	log.Fatal(err)
}

file, err := client.Files.UploadFromPath(
	ctx, 
	filepath.Join(getMedia(), "Big_Buck_Bunny.mp4"), 
	&genai.UploadFileConfig{
		MIMEType : "video/mp4",
	},
)
if err != nil {
	log.Fatal(err)
}

// Poll until the video file is completely processed (state becomes ACTIVE).
for file.State == genai.FileStateUnspecified || file.State != genai.FileStateActive {
	fmt.Println("Processing video...")
	fmt.Println("File state:", file.State)
	time.Sleep(5 * time.Second)

	file, err = client.Files.Get(ctx, file.Name, nil)
	if err != nil {
		log.Fatal(err)
	}
}

parts := []*genai.Part{
	genai.NewPartFromText("Describe this video clip"),
	genai.NewPartFromURI(file.URI, file.MIMEType),
}

contents := []*genai.Content{
	genai.NewContentFromParts(parts, genai.RoleUser),
}

for result, err := range client.Models.GenerateContentStream(
	ctx,
	"gemini-3.6-flash",
	contents,
	nil,
) {
	if err != nil {
		log.Fatal(err)
	}
	fmt.Print(result.Candidates[0].Content.Parts[0].Text)
}text_generation.go

Оболочка

# Use File API to upload audio data to API request.
MIME_TYPE=$(file -b --mime-type "${VIDEO_PATH}")
NUM_BYTES=$(wc -c < "${VIDEO_PATH}")
DISPLAY_NAME=VIDEO_PATH

# Initial resumable request defining metadata.
# The upload url is in the response headers dump them to a file.
curl "${BASE_URL}/upload/v1beta/files?key=${GEMINI_API_KEY}" \
  -D upload-header.tmp \
  -H "X-Goog-Upload-Protocol: resumable" \
  -H "X-Goog-Upload-Command: start" \
  -H "X-Goog-Upload-Header-Content-Length: ${NUM_BYTES}" \
  -H "X-Goog-Upload-Header-Content-Type: ${MIME_TYPE}" \
  -H "Content-Type: application/json" \
  -d "{'file': {'display_name': '${DISPLAY_NAME}'}}" 2> /dev/null

upload_url=$(grep -i "x-goog-upload-url: " "${tmp_header_file}" | cut -d" " -f2 | tr -d "\r")
rm "${tmp_header_file}"

# Upload the actual bytes.
curl "${upload_url}" \
  -H "Content-Length: ${NUM_BYTES}" \
  -H "X-Goog-Upload-Offset: 0" \
  -H "X-Goog-Upload-Command: upload, finalize" \
  --data-binary "@${VIDEO_PATH}" 2> /dev/null > file_info.json

file_uri=$(jq ".file.uri" file_info.json)
echo file_uri=$file_uri

state=$(jq ".file.state" file_info.json)
echo state=$state

while [[ "($state)" = *"PROCESSING"* ]];
do
  echo "Processing video..."
  sleep 5
  # Get the file of interest to check state
  curl https://generativelanguage.googleapis.com/v1beta/files/$name > file_info.json
  state=$(jq ".file.state" file_info.json)
done

curl "https://generativelanguage.googleapis.com/v1beta/models/gemini-2.0-flash:streamGenerateContent?alt=sse&key=$GEMINI_API_KEY" \
    -H 'Content-Type: application/json' \
    -X POST \
    -d '{
      "contents": [{
        "parts":[
          {"text": "Please describe this file."},
          {"file_data":{"mime_type": "video/mp4", "file_uri": '$file_uri'}}]
        }]
       }' 2> /dev/null > response.json

cat response.json
echotext_generation.sh

PDF

Python

from google import genai

client = genai.Client()
sample_pdf = client.files.upload(file=media / "test.pdf")
response = client.models.generate_content_stream(
    model="gemini-3.6-flash",
    contents=["Give me a summary of this document:", sample_pdf],
)

for chunk in response:
    print(chunk.text)
    print("_" * 80)text_generation.py

Идти

ctx := context.Background()
client, err := genai.NewClient(ctx, &genai.ClientConfig{
	APIKey:  os.Getenv("GEMINI_API_KEY"),
	Backend: genai.BackendGeminiAPI,
})
if err != nil {
	log.Fatal(err)
}

file, err := client.Files.UploadFromPath(
	ctx, 
	filepath.Join(getMedia(), "test.pdf"), 
	&genai.UploadFileConfig{
		MIMEType : "application/pdf",
	},
)
if err != nil {
	log.Fatal(err)
}

parts := []*genai.Part{
	genai.NewPartFromText("Give me a summary of this document:"),
	genai.NewPartFromURI(file.URI, file.MIMEType),
}

contents := []*genai.Content{
	genai.NewContentFromParts(parts, genai.RoleUser),
}

for result, err := range client.Models.GenerateContentStream(
	ctx,
	"gemini-3.6-flash",
	contents,
	nil,
) {
	if err != nil {
		log.Fatal(err)
	}
	fmt.Print(result.Candidates[0].Content.Parts[0].Text)
}text_generation.go

Оболочка

MIME_TYPE=$(file -b --mime-type "${PDF_PATH}")
NUM_BYTES=$(wc -c < "${PDF_PATH}")
DISPLAY_NAME=TEXT


echo $MIME_TYPE
tmp_header_file=upload-header.tmp

# Initial resumable request defining metadata.
# The upload url is in the response headers dump them to a file.
curl "${BASE_URL}/upload/v1beta/files?key=${GEMINI_API_KEY}" \
  -D upload-header.tmp \
  -H "X-Goog-Upload-Protocol: resumable" \
  -H "X-Goog-Upload-Command: start" \
  -H "X-Goog-Upload-Header-Content-Length: ${NUM_BYTES}" \
  -H "X-Goog-Upload-Header-Content-Type: ${MIME_TYPE}" \
  -H "Content-Type: application/json" \
  -d "{'file': {'display_name': '${DISPLAY_NAME}'}}" 2> /dev/null

upload_url=$(grep -i "x-goog-upload-url: " "${tmp_header_file}" | cut -d" " -f2 | tr -d "\r")
rm "${tmp_header_file}"

# Upload the actual bytes.
curl "${upload_url}" \
  -H "Content-Length: ${NUM_BYTES}" \
  -H "X-Goog-Upload-Offset: 0" \
  -H "X-Goog-Upload-Command: upload, finalize" \
  --data-binary "@${PDF_PATH}" 2> /dev/null > file_info.json

file_uri=$(jq ".file.uri" file_info.json)
echo file_uri=$file_uri

# Now generate content using that file
curl "https://generativelanguage.googleapis.com/v1beta/models/gemini-2.0-flash:streamGenerateContent?alt=sse&key=$GEMINI_API_KEY" \
    -H 'Content-Type: application/json' \
    -X POST \
    -d '{
      "contents": [{
        "parts":[
          {"text": "Can you add a few more lines to this poem?"},
          {"file_data":{"mime_type": "application/pdf", "file_uri": '$file_uri'}}]
        }]
       }' 2> /dev/null > response.json

cat response.json
echotext_generation.sh

Чат

Python

from google import genai
from google.genai import types

client = genai.Client()
chat = client.chats.create(
    model="gemini-3.6-flash",
    history=[
        types.Content(role="user", parts=[types.Part(text="Hello")]),
        types.Content(
            role="model",
            parts=[
                types.Part(
                    text="Great to meet you. What would you like to know?"
                )
            ],
        ),
    ],
)
response = chat.send_message_stream(message="I have 2 dogs in my house.")
for chunk in response:
    print(chunk.text)
    print("_" * 80)
response = chat.send_message_stream(message="How many paws are in my house?")
for chunk in response:
    print(chunk.text)
    print("_" * 80)

print(chat.get_history())chat.py

Node.js

// Make sure to include the following import:
// import {GoogleGenAI} from '@google/genai';
const ai = new GoogleGenAI({ apiKey: process.env.GEMINI_API_KEY });
const chat = ai.chats.create({
  model: "gemini-3.6-flash",
  history: [
    {
      role: "user",
      parts: [{ text: "Hello" }],
    },
    {
      role: "model",
      parts: [{ text: "Great to meet you. What would you like to know?" }],
    },
  ],
});

console.log("Streaming response for first message:");
const stream1 = await chat.sendMessageStream({
  message: "I have 2 dogs in my house.",
});
for await (const chunk of stream1) {
  console.log(chunk.text);
  console.log("_".repeat(80));
}

console.log("Streaming response for second message:");
const stream2 = await chat.sendMessageStream({
  message: "How many paws are in my house?",
});
for await (const chunk of stream2) {
  console.log(chunk.text);
  console.log("_".repeat(80));
}

console.log(chat.getHistory());chat.js

Идти

ctx := context.Background()
client, err := genai.NewClient(ctx, &genai.ClientConfig{
	APIKey:  os.Getenv("GEMINI_API_KEY"),
	Backend: genai.BackendGeminiAPI,
})
if err != nil {
	log.Fatal(err)
}

history := []*genai.Content{
	genai.NewContentFromText("Hello", genai.RoleUser),
	genai.NewContentFromText("Great to meet you. What would you like to know?", genai.RoleModel),
}
chat, err := client.Chats.Create(ctx, "gemini-3.6-flash", nil, history)
if err != nil {
	log.Fatal(err)
}

for chunk, err := range chat.SendMessageStream(ctx, genai.Part{Text: "I have 2 dogs in my house."}) {
	if err != nil {
		log.Fatal(err)
	}
	fmt.Println(chunk.Text())
	fmt.Println(strings.Repeat("_", 64))
}

for chunk, err := range chat.SendMessageStream(ctx, genai.Part{Text: "How many paws are in my house?"}) {
	if err != nil {
		log.Fatal(err)
	}
	fmt.Println(chunk.Text())
	fmt.Println(strings.Repeat("_", 64))
}

fmt.Println(chat.History(false))chat.go

Оболочка

curl https://generativelanguage.googleapis.com/v1beta/models/gemini-2.0-flash:streamGenerateContent?alt=sse&key=$GEMINI_API_KEY \
    -H 'Content-Type: application/json' \
    -X POST \
    -d '{
      "contents": [
        {"role":"user",
         "parts":[{
           "text": "Hello"}]},
        {"role": "model",
         "parts":[{
           "text": "Great to meet you. What would you like to know?"}]},
        {"role":"user",
         "parts":[{
           "text": "I have two dogs in my house. How many paws are in my house?"}]},
      ]
    }' 2> /dev/null | grep "text"chat.sh

Ответный текст

В случае успеха тело ответа содержит поток экземпляров GenerateContentResponse .

GenerateContentResponse

JSON-представление
PromptFeedback
- JSON-представление
BlockReason
UsageMetadata
- JSON-представление
ModelStatus
- JSON-представление
ModelStage

Ответ модели, подтверждающий наличие нескольких вариантов ответа.

Рейтинги безопасности и фильтрация контента отображаются для обоих запросов в GenerateContentResponse.prompt_feedback , а для каждого кандидата — в finishReason и safetyRatings . API: - Возвращает либо все запрошенные кандидаты, либо ни одного из них; - Не возвращает ни одного кандидата, только если с запросом что-то было не так (см. promptFeedback ); - Отображает отзывы по каждому кандидату в finishReason и safetyRatings .

Поля

candidates[] object ( Candidate )

Варианты ответов, полученные от модели.

Объект promptFeedback object ( PromptFeedback )

Возвращает обратную связь от запроса, относящуюся к фильтрам содержимого.

объект usageMetadata object ( UsageMetadata )

Только вывод. Метаданные об использовании токенов в запросах на генерацию.

string modelVersion

Только выходные данные. Версия модели, использованная для генерации ответа.

responseId string

Только вывод. responseId используется для идентификации каждого ответа.

объект modelStatus object ( ModelStatus )

Только вывод. Текущее состояние данной модели.

JSON-представление

JSON-представление
{ "candidates": [ { object (`Candidate`) } ], "promptFeedback": { object (`PromptFeedback`) }, "usageMetadata": { object (`UsageMetadata`) }, "modelVersion": string, "responseId": string, "modelStatus": { object (`ModelStatus`) } }

{
  "candidates": [
    {
      object (Candidate)
    }
  ],
  "promptFeedback": {
    object (PromptFeedback)
  },
  "usageMetadata": {
    object (UsageMetadata)
  },
  "modelVersion": string,
  "responseId": string,
  "modelStatus": {
    object (ModelStatus)
  }
}

PromptFeedback

Набор метаданных обратной связи, указанных в запросе GenerateContentRequest.content .

Поля

blockReason enum ( BlockReason )

Необязательно. Если задано, запрос был заблокирован, и варианты не возвращаются. Переформулируйте запрос.

safetyRatings[] object ( SafetyRating )

Оценки безопасности подсказки. В каждой категории может быть не более одной оценки.

JSON-представление
{ "blockReason": enum (`BlockReason`), "safetyRatings": [ { object (`SafetyRating`) } ] }

BlockReason

Указывает причину блокировки запроса.

Перечисления
`BLOCK_REASON_UNSPECIFIED`	Значение по умолчанию. Это значение не используется.
`SAFETY`	Запрос был заблокирован по соображениям безопасности. Проверьте `safetyRatings` , чтобы узнать, какая категория безопасности привела к блокировке.
`OTHER`	Запрос был заблокирован по неизвестным причинам.
`BLOCKLIST`	Запрос был заблокирован из-за терминов, включенных в список заблокированных терминов.
`PROHIBITED_CONTENT`	Запрос был заблокирован из-за запрещенного контента.
`IMAGE_SAFETY`	Кандидаты заблокированы из-за небезопасного контента, созданного с помощью фотошопа.

UsageMetadata

Метаданные об использовании токена в запросе на генерацию.

Поля

promptTokenCount integer

Количество токенов в запросе. Если задан параметр cachedContent , это по-прежнему общий эффективный размер запроса, то есть он включает количество токенов в кэшированном содержимом.

cachedContentTokenCount integer

Количество токенов в кэшированной части запроса (кэшированное содержимое)

candidatesTokenCount integer

Общее количество токенов по всем сгенерированным вариантам ответа.

toolUsePromptTokenCount integer

Только вывод. Количество токенов, присутствующих в подсказках использования инструмента.

thoughtsTokenCount integer

Только вывод. Количество токенов мыслей для моделей мышления.

totalTokenCount integer

Общее количество токенов для запроса на генерацию (запрос + мысли + варианты ответа).

promptTokensDetails[] object ( ModalityTokenCount )

Только выходные данные. Список модальностей, которые были обработаны во входном запросе.

cacheTokensDetails[] object ( ModalityTokenCount )

Только вывод. Список вариантов содержимого, кэшированного в запросе.

candidatesTokensDetails[] object ( ModalityTokenCount )

Только вывод. Список модальностей, которые были возвращены в ответе.

toolUsePromptTokensDetails[] object ( ModalityTokenCount )

Только выходные данные. Список модальностей, обработанных для обработки запросов на использование инструмента.

перечисление serviceTier enum ( ServiceTier )

Только вывод. Уровень обслуживания запроса.

JSON-представление

JSON-представление
{ "promptTokenCount": integer, "cachedContentTokenCount": integer, "candidatesTokenCount": integer, "toolUsePromptTokenCount": integer, "thoughtsTokenCount": integer, "totalTokenCount": integer, "promptTokensDetails": [ { object (`ModalityTokenCount`) } ], "cacheTokensDetails": [ { object (`ModalityTokenCount`) } ], "candidatesTokensDetails": [ { object (`ModalityTokenCount`) } ], "toolUsePromptTokensDetails": [ { object (`ModalityTokenCount`) } ], "serviceTier": enum (`ServiceTier`) }

{
  "promptTokenCount": integer,
  "cachedContentTokenCount": integer,
  "candidatesTokenCount": integer,
  "toolUsePromptTokenCount": integer,
  "thoughtsTokenCount": integer,
  "totalTokenCount": integer,
  "promptTokensDetails": [
    {
      object (ModalityTokenCount)
    }
  ],
  "cacheTokensDetails": [
    {
      object (ModalityTokenCount)
    }
  ],
  "candidatesTokensDetails": [
    {
      object (ModalityTokenCount)
    }
  ],
  "toolUsePromptTokensDetails": [
    {
      object (ModalityTokenCount)
    }
  ],
  "serviceTier": enum (ServiceTier)
}

ModelStatus

Статус базовой модели. Используется для обозначения стадии развития базовой модели и времени вывода из эксплуатации, если таковое имеется.

Поля

modelStage enum ( ModelStage )

Этап базовой модели.

retirementTime string ( Timestamp format)

Время, когда модель будет выведена из эксплуатации.

Используется RFC 3339, согласно которому сгенерированный вывод всегда будет Z-нормализован и будет содержать 0, 3, 6 или 9 дробных знаков. Допускаются также смещения, отличные от "Z". Примеры: "2014-10-02T15:01:23Z" , "2014-10-02T15:01:23.045123456Z" или "2014-10-02T15:01:23+05:30" .

string message

Сообщение с пояснением статуса модели.

JSON-представление
{ "modelStage": enum (`ModelStage`), "retirementTime": string, "message": string }

ModelStage

Определяет стадию развития базовой модели.

Перечисления
`MODEL_STAGE_UNSPECIFIED`	Неуказанная стадия модели.
`UNSTABLE_EXPERIMENTAL`	Базовая модель подвергается множеству настроек. Этот товар устарел!
`EXPERIMENTAL`	Модели на данном этапе предназначены исключительно для экспериментальных целей.
`PREVIEW`	Модели на этом этапе более зрелые, чем экспериментальные модели.
`STABLE`	Модели на этом этапе считаются стабильными и готовыми к использованию в производственных целях.
`LEGACY`	Если модель находится на этой стадии, это означает, что в ближайшем будущем она будет снята с поддержки. Использовать эту модель смогут только существующие клиенты.
`DEPRECATED`	Модели на этом этапе устарели. Использовать эти модели нельзя. Этот товар устарел!
`RETIRED`	Модели на этом этапе сняты с производства. Эти модели использовать нельзя.

Кандидат

JSON-представление
FinishReason
Атрибуция заземления
- JSON-представление
AttributionSourceId
- JSON-представление
GroundingPassageId
- JSON-представление
SemanticRetrieverChunk
- JSON-представление
Метаданные заземления
- JSON-представление
SearchEntryPoint
- JSON-представление
GroundingChunk
- JSON-представление
Веб
- JSON-представление
Изображение
- JSON-представление
Полученный контекст
- JSON-представление
Пользовательские метаданные
- JSON-представление
StringList
- JSON-представление
Карты
- JSON-представление
PlaceAnswerSources
- JSON-представление
ReviewSnippet
- JSON-представление
Поддержка заземления
- JSON-представление
Сегмент
- JSON-представление
RetrievalMetadata
- JSON-представление
LogprobsResult
- JSON-представление
Лучшие кандидаты
- JSON-представление
Кандидат
- JSON-представление
UrlContextMetadata
- JSON-представление
UrlMetadata
- JSON-представление
UrlRetrievalStatus

Вариант ответа, сгенерированный на основе модели.

Поля

объект content object ( Content )

Только выходные данные. Сгенерированный контент, возвращаемый моделью.

finishReason enum ( FinishReason )

Необязательно. Только для вывода. Причина, по которой модель перестала генерировать токены.

Если поле пустое, модель не прекращает генерацию токенов.

safetyRatings[] object ( SafetyRating )

Список оценок безопасности кандидатов на должность в оперативно-розыскной группе.

В каждой категории может быть не более одной оценки.

Объект citationMetadata object ( CitationMetadata )

Только выходные данные. Информация об источнике информации для кандидата, сгенерированного моделью.

Это поле может быть заполнено информацией о декламации любого текста, включенного в content . Речь идет о отрывках, которые «декламируются» из материалов, защищенных авторским правом, в обучающих данных базовой магистерской программы.

tokenCount integer

Только вывод. Количество токенов для этого кандидата.

groundingAttributions[] object ( GroundingAttribution )

Только выходные данные. Информация об источниках, которые способствовали получению обоснованного ответа.

Это поле заполняется для звонков, GenerateAnswer .

объект groundingMetadata object ( GroundingMetadata )

Только выходные данные. Метаданные для подтверждения данных кандидата.

Это поле заполняется для вызовов GenerateContent .

avgLogprobs number

Только выходные данные. Средний логарифмический показатель вероятности кандидата.

Объект logprobsResult object ( LogprobsResult )

Только вывод. Значения логарифмической функции правдоподобия для токенов ответа и лучших токенов.

object ( UrlContextMetadata ) urlContextMetadata (UrlContextMetadata)

Только выходные данные. Метаданные, относящиеся к инструменту получения контекста URL.

index integer

Только вывод. Индекс кандидата в списке кандидатов на ответ.

string finishMessage

Необязательно. Только для вывода. Подробно описывает причину, по которой модель перестала генерировать токены. Заполняется только при установке finishReason .

JSON-представление

JSON-представление
{ "content": { object (`Content`) }, "finishReason": enum (`FinishReason`), "safetyRatings": [ { object (`SafetyRating`) } ], "citationMetadata": { object (`CitationMetadata`) }, "tokenCount": integer, "groundingAttributions": [ { object (`GroundingAttribution`) } ], "groundingMetadata": { object (`GroundingMetadata`) }, "avgLogprobs": number, "logprobsResult": { object (`LogprobsResult`) }, "urlContextMetadata": { object (`UrlContextMetadata`) }, "index": integer, "finishMessage": string }

{
  "content": {
    object (Content)
  },
  "finishReason": enum (FinishReason),
  "safetyRatings": [
    {
      object (SafetyRating)
    }
  ],
  "citationMetadata": {
    object (CitationMetadata)
  },
  "tokenCount": integer,
  "groundingAttributions": [
    {
      object (GroundingAttribution)
    }
  ],
  "groundingMetadata": {
    object (GroundingMetadata)
  },
  "avgLogprobs": number,
  "logprobsResult": {
    object (LogprobsResult)
  },
  "urlContextMetadata": {
    object (UrlContextMetadata)
  },
  "index": integer,
  "finishMessage": string
}

FinishReason

Указывает причину, по которой модель перестала генерировать токены.

Перечисления
`FINISH_REASON_UNSPECIFIED`	Значение по умолчанию. Это значение не используется.
`STOP`	Естественная точка остановки модели или заданная последовательность остановок.
`MAX_TOKENS`	Достигнуто максимальное количество токенов, указанное в запросе.
`SAFETY`	Содержимое предложенного варианта ответа было помечено как потенциально опасное по соображениям безопасности.
`RECITATION`	Содержание предложенного варианта ответа было помечено как требующее заучивания наизусть.
`LANGUAGE`	В предложенном варианте ответа было обнаружено использование неподдерживаемого языка.
`OTHER`	Причина неизвестна.
`BLOCKLIST`	Генерация токенов прекращена, поскольку контент содержит запрещенные термины.
`PROHIBITED_CONTENT`	Выпуск токенов приостановлен из-за потенциального наличия запрещенного контента.
`SPII`	Генерация токенов прекращена, поскольку содержимое потенциально содержит конфиденциальную личную информацию (SPII).
`MALFORMED_FUNCTION_CALL`	Вызов функции, сгенерированный моделью, является недопустимым.
`IMAGE_SAFETY`	Генерация токенов прекращена, поскольку сгенерированные изображения содержат нарушения правил безопасности.
`IMAGE_PROHIBITED_CONTENT`	Генерация изображений была остановлена, поскольку сгенерированные изображения содержали запрещенный контент.
`IMAGE_OTHER`	Генерация изображений прекратилась из-за других различных проблем.
`NO_IMAGE`	Предполагалось, что модель сгенерирует изображение, но изображение не было сгенерировано.
`IMAGE_RECITATION`	Генерация изображений прекратилась из-за чтения вслух.
`UNEXPECTED_TOOL_CALL`	Модель сгенерировала вызов инструмента, но ни один инструмент не был включен в запрос.
`TOO_MANY_TOOL_CALLS`	Модель вызвала слишком много инструментов подряд, в результате чего система завершила выполнение.
`MISSING_THOUGHT_SIGNATURE`	В запросе отсутствует как минимум одна подпись, выражающая мысль.
`MALFORMED_RESPONSE`	Завершено из-за неправильной реакции.
`ESCALATION`	Запрос был отфильтрован правилом эскалации.

Атрибуция заземления

Укажите источник, который послужил основой для ответа.

Поля

объект sourceId object ( AttributionSourceId )

Только вывод. Идентификатор источника, обеспечившего данное указание.

объект content object ( Content )

Исходный контент, на основе которого составлена эта атрибуция.

JSON-представление
{ "sourceId": { object (`AttributionSourceId`) }, "content": { object (`Content`) } }

AttributionSourceId

Идентификатор источника, предоставившего эти данные.

Поля

source Union type

source может быть только один из следующих вариантов:

объект groundingPassage object ( GroundingPassageId )

Идентификатор для встроенного фрагмента текста.

object ( SemanticRetrieverChunk ) semanticRetrieverChunk (SemanticRetrieverChunk)

Идентификатор Chunk , полученного с помощью семантического ретривера.

JSON-представление
{ // source "groundingPassage": { object (`GroundingPassageId`) }, "semanticRetrieverChunk": { object (`SemanticRetrieverChunk`) } // Union type }

GroundingPassageId

Идентификатор части внутри объекта GroundingPassage .

Поля

string passageId

Только вывод. Идентификатор фрагмента текста, соответствующего свойству GroundingPassage.id из GenerateAnswerRequest .

partIndex integer

Только вывод. Индекс части внутри GroundingPassage.content объекта GenerateAnswerRequest .

JSON-представление
{ "passageId": string, "partIndex": integer }

SemanticRetrieverChunk

Идентификатор Chunk , полученного с помощью семантического ретривера, указанный в GenerateAnswerRequest с использованием SemanticRetrieverConfig .

Поля

source string

Только вывод. Имя источника, соответствующее файлу SemanticRetrieverConfig.source запроса. Пример: corpora/123 или corpora/123/documents/abc

chunk string

Только вывод. Название Chunk , содержащего атрибутированный текст. Пример: corpora/123/documents/abc/chunks/xyz

JSON-представление
{ "source": string, "chunk": string }

Метаданные заземления

Метаданные возвращаются клиенту при включении заземления.

Поля

groundingChunks[] object ( GroundingChunk )

Список подтверждающих ссылок, полученных из указанного источника данных. При потоковой передаче он содержит только те фрагменты данных, которые не были включены в метаданные предыдущих ответов.

groundingSupports[] object ( GroundingSupport )

Список средств заземления.

webSearchQueries[] string

Поисковые запросы в интернете для последующего поиска.

imageSearchQueries[] string

Поисковые запросы по изображениям, используемые для приведения в соответствие с реальностью.

объект searchEntryPoint object ( SearchEntryPoint )

Необязательно. Запись в поисковой выдаче Google для последующих веб-поисков.

объект retrievalMetadata object ( RetrievalMetadata )

Метаданные, связанные с извлечением данных в процессе заземления.

string googleMapsWidgetContextToken

Необязательно. Имя ресурса контекстного токена виджета Google Maps, который можно использовать с виджетом PlacesContextElement для отображения контекстных данных. Заполняется только в том случае, если включена привязка к карте с помощью Google Maps.

JSON-представление

JSON-представление
{ "groundingChunks": [ { object (`GroundingChunk`) } ], "groundingSupports": [ { object (`GroundingSupport`) } ], "webSearchQueries": [ string ], "imageSearchQueries": [ string ], "searchEntryPoint": { object (`SearchEntryPoint`) }, "retrievalMetadata": { object (`RetrievalMetadata`) }, "googleMapsWidgetContextToken": string }

{
  "groundingChunks": [
    {
      object (GroundingChunk)
    }
  ],
  "groundingSupports": [
    {
      object (GroundingSupport)
    }
  ],
  "webSearchQueries": [
    string
  ],
  "imageSearchQueries": [
    string
  ],
  "searchEntryPoint": {
    object (SearchEntryPoint)
  },
  "retrievalMetadata": {
    object (RetrievalMetadata)
  },
  "googleMapsWidgetContextToken": string
}

SearchEntryPoint

Точка входа в поисковую выдачу Google.

Поля

renderedContent string

Необязательный элемент. Фрагмент веб-контента, который можно встроить в веб-страницу или веб-представление приложения.

sdkBlob string ( bytes format)

Необязательно. JSON-данные в кодировке Base64, представляющие собой массив кортежей <поисковый запрос, URL поиска>.

Строка, закодированная в формате Base64.

JSON-представление
{ "renderedContent": string, "sdkBlob": string }

GroundingChunk

GroundingChunk представляет собой сегмент подтверждающих данных, который служит основой для ответа модели. Это может быть фрагмент из интернета, контекст, полученный из файла, или информация из Google Maps.

Поля

тип chunk_type Union type

Тип фрагмента. chunk_type может быть только одним из следующих:

web object ( Web )

Фрагмент веб-страницы, предназначенный для закрепления на месте.

объект image object ( Image )

Необязательно. Фрагмент для определения местоположения из результатов поиска изображений.

объект retrievedContext object ( RetrievedContext )

Необязательно. Фрагмент контекста, полученный с помощью инструмента поиска файлов.

объект maps object ( Maps )

Необязательно. Фрагмент карты местности из Google Maps.

JSON-представление
{ // chunk_type "web": { object (`Web`) }, "image": { object (`Image`) }, "retrievedContext": { object (`RetrievedContext`) }, "maps": { object (`Maps`) } // Union type }

Веб

Фрагмент из интернета.

Поля

string uri

Только вывод. URI-ссылка на фрагмент.

string title

Только вывод. Заголовок фрагмента.

JSON-представление
{ "uri": string, "title": string }

Изображение

Фрагмент из поиска изображений.

Поля

string sourceUri

URI веб-страницы для указания источника.

string imageUri

URL-адрес графического ресурса.

string title

Заголовок веб-страницы, с которой взято изображение.

string domain

Корневой домен веб-страницы, с которой взято изображение, например, "example.com".

JSON-представление
{ "sourceUri": string, "imageUri": string, "title": string, "domain": string }

Полученный контекст

Фрагмент из контекста, полученный с помощью инструмента поиска по файлу.

Поля

customMetadata[] object ( CustomMetadata )

Необязательно. Предоставляемые пользователем метаданные о полученном контексте.

string uri

Необязательно. URI-ссылка на документ для семантического поиска.

string title

Необязательно. Заголовок документа.

text string

Необязательно. Текст фрагмента.

fileSearchStore string

Необязательно. Название FileSearchStore содержащего документ. Пример: fileSearchStores/123

pageNumber integer

Необязательно. Номер страницы полученного контекста, если применимо.

mediaId string

Необязательно. Имя ресурса медиа-объекта для результатов многомодального поиска файлов. Формат: fileSearchStores/{file_search_store_id}/media/{blobId}

JSON-представление
{ "customMetadata": [ { object (`CustomMetadata`) } ], "uri": string, "title": string, "text": string, "fileSearchStore": string, "pageNumber": integer, "mediaId": string }

Пользовательские метаданные

Пользователь предоставил метаданные о GroundingFact.

Поля

key string

Ключ метаданных.

value Union type

Значение метаданных. Может быть строкой, списком строк или числом. value может принимать только одно из следующих значений:

stringValue string

Необязательный параметр. Строковое значение метаданных.

объект stringListValue object ( StringList )

Необязательно. Список строковых значений для метаданных.

numericValue number

Необязательно. Числовое значение метаданных. Ожидаемый диапазон значений зависит от используемого key .

JSON-представление
{ "key": string, // value "stringValue": string, "stringListValue": { object (`StringList`) }, "numericValue": number // Union type }

StringList

Список строковых значений.

Поля

values[] string

Строковые значения списка.

JSON-представление
{ "values": [ string ] }

Карты

Фрагмент карты Google Maps, соответствующий одному месту.

Поля

string uri

URI-ссылка на это место.

string title

Название места.

text string

Текстовое описание места, дающего ответ.

placeId string

Идентификатор места в формате places/{placeId} . Пользователь может использовать этот идентификатор для поиска данного места.

объект placeAnswerSources object ( PlaceAnswerSources )

Источники, предоставляющие ответы на вопросы об особенностях того или иного места на Google Maps.

JSON-представление
{ "uri": string, "title": string, "text": string, "placeId": string, "placeAnswerSources": { object (`PlaceAnswerSources`) } }

PlaceAnswerSources

Коллекция источников, предоставляющих ответы на вопросы об особенностях конкретного места на Google Maps. Каждое сообщение PlaceAnswerSources соответствует определенному месту на Google Maps. Инструмент Google Maps использовал эти источники для ответа на вопросы об особенностях места (например: «Есть ли Wi-Fi в Bar Foo?» или «Доступен ли бар Foo для инвалидов-колясочников?»). В настоящее время мы поддерживаем в качестве источников только фрагменты отзывов.

Поля

объект reviewSnippets[] object ( ReviewSnippet )

Фрагменты отзывов, используемые для генерации ответов об особенностях того или иного места в Google Maps.

JSON-представление
{ "reviewSnippets": [ { object (`ReviewSnippet`) } ] }

ReviewSnippet

Представляет собой фрагмент отзыва пользователя, отвечающего на вопрос об особенностях конкретного места на Google Maps.

Поля

reviewId string

Идентификатор фрагмента отзыва.

string googleMapsUri

Ссылка, соответствующая отзыву пользователя на Google Maps.

string title

Заголовок рецензии.

JSON-представление
{ "reviewId": string, "googleMapsUri": string, "title": string }

Поддержка заземления

Поддержка заземления.

Поля

groundingChunkIndices[] integer

Необязательно. Список индексов (в 'grounding_chunk' в response.candidate.grounding_metadata ), указывающий на цитаты, связанные с утверждением. Например, [1,3,4] означает, что grounding_chunk[1], grounding_chunk[3], grounding_chunk[4] — это полученный контент, относящийся к утверждению. Если ответ потоковый, groundingChunkIndices ссылаются на индексы во всех ответах. Клиент несет ответственность за накопление фрагментов данных из всех ответов (с сохранением того же порядка).

confidenceScores[] number

Необязательно. Показатель достоверности ссылок на источники поддержки. Диапазон от 0 до 1. 1 означает наивысший уровень достоверности. Этот список должен иметь тот же размер, что и groundingChunkIndices.

renderedParts[] integer

Только для вывода. Индексы в поле parts содержимого кандидата. Эти индексы указывают, какие отрендеренные части связаны с данным источником поддержки.

Объект segment object ( Segment )

Данный раздел контента относится к данной поддержке.

JSON-представление
{ "groundingChunkIndices": [ integer ], "confidenceScores": [ number ], "renderedParts": [ integer ], "segment": { object (`Segment`) } }

Сегмент

Фрагмент контента.

Поля

partIndex integer

Индекс объекта Part внутри родительского объекта Content.

startIndex integer

Начальный индекс в заданной части, измеряемый в байтах. Смещение от начала части включительно, начиная с нуля.

endIndex integer

Конечный индекс в заданной части, измеряемый в байтах. Смещение от начала части, исключая его, начиная с нуля.

text string

Текст, соответствующий фрагменту ответа.

JSON-представление
{ "partIndex": integer, "startIndex": integer, "endIndex": integer, "text": string }

RetrievalMetadata

Метаданные, связанные с извлечением данных в процессе заземления.

Поля

googleSearchDynamicRetrievalScore number

Необязательный параметр. Оценка, указывающая на вероятность того, что информация из поиска Google поможет ответить на вопрос. Оценка находится в диапазоне [0, 1], где 0 — наименее вероятный ответ, а 1 — наиболее вероятный. Эта оценка заполняется только при включенной функции сопоставления с поиском Google и динамического поиска. Она будет сравниваться с пороговым значением для определения необходимости запуска поиска Google.

JSON-представление
{ "googleSearchDynamicRetrievalScore": number }

LogprobsResult

Результат Logprobs

Поля

topCandidates[] object ( TopCandidates )

Длина = общее количество шагов декодирования.

chosenCandidates[] object ( Candidate )

Длина = общее количество шагов декодирования. Выбранные кандидаты могут входить или не в число лучших кандидатов.

logProbabilitySum number

Сумма логарифмических вероятностей для всех токенов.

JSON-представление
{ "topCandidates": [ { object (`TopCandidates`) } ], "chosenCandidates": [ { object (`Candidate`) } ], "logProbabilitySum": number }

Лучшие кандидаты

Кандидаты с наивысшими логарифмическими вероятностями на каждом этапе декодирования.

Поля

candidates[] object ( Candidate )

Отсортировано по логарифмической вероятности в порядке убывания.

JSON-представление
{ "candidates": [ { object (`Candidate`) } ] }

Кандидат

Кандидат на получение токена и оценки logprobs.

Поля

string token

Строковое значение токена кандидата.

tokenId integer

Идентификатор токена кандидата.

logProbability number

Логарифмическая вероятность кандидата.

JSON-представление
{ "token": string, "tokenId": integer, "logProbability": number }

UrlContextMetadata

Метаданные, относящиеся к инструменту получения контекста URL-адреса.

Поля

urlMetadata[] object ( UrlMetadata )

Список контекста URL-адреса.

JSON-представление
{ "urlMetadata": [ { object (`UrlMetadata`) } ] }

UrlMetadata

Контекст получения одной и той же URL-ссылки.

Поля

retrievedUrl string

URL-адрес получен инструментом.

urlRetrievalStatus enum ( UrlRetrievalStatus )

Статус получения URL-адреса.

JSON-представление
{ "retrievedUrl": string, "urlRetrievalStatus": enum (`UrlRetrievalStatus`) }

UrlRetrievalStatus

Статус получения URL-адреса.

Перечисления
`URL_RETRIEVAL_STATUS_UNSPECIFIED`	Значение по умолчанию. Это значение не используется.
`URL_RETRIEVAL_STATUS_SUCCESS`	Получение URL-адреса прошло успешно.
`URL_RETRIEVAL_STATUS_ERROR`	Получение URL-адреса не удалось из-за ошибки.
`URL_RETRIEVAL_STATUS_PAYWALL`	Не удалось получить URL-адрес, поскольку контент находится за платным доступом.
`URL_RETRIEVAL_STATUS_UNSAFE`	Не удалось получить URL-адрес, поскольку содержимое небезопасно.

Метаданные цитирования

JSON-представление
Источник цитаты
- JSON-представление

Подборка ссылок на источники для данного контента.

Поля

Объект citationSources[] object ( CitationSource )

Ссылки на источники для конкретного ответа.

JSON-представление
{ "citationSources": [ { object (`CitationSource`) } ] }

Источник цитаты

Ссылка на источник для части конкретного ответа.

Поля

startIndex integer

Необязательно. Начало сегмента ответа, который относится к данному источнику.

Index indicates the start of the segment, measured in bytes.

endIndex integer

Optional. End of the attributed segment, exclusive.

uri string

Optional. URI that is attributed as a source for a portion of the text.

license string

Optional. License for the GitHub project that is attributed as a source for segment.

License info is required for code citations.

JSON-представление
{ "startIndex": integer, "endIndex": integer, "uri": string, "license": string }

HarmCategory

Harm categories that can be detected in user input and model responses.

Перечисления
`HARM_CATEGORY_UNSPECIFIED`	Значение по умолчанию. Это значение не используется.
`HARM_CATEGORY_HATE_SPEECH`	Content that promotes violence or incites hatred against individuals or groups based on certain attributes.
`HARM_CATEGORY_DANGEROUS_CONTENT`	Content that promotes, facilitates, or enables dangerous activities.
`HARM_CATEGORY_HARASSMENT`	Abusive, threatening, or content intended to bully, torment, or ridicule.
`HARM_CATEGORY_SEXUALLY_EXPLICIT`	Content that contains sexually explicit material.
`HARM_CATEGORY_CIVIC_INTEGRITY`	Deprecated: Election filter is not longer supported. The harm category is civic integrity. This item is deprecated!
`HARM_CATEGORY_IMAGE_HATE`	Images that contain hate speech.
`HARM_CATEGORY_IMAGE_DANGEROUS_CONTENT`	Images that contain dangerous content.
`HARM_CATEGORY_IMAGE_HARASSMENT`	Images that contain harassment.
`HARM_CATEGORY_IMAGE_SEXUALLY_EXPLICIT`	Images that contain sexually explicit content.
`HARM_CATEGORY_JAILBREAK`	Prompts designed to bypass safety filters.

ModalityTokenCount

JSON-представление
Модальность

Represents token counting info for a single modality.

Поля

modality enum ( Modality )

The modality associated with this token count.

tokenCount integer

Number of tokens.

JSON-представление
{ "modality": enum (`Modality`), "tokenCount": integer }

Модальность

Content Part modality

Перечисления
`MODALITY_UNSPECIFIED`	Unspecified modality.
`TEXT`	Plain text.
`IMAGE`	Изображение.
`VIDEO`	Видео.
`AUDIO`	Аудио.
`DOCUMENT`	Document, eg PDF.

SafetyRating

JSON-представление
HarmProbability

Safety rating for a piece of content.

The safety rating contains the category of harm and the harm probability level in that category for a piece of content. Content is classified for safety across a number of harm categories and the probability of the harm classification is included here.

Поля

category enum ( HarmCategory )

Required. The category for this rating.

probability enum ( HarmProbability )

Required. The probability of harm for this content.

blocked boolean

Was this content blocked because of this rating?

JSON-представление
{ "category": enum (`HarmCategory`), "probability": enum (`HarmProbability`), "blocked": boolean }

HarmProbability

The probability that a piece of content is harmful.

The classification system gives the probability of the content being unsafe. This does not indicate the severity of harm for a piece of content.

Перечисления
`HARM_PROBABILITY_UNSPECIFIED`	Probability is unspecified.
`NEGLIGIBLE`	Content has a negligible chance of being unsafe.
`LOW`	Content has a low chance of being unsafe.
`MEDIUM`	Content has a medium chance of being unsafe.
`HIGH`	Content has a high chance of being unsafe.

SafetySetting

JSON-представление
HarmBlockThreshold

Safety setting, affecting the safety-blocking behavior.

Passing a safety setting for a category changes the allowed probability that content is blocked.

Поля

category enum ( HarmCategory )

Required. The category for this setting.

threshold enum ( HarmBlockThreshold )

Required. Controls the probability threshold at which harm is blocked.

JSON-представление
{ "category": enum (`HarmCategory`), "threshold": enum (`HarmBlockThreshold`) }

HarmBlockThreshold

Block at and beyond a specified harm probability.

Перечисления
`HARM_BLOCK_THRESHOLD_UNSPECIFIED`	Threshold is unspecified.
`BLOCK_LOW_AND_ABOVE`	Content with NEGLIGIBLE will be allowed.
`BLOCK_MEDIUM_AND_ABOVE`	Content with NEGLIGIBLE and LOW will be allowed.
`BLOCK_ONLY_HIGH`	Content with NEGLIGIBLE, LOW, and MEDIUM will be allowed.
`BLOCK_NONE`	All content will be allowed.
`OFF`	Turn off the safety filter.

ServiceTier

The service tier of the interaction.

Перечисления
`SERVICE_TIER_UNSPECIFIED`	Default service tier, which is standard.
`SERVICE_TIER_FLEX`	Flex service tier.
`SERVICE_TIER_STANDARD`	Standard service tier.
`SERVICE_TIER_PRIORITY`	Priority service tier.

РазрешенныеИнструменты

JSON-представление

The configuration for allowed tools.

Поля

mode enum ( ToolChoiceType )

The mode of the tool choice.

tools[] string

The names of the allowed tools.

JSON-представление
{ "mode": enum (`ToolChoiceType`), "tools": [ string ] }

Аннотация

JSON-представление
URLCitation
- JSON-представление
FileCitation
- JSON-представление
PlaceCitation
- JSON-представление

Citation information for model-generated content.

Поля

startIndex integer

Start of segment of the response that is attributed to this source.

Index indicates the start of the segment, measured in bytes.

endIndex integer

End of the attributed segment, exclusive.

type Union type

The type of annotation. type can be only one of the following:

urlCitation object ( UrlCitation )

A URL citation annotation.

fileCitation object ( FileCitation )

A file citation annotation.

placeCitation object ( PlaceCitation )

A place citation annotation.

JSON-представление

JSON-представление
{ "startIndex": integer, "endIndex": integer, // type "urlCitation": { object (`UrlCitation`) }, "fileCitation": { object (`FileCitation`) }, "placeCitation": { object (`PlaceCitation`) } // Union type }

{
  "startIndex": integer,
  "endIndex": integer,

  // type
  "urlCitation": {
    object (UrlCitation)
  },
  "fileCitation": {
    object (FileCitation)
  },
  "placeCitation": {
    object (PlaceCitation)
  }
  // Union type
}

URLCitation

A URL citation annotation.

Поля

url string

The URL.

title string

The title of the URL.

JSON-представление
{ "url": string, "title": string }

FileCitation

A file citation annotation.

Поля

documentUri string

The URI of the file.

fileName string

The name of the file.

source string

Source attributed for a portion of the text.

customMetadata object ( Struct )

User provided metadata about the retrieved context.

pageNumber integer

Page number of the cited document, if applicable.

mediaId string

Media ID in-case of image citations, if applicable.

JSON-представление
{ "documentUri": string, "fileName": string, "source": string, "customMetadata": { object (`Struct`) }, "pageNumber": integer, "mediaId": string }

PlaceCitation

A place citation annotation.

Поля

placeId string

The ID of the place, in places/{placeId} format.

name string

Title of the place.

url string

URI reference of the place.

reviewSnippets[] object ( ReviewSnippet )

Snippets of reviews that are used to generate answers about the features of a given place in Google Maps.

JSON-представление
{ "placeId": string, "name": string, "url": string, "reviewSnippets": [ { object (`ReviewSnippet`) } ] }

AspectRatio

Supported aspect ratios for image output.

Перечисления
`ASPECT_RATIO_UNSPECIFIED`	Значение по умолчанию. Это значение не используется.
`ASPECT_RATIO_ONE_BY_ONE`	1:1 aspect ratio.
`ASPECT_RATIO_TWO_BY_THREE`	2:3 aspect ratio.
`ASPECT_RATIO_THREE_BY_TWO`	3:2 aspect ratio.
`ASPECT_RATIO_THREE_BY_FOUR`	3:4 aspect ratio.
`ASPECT_RATIO_FOUR_BY_THREE`	4:3 aspect ratio.
`ASPECT_RATIO_FOUR_BY_FIVE`	4:5 aspect ratio.
`ASPECT_RATIO_FIVE_BY_FOUR`	5:4 aspect ratio.
`ASPECT_RATIO_NINE_BY_SIXTEEN`	9:16 aspect ratio.
`ASPECT_RATIO_SIXTEEN_BY_NINE`	16:9 aspect ratio.
`ASPECT_RATIO_TWENTY_ONE_BY_NINE`	21:9 aspect ratio.
`ASPECT_RATIO_ONE_BY_EIGHT`	1:8 aspect ratio.
`ASPECT_RATIO_EIGHT_BY_ONE`	8:1 aspect ratio.
`ASPECT_RATIO_ONE_BY_FOUR`	1:4 aspect ratio.
`ASPECT_RATIO_FOUR_BY_ONE`	4:1 aspect ratio.

AudioResponseFormat

JSON-представление

Configuration for audio output format.

Поля

mimeType enum ( MimeType )

The MIME type of the audio output.

delivery enum ( Delivery )

The delivery mode for the audio output.

sampleRate integer

Sample rate in Hz.

bitRate integer

Bit rate in bits per second (bps). Only applicable for compressed formats (MP3, Opus).

JSON-представление
{ "mimeType": enum (`MimeType`), "delivery": enum (`Delivery`), "sampleRate": integer, "bitRate": integer }

CodeExecution

Этот тип не содержит полей.

A tool that can be used by the model to execute code.

CodeExecutionCallStep

JSON-представление
CodeExecutionCallStepArguments
- JSON-представление

Code execution call step.

Поля

arguments object ( CodeExecutionCallStepArguments )

Required. The arguments to pass to the code execution.

JSON-представление
{ "arguments": { object (`CodeExecutionCallStepArguments`) } }

CodeExecutionCallStepArguments

The arguments to pass to the code execution.

Поля

language enum ( Language )

Programming language of the code .

code string

The code to be executed.

JSON-представление
{ "language": enum (`Language`), "code": string }

CodeExecutionResultStep

JSON-представление

Code execution result step.

Поля

result string

Required. The output of the code execution.

isError boolean

Whether the code execution resulted in an error.

JSON-представление
{ "result": string, "isError": boolean }

ComputerUse

JSON-представление

A tool that can be used by the model to interact with the computer.

Поля

environment enum ( Environment )

The environment being operated.

excludedPredefinedFunctions[] string

The list of predefined functions that are excluded from the model call.

enablePromptInjectionDetection boolean

Whether enable the prompt injection detection check on computer-use request.

disabledSafetyPolicies[] enum ( SafetyPolicy )

Optional. Disabled safety policies for computer use.

JSON-представление
{ "environment": enum (`Environment`), "excludedPredefinedFunctions": [ string ], "enablePromptInjectionDetection": boolean, "disabledSafetyPolicies": [ enum (`SafetyPolicy`) ] }

Содержание

JSON-представление
TextContent
- JSON-представление
ImageContent
- JSON-представление
AudioContent
- JSON-представление
Содержимое документа
- JSON-представление
VideoContent
- JSON-представление
ThoughtContent
- JSON-представление
ThoughtSummaryContent
- JSON-представление
ToolCallContent
- JSON-представление
FunctionCallContent
- JSON-представление
CodeExecutionCallContent
- JSON-представление
CodeExecutionCallArguments
- JSON-представление
UrlContextCallContent
- JSON-представление
UrlContextCallArguments
- JSON-представление
McpServerToolCallContent
- JSON-представление
GoogleSearchCallContent
- JSON-представление
GoogleSearchCallArguments
- JSON-представление
FileSearchCallContent
GoogleMapsCallContent
- JSON-представление
GoogleMapsCallArguments
- JSON-представление
ToolResultContent
- JSON-представление
FunctionResultContent
- JSON-представление
FunctionResultSubcontentList
- JSON-представление
FunctionResultSubcontent
- JSON-представление
CodeExecutionResultContent
- JSON-представление
UrlContextResultContent
- JSON-представление
UrlContextResult
- JSON-представление
GoogleSearchResultContent
- JSON-представление
GoogleSearchResult
- JSON-представление
McpServerToolResultContent
- JSON-представление
FileSearchResultContent
- JSON-представление
FileSearchResult
GoogleMapsResultContent
- JSON-представление
GoogleКартыРезультаты
- JSON-представление
Места
- JSON-представление

The content of the response.

Поля

type Union type

type can be only one of the following:

text object ( TextContent )

image object ( ImageContent )

audio object ( AudioContent )

document object ( DocumentContent )

video object ( VideoContent )

thought
 (deprecated)

object ( ThoughtContent )

toolCall
 (deprecated)

object ( ToolCallContent )

toolResult
 (deprecated)

object ( ToolResultContent )

JSON-представление

{

  // type
  "text": {
    object (TextContent)
  },
  "image": {
    object (ImageContent)
  },
  "audio": {
    object (AudioContent)
  },
  "document": {
    object (DocumentContent)
  },
  "video": {
    object (VideoContent)
  },
  "thought": {
    object (ThoughtContent)
  },
  "toolCall": {
    object (ToolCallContent)
  },
  "toolResult": {
    object (ToolResultContent)
  }
  // Union type
}

TextContent

A text content block.

Поля

text string

Required. The text content.

annotations[] object ( Annotation )

Citation information for model-generated content.

JSON-представление
{ "text": string, "annotations": [ { object (`Annotation`) } ] }

ImageContent

An image content block.

Поля

mimeType enum ( MimeType )

The mime type of the image.

resolution enum ( MediaResolution )

The resolution of the media.

data_or_uri Union type

The image content. data_or_uri can be only one of the following:

data string ( bytes format)

The image content.

A base64-encoded string.

uri string

The URI of the image.

JSON-представление
{ "mimeType": enum (`MimeType`), "resolution": enum (`MediaResolution`), // data_or_uri "data": string, "uri": string // Union type }

AudioContent

An audio content block.

Поля

mimeType enum ( MimeType )

The mime type of the audio.

channels integer

The number of audio channels.

sampleRate integer

The sample rate of the audio.

data_or_uri Union type

The audio content. data_or_uri can be only one of the following:

data string ( bytes format)

The audio content.

A base64-encoded string.

uri string

The URI of the audio.

JSON-представление
{ "mimeType": enum (`MimeType`), "channels": integer, "sampleRate": integer, // data_or_uri "data": string, "uri": string // Union type }

Содержимое документа

A document content block.

Поля

mimeType enum ( MimeType )

The mime type of the document.

data_or_uri Union type

The document content. data_or_uri can be only one of the following:

data string ( bytes format)

The document content.

A base64-encoded string.

uri string

The URI of the document.

JSON-представление
{ "mimeType": enum (`MimeType`), // data_or_uri "data": string, "uri": string // Union type }

VideoContent

A video content block.

Поля

mimeType enum ( MimeType )

The mime type of the video.

resolution enum ( MediaResolution )

The resolution of the media.

data_or_uri Union type

The video content. data_or_uri can be only one of the following:

data string ( bytes format)

The video content.

A base64-encoded string.

uri string

The URI of the video.

JSON-представление
{ "mimeType": enum (`MimeType`), "resolution": enum (`MediaResolution`), // data_or_uri "data": string, "uri": string // Union type }

ThoughtContent

A thought content block.

Поля

signature string ( bytes format)

Signature to match the backend source to be part of the generation.

A base64-encoded string.

summary[] object ( ThoughtSummaryContent )

A summary of the thought.

JSON-представление
{ "signature": string, "summary": [ { object (`ThoughtSummaryContent`) } ] }

ThoughtSummaryContent

Поля

type Union type

type can be only one of the following:

text object ( TextContent )

image object ( ImageContent )

JSON-представление
{ // type "text": { object (`TextContent`) }, "image": { object (`ImageContent`) } // Union type }

ToolCallContent

Tool call content.

Поля

id string

Required. A unique ID for this specific tool call.

signature string ( bytes format)

A signature hash for backend validation.

A base64-encoded string.

type Union type

type can be only one of the following:

functionCall object ( FunctionCallContent )

codeExecutionCall object ( CodeExecutionCallContent )

urlContextCall object ( UrlContextCallContent )

mcpServerToolCall object ( McpServerToolCallContent )

googleSearchCall object ( GoogleSearchCallContent )

fileSearchCall object ( FileSearchCallContent )

googleMapsCall object ( GoogleMapsCallContent )

JSON-представление

{
  "id": string,
  "signature": string,

  // type
  "functionCall": {
    object (FunctionCallContent)
  },
  "codeExecutionCall": {
    object (CodeExecutionCallContent)
  },
  "urlContextCall": {
    object (UrlContextCallContent)
  },
  "mcpServerToolCall": {
    object (McpServerToolCallContent)
  },
  "googleSearchCall": {
    object (GoogleSearchCallContent)
  },
  "fileSearchCall": {
    object (FileSearchCallContent)
  },
  "googleMapsCall": {
    object (GoogleMapsCallContent)
  }
  // Union type
}

FunctionCallContent

A function tool call content block.

Поля

name string

Required. The name of the tool to call.

arguments object ( Struct )

Required. The arguments to pass to the function.

JSON-представление
{ "name": string, "arguments": { object (`Struct`) } }

CodeExecutionCallContent

Code execution content.

Поля

arguments object ( CodeExecutionCallArguments )

Required. The arguments to pass to the code execution.

JSON-представление
{ "arguments": { object (`CodeExecutionCallArguments`) } }

CodeExecutionCallArguments

The arguments to pass to the code execution.

Поля

language enum ( Language )

Programming language of the code .

code string

The code to be executed.

JSON-представление
{ "language": enum (`Language`), "code": string }

UrlContextCallContent

URL context content.

Поля

arguments object ( UrlContextCallArguments )

Required. The arguments to pass to the URL context.

JSON-представление
{ "arguments": { object (`UrlContextCallArguments`) } }

UrlContextCallArguments

The arguments to pass to the URL context.

Поля

urls[] string

The URLs to fetch.

JSON-представление
{ "urls": [ string ] }

McpServerToolCallContent

MCPServer tool call content.

Поля

name string

Required. The name of the tool which was called.

serverName string

Required. The name of the used MCP server.

arguments object ( Struct )

Required. The JSON object of arguments for the function.

JSON-представление
{ "name": string, "serverName": string, "arguments": { object (`Struct`) } }

GoogleSearchCallContent

Google Search content.

Поля

arguments object ( GoogleSearchCallArguments )

Required. The arguments to pass to Google Search.

searchType enum ( SearchType )

The type of search grounding enabled.

JSON-представление
{ "arguments": { object (`GoogleSearchCallArguments`) }, "searchType": enum (`SearchType`) }

GoogleSearchCallArguments

The arguments to pass to Google Search.

Поля

queries[] string

Web search queries for the following-up web search.

JSON-представление
{ "queries": [ string ] }

FileSearchCallContent

Этот тип не содержит полей.

File Search content.

GoogleMapsCallContent

Google Maps content.

Поля

arguments object ( GoogleMapsCallArguments )

The arguments to pass to the Google Maps tool.

JSON-представление
{ "arguments": { object (`GoogleMapsCallArguments`) } }

GoogleMapsCallArguments

The arguments to pass to the Google Maps tool.

Поля

queries[] string

The queries to be executed.

JSON-представление
{ "queries": [ string ] }

ToolResultContent

Tool result content.

Поля

callId string

Required. ID to match the ID from the function call block.

signature string ( bytes format)

A signature hash for backend validation.

A base64-encoded string.

type Union type

type can be only one of the following:

functionResult object ( FunctionResultContent )

codeExecutionResult object ( CodeExecutionResultContent )

urlContextResult object ( UrlContextResultContent )

googleSearchResult object ( GoogleSearchResultContent )

mcpServerToolResult object ( McpServerToolResultContent )

fileSearchResult object ( FileSearchResultContent )

googleMapsResult object ( GoogleMapsResultContent )

JSON-представление

{
  "callId": string,
  "signature": string,

  // type
  "functionResult": {
    object (FunctionResultContent)
  },
  "codeExecutionResult": {
    object (CodeExecutionResultContent)
  },
  "urlContextResult": {
    object (UrlContextResultContent)
  },
  "googleSearchResult": {
    object (GoogleSearchResultContent)
  },
  "mcpServerToolResult": {
    object (McpServerToolResultContent)
  },
  "fileSearchResult": {
    object (FileSearchResultContent)
  },
  "googleMapsResult": {
    object (GoogleMapsResultContent)
  }
  // Union type
}

FunctionResultContent

A function tool result content block.

Поля

name string

The name of the tool that was called.

isError boolean

Whether the tool call resulted in an error.

result Union type

The result of the tool call. result can be only one of the following:

structResult object ( Struct )

contentList object ( FunctionResultSubcontentList )

stringResult string

JSON-представление
{ "name": string, "isError": boolean, // result "structResult": { object (`Struct`) }, "contentList": { object (`FunctionResultSubcontentList`) }, "stringResult": string // Union type }

FunctionResultSubcontentList

Поля

contents[] object ( FunctionResultSubcontent )

JSON-представление
{ "contents": [ { object (`FunctionResultSubcontent`) } ] }

FunctionResultSubcontent

Поля

type Union type

type can be only one of the following:

text object ( TextContent )

image object ( ImageContent )

JSON-представление
{ // type "text": { object (`TextContent`) }, "image": { object (`ImageContent`) } // Union type }

CodeExecutionResultContent

Code execution result content.

Поля

result string

Required. The output of the code execution.

isError boolean

Whether the code execution resulted in an error.

JSON-представление
{ "result": string, "isError": boolean }

UrlContextResultContent

URL context result content.

Поля

result[] object ( UrlContextResult )

Required. The results of the URL context.

isError boolean

Whether the URL context resulted in an error.

JSON-представление
{ "result": [ { object (`UrlContextResult`) } ], "isError": boolean }

UrlContextResult

The result of the URL context.

Поля

url string

The URL that was fetched.

status enum ( Status )

The status of the URL retrieval.

JSON-представление
{ "url": string, "status": enum (`Status`) }

GoogleSearchResultContent

Google Search result content.

Поля

result[] object ( GoogleSearchResult )

Required. The results of the Google Search.

isError boolean

Whether the Google Search resulted in an error.

JSON-представление
{ "result": [ { object (`GoogleSearchResult`) } ], "isError": boolean }

GoogleSearchResult

The result of the Google Search.

Поля

searchSuggestions string

Web content snippet that can be embedded in a web page or an app webview.

JSON-представление
{ "searchSuggestions": string }

McpServerToolResultContent

MCPServer tool result content.

Поля

name string

Name of the tool which is called for this specific tool call.

serverName string

The name of the used MCP server.

result Union type

The output from the MCP server call. Can be simple text or rich content. result can be only one of the following:

structResult object ( Struct )

contentList object ( FunctionResultSubcontentList )

stringResult string

JSON-представление
{ "name": string, "serverName": string, // result "structResult": { object (`Struct`) }, "contentList": { object (`FunctionResultSubcontentList`) }, "stringResult": string // Union type }

FileSearchResultContent

File Search result content.

Поля

result[] object ( FileSearchResult )

Optional. The results of the File Search.

JSON-представление
{ "result": [ { object (`FileSearchResult`) } ] }

FileSearchResult

Этот тип не содержит полей.

The result of the File Search.

GoogleMapsResultContent

Google Maps result content.

Поля

result[] object ( GoogleMapsResult )

Required. The results of the Google Maps.

JSON-представление
{ "result": [ { object (`GoogleMapsResult`) } ] }

GoogleКартыРезультаты

The result of the Google Maps.

Поля

places[] object ( Places )

The places that were found.

widgetContextToken string

Resource name of the Google Maps widget context token.

JSON-представление
{ "places": [ { object (`Places`) } ], "widgetContextToken": string }

Места

Поля

placeId string

The ID of the place, in places/{placeId} format.

name string

Title of the place.

url string

URI reference of the place.

reviewSnippets[] object ( ReviewSnippet )

Snippets of reviews that are used to generate answers about the features of a given place in Google Maps.

JSON-представление
{ "placeId": string, "name": string, "url": string, "reviewSnippets": [ { object (`ReviewSnippet`) } ] }

ContentList

JSON representation

A list of Content.

Поля

contents[] object ( Content )

The contents of the list.

JSON representation
{ "contents": [ { object (`Content`) } ] }

CreateInteractionRequest

JSON representation
Взаимодействие
- JSON-представление
TurnList
- JSON representation
Повернуть
- JSON representation
Список шагов
- JSON representation
Шаг
- JSON representation
ThoughtStep
- JSON representation
ToolCallStep
- JSON representation
FunctionCallStep
- JSON representation
UrlContextCallStep
- JSON representation
UrlContextCallStepArguments
- JSON representation
McpServerToolCallStep
- JSON representation
GoogleSearchCallStep
- JSON representation
GoogleSearchCallStepArguments
- JSON representation
FileSearchCallStep
GoogleMapsCallStep
- JSON representation
GoogleMapsCallStepArguments
- JSON representation
ToolResultStep
- JSON representation
FunctionResultStep
- JSON representation
UrlContextResultStep
- JSON representation
UrlContextResultItem
- JSON-представление
GoogleSearchResultStep
- JSON representation
GoogleSearchResultItem
- JSON representation
McpServerToolResultStep
- JSON representation
FileSearchResultStep
GoogleMapsResultStep
- JSON representation
GoogleMapsResultItem
- JSON representation
GoogleMapsResultPlaces
- JSON representation
UserInputStep
- JSON representation
ModelOutputStep
- JSON-представление
ResponseFormatList
- JSON representation
ResponseFormat
- JSON representation
TextResponseFormat
- JSON representation
ImageResponseFormat
- JSON representation
VideoResponseFormat
- JSON representation
Взаимодействие моделей
- JSON representation
GenerationConfig
- JSON representation
ToolChoiceConfig
- JSON representation
SpeechConfig
- JSON representation
ImageConfig
- JSON-представление
VideoConfig
- JSON representation
EnvironmentConfig
- JSON representation
EnvironmentNetworkEgressAllowlist
- JSON representation
EgressRule
- JSON representation
Источник
- JSON representation
LocalEnvironmentConfig
Инструмент
- JSON representation
Функция
- JSON representation
UrlContext
McpServer
- JSON representation
GoogleSearch
- JSON representation
FileSearch
- JSON representation
GoogleMaps
- JSON-представление
Использование
- JSON representation
ModalityTokens
- JSON representation
GroundingToolCount
- JSON representation
WebhookConfig
- JSON representation
SafetySetting
- JSON-представление

Configuration parameters for creating an interaction.

Поля

stream boolean

Input only. Whether the interaction will be streamed.

store boolean

Input only. Whether to store the response and request for later retrieval.

interaction object ( Interaction )

The interaction to create.

background boolean

Input only. Whether to run the model interaction in the background.

JSON representation
{ "stream": boolean, "store": boolean, "interaction": { object (`Interaction`) }, "background": boolean }

Взаимодействие

Response for InteractionService.CreateInteraction.

Поля

id string

Required. Output only. A unique identifier for the interaction completion.

status enum ( Status )

Required. Output only. The status of the interaction.

created string

Required. Output only. The time at which the response was created in ISO 8601 format (YYYY-MM-DDThh:mm:ssZ).

updated string

Required. Output only. The time at which the response was last updated in ISO 8601 format (YYYY-MM-DDThh:mm:ssZ).

role
 (deprecated)

string

Output only. The role of the interaction.

outputs[]
 (deprecated)

object ( Content )

Output only. Responses from the model.

systemInstruction string

System instruction for the interaction.

tools[] object ( Tool )

A list of tool declarations the model may call during interaction.

usage object ( Usage )

Output only. Statistics on the interaction request's token usage.

responseModalities[]
 (deprecated)

enum ( ResponseModality )

The requested modalities of the response (TEXT, IMAGE, AUDIO).

responseMimeType
 (deprecated)

string

The mime type of the response. This is required if responseFormat is set.

previousInteractionId string

The ID of the previous interaction, if any.

environmentId string

Output only. The environment ID for the interaction. Only populated if environment config is set in the request.

serviceTier enum ( ServiceTier )

The service tier for the interaction.

webhookConfig object ( WebhookConfig )

Optional. Webhook configuration for receiving notifications when the interaction completes.

steps[] object ( Step )

Required. Output only. The steps that make up the interaction.

input Union type

The input for the interaction. input can be only one of the following:

contentList
 (deprecated)

object ( ContentList )

The inputs for the interaction.

stringContent string

A string input for the interaction, it will be processed as a single text input.

turnList
 (deprecated)

object ( TurnList )

The turns for the interaction.

stepList object ( StepList )

Input only. The steps for the interaction.

content object ( Content )

The content for the interaction.

response_format_config Union type

response_format_config can be only one of the following:

responseFormat
 (deprecated)

object ( Value )

Enforces that the generated response is a JSON object that complies with the JSON schema specified in this field.

responseFormatList object ( ResponseFormatList )

responseFormatSingleton object ( ResponseFormat )

request_type Union type

The request type for the interaction. request_type can be only one of the following:

modelInteraction object ( ModelInteraction )

Interaction for generating the completion using models.

agentInteraction object ( AgentInteraction )

Interaction for generating the completion using agents.

environment Union type

The environment configuration for the interaction. environment can be only one of the following:

envId string

The environment ID for the interaction. Can be 'remote' for default environment.

remoteEnvironment object ( EnvironmentConfig )

localEnvironment object ( LocalEnvironmentConfig )

The agent's environment lives on the client connection: its built-in environment operations (filesystem ops and running commands) are yielded to the client to execute, instead of running in a server-managed sandbox. Mutually exclusive with remoteEnvironment . (Independent of any client-declared function tools, which are always executed on the client regardless of this field.)

JSON representation

{
  "id": string,
  "status": enum (Status),
  "created": string,
  "updated": string,
  "role": string,
  "outputs": [
    {
      object (Content)
    }
  ],
  "systemInstruction": string,
  "tools": [
    {
      object (Tool)
    }
  ],
  "usage": {
    object (Usage)
  },
  "responseModalities": [
    enum (ResponseModality)
  ],
  "responseMimeType": string,
  "previousInteractionId": string,
  "environmentId": string,
  "serviceTier": enum (ServiceTier),
  "webhookConfig": {
    object (WebhookConfig)
  },
  "steps": [
    {
      object (Step)
    }
  ],

  // input
  "contentList": {
    object (ContentList)
  },
  "stringContent": string,
  "turnList": {
    object (TurnList)
  },
  "stepList": {
    object (StepList)
  },
  "content": {
    object (Content)
  }
  // Union type

  // response_format_config
  "responseFormat": {
    object (Value)
  },
  "responseFormatList": {
    object (ResponseFormatList)
  },
  "responseFormatSingleton": {
    object (ResponseFormat)
  }
  // Union type

  // request_type
  "modelInteraction": {
    object (ModelInteraction)
  },
  "agentInteraction": {
    object (AgentInteraction)
  }
  // Union type

  // environment
  "envId": string,
  "remoteEnvironment": {
    object (EnvironmentConfig)
  },
  "localEnvironment": {
    object (LocalEnvironmentConfig)
  }
  // Union type
}

TurnList

A list of Turns.

Поля

turns[] object ( Turn )

JSON representation
{ "turns": [ { object (`Turn`) } ] }

Повернуть

Поля

role string

The originator of this turn. Must be user for input or model for model output.

content Union type

content can be only one of the following:

contentList object ( ContentList )

The content of the turn. An array of Content objects.

contentString string

The content of the turn. A single string.

JSON representation
{ "role": string, // content "contentList": { object (`ContentList`) }, "contentString": string // Union type }

Список шагов

A list of Steps.

Поля

steps[] object ( Step )

The steps of the list.

JSON representation
{ "steps": [ { object (`Step`) } ] }

Шаг

A step in the interaction.

Поля

type Union type

type can be only one of the following:

thought object ( ThoughtStep )

toolCall object ( ToolCallStep )

toolResult object ( ToolResultStep )

userInput object ( UserInputStep )

DO NOT USE -- These are for 3P JSON only

modelOutput object ( ModelOutputStep )

text
 (deprecated)

object ( LegacyTextContent )

image
 (deprecated)

object ( LegacyImageContent )

audio
 (deprecated)

object ( LegacyAudioContent )

document
 (deprecated)

object ( LegacyDocumentContent )

video
 (deprecated)

object ( LegacyVideoContent )

JSON-представление

{

  // type
  "thought": {
    object (ThoughtStep)
  },
  "toolCall": {
    object (ToolCallStep)
  },
  "toolResult": {
    object (ToolResultStep)
  },
  "userInput": {
    object (UserInputStep)
  },
  "modelOutput": {
    object (ModelOutputStep)
  },
  "text": {
    object (LegacyTextContent)
  },
  "image": {
    object (LegacyImageContent)
  },
  "audio": {
    object (LegacyAudioContent)
  },
  "document": {
    object (LegacyDocumentContent)
  },
  "video": {
    object (LegacyVideoContent)
  }
  // Union type
}

ThoughtStep

A thought step.

Поля

signature string ( bytes format)

A signature hash for backend validation.

A base64-encoded string.

summary[] object ( Content )

A summary of the thought.

JSON representation
{ "signature": string, "summary": [ { object (`Content`) } ] }

ToolCallStep

Tool call step.

Поля

id string

Required. A unique ID for this specific tool call.

signature string ( bytes format)

A signature hash for backend validation.

A base64-encoded string.

type Union type

type can be only one of the following:

functionCall object ( FunctionCallStep )

codeExecutionCall object ( CodeExecutionCallStep )

urlContextCall object ( UrlContextCallStep )

mcpServerToolCall object ( McpServerToolCallStep )

googleSearchCall object ( GoogleSearchCallStep )

fileSearchCall object ( FileSearchCallStep )

googleMapsCall object ( GoogleMapsCallStep )

retrievalCall object ( RetrievalCallStep )

JSON representation

{
  "id": string,
  "signature": string,

  // type
  "functionCall": {
    object (FunctionCallStep)
  },
  "codeExecutionCall": {
    object (CodeExecutionCallStep)
  },
  "urlContextCall": {
    object (UrlContextCallStep)
  },
  "mcpServerToolCall": {
    object (McpServerToolCallStep)
  },
  "googleSearchCall": {
    object (GoogleSearchCallStep)
  },
  "fileSearchCall": {
    object (FileSearchCallStep)
  },
  "googleMapsCall": {
    object (GoogleMapsCallStep)
  },
  "retrievalCall": {
    object (RetrievalCallStep)
  }
  // Union type
}

FunctionCallStep

A function tool call step.

Поля

name string

Required. The name of the tool to call.

arguments object ( Struct )

Required. The arguments to pass to the function.

JSON representation
{ "name": string, "arguments": { object (`Struct`) } }

UrlContextCallStep

URL context call step.

Поля

arguments object ( UrlContextCallStepArguments )

Required. The arguments to pass to the URL context.

JSON representation
{ "arguments": { object (`UrlContextCallStepArguments`) } }

UrlContextCallStepArguments

The arguments to pass to the URL context.

Поля

urls[] string

The URLs to fetch.

JSON representation
{ "urls": [ string ] }

McpServerToolCallStep

MCPServer tool call step.

Поля

name string

Required. The name of the tool which was called.

serverName string

Required. The name of the used MCP server.

arguments object ( Struct )

Required. The JSON object of arguments for the function.

JSON representation
{ "name": string, "serverName": string, "arguments": { object (`Struct`) } }

GoogleSearchCallStep

Google Search call step.

Поля

arguments object ( GoogleSearchCallStepArguments )

Required. The arguments to pass to Google Search.

searchType enum ( SearchType )

The type of search grounding enabled.

JSON representation
{ "arguments": { object (`GoogleSearchCallStepArguments`) }, "searchType": enum (`SearchType`) }

GoogleSearchCallStepArguments

The arguments to pass to Google Search.

Поля

queries[] string

Web search queries for the following-up web search.

JSON representation
{ "queries": [ string ] }

FileSearchCallStep

Этот тип не содержит полей.

File Search call step.

GoogleMapsCallStep

Google Maps call step.

Поля

arguments object ( GoogleMapsCallStepArguments )

The arguments to pass to the Google Maps tool.

JSON representation
{ "arguments": { object (`GoogleMapsCallStepArguments`) } }

GoogleMapsCallStepArguments

The arguments to pass to the Google Maps tool.

Поля

queries[] string

The queries to be executed.

JSON representation
{ "queries": [ string ] }

ToolResultStep

Tool result step.

Поля

callId string

Required. ID to match the ID from the function call block.

signature string ( bytes format)

A signature hash for backend validation.

A base64-encoded string.

type Union type

type can be only one of the following:

functionResult object ( FunctionResultStep )

codeExecutionResult object ( CodeExecutionResultStep )

urlContextResult object ( UrlContextResultStep )

googleSearchResult object ( GoogleSearchResultStep )

mcpServerToolResult object ( McpServerToolResultStep )

fileSearchResult object ( FileSearchResultStep )

googleMapsResult object ( GoogleMapsResultStep )

retrievalResult object ( RetrievalResultStep )

JSON representation

{
  "callId": string,
  "signature": string,

  // type
  "functionResult": {
    object (FunctionResultStep)
  },
  "codeExecutionResult": {
    object (CodeExecutionResultStep)
  },
  "urlContextResult": {
    object (UrlContextResultStep)
  },
  "googleSearchResult": {
    object (GoogleSearchResultStep)
  },
  "mcpServerToolResult": {
    object (McpServerToolResultStep)
  },
  "fileSearchResult": {
    object (FileSearchResultStep)
  },
  "googleMapsResult": {
    object (GoogleMapsResultStep)
  },
  "retrievalResult": {
    object (RetrievalResultStep)
  }
  // Union type
}

FunctionResultStep

Result of a function tool call.

Поля

name string

The name of the tool that was called.

isError boolean

Whether the tool call resulted in an error.

result object ( Value )

Required. The result of the tool call.

JSON-представление
{ "name": string, "isError": boolean, "result": { object (`Value`) } }

UrlContextResultStep

URL context result step.

Поля

result[] object ( UrlContextResultItem )

Required. The results of the URL context.

isError boolean

Whether the URL context resulted in an error.

JSON representation
{ "result": [ { object (`UrlContextResultItem`) } ], "isError": boolean }

UrlContextResultItem

The result of the URL context.

Поля

url string

The URL that was fetched.

status enum ( Status )

The status of the URL retrieval.

JSON representation
{ "url": string, "status": enum (`Status`) }

GoogleSearchResultStep

Google Search result step.

Поля

result[] object ( GoogleSearchResultItem )

Required. The results of the Google Search.

isError boolean

Whether the Google Search resulted in an error.

JSON representation
{ "result": [ { object (`GoogleSearchResultItem`) } ], "isError": boolean }

GoogleSearchResultItem

The result of the Google Search.

Поля

searchSuggestions string

Web content snippet that can be embedded in a web page or an app webview.

JSON representation
{ "searchSuggestions": string }

McpServerToolResultStep

MCPServer tool result step.

Поля

name string

Name of the tool which is called for this specific tool call.

serverName string

The name of the used MCP server.

result object ( Value )

Required. The output from the MCP server call. Can be simple text or rich content.

JSON representation
{ "name": string, "serverName": string, "result": { object (`Value`) } }

FileSearchResultStep

Этот тип не содержит полей.

File Search result step.

GoogleMapsResultStep

Google Maps result step.

Поля

result[] object ( GoogleMapsResultItem )

JSON representation
{ "result": [ { object (`GoogleMapsResultItem`) } ] }

GoogleMapsResultItem

The result of the Google Maps.

Поля

places[] object ( GoogleMapsResultPlaces )

widgetContextToken string

JSON representation
{ "places": [ { object (`GoogleMapsResultPlaces`) } ], "widgetContextToken": string }

GoogleMapsResultPlaces

Поля

placeId string

name string

url string

reviewSnippets[] object ( ReviewSnippet )

JSON-представление
{ "placeId": string, "name": string, "url": string, "reviewSnippets": [ { object (`ReviewSnippet`) } ] }

UserInputStep

Input provided by the user.

Поля

content Union type

content can be only one of the following:

contentList object ( ContentList )

The content of the step. An array of Content objects.

contentString string

The content of the step. A single string.

JSON representation
{ // content "contentList": { object (`ContentList`) }, "contentString": string // Union type }

ModelOutputStep

Output generated by the model.

Поля

content[] object ( Content )

JSON representation
{ "content": [ { object (`Content`) } ] }

ResponseFormatList

Поля

responseFormats[] object ( ResponseFormat )

JSON representation
{ "responseFormats": [ { object (`ResponseFormat`) } ] }

ResponseFormat

Поля

type Union type

type can be only one of the following:

audio object ( AudioResponseFormat )

text object ( TextResponseFormat )

image object ( ImageResponseFormat )

video object ( VideoResponseFormat )

structValue object ( Struct )

Multi-discriminator values is already enabled in GAOS

JSON representation

{

  // type
  "audio": {
    object (AudioResponseFormat)
  },
  "text": {
    object (TextResponseFormat)
  },
  "image": {
    object (ImageResponseFormat)
  },
  "video": {
    object (VideoResponseFormat)
  },
  "structValue": {
    object (Struct)
  }
  // Union type
}

TextResponseFormat

Configuration for text output format.

Поля

mimeType enum ( MimeType )

The MIME type of the text output.

schema object ( Struct )

The JSON schema that the output should conform to. Only applicable when mimeType is application/json.

JSON representation
{ "mimeType": enum (`MimeType`), "schema": { object (`Struct`) } }

ImageResponseFormat

Configuration for image output format.

Поля

mimeType enum ( MimeType )

The MIME type of the image output.

delivery enum ( Delivery )

The delivery mode for the image output.

aspectRatio enum ( AspectRatio )

The aspect ratio for the image output.

imageSize enum ( ImageSize )

The size of the image output.

JSON representation
{ "mimeType": enum (`MimeType`), "delivery": enum (`Delivery`), "aspectRatio": enum (`AspectRatio`), "imageSize": enum (`ImageSize`) }

VideoResponseFormat

Configuration for video output format.

Поля

delivery enum ( Delivery )

The delivery mode for the video output.

aspectRatio enum ( AspectRatio )

The aspect ratio for the video output.

duration string ( Duration format)

The duration for the video output.

A duration in seconds with up to nine fractional digits, ending with ' s '. Example: "3.5s" .

JSON-представление
{ "delivery": enum (`Delivery`), "aspectRatio": enum (`AspectRatio`), "duration": string }

Взаимодействие моделей

Interaction for generating the completion using models.

Поля

model string

The name of the Model used for generating the completion.

generationConfig object ( GenerationConfig )

Input only. Configuration parameters for the model interaction.

JSON representation
{ "model": string, "generationConfig": { object (`GenerationConfig`) } }

GenerationConfig

Configuration parameters for model interactions.

Поля

temperature number

Controls the randomness of the output.

topP number

The maximum cumulative probability of tokens to consider when sampling.

seed integer

Seed used in decoding for reproducibility.

stopSequences[] string

A list of character sequences that will stop output interaction.

thinkingLevel enum ( ThinkingLevel )

The level of thought tokens that the model should generate.

thinkingSummaries enum ( ThinkingSummaries )

Whether to include thought summaries in the response.

maxOutputTokens integer

The maximum number of tokens to include in the response.

speechConfig[] object ( SpeechConfig )

Configuration for speech interaction.

imageConfig
 (deprecated)

object ( ImageConfig )

Configuration for image interaction.

videoConfig object ( VideoConfig )

Configuration for video generation.

tool_choice Union type

The tool choice configuration. tool_choice can be only one of the following:

toolChoiceMode enum ( ToolChoiceType )

The mode of the tool choice.

toolChoiceConfig object ( ToolChoiceConfig )

The config for the tool choice.

JSON representation

{
  "temperature": number,
  "topP": number,
  "seed": integer,
  "stopSequences": [
    string
  ],
  "thinkingLevel": enum (ThinkingLevel),
  "thinkingSummaries": enum (ThinkingSummaries),
  "maxOutputTokens": integer,
  "speechConfig": [
    {
      object (SpeechConfig)
    }
  ],
  "imageConfig": {
    object (ImageConfig)
  },
  "videoConfig": {
    object (VideoConfig)
  },

  // tool_choice
  "toolChoiceMode": enum (ToolChoiceType),
  "toolChoiceConfig": {
    object (ToolChoiceConfig)
  }
  // Union type
}

ToolChoiceConfig

The tool choice configuration containing allowed tools.

Поля

allowedTools object ( AllowedTools )

The allowed tools.

JSON representation
{ "allowedTools": { object (`AllowedTools`) } }

SpeechConfig

The configuration for speech interaction.

Fields

voice string

The voice of the speaker.

language string

The language of the speech.

speaker string

The speaker's name, it should match the speaker name given in the prompt.

JSON representation
{ "voice": string, "language": string, "speaker": string }

ImageConfig

The configuration for image interaction.

Fields

aspectRatio string

The aspect ratio of the image to generate. Supported aspect ratios: 1:1, 2:3, 3:2, 3:4, 4:3, 9:16, 16:9, 21:9.

If not specified, the model will choose a default aspect ratio based on any reference images provided.

imageSize string

Specifies the size of generated images. Supported values are 1K , 2K , 4K . If not specified, the model will use default value 1K .

JSON representation
{ "aspectRatio": string, "imageSize": string }

VideoConfig

Configuration options for video generation.

Fields

task enum ( Task )

Optional task mode for video generation. If not specified, the model automatically determines the appropriate mode based on the provided text prompt and input media.

JSON representation
{ "task": enum (`Task`) }

EnvironmentConfig

Configuration for a custom environment.

Fields

sources[] object ( Source )

environmentId string

Optional. The environment ID for the interaction. If specified, the request will update the existing environment instead of creating a new one.

network Union type

Network configuration for the environment. network can be only one of the following:

networkAllowlist object ( EnvironmentNetworkEgressAllowlist )

Allow only specific domains.

networkMode enum ( NetworkMode )

Network egress mode.

JSON-представление
{ "sources": [ { object (`Source`) } ], "environmentId": string, // network "networkAllowlist": { object (`EnvironmentNetworkEgressAllowlist`) }, "networkMode": enum (`NetworkMode`) // Union type }

EnvironmentNetworkEgressAllowlist

Network egress configuration for the environment.

Fields

allowlist[] object ( EgressRule )

List of allowed domains and their configurations.

JSON representation
{ "allowlist": [ { object (`EgressRule`) } ] }

EgressRule

A network egress rule that controls which external domains the environment is allowed to reach. Each rule identifies a target domain and, optionally, a set of HTTP headers to inject into every matching outbound request.

Fields

domain string

The domain pattern to match for this rule. Use an exact hostname (eg, github.com ), a wildcard prefix (eg, *.googleapis.com ), or * to match all domains.

transform map (key: string, value: string)

Headers to inject into requests matching this rule. Key: header name (eg, "Authorization"). Value: header value (eg, "Bearer your-token").

An object containing a list of "key": value pairs. Example: { "name": "wrench", "mass": "1.3kg", "count": "3" } .

JSON representation
{ "domain": string, "transform": { string: string, ... } }

Источник

A source to be mounted into the environment.

Fields

type enum ( Type )

source string

The source of the environment. For GCS, this is the GCS path. For GitHub, this is the GitHub path.

target string

Where the source should appear in the environment.

content string

The inline content if type is INLINE .

encoding string

Optional encoding for inline content (eg base64 ).

JSON representation
{ "type": enum (`Type`), "source": string, "target": string, "content": string, "encoding": string }

LocalEnvironmentConfig

Этот тип не содержит полей.

Configuration for an environment that lives on the client connection rather than in a server-managed sandbox.

When set (via Interaction.local_environment), the agent's filesystem and shell are treated as living on the client: the agent's built-in environment operations (eg reading/listing/editing files and running commands) are suspended on the server and yielded back to the client to execute, with their results returned on a subsequent turn. This is mutually exclusive with a server-managed EnvironmentConfig (remoteEnvironment), since the environment is either on the client or in a server sandbox, never both.

This governs only the agent's built-in environment. Client-declared function tools are always executed on the client regardless of this field.

Инструмент

A tool that can be used by the model.

Fields

type Union type

The tool to use. type can be only one of the following:

function object ( Function )

A function that can be used by the model.

codeExecution object ( CodeExecution )

A tool that can be used by the model to execute code.

urlContext object ( UrlContext )

A tool that can be used by the model to fetch URL context.

computerUse object ( ComputerUse )

Tool to support the model interacting directly with the computer.

mcpServer object ( McpServer )

A MCPServer is a server that can be called by the model to perform actions.

googleSearch object ( GoogleSearch )

A tool that can be used by the model to search Google.

fileSearch object ( FileSearch )

A tool that can be used by the model to search files.

googleMaps object ( GoogleMaps )

A tool that can be used by the model to search Google Maps.

retrieval object ( Retrieval )

A tool that can be used by the model to retrieve files.

JSON representation

{

  // type
  "function": {
    object (Function)
  },
  "codeExecution": {
    object (CodeExecution)
  },
  "urlContext": {
    object (UrlContext)
  },
  "computerUse": {
    object (ComputerUse)
  },
  "mcpServer": {
    object (McpServer)
  },
  "googleSearch": {
    object (GoogleSearch)
  },
  "fileSearch": {
    object (FileSearch)
  },
  "googleMaps": {
    object (GoogleMaps)
  },
  "retrieval": {
    object (Retrieval)
  }
  // Union type
}

Функция

A tool that can be used by the model.

Fields

name string

The name of the function.

description string

A description of the function.

parameters object ( Value )

The JSON Schema for the function's parameters.

JSON representation
{ "name": string, "description": string, "parameters": { object (`Value`) } }

UrlContext

Этот тип не содержит полей.

A tool that can be used by the model to fetch URL context.

McpServer

A MCPServer is a server that can be called by the model to perform actions.

Fields

name string

The name of the MCPServer.

url string

The full URL for the MCPServer endpoint. Example: "https://api.example.com/mcp"

headers map (key: string, value: string)

Optional: Fields for authentication headers, timeouts, etc., if needed.

An object containing a list of "key": value pairs. Example: { "name": "wrench", "mass": "1.3kg", "count": "3" } .

allowedTools[] object ( AllowedTools )

The allowed tools.

JSON representation
{ "name": string, "url": string, "headers": { string: string, ... }, "allowedTools": [ { object (`AllowedTools`) } ] }

GoogleSearch

A tool that can be used by the model to search Google.

Fields

searchTypes[] enum ( SearchType )

The types of search grounding to enable.

JSON representation
{ "searchTypes": [ enum (`SearchType`) ] }

FileSearch

A tool that can be used by the model to search files.

Fields

fileSearchStoreNames[] string

The file search store names to search.

topK integer

The number of semantic retrieval chunks to retrieve.

metadataFilter string

Metadata filter to apply to the semantic retrieval documents and chunks.

JSON-представление
{ "fileSearchStoreNames": [ string ], "topK": integer, "metadataFilter": string }

GoogleMaps

A tool that can be used by the model to call Google Maps.

Fields

enableWidget boolean

Whether to return a widget context token in the tool call result of the response.

latitude number

The latitude of the user's location.

longitude number

The longitude of the user's location.

JSON representation
{ "enableWidget": boolean, "latitude": number, "longitude": number }

Использование

Statistics on the interaction request's token usage.

Fields

totalInputTokens integer

Number of tokens in the prompt (context).

inputTokensByModality[] object ( ModalityTokens )

A breakdown of input token usage by modality.

totalCachedTokens integer

Number of tokens in the cached part of the prompt (the cached content).

cachedTokensByModality[] object ( ModalityTokens )

A breakdown of cached token usage by modality.

totalOutputTokens integer

Total number of tokens across all the generated responses.

outputTokensByModality[] object ( ModalityTokens )

A breakdown of output token usage by modality.

totalToolUseTokens integer

Number of tokens present in tool-use prompt(s).

toolUseTokensByModality[] object ( ModalityTokens )

A breakdown of tool-use token usage by modality.

totalThoughtTokens integer

Number of tokens of thoughts for thinking models.

totalTokens integer

Total token count for the interaction request (prompt + responses + other internal tokens).

groundingToolCount[] object ( GroundingToolCount )

Grounding tool count.

JSON representation

{
  "totalInputTokens": integer,
  "inputTokensByModality": [
    {
      object (ModalityTokens)
    }
  ],
  "totalCachedTokens": integer,
  "cachedTokensByModality": [
    {
      object (ModalityTokens)
    }
  ],
  "totalOutputTokens": integer,
  "outputTokensByModality": [
    {
      object (ModalityTokens)
    }
  ],
  "totalToolUseTokens": integer,
  "toolUseTokensByModality": [
    {
      object (ModalityTokens)
    }
  ],
  "totalThoughtTokens": integer,
  "totalTokens": integer,
  "groundingToolCount": [
    {
      object (GroundingToolCount)
    }
  ]
}

ModalityTokens

The token count for a single response modality.

Fields

modality enum ( ResponseModality )

The modality associated with the token count.

tokens integer

Number of tokens for the modality.

JSON representation
{ "modality": enum (`ResponseModality`), "tokens": integer }

GroundingToolCount

The number of grounding tool counts.

Fields

type enum ( Type )

The grounding tool type associated with the count.

count integer

The number of grounding tool counts.

JSON representation
{ "type": enum (`Type`), "count": integer }

WebhookConfig

Message for configuring webhook events for a request.

Fields

uris[] string

Optional. If set, these webhook URIs will be used for webhook events instead of the registered webhooks.

userMetadata object ( Struct format)

Optional. The user metadata that will be returned on each event emission to the webhooks.

JSON representation
{ "uris": [ string ], "userMetadata": { object } }

SafetySetting

A safety setting that affects the safety-blocking behavior.

A [SafetySetting][google.cloud.aiplatform.master.SafetySetting] consists of a harm [category][google.cloud.aiplatform.master.SafetySetting.category] and a [threshold][google.cloud.aiplatform.master.SafetySetting.threshold] for that category.

Fields

type enum ( HarmCategory )

Required. The type of harm category to be blocked.

threshold enum ( HarmBlockThreshold )

Required. The threshold for blocking content. If the harm probability exceeds this threshold, the content will be blocked.

method enum ( HarmBlockMethod )

Optional. The method for blocking content. If not specified, the default behavior is to use the probability score.

JSON representation
{ "type": enum (`HarmCategory`), "threshold": enum (`HarmBlockThreshold`), "method": enum (`HarmBlockMethod`) }

Доставка

Delivery mode for audio output.

Перечисления
`DELIVERY_UNSPECIFIED`	Значение по умолчанию. Это значение не используется.
`INLINE`	Audio data is returned inline in the response.
`URI`	Audio data is returned as a URI.

Среда

Represents the environment being operated, such as a web browser.

Перечисления
`ENVIRONMENT_UNSPECIFIED`	Defaults to browser.
`BROWSER`	Operates in a web browser.
`MOBILE`	Operates in a mobile environment.
`DESKTOP`	Operates in a desktop environment.

HarmBlockMethod

The method for blocking content.

Перечисления
`HARM_BLOCK_METHOD_UNSPECIFIED`	The harm block method is unspecified.
`SEVERITY`	The harm block method uses both probability and severity scores.
`PROBABILITY`	The harm block method uses the probability score.

HarmBlockThreshold

Thresholds for blocking content based on harm probability.

Перечисления
`HARM_BLOCK_THRESHOLD_UNSPECIFIED`	The harm block threshold is unspecified.
`BLOCK_LOW_AND_ABOVE`	Block content with a low harm probability or higher.
`BLOCK_MEDIUM_AND_ABOVE`	Block content with a medium harm probability or higher.
`BLOCK_ONLY_HIGH`	Block content with a high harm probability.
`BLOCK_NONE`	Do not block any content, regardless of its harm probability.
`OFF`	Turn off the safety filter entirely.

Размер изображения

Supported image sizes for image output.

Перечисления
`IMAGE_SIZE_UNSPECIFIED`	Значение по умолчанию. Это значение не используется.
`IMAGE_SIZE_FIVE_TWELVE`	512px image size.
`IMAGE_SIZE_ONE_K`	1K image size.
`IMAGE_SIZE_TWO_K`	2K image size.
`IMAGE_SIZE_FOUR_K`	4K image size.

Язык

Supported programming languages for the generated code.

Перечисления
`LANGUAGE_UNSPECIFIED`	Unspecified language. This value should not be used.
`PYTHON`	Python >= 3.10, with numpy and simpy available.

MediaResolution

Resolution for input media (images/video).

Перечисления
`MEDIA_RESOLUTION_UNSPECIFIED`	Значение по умолчанию. Это значение не используется.
`LOW`	Low resolution.
`MEDIUM`	Medium resolution.
`HIGH`	Высокое разрешение.
`ULTRA_HIGH`	Ultra high resolution.

MIME-тип

Перечисления
`TYPE_UNSPECIFIED`
`TYPE_WAV`	аудиоформат WAV
`TYPE_MP3`	MP3 audio format
`TYPE_AIFF`	AIFF audio format
`TYPE_AAC`	AAC audio format
`TYPE_OGG`	OGG audio format
`TYPE_FLAC`	FLAC audio format
`TYPE_MPEG`	MPEG audio format
`TYPE_M4A`	M4A audio format
`TYPE_L16`	L16 audio format
`TYPE_OPUS`	OPUS audio format
`TYPE_ALAW`	ALAW audio format
`TYPE_MULAW`	MULAW audio format

Режим

Defines the depth and thoroughness of the find session.

Перечисления
`MODE_UNSPECIFIED`	Значение по умолчанию. Это значение не используется.
`MODE_SCAN`	Fast scan using only the initial classifier.
`MODE_VERIFY`	Performs classification followed by detailed investigation.

Сетевой режим

Network egress mode for non-allowlist configurations.

Перечисления
`NETWORK_MODE_UNSPECIFIED`	Default value. Unused.
`DISABLED`	All network egress is blocked.

ResponseModality

The modality of the response.

Перечисления
`RESPONSE_MODALITY_UNSPECIFIED`	Значение по умолчанию. Это значение не используется.
`TEXT`	Indicates the model should return text.
`IMAGE`	Indicates the model should return images.
`AUDIO`	Indicates the model should return audio.
`VIDEO`	Indicates the model should return video.
`DOCUMENT`	Indicates the model should return documents.

ReviewSnippet

JSON representation

Encapsulates a snippet of a user review that answers a question about the features of a specific place in Google Maps.

Fields

title string

Title of the review.

url string

A link that corresponds to the user review on Google Maps.

reviewId string

The ID of the review snippet.

JSON representation
{ "title": string, "url": string, "reviewId": string }

SafetyPolicy

Перечисления
`SAFETY_POLICY_UNSPECIFIED`	Unspecified safety policy.
`FINANCIAL_TRANSACTIONS`	Safety policy for financial transactions.
`SENSITIVE_DATA_MODIFICATION`	Safety policy for sensitive data modification.
`COMMUNICATION_TOOL`	Safety policy for communication tools (eg Gmail, Chat, Meet).
`ACCOUNT_CREATION`	Safety policy for account creation.
`DATA_MODIFICATION`	Safety policy for data modification.
`USER_CONSENT_MANAGEMENT`	Safety policy for user consent management.
`LEGAL_TERMS_AND_AGREEMENTS`	Safety policy for legal terms and agreements.

Схема

JSON representation
Тип

The Schema object allows the definition of input and output data types. These types can be objects, but also primitives and arrays. Represents a select subset of an OpenAPI 3.0 schema object .

Fields

type enum ( Type )

Required. Data type.

format string

Optional. The format of the data. Any value is allowed, but most do not trigger any special functionality.

title string

Optional. The title of the schema.

description string

Optional. A brief description of the parameter. This could contain examples of use. Parameter description may be formatted as Markdown.

nullable boolean

Optional. Indicates if the value may be null.

enum[] string

Optional. Possible values of the element of Type.STRING with enum format. For example we can define an Enum Direction as : {type:STRING, format:enum, enum:["EAST", NORTH", "SOUTH", "WEST"]}

maxItems string ( int64 format)

Optional. Maximum number of the elements for Type.ARRAY.

minItems string ( int64 format)

Optional. Minimum number of the elements for Type.ARRAY.

properties map (key: string, value: object ( Schema ))

Optional. Properties of Type.OBJECT.

An object containing a list of "key": value pairs. Example: { "name": "wrench", "mass": "1.3kg", "count": "3" } .

required[] string

Optional. Required properties of Type.OBJECT.

minProperties string ( int64 format)

Optional. Minimum number of the properties for Type.OBJECT.

maxProperties string ( int64 format)

Optional. Maximum number of the properties for Type.OBJECT.

minLength string ( int64 format)

Optional. SCHEMA FIELDS FOR TYPE STRING Minimum length of the Type.STRING

maxLength string ( int64 format)

Optional. Maximum length of the Type.STRING

pattern string

Optional. Pattern of the Type.STRING to restrict a string to a regular expression.

example value ( Value format)

Optional. Example of the object. Will only populated when the object is the root.

anyOf[] object ( Schema )

Optional. The value should be validated against any (one or more) of the subschemas in the list.

propertyOrdering[] string

Optional. The order of the properties. Not a standard field in open api spec. Used to determine the order of the properties in the response.

default value ( Value format)

Optional. Default value of the field. Per JSON Schema, this field is intended for documentation generators and doesn't affect validation. Thus it's included here and ignored so that developers who send schemas with a default field don't get unknown-field errors.

items object ( Schema )

Optional. Schema of the elements of Type.ARRAY.

minimum number

Optional. SCHEMA FIELDS FOR TYPE INTEGER and NUMBER Minimum value of the Type.INTEGER and Type.NUMBER

maximum number

Optional. Maximum value of the Type.INTEGER and Type.NUMBER

JSON representation

{
  "type": enum (Type),
  "format": string,
  "title": string,
  "description": string,
  "nullable": boolean,
  "enum": [
    string
  ],
  "maxItems": string,
  "minItems": string,
  "properties": {
    string: {
      object (Schema)
    },
    ...
  },
  "required": [
    string
  ],
  "minProperties": string,
  "maxProperties": string,
  "minLength": string,
  "maxLength": string,
  "pattern": string,
  "example": value,
  "anyOf": [
    {
      object (Schema)
    }
  ],
  "propertyOrdering": [
    string
  ],
  "default": value,
  "items": {
    object (Schema)
  },
  "minimum": number,
  "maximum": number
}

Тип

Type contains the list of OpenAPI data types as defined by https://spec.openapis.org/oas/v3.0.3#data-types

Перечисления
`TYPE_UNSPECIFIED`	Not specified, should not be used.
`STRING`	String type.
`NUMBER`	Number type.
`INTEGER`	Integer type.
`BOOLEAN`	Boolean type.
`ARRAY`	Array type.
`OBJECT`	Object type.
`NULL`	Null type.

SearchType

The types of search grounding to enable.

Перечисления
`SEARCH_TYPE_UNSPECIFIED`	Unspecified search type. This value should not be used.
`SEARCH_TYPE_WEB_SEARCH`	Setting this field enables web search. Only text results are returned.
`SEARCH_TYPE_IMAGE_SEARCH`	Setting this field enables image search. Image bytes are returned.

Struct

JSON representation
Поле
- JSON representation

Struct represents a structured data value, consisting of fields which map to dynamically typed values.

Fields

fields[] object ( Field )

Dynamically typed fields. List instead of map because LLMs are sensitive to ordering, and we want to give users full control.

JSON representation
{ "fields": [ { object (`Field`) } ] }

Поле

Represents a single field in a struct.

Fields

name string

value object ( Value )

JSON representation
{ "name": string, "value": { object (`Value`) } }

Задача

Supported video generation tasks.

Перечисления
`TASK_UNSPECIFIED`	Unspecified task. The task is inferred from the input prompt and media.
`TEXT_TO_VIDEO`	Generates video solely from a text prompt.
`IMAGE_TO_VIDEO`	Generates video from one or two source images. The first image defines the starting frame, and the optional second image defines the ending frame.
`REFERENCE_TO_VIDEO`	Generates video using reference media (such as images, audio, or video).
`EDIT`	Modifies an existing input video.

ThinkingLevel

The level of thought tokens that the model should generate.

Перечисления
`THINKING_LEVEL_UNSPECIFIED`	Значение по умолчанию. Это значение не используется.
`THINKING_LEVEL_MINIMAL`	Little to no thinking.
`THINKING_LEVEL_LOW`	Low thinking level.
`THINKING_LEVEL_MEDIUM`	Medium thinking level.
`THINKING_LEVEL_HIGH`	High thinking level.

Краткие обзоры мыслей

Whether to include thought summaries in the response.

Перечисления
`THINKING_SUMMARIES_UNSPECIFIED`	Значение по умолчанию. Это значение не используется.
`THINKING_SUMMARIES_AUTO`	Auto thinking summaries.
`THINKING_SUMMARIES_NONE`	No thinking summaries.

Инструмент

Tool details that the model may use to generate response.

A Tool is a piece of code that enables the system to interact with external systems to perform an action, or set of actions, outside of knowledge and scope of the model.

Next ID: 16

Fields

functionDeclarations[] object ( FunctionDeclaration )

Optional. A list of FunctionDeclarations available to the model that can be used for function calling.

The model or system does not execute the function. Instead the defined function may be returned as a FunctionCall with arguments to the client side for execution. The model may decide to call a subset of these functions by populating FunctionCall in the response. The next conversation turn may contain a FunctionResponse with the Content.role "function" generation context for the next model turn.

googleSearchRetrieval object ( GoogleSearchRetrieval )

Optional. Retrieval tool that is powered by Google search.

codeExecution object ( CodeExecution )

Optional. Enables the model to execute code as part of generation.

googleSearch object ( GoogleSearch )

Optional. GoogleSearch tool type. Tool to support Google Search in Model. Powered by Google.

computerUse object ( ComputerUse )

Optional. Tool to support the model interacting directly with the computer. If enabled, it automatically populates computer-use specific Function Declarations.

urlContext object ( UrlContext )

Optional. Tool to support URL context retrieval.

fileSearch object ( FileSearch )

Optional. FileSearch tool type. Tool to retrieve knowledge from Semantic Retrieval corpora.

mcpServers[] object ( McpServer )

Optional. MCP Servers to connect to.

googleMaps object ( GoogleMaps )

Optional. Tool that allows grounding the model's response with geospatial context related to the user's query.

JSON representation

{
  "functionDeclarations": [
    {
      object (FunctionDeclaration)
    }
  ],
  "googleSearchRetrieval": {
    object (GoogleSearchRetrieval)
  },
  "codeExecution": {
    object (CodeExecution)
  },
  "googleSearch": {
    object (GoogleSearch)
  },
  "computerUse": {
    object (ComputerUse)
  },
  "urlContext": {
    object (UrlContext)
  },
  "fileSearch": {
    object (FileSearch)
  },
  "mcpServers": [
    {
      object (McpServer)
    }
  ],
  "googleMaps": {
    object (GoogleMaps)
  }
}

FunctionDeclaration

Structured representation of a function declaration as defined by the OpenAPI 3.03 specification . Included in this declaration are the function name and parameters. This FunctionDeclaration is a representation of a block of code that can be used as a Tool by the model and executed by the client.

Fields

name string

Required. The name of the function. Must be az, AZ, 0-9, or contain underscores, colons, dots, and dashes, with a maximum length of 128.

description string

Required. A brief description of the function.

behavior enum ( Behavior )

Optional. Specifies the function Behavior. Currently only supported by the BidiGenerateContent method.

parameters object ( Schema )

Optional. Describes the parameters to this function. Reflects the Open API 3.03 Parameter Object string Key: the name of the parameter. Parameter names are case sensitive. Schema Value: the Schema defining the type used for the parameter.

parametersJsonSchema value ( Value format)

Optional. Describes the parameters to the function in JSON Schema format. The schema must describe an object where the properties are the parameters to the function. For example:

{
  "type": "object",
  "properties": {
    "name": { "type": "string" },
    "age": { "type": "integer" }
  },
  "additionalProperties": false,
  "required": ["name", "age"],
  "propertyOrdering": ["name", "age"]
}

This field is mutually exclusive with parameters .

response object ( Schema )

Optional. Describes the output from this function in JSON Schema format. Reflects the Open API 3.03 Response Object. The Schema defines the type used for the response value of the function.

responseJsonSchema value ( Value format)

Optional. Describes the output from this function in JSON Schema format. The value specified by the schema is the response value of the function.

This field is mutually exclusive with response .

JSON representation

{
  "name": string,
  "description": string,
  "behavior": enum (Behavior),
  "parameters": {
    object (Schema)
  },
  "parametersJsonSchema": value,
  "response": {
    object (Schema)
  },
  "responseJsonSchema": value
}

Поведение

Defines the function behavior. Defaults to BLOCKING .

Перечисления
`UNSPECIFIED`	This value is unused.
`BLOCKING`	If set, the system will wait to receive the function response before continuing the conversation.
`NON_BLOCKING`	If set, the system will not wait to receive the function response. Instead, it will attempt to handle function responses as they become available while maintaining the conversation between the user and the model.

GoogleSearchRetrieval

Tool to retrieve public web data for grounding, powered by Google.

Fields

dynamicRetrievalConfig object ( DynamicRetrievalConfig )

Specifies the dynamic retrieval configuration for the given source.

JSON representation
{ "dynamicRetrievalConfig": { object (`DynamicRetrievalConfig`) } }

DynamicRetrievalConfig

Describes the options to customize dynamic retrieval.

Fields

mode enum ( Mode )

The mode of the predictor to be used in dynamic retrieval.

dynamicThreshold number

The threshold to be used in dynamic retrieval. If not set, a system default value is used.

JSON representation
{ "mode": enum (`Mode`), "dynamicThreshold": number }

Режим

The mode of the predictor to be used in dynamic retrieval.

Перечисления
`MODE_UNSPECIFIED`	Always trigger retrieval.
`MODE_DYNAMIC`	Run retrieval only when system decides it is necessary.

CodeExecution

Этот тип не содержит полей.

Tool that executes code generated by the model, and automatically returns the result to the model.

See also ExecutableCode and CodeExecutionResult which are only generated when using this tool.

GoogleSearch

GoogleSearch tool type. Tool to support Google Search in Model. Powered by Google.

Fields

timeRangeFilter object ( Interval )

Optional. Filter search results to a specific time range. If customers set a start time, they must set an end time (and vice versa).

searchTypes object ( SearchTypes )

Optional. The set of search types to enable. If not set, web search is enabled by default.

JSON-представление
{ "timeRangeFilter": { object (`Interval`) }, "searchTypes": { object (`SearchTypes`) } }

Интервал

Represents a time interval, encoded as a Timestamp start (inclusive) and a Timestamp end (exclusive).

The start must be less than or equal to the end. When the start equals the end, the interval is empty (matches no time). When both start and end are unspecified, the interval matches any time.

Fields

startTime string ( Timestamp format)

Optional. Inclusive start of the interval.

If specified, a Timestamp matching this interval will have to be the same or after the start.

Uses RFC 3339, where generated output will always be Z-normalized and use 0, 3, 6 or 9 fractional digits. Offsets other than "Z" are also accepted. Examples: "2014-10-02T15:01:23Z" , "2014-10-02T15:01:23.045123456Z" or "2014-10-02T15:01:23+05:30" .

endTime string ( Timestamp format)

Optional. Exclusive end of the interval.

If specified, a Timestamp matching this interval will have to be before the end.

JSON representation
{ "startTime": string, "endTime": string }

SearchTypes

Different types of search that can be enabled on the GoogleSearch tool.

Fields

webSearch object ( WebSearch )

Optional. Enables web search. Only text results are returned.

imageSearch object ( ImageSearch )

Optional. Enables image search. Image bytes are returned.

JSON representation
{ "webSearch": { object (`WebSearch`) }, "imageSearch": { object (`ImageSearch`) } }

WebSearch

Этот тип не содержит полей.

Standard web search for grounding and related configurations.

ImageSearch

Этот тип не содержит полей.

Image search for grounding and related configurations.

ComputerUse

Computer Use tool type.

Fields

environment enum ( Environment )

Required. The environment being operated.

excludedPredefinedFunctions[] string

Optional. By default, predefined functions are included in the final model call. Some of them can be explicitly excluded from being automatically included. This can serve two purposes: 1. Using a more restricted / different action space. 2. Improving the definitions / instructions of predefined functions.

enablePromptInjectionDetection boolean

Optional. Whether enable the prompt injection detection check on computer-use request.

disabledSafetyPolicies[] enum ( SafetyPolicy )

Optional. Disabled safety policies for computer use.

JSON representation
{ "environment": enum (`Environment`), "excludedPredefinedFunctions": [ string ], "enablePromptInjectionDetection": boolean, "disabledSafetyPolicies": [ enum (`SafetyPolicy`) ] }

Среда

Represents the environment being operated, such as a web browser.

Перечисления
`ENVIRONMENT_UNSPECIFIED`	Defaults to browser.
`ENVIRONMENT_BROWSER`	Operates in a web browser.
`ENVIRONMENT_MOBILE`	Operates in a mobile environment.
`ENVIRONMENT_DESKTOP`	Operates in a desktop environment.

SafetyPolicy

Predefined safety policies for computer use.

Перечисления
`SAFETY_POLICY_UNSPECIFIED`	Unspecified safety policy.
`FINANCIAL_TRANSACTIONS`	Safety policy for financial transactions.
`SENSITIVE_DATA_MODIFICATION`	Safety policy for sensitive data modification.
`COMMUNICATION_TOOL`	Safety policy for communication tools (eg Gmail, Chat, Meet).
`ACCOUNT_CREATION`	Safety policy for account creation.
`DATA_MODIFICATION`	Safety policy for data modification.
`USER_CONSENT_MANAGEMENT`	Safety policy for user consent management.
`LEGAL_TERMS_AND_AGREEMENTS`	Safety policy for legal terms and agreements.

UrlContext

Этот тип не содержит полей.

Tool to support URL context retrieval.

FileSearch

The FileSearch tool that retrieves knowledge from Semantic Retrieval corpora. Files are imported to Semantic Retrieval corpora using the ImportFile API.

Fields

fileSearchStoreNames[] string

Required. The names of the fileSearchStores to retrieve from. Example: fileSearchStores/my-file-search-store-123

metadataFilter string

Optional. Metadata filter to apply to the semantic retrieval documents and chunks.

topK integer

Optional. The number of semantic retrieval chunks to retrieve.

JSON representation
{ "fileSearchStoreNames": [ string ], "metadataFilter": string, "topK": integer }

McpServer

A MCPServer is a server that can be called by the model to perform actions. It is a server that implements the MCP protocol. Next ID: 6

Fields

name string

The name of the MCPServer.

transport Union type

The transport to use to connect to the MCPServer. transport can be only one of the following:

streamableHttpTransport object ( StreamableHttpTransport )

A transport that can stream HTTP requests and responses.

JSON representation
{ "name": string, // transport "streamableHttpTransport": { object (`StreamableHttpTransport`) } // Union type }

StreamableHttpTransport

A transport that can stream HTTP requests and responses. Next ID: 6

Fields

url string

The full URL for the MCPServer endpoint. Example: "https://api.example.com/mcp"

headers map (key: string, value: string)

Optional: Fields for authentication headers, timeouts, etc., if needed.

An object containing a list of "key": value pairs. Example: { "name": "wrench", "mass": "1.3kg", "count": "3" } .

timeout string ( Duration format)

HTTP timeout for regular operations.

A duration in seconds with up to nine fractional digits, ending with ' s '. Example: "3.5s" .

sseReadTimeout string ( Duration format)

Timeout for SSE read operations.

A duration in seconds with up to nine fractional digits, ending with ' s '. Example: "3.5s" .

terminateOnClose boolean

Whether to close the client session when the transport closes.

JSON representation
{ "url": string, "headers": { string: string, ... }, "timeout": string, "sseReadTimeout": string, "terminateOnClose": boolean }

GoogleMaps

The GoogleMaps Tool that provides geospatial context for the user's query.

Fields

enableWidget boolean

Optional. Whether to return a widget context token in the GroundingMetadata of the response. Developers can use the widget context token to render a Google Maps widget with geospatial context related to the places that the model references in the response.

JSON-представление
{ "enableWidget": boolean }

ToolChoiceType

The type of tool choice.

Перечисления
`TOOL_CHOICE_TYPE_UNSPECIFIED`	Значение по умолчанию. Это значение не используется.
`AUTO`	Auto tool choice.
`ANY`	Any tool choice.
`NONE`	No tool choice.
`VALIDATED`	Validated tool choice.

Ценить

JSON representation
ListValue
- JSON representation

Value represents a dynamically typed value which can be either null, a number, a string, a boolean, a recursive struct value, or a list of values. A producer of value is expected to set one of these variants. Absence of any variant indicates an error.

Fields

kind Union type

The kind of value. kind can be only one of the following:

nullValue null

Represents a null value.

numberValue number

Represents a double value.

stringValue string

Represents a string value.

boolValue boolean

Represents a boolean value.

structValue object ( Struct )

Represents a structured value.

listValue object ( ListValue )

Represents a repeated Value .

contentValue object ( Content )

Represents rich content (text, image, etc.).

JSON representation

{

  // kind
  "nullValue": null,
  "numberValue": number,
  "stringValue": string,
  "boolValue": boolean,
  "structValue": {
    object (Struct)
  },
  "listValue": {
    object (ListValue)
  },
  "contentValue": {
    object (Content)
  }
  // Union type
}

ListValue

ListValue is a wrapper around a repeated field of values.

Fields

values[] object ( Value )

Repeated field of dynamically typed values.

JSON representation
{ "values": [ { object (`Value`) } ] }

Режим визуализации

Enum for visualization mode. Eventually we will support an interactive mode where the user can choose whether to include HTML visualizations in the response.

Перечисления
`UNSPECIFIED`	The default visualization mode. Will default to AUTO.
`OFF`	Do not include visualizations.
`AUTO`	Automatically include visualizations.

REST Resource: auth_tokens

Ресурс: AuthToken
- JSON representation
BidiGenerateContentSetup
- JSON representation
GenerationConfig
- JSON representation
Модальность
SpeechConfig
- JSON representation
VoiceConfig
- JSON representation
PrebuiltVoiceConfig
- JSON representation
MultiSpeakerVoiceConfig
- JSON representation
SpeakerVoiceConfig
- JSON-представление
ThinkingConfig
- JSON representation
ThinkingLevel
ImageConfig
- JSON representation
MediaResolution
ResponseFormatConfig
- JSON representation
TextResponseFormat
- JSON representation
MIME-тип
AudioResponseFormat
- JSON representation
MIME-тип
Доставка
ImageResponseFormat
- JSON representation
MIME-тип
Доставка
AspectRatio
Размер изображения
TranslationConfig
- JSON representation
RealtimeInputConfig
- JSON representation
AutomaticActivityDetection
- JSON representation
StartSensitivity
EndSensitivity
ActivityHandling
TurnCoverage
SessionResumptionConfig
- JSON representation
ContextWindowCompressionConfig
- JSON representation
SlidingWindow
- JSON-представление
AudioTranscriptionConfig
- JSON representation
LanguageAuto
LanguageHints
- JSON representation
HistoryConfig
- JSON representation
Методы

Ресурс: AuthToken

A request to create an ephemeral authentication token.

Fields

name string

Output only. Identifier. The token itself.

expireTime string ( Timestamp format)

Optional. Input only. Immutable. An optional time after which, when using the resulting token, messages in BidiGenerateContent sessions will be rejected. (Gemini may preemptively close the session after this time.)

If not set then this defaults to 30 minutes in the future. If set, this value must be less than 20 hours in the future.

newSessionExpireTime string ( Timestamp format)

Optional. Input only. Immutable. The time after which new Live API sessions using the token resulting from this request will be rejected.

If not set this defaults to 60 seconds in the future. If set, this value must be less than 20 hours in the future.

fieldMask string ( FieldMask format)

Optional. Input only. Immutable. If fieldMask is empty, and bidiGenerateContentSetup is not present, then the effective BidiGenerateContentSetup message is taken from the Live API connection.

If fieldMask is empty, and bidiGenerateContentSetup is present, then the effective BidiGenerateContentSetup message is taken entirely from bidiGenerateContentSetup in this request. The setup message from the Live API connection is ignored.

If fieldMask is not empty, then the corresponding fields from bidiGenerateContentSetup will overwrite the fields from the setup message in the Live API connection.

Это список полных имен полей, разделенных запятыми. Пример: "user.displayName,photo" .

config Union type

The method-specific configuration for the resulting token. config can be only one of the following:

bidiGenerateContentSetup object ( BidiGenerateContentSetup )

Optional. Input only. Immutable. Configuration specific to BidiGenerateContent .

uses integer

Optional. Input only. Immutable. The number of times the token can be used. If this value is zero then no limit is applied. Resuming a Live API session does not count as a use. If unspecified, the default is 1.

JSON representation

{
  "name": string,
  "expireTime": string,
  "newSessionExpireTime": string,
  "fieldMask": string,

  // config
  "bidiGenerateContentSetup": {
    object (BidiGenerateContentSetup)
  }
  // Union type
  "uses": integer
}

BidiGenerateContentSetup

Message to be sent in the first (and only in the first) BidiGenerateContentClientMessage . Contains configuration that will apply for the duration of the streaming RPC.

Clients should wait for a BidiGenerateContentSetupComplete message before sending any additional messages.

Fields

model string

Required. The model's resource name. This serves as an ID for the Model to use.

Format: models/{model}

generationConfig object ( GenerationConfig )

Optional. Generation config.

The following fields are not supported:

responseLogprobs
responseMimeType
logprobs
responseSchema
responseJsonSchema
stop_sequence
skipResponseCache
routing_config
audio_timestamp

systemInstruction object ( Content )

Optional. The user provided system instructions for the model.

Note: Only text should be used in parts and content in each part will be in a separate paragraph.

tools[] object ( Tool )

Optional. A list of Tools the model may use to generate the next response.

A Tool is a piece of code that enables the system to interact with external systems to perform an action, or set of actions, outside of knowledge and scope of the model.

realtimeInputConfig object ( RealtimeInputConfig )

Optional. Configures the handling of realtime input.

sessionResumption object ( SessionResumptionConfig )

Optional. Configures session resumption mechanism.

If included, the server will send SessionResumptionUpdate messages.

contextWindowCompression object ( ContextWindowCompressionConfig )

Optional. Configures a context window compression mechanism.

If included, the server will automatically reduce the size of the context when it exceeds the configured length.

inputAudioTranscription object ( AudioTranscriptionConfig )

Optional. If set, enables transcription of voice input. The transcription aligns with the input audio language, if configured.

outputAudioTranscription object ( AudioTranscriptionConfig )

Optional. If set, enables transcription of the model's audio output. The transcription aligns with the language code specified for the output audio, if configured.

historyConfig object ( HistoryConfig )

Optional. Configures the exchange of history between the client and the server.

JSON representation

{
  "model": string,
  "generationConfig": {
    object (GenerationConfig)
  },
  "systemInstruction": {
    object (Content)
  },
  "tools": [
    {
      object (Tool)
    }
  ],
  "realtimeInputConfig": {
    object (RealtimeInputConfig)
  },
  "sessionResumption": {
    object (SessionResumptionConfig)
  },
  "contextWindowCompression": {
    object (ContextWindowCompressionConfig)
  },
  "inputAudioTranscription": {
    object (AudioTranscriptionConfig)
  },
  "outputAudioTranscription": {
    object (AudioTranscriptionConfig)
  },
  "historyConfig": {
    object (HistoryConfig)
  }
}

GenerationConfig

Configuration options for model generation and outputs. Not all parameters are configurable for every model.

Fields

stopSequences[] string

Optional. The set of character sequences (up to 5) that will stop output generation. If specified, the API will stop at the first appearance of a stop_sequence . The stop sequence will not be included as part of the response.

responseMimeType string

Optional. MIME type of the generated candidate text. Supported MIME types are: text/plain : (default) Text output. application/json : JSON response in the response candidates. text/x.enum : ENUM as a string response in the response candidates. Refer to the docs for a list of all supported text MIME types.

responseSchema
 (deprecated)

object ( Schema )

Optional. Output schema of the generated candidate text. Schemas must be a subset of the OpenAPI schema and can be objects, primitives or arrays.

If set, a compatible responseMimeType must also be set. Compatible MIME types: application/json : Schema for JSON response. Refer to the JSON text generation guide for more details.

_responseJsonSchema
 (deprecated)

value ( Value format)

Optional. Output schema of the generated response. This is an alternative to responseSchema that accepts JSON Schema .

If set, responseSchema must be omitted, but responseMimeType is required.

While the full JSON Schema may be sent, not all features are supported. Specifically, only the following properties are supported:

$id
$defs
$ref
$anchor
type
format
title
description
enum (for strings and numbers)
items
prefixItems
minItems
maxItems
minimum
maximum
anyOf
oneOf (interpreted the same as anyOf )
properties
additionalProperties
required

The non-standard propertyOrdering property may also be set.

Cyclic references are unrolled to a limited degree and, as such, may only be used within non-required properties. (Nullable properties are not sufficient.) If $ref is set on a sub-schema, no other properties, except for than those starting as a $ , may be set.

responseJsonSchema value ( Value format)

Optional. An internal detail. Use responseJsonSchema rather than this field.

responseModalities[] enum ( Modality )

Optional. The requested modalities of the response. Represents the set of modalities that the model can return, and should be expected in the response. This is an exact match to the modalities of the response.

A model may have multiple combinations of supported modalities. If the requested modalities do not match any of the supported combinations, an error will be returned.

An empty list is equivalent to requesting only text.

candidateCount integer

Optional. Number of generated responses to return. If unset, this will default to 1. Please note that this doesn't work for previous generation models (Gemini 1.0 family)

maxOutputTokens integer

Optional. The maximum number of tokens to include in a response candidate.

Note: The default value varies by model, see the Model.output_token_limit attribute of the Model returned from the getModel function.

temperature number

Optional. Controls the randomness of the output.

Note: The default value varies by model, see the Model.temperature attribute of the Model returned from the getModel function.

Values can range from [0.0, 2.0].

topP number

Optional. The maximum cumulative probability of tokens to consider when sampling.

The model uses combined Top-k and Top-p (nucleus) sampling.

Tokens are sorted based on their assigned probabilities so that only the most likely tokens are considered. Top-k sampling directly limits the maximum number of tokens to consider, while Nucleus sampling limits the number of tokens based on the cumulative probability.

Note: The default value varies by Model and is specified by the Model.top_p attribute returned from the getModel function. An empty topK attribute indicates that the model doesn't apply top-k sampling and doesn't allow setting topK on requests.

topK integer

Optional. The maximum number of tokens to consider when sampling.

Gemini models use Top-p (nucleus) sampling or a combination of Top-k and nucleus sampling. Top-k sampling considers the set of topK most probable tokens. Models running with nucleus sampling don't allow topK setting.

seed integer

Optional. Seed used in decoding. If not set, the request uses a randomly generated seed.

presencePenalty number

Optional. Presence penalty applied to the next token's logprobs if the token has already been seen in the response.

This penalty is binary on/off and not dependant on the number of times the token is used (after the first). Use frequencyPenalty for a penalty that increases with each use.

A positive penalty will discourage the use of tokens that have already been used in the response, increasing the vocabulary.

A negative penalty will encourage the use of tokens that have already been used in the response, decreasing the vocabulary.

frequencyPenalty number

Optional. Frequency penalty applied to the next token's logprobs, multiplied by the number of times each token has been seen in the respponse so far.

A positive penalty will discourage the use of tokens that have already been used, proportional to the number of times the token has been used: The more a token is used, the more difficult it is for the model to use that token again increasing the vocabulary of responses.

Caution: A negative penalty will encourage the model to reuse tokens proportional to the number of times the token has been used. Small negative values will reduce the vocabulary of a response. Larger negative values will cause the model to start repeating a common token until it hits the maxOutputTokens limit.

responseLogprobs boolean

Optional. If true, export the logprobs results in response.

logprobs integer

Optional. Only valid if responseLogprobs=True . This sets the number of top logprobs, including the chosen candidate, to return at each decoding step in the Candidate.logprobs_result . The number must be in the range of [0, 20].

enableEnhancedCivicAnswers boolean

Optional. Enables enhanced civic answers. It may not be available for all models.

speechConfig object ( SpeechConfig )

Optional. The speech generation config.

thinkingConfig object ( ThinkingConfig )

Optional. Config for thinking features. An error will be returned if this field is set for models that don't support thinking.

imageConfig object ( ImageConfig )

Optional. Config for image generation. An error will be returned if this field is set for models that don't support these config options.

mediaResolution enum ( MediaResolution )

Optional. If specified, the media resolution specified will be used.

enableAffectiveDialog boolean

Optional. If enabled, the model will detect emotions and adapt its responses accordingly.

responseFormat object ( ResponseFormatConfig )

Optional. Configuration for the response output format. Allows specifying output configuration per modality (text, audio, image) in a flat structure.

translationConfig object ( TranslationConfig )

Optional. Config for translation.

JSON-представление

{
  "stopSequences": [
    string
  ],
  "responseMimeType": string,
  "responseSchema": {
    object (Schema)
  },
  "_responseJsonSchema": value,
  "responseJsonSchema": value,
  "responseModalities": [
    enum (Modality)
  ],
  "candidateCount": integer,
  "maxOutputTokens": integer,
  "temperature": number,
  "topP": number,
  "topK": integer,
  "seed": integer,
  "presencePenalty": number,
  "frequencyPenalty": number,
  "responseLogprobs": boolean,
  "logprobs": integer,
  "enableEnhancedCivicAnswers": boolean,
  "speechConfig": {
    object (SpeechConfig)
  },
  "thinkingConfig": {
    object (ThinkingConfig)
  },
  "imageConfig": {
    object (ImageConfig)
  },
  "mediaResolution": enum (MediaResolution),
  "enableAffectiveDialog": boolean,
  "responseFormat": {
    object (ResponseFormatConfig)
  },
  "translationConfig": {
    object (TranslationConfig)
  }
}

Модальность

Supported modalities of the response.

Перечисления
`MODALITY_UNSPECIFIED`	Значение по умолчанию.
`TEXT`	Indicates the model should return text.
`IMAGE`	Indicates the model should return images.
`AUDIO`	Indicates the model should return audio.

SpeechConfig

Config for speech generation and transcription.

Fields

voiceConfig object ( VoiceConfig )

The configuration in case of single-voice output.

multiSpeakerVoiceConfig object ( MultiSpeakerVoiceConfig )

Optional. The configuration for the multi-speaker setup. It is mutually exclusive with the voiceConfig field.

languageCode string

Optional. The IETF BCP-47 language code that the user configured the app to use. Used for speech recognition and synthesis.

Valid values are: de-DE , en-AU , en-GB , en-IN , en-US , es-US , fr-FR , hi-IN , pt-BR , ar-XA , es-ES , fr-CA , id-ID , it-IT , ja-JP , tr-TR , vi-VN , bn-IN , gu-IN , kn-IN , ml-IN , mr-IN , ta-IN , te-IN , nl-NL , ko-KR , cmn-CN , pl-PL , ru-RU , and th-TH .

JSON representation
{ "voiceConfig": { object (`VoiceConfig`) }, "multiSpeakerVoiceConfig": { object (`MultiSpeakerVoiceConfig`) }, "languageCode": string }

VoiceConfig

The configuration for the voice to use.

Fields

voice_config Union type

The configuration for the speaker to use. voice_config can be only one of the following:

prebuiltVoiceConfig object ( PrebuiltVoiceConfig )

The configuration for the prebuilt voice to use.

JSON representation
{ // voice_config "prebuiltVoiceConfig": { object (`PrebuiltVoiceConfig`) } // Union type }

PrebuiltVoiceConfig

The configuration for the prebuilt speaker to use.

Fields

voiceName string

The name of the preset voice to use.

JSON representation
{ "voiceName": string }

MultiSpeakerVoiceConfig

The configuration for the multi-speaker setup.

Fields

speakerVoiceConfigs[] object ( SpeakerVoiceConfig )

Required. All the enabled speaker voices.

JSON representation
{ "speakerVoiceConfigs": [ { object (`SpeakerVoiceConfig`) } ] }

SpeakerVoiceConfig

The configuration for a single speaker in a multi speaker setup.

Fields

speaker string

Required. The name of the speaker to use. Should be the same as in the prompt.

voiceConfig object ( VoiceConfig )

Required. The configuration for the voice to use.

JSON representation
{ "speaker": string, "voiceConfig": { object (`VoiceConfig`) } }

ThinkingConfig

Config for thinking features.

Fields

includeThoughts boolean

Indicates whether to include thoughts in the response. If true, thoughts are returned only when available.

thinkingBudget integer

The number of thoughts tokens that the model should generate.

thinkingLevel enum ( ThinkingLevel )

Optional. Controls the maximum depth of the model's internal reasoning process before it produces a response. The default value is model-dependent. Refer to the Thinking levels guide for more details. Recommended for Gemini 3 or later models. Use with earlier models results in an error.

JSON representation
{ "includeThoughts": boolean, "thinkingBudget": integer, "thinkingLevel": enum (`ThinkingLevel`) }

ThinkingLevel

Allow user to specify how much to think using enum instead of integer budget.

Перечисления
`THINKING_LEVEL_UNSPECIFIED`	Значение по умолчанию.
`MINIMAL`	Little to no thinking.
`LOW`	Low thinking level.
`MEDIUM`	Medium thinking level.
`HIGH`	High thinking level.

ImageConfig

Config for image generation features.

Fields

aspectRatio string

Optional. The aspect ratio of the image to generate. Supported aspect ratios: 1:1 , 1:4 , 4:1 , 1:8 , 8:1 , 2:3 , 3:2 , 3:4 , 4:3 , 4:5 , 5:4 , 9:16 , 16:9 , or 21:9 .

If not specified, the model will choose a default aspect ratio based on any reference images provided.

imageSize string

Optional. Specifies the size of generated images. Supported values are 512 , 1K , 2K , 4K . If not specified, the model will use default value 1K .

JSON representation
{ "aspectRatio": string, "imageSize": string }

MediaResolution

Media resolution for the input media.

Перечисления
`MEDIA_RESOLUTION_UNSPECIFIED`	Media resolution has not been set.
`MEDIA_RESOLUTION_LOW`	Media resolution set to low (64 tokens).
`MEDIA_RESOLUTION_MEDIUM`	Media resolution set to medium (256 tokens).
`MEDIA_RESOLUTION_HIGH`	Media resolution set to high (zoomed reframing with 256 tokens).

ResponseFormatConfig

Configuration for the response output format. This is a flat object where each optional sub-field configures a specific output modality.

Fields

text object ( TextResponseFormat )

Optional. Text output format configuration.

audio object ( AudioResponseFormat )

Optional. Audio output format configuration.

image object ( ImageResponseFormat )

Optional. Image output format configuration.

JSON representation
{ "text": { object (`TextResponseFormat`) }, "audio": { object (`AudioResponseFormat`) }, "image": { object (`ImageResponseFormat`) } }

TextResponseFormat

Configuration for text output format.

Fields

mimeType enum ( MimeType )

Optional. The MIME type of the text output.

schema value ( Value format)

Optional. The JSON schema that the output should conform to. Only applicable when mimeType is APPLICATION_JSON.

JSON representation
{ "mimeType": enum (`MimeType`), "schema": value }

MIME-тип

Supported MIME types for text output.

Перечисления
`MIME_TYPE_UNSPECIFIED`	Значение по умолчанию. Это значение не используется.
`APPLICATION_JSON`	JSON output format.
`TEXT_PLAIN`	Plain text output format.

AudioResponseFormat

Configuration for audio output format.

Fields

mimeType enum ( MimeType )

Optional. The MIME type of the audio output.

delivery enum ( Delivery )

Optional. The delivery mode for the audio output.

sampleRate integer

Optional. Sample rate in Hz.

bitRate integer

Optional. Bit rate in bits per second (bps). Only applicable for compressed formats (MP3, Opus).

JSON representation
{ "mimeType": enum (`MimeType`), "delivery": enum (`Delivery`), "sampleRate": integer, "bitRate": integer }

MIME-тип

Supported MIME types for audio output.

Перечисления
`MIME_TYPE_UNSPECIFIED`	Значение по умолчанию. Это значение не используется.
`AUDIO_MP3`	MP3 audio format.
`AUDIO_OGG_OPUS`	OGG Opus audio format.
`AUDIO_L16`	Raw PCM (L16) audio format.
`AUDIO_WAV`	WAV audio format.
`AUDIO_ALAW`	A-law audio format.
`AUDIO_MULAW`	Mu-law audio format.

Доставка

Delivery mode for audio output.

Перечисления
`DELIVERY_UNSPECIFIED`	Значение по умолчанию. Это значение не используется.
`INLINE`	Audio data is returned inline in the response.
`URI`	Audio data is returned as a URI.

ImageResponseFormat

Configuration for image output format.

Fields

mimeType enum ( MimeType )

Optional. The MIME type of the image output.

delivery enum ( Delivery )

Optional. The delivery mode for the image output.

aspectRatio enum ( AspectRatio )

Optional. The aspect ratio for the image output.

imageSize enum ( ImageSize )

Optional. The size of the image output.

JSON representation
{ "mimeType": enum (`MimeType`), "delivery": enum (`Delivery`), "aspectRatio": enum (`AspectRatio`), "imageSize": enum (`ImageSize`) }

MIME-тип

Supported MIME types for image output.

Перечисления
`MIME_TYPE_UNSPECIFIED`	Значение по умолчанию. Это значение не используется.
`IMAGE_JPEG`	JPEG image format.

Доставка

Delivery mode for image output.

Перечисления
`DELIVERY_UNSPECIFIED`	Значение по умолчанию. Это значение не используется.
`INLINE`	Image data is returned inline in the response.
`URI`	Image data is returned as a URI.

AspectRatio

Supported aspect ratios for image output.

Перечисления
`ASPECT_RATIO_UNSPECIFIED`	Значение по умолчанию. Это значение не используется.
`ASPECT_RATIO_ONE_BY_ONE`	1:1 aspect ratio.
`ASPECT_RATIO_TWO_BY_THREE`	2:3 aspect ratio.
`ASPECT_RATIO_THREE_BY_TWO`	3:2 aspect ratio.
`ASPECT_RATIO_THREE_BY_FOUR`	3:4 aspect ratio.
`ASPECT_RATIO_FOUR_BY_THREE`	4:3 aspect ratio.
`ASPECT_RATIO_FOUR_BY_FIVE`	4:5 aspect ratio.
`ASPECT_RATIO_FIVE_BY_FOUR`	5:4 aspect ratio.
`ASPECT_RATIO_NINE_BY_SIXTEEN`	9:16 aspect ratio.
`ASPECT_RATIO_SIXTEEN_BY_NINE`	16:9 aspect ratio.
`ASPECT_RATIO_TWENTY_ONE_BY_NINE`	21:9 aspect ratio.
`ASPECT_RATIO_ONE_BY_EIGHT`	1:8 aspect ratio.
`ASPECT_RATIO_EIGHT_BY_ONE`	8:1 aspect ratio.
`ASPECT_RATIO_ONE_BY_FOUR`	1:4 aspect ratio.
`ASPECT_RATIO_FOUR_BY_ONE`	4:1 aspect ratio.

Размер изображения

Supported image sizes for image output.

Перечисления
`IMAGE_SIZE_UNSPECIFIED`	Значение по умолчанию. Это значение не используется.
`IMAGE_SIZE_FIVE_TWELVE`	512px image size.
`IMAGE_SIZE_ONE_K`	1K image size.
`IMAGE_SIZE_TWO_K`	2K image size.
`IMAGE_SIZE_FOUR_K`	4K image size.

TranslationConfig

Config for translation features.

Fields

targetLanguageCode string

Required. The target language for translation. Supported values are BCP-47 language codes (eg "en", "es", "fr").

echoTargetLanguage boolean

Optional. If true, the model will generate audio when the target language is spoken, essentially it will parrot the input. If false, we will not produce audio for the target language.

JSON representation
{ "targetLanguageCode": string, "echoTargetLanguage": boolean }

RealtimeInputConfig

Configures the realtime input behavior in BidiGenerateContent .

Fields

automaticActivityDetection object ( AutomaticActivityDetection )

Optional. If not set, automatic activity detection is enabled by default. If automatic voice detection is disabled, the client must send activity signals.

activityHandling enum ( ActivityHandling )

Optional. Defines what effect activity has.

turnCoverage enum ( TurnCoverage )

Optional. Defines which input is included in the user's turn.

JSON representation
{ "automaticActivityDetection": { object (`AutomaticActivityDetection`) }, "activityHandling": enum (`ActivityHandling`), "turnCoverage": enum (`TurnCoverage`) }

AutomaticActivityDetection

Configures automatic detection of activity.

Fields

disabled boolean

Optional. If enabled (the default), detected voice and text input count as activity. If disabled, the client must send activity signals.

startOfSpeechSensitivity enum ( StartSensitivity )

Optional. Determines how likely speech is to be detected.

prefixPaddingMs integer

Optional. The required duration of detected speech before start-of-speech is committed. The lower this value, the more sensitive the start-of-speech detection is and shorter speech can be recognized. However, this also increases the probability of false positives.

endOfSpeechSensitivity enum ( EndSensitivity )

Optional. Determines how likely detected speech is ended.

silenceDurationMs integer

Optional. The required duration of detected non-speech (eg silence) before end-of-speech is committed. The larger this value, the longer speech gaps can be without interrupting the user's activity but this will increase the model's latency.

JSON representation
{ "disabled": boolean, "startOfSpeechSensitivity": enum (`StartSensitivity`), "prefixPaddingMs": integer, "endOfSpeechSensitivity": enum (`EndSensitivity`), "silenceDurationMs": integer }

StartSensitivity

Determines how start of speech is detected.

Перечисления
`START_SENSITIVITY_UNSPECIFIED`	The default is START_SENSITIVITY_HIGH.
`START_SENSITIVITY_HIGH`	Automatic detection will detect the start of speech more often.
`START_SENSITIVITY_LOW`	Automatic detection will detect the start of speech less often.

EndSensitivity

Determines how end of speech is detected.

Перечисления
`END_SENSITIVITY_UNSPECIFIED`	The default is END_SENSITIVITY_HIGH.
`END_SENSITIVITY_HIGH`	Automatic detection ends speech more often.
`END_SENSITIVITY_LOW`	Automatic detection ends speech less often.

ActivityHandling

The different ways of handling user activity.

Перечисления
`ACTIVITY_HANDLING_UNSPECIFIED`	If unspecified, the default behavior is `START_OF_ACTIVITY_INTERRUPTS` .
`START_OF_ACTIVITY_INTERRUPTS`	If true, start of activity will interrupt the model's response (also called "barge in"). The model's current response will be cut-off in the moment of the interruption. This is the default behavior.
`NO_INTERRUPTION`	The model's response will not be interrupted.

TurnCoverage

Options about which input is included in the user's turn.

Перечисления
`TURN_COVERAGE_UNSPECIFIED`	If unspecified, a default behavior is selected based on the model. Eg, for Gemini 2.5, the default is `TURN_INCLUDES_ONLY_ACTIVITY` , while for Gemini 3.1 and onwards, it's `TURN_INCLUDES_AUDIO_ACTIVITY_AND_ALL_VIDEO` .
`TURN_INCLUDES_ONLY_ACTIVITY`	Includes activity since the last turn, excluding inactivity (eg silence on the audio stream).
`TURN_INCLUDES_ALL_INPUT`	Includes all realtime input since the last turn, including inactivity (eg silence on the audio stream).
`TURN_INCLUDES_AUDIO_ACTIVITY_AND_ALL_VIDEO`	Includes audio activity and all video since the last turn. With automatic activity detection, audio activity means speech and excludes silence.

SessionResumptionConfig

Session resumption configuration.

This message is included in the session configuration as BidiGenerateContentSetup.session_resumption . If configured, the server will send SessionResumptionUpdate messages.

Fields

handle string

The handle of a previous session. If not present then a new session is created.

Session handles come from SessionResumptionUpdate.token values in previous connections.

JSON representation
{ "handle": string }

ContextWindowCompressionConfig

Enables context window compression — a mechanism for managing the model's context window so that it does not exceed a given length.

Fields

compression_mechanism Union type

The context window compression mechanism used. compression_mechanism can be only one of the following:

slidingWindow object ( SlidingWindow )

A sliding-window mechanism.

triggerTokens string ( int64 format)

The number of tokens (before running a turn) required to trigger a context window compression.

This can be used to balance quality against latency as shorter context windows may result in faster model responses. However, any compression operation will cause a temporary latency increase, so they should not be triggered frequently.

If not set, the default is 80% of the model's context window limit. This leaves 20% for the next user request/model response.

JSON representation
{ // compression_mechanism "slidingWindow": { object (`SlidingWindow`) } // Union type "triggerTokens": string }

SlidingWindow

The SlidingWindow method operates by discarding content at the beginning of the context window. The resulting context will always begin at the start of a USER role turn. System instructions and any BidiGenerateContentSetup.prefix_turns will always remain at the beginning of the result.

Fields

targetTokens string ( int64 format)

The target number of tokens to keep. The default value is triggerTokens/2.

Discarding parts of the context window causes a temporary latency increase so this value should be calibrated to avoid frequent compression operations.

JSON representation
{ "targetTokens": string }

AudioTranscriptionConfig

The audio transcription configuration.

Fields

adaptationPhrases[]
 (deprecated)

string

Optional. A list of phrases used for speech adaptation, which biases the ASR model to improve recognition of these specific terms.

customVocabulary[] string

Optional. A list of custom vocabulary phrases to bias the speech recognition model toward recognizing specific terms (product names, proper nouns, jargon).

language_config Union type

The language config for the audio transcription. For ASR models, it is required, an error will be returned if not set. language_config can be only one of the following:

languageAuto object ( LanguageAuto )

Optional. The model will detect the language automatically.

languageHints object ( LanguageHints )

Optional. Specifies one or more languages in the audio.

JSON representation
{ "adaptationPhrases": [ string ], "customVocabulary": [ string ], // language_config "languageAuto": { object (`LanguageAuto`) }, "languageHints": { object (`LanguageHints`) } // Union type }

LanguageAuto

Этот тип не содержит полей.

Indicates the language of the audio should be automatically detected.

LanguageHints

Provides hints to the model about possible languages present in the audio.

Fields

languageCodes[] string

Required. BCP-47 language codes.

JSON representation
{ "languageCodes": [ string ] }

HistoryConfig

History configuration.

This message is included in the session configuration as BidiGenerateContentSetup.history_config . Configures the exchange of history messages.

Fields

initialHistoryInClientContent boolean

Optional. If true, after sending setupComplete , the server will wait and at first process clientContent messages until turnComplete is true . This initial history will not trigger a model call and may end with role MODEL . After turnComplete is true , the client can start the realtime conversation via realtimeInput .

JSON representation
{ "initialHistoryInClientContent": boolean }

Method: auth_tokens.create

Конечная точка
Текст запроса
Ответный текст
Области полномочий

Creates a token that can be used to constrain the behavior of a BidiGenerateContent session.

Конечная точка

post https: / /generativelanguage.googleapis.com /v1beta /auth_tokens

Текст запроса

The request body contains an instance of AuthToken .

Fields

expireTime string ( Timestamp format)

If not set then this defaults to 30 minutes in the future. If set, this value must be less than 20 hours in the future.

newSessionExpireTime string ( Timestamp format)

Optional. Input only. Immutable. The time after which new Live API sessions using the token resulting from this request will be rejected.

If not set this defaults to 60 seconds in the future. If set, this value must be less than 20 hours in the future.

fieldMask string ( FieldMask format)

Optional. Input only. Immutable. If fieldMask is empty, and bidiGenerateContentSetup is not present, then the effective BidiGenerateContentSetup message is taken from the Live API connection.

If fieldMask is not empty, then the corresponding fields from bidiGenerateContentSetup will overwrite the fields from the setup message in the Live API connection.

Это список полных имен полей, разделенных запятыми. Пример: "user.displayName,photo" .

config Union type

The method-specific configuration for the resulting token. config can be only one of the following:

bidiGenerateContentSetup object ( BidiGenerateContentSetup )

Optional. Input only. Immutable. Configuration specific to BidiGenerateContent .

uses integer

Ответный текст

If successful, the response body contains a newly created instance of AuthToken .