The Gemini API supports batch APIs, which let you process multiple requests in a single call. For more details, see the Batch API guide.
Method: models.batchGenerateContent
Enqueues a batch of models.generateContent requests for batch processing.
Endpoint
posthttps:
Path parameters
batch.modelstring
Required. The name of the Model to use for generating the completion.
Format: models/{model}. It takes the form models/{model}.
Request body
The request body contains data with the following structure:
batch.namestring
Output only. Identifier. Resource name of the batch.
Format: batches/{batchId}.
batch.displayNamestring
Required. The user-defined name of this batch.
Required. Input configuration of the instances on which batch processing are performed.
Output only. The output of the batch request.
Output only. The time at which the batch was created.
Uses RFC 3339, where generated output will always be Z-normalized and use 0, 3, 6 or 9 fractional digits. Offsets other than "Z" are also accepted. Examples: "2014-10-02T15:01:23Z", "2014-10-02T15:01:23.045123456Z" or "2014-10-02T15:01:23+05:30".
Output only. The time at which the batch processing completed.
Uses RFC 3339, where generated output will always be Z-normalized and use 0, 3, 6 or 9 fractional digits. Offsets other than "Z" are also accepted. Examples: "2014-10-02T15:01:23Z", "2014-10-02T15:01:23.045123456Z" or "2014-10-02T15:01:23+05:30".
Output only. The time at which the batch was last updated.
Uses RFC 3339, where generated output will always be Z-normalized and use 0, 3, 6 or 9 fractional digits. Offsets other than "Z" are also accepted. Examples: "2014-10-02T15:01:23Z", "2014-10-02T15:01:23.045123456Z" or "2014-10-02T15:01:23+05:30".
Output only. Stats about the batch.
Output only. The state of the batch.
Optional. The priority of the batch. Batches with a higher priority value will be processed before batches with a lower priority value. Negative values are allowed. Default is 0.
Response body
If successful, the response body contains an instance of Operation.
Method: models.asyncBatchEmbedContent
Enqueues a batch of models.embedContent requests for batch processing. We have a models.batchEmbedContents handler in GenerativeService, but it was synchronized. So we name this one to be Async to avoid confusion.
Endpoint
posthttps:
Path parameters
batch.modelstring
Required. The name of the Model to use for generating the completion.
Format: models/{model}. It takes the form models/{model}.
Request body
The request body contains data with the following structure:
batch.namestring
Output only. Identifier. Resource name of the batch.
Format: batches/{batchId}.
batch.displayNamestring
Required. The user-defined name of this batch.
Required. Input configuration of the instances on which batch processing are performed.
Output only. The output of the batch request.
Output only. The time at which the batch was created.
Uses RFC 3339, where generated output will always be Z-normalized and use 0, 3, 6 or 9 fractional digits. Offsets other than "Z" are also accepted. Examples: "2014-10-02T15:01:23Z", "2014-10-02T15:01:23.045123456Z" or "2014-10-02T15:01:23+05:30".
Output only. The time at which the batch processing completed.
Uses RFC 3339, where generated output will always be Z-normalized and use 0, 3, 6 or 9 fractional digits. Offsets other than "Z" are also accepted. Examples: "2014-10-02T15:01:23Z", "2014-10-02T15:01:23.045123456Z" or "2014-10-02T15:01:23+05:30".
Output only. The time at which the batch was last updated.
Uses RFC 3339, where generated output will always be Z-normalized and use 0, 3, 6 or 9 fractional digits. Offsets other than "Z" are also accepted. Examples: "2014-10-02T15:01:23Z", "2014-10-02T15:01:23.045123456Z" or "2014-10-02T15:01:23+05:30".
Output only. Stats about the batch.
Output only. The state of the batch.
Optional. The priority of the batch. Batches with a higher priority value will be processed before batches with a lower priority value. Negative values are allowed. Default is 0.
Response body
If successful, the response body contains an instance of Operation.
Method: batches.get
Gets the latest state of a long-running operation. Clients can use this method to poll the operation result at intervals as recommended by the API service.
Endpoint
gethttps:
Path parameters
namestring
The name of the operation resource. It takes the form batches/{batches}.
Request body
The request body must be empty.
Response body
If successful, the response body contains an instance of Operation.
Method: batches.list
Lists operations that match the specified filter in the request. If the server doesn't support this method, it returns UNIMPLEMENTED.
Endpoint
gethttps:
Path parameters
namestring
The name of the operation's parent resource. It takes the form batches.
Query parameters
filterstring
The standard list filter.
pageSizeinteger
The standard list page size.
pageTokenstring
The standard list page token.
returnPartialSuccessboolean
When set to true, operations that are reachable are returned as normal, and those that are unreachable are returned in the [ListOperationsResponse.unreachable] field.
This can only be true when reading across collections e.g. when parent is set to "projects/example/locations/-".
This field is not by default supported and will result in an UNIMPLEMENTED error if set unless explicitly documented otherwise in service or product specific documentation.
Request body
The request body must be empty.
Response body
If successful, the response body contains an instance of ListOperationsResponse.
Method: batches.cancel
Starts asynchronous cancellation on a long-running operation. The server makes a best effort to cancel the operation, but success is not guaranteed. If the server doesn't support this method, it returns google.rpc.Code.UNIMPLEMENTED. Clients can use Operations.GetOperation or other methods to check whether the cancellation succeeded or whether the operation completed despite cancellation. On successful cancellation, the operation is not deleted; instead, it becomes an operation with an Operation.error value with a google.rpc.Status.code of 1, corresponding to Code.CANCELLED.
Endpoint
posthttps:
Path parameters
namestring
The name of the operation resource to be cancelled. It takes the form batches/{batches}.
Request body
The request body must be empty.
Response body
If successful, the response body is an empty JSON object.
Method: batches.delete
Deletes a long-running operation. This method indicates that the client is no longer interested in the operation result. It does not cancel the operation. If the server doesn't support this method, it returns google.rpc.Code.UNIMPLEMENTED.
Endpoint
deletehttps:
Path parameters
namestring
The name of the operation resource to be deleted. It takes the form batches/{batches}.
Request body
The request body must be empty.
Response body
If successful, the response body is an empty JSON object.
GenerateContentBatch
- JSON representation
- InputConfig
- InlinedRequests
- InlinedRequest
- GenerateContentBatchOutput
- InlinedResponses
- InlinedResponse
- BatchStats
A resource representing a batch of GenerateContent requests.
modelstring
Required. The name of the Model to use for generating the completion.
Format: models/{model}.
namestring
Output only. Identifier. Resource name of the batch.
Format: batches/{batchId}.
displayNamestring
Required. The user-defined name of this batch.
Required. Input configuration of the instances on which batch processing are performed.
Output only. The output of the batch request.
Output only. The time at which the batch was created.
Uses RFC 3339, where generated output will always be Z-normalized and use 0, 3, 6 or 9 fractional digits. Offsets other than "Z" are also accepted. Examples: "2014-10-02T15:01:23Z", "2014-10-02T15:01:23.045123456Z" or "2014-10-02T15:01:23+05:30".
Output only. The time at which the batch processing completed.
Uses RFC 3339, where generated output will always be Z-normalized and use 0, 3, 6 or 9 fractional digits. Offsets other than "Z" are also accepted. Examples: "2014-10-02T15:01:23Z", "2014-10-02T15:01:23.045123456Z" or "2014-10-02T15:01:23+05:30".
Output only. The time at which the batch was last updated.
Uses RFC 3339, where generated output will always be Z-normalized and use 0, 3, 6 or 9 fractional digits. Offsets other than "Z" are also accepted. Examples: "2014-10-02T15:01:23Z", "2014-10-02T15:01:23.045123456Z" or "2014-10-02T15:01:23+05:30".
Output only. Stats about the batch.
Output only. The state of the batch.
Optional. The priority of the batch. Batches with a higher priority value will be processed before batches with a lower priority value. Negative values are allowed. Default is 0.
| JSON representation | 
|---|
| { "model": string, "name": string, "displayName": string, "inputConfig": { object ( | 
InputConfig
Configures the input to the batch request.
sourceUnion type
source can be only one of the following:fileNamestring
The name of the File containing the input requests.
The requests to be processed in the batch.
| JSON representation | 
|---|
| {
  // source
  "fileName": string,
  "requests": {
    object ( | 
InlinedRequests
The requests to be processed in the batch if provided as part of the batch creation request.
Required. The requests to be processed in the batch.
| JSON representation | 
|---|
| {
  "requests": [
    {
      object ( | 
InlinedRequest
The request to be processed in the batch.
Required. The request to be processed in the batch.
Optional. The metadata to be associated with the request.
| JSON representation | 
|---|
| {
  "request": {
    object ( | 
GenerateContentBatchOutput
The output of a batch request. This is returned in the BatchGenerateContentResponse or the GenerateContentBatch.output field.
outputUnion type
output can be only one of the following:responsesFilestring
Output only. The file ID of the file containing the responses. The file will be a JSONL file with a single response per line. The responses will be GenerateContentResponse messages formatted as JSON. The responses will be written in the same order as the input requests.
Output only. The responses to the requests in the batch. Returned when the batch was built using inlined requests. The responses will be in the same order as the input requests.
| JSON representation | 
|---|
| {
  // output
  "responsesFile": string,
  "inlinedResponses": {
    object ( | 
InlinedResponses
The responses to the requests in the batch.
Output only. The responses to the requests in the batch.
| JSON representation | 
|---|
| {
  "inlinedResponses": [
    {
      object ( | 
InlinedResponse
The response to a single request in the batch.
Output only. The metadata associated with the request.
outputUnion type
output can be only one of the following:Output only. The error encountered while processing the request.
Output only. The response to the request.
| JSON representation | 
|---|
| { "metadata": { object }, // output "error": { object ( | 
BatchStats
Stats about the batch.
Output only. The number of requests in the batch.
Output only. The number of requests that were successfully processed.
Output only. The number of requests that failed to be processed.
Output only. The number of requests that are still pending processing.
| JSON representation | 
|---|
| { "requestCount": string, "successfulRequestCount": string, "failedRequestCount": string, "pendingRequestCount": string } | 
Method: batches.updateEmbedContentBatch
- Endpoint
- Path parameters
- Query parameters
- Request body
- Response body
- Authorization scopes
- EmbedContentRequest
Updates a batch of EmbedContent requests for batch processing.
Endpoint
patchhttps:
PATCH https://generativelanguage.googleapis.com/v1beta/{embedContentBatch.name=batches/*}:updateEmbedContentBatch
Path parameters
embedContentBatch.namestring
Output only. Identifier. Resource name of the batch.
Format: batches/{batchId}. It takes the form batches/{batches}.
Query parameters
Optional. The list of fields to update.
This is a comma-separated list of fully qualified names of fields. Example: "user.displayName,photo".
Request body
The request body contains an instance of EmbedContentBatch.
modelstring
Required. The name of the Model to use for generating the completion.
Format: models/{model}.
displayNamestring
Required. The user-defined name of this batch.
Required. Input configuration of the instances on which batch processing are performed.
Optional. The priority of the batch. Batches with a higher priority value will be processed before batches with a lower priority value. Negative values are allowed. Default is 0.
Response body
If successful, the response body contains an instance of EmbedContentBatch.
EmbedContentRequest
Request containing the Content for the model to embed.
modelstring
Required. The model's resource name. This serves as an ID for the Model to use.
This name should match a model name returned by the ListModels method.
Format: models/{model}
Required. The content to embed. Only the parts.text fields will be counted.
Optional. Optional task type for which the embeddings will be used. Not supported on earlier models (models/embedding-001).
titlestring
Optional. An optional title for the text. Only applicable when TaskType is RETRIEVAL_DOCUMENT.
Note: Specifying a title for RETRIEVAL_DOCUMENT provides better quality embeddings for retrieval.
outputDimensionalityinteger
Optional. Optional reduced dimension for the output embedding. If set, excessive values in the output embedding are truncated from the end. Supported by newer models since 2024 only. You cannot set this value if using the earlier model (models/embedding-001).
Method: batches.updateGenerateContentBatch
- Endpoint
- Path parameters
- Query parameters
- Request body
- Response body
- Authorization scopes
- GenerateContentRequest
Updates a batch of GenerateContent requests for batch processing.
Endpoint
patchhttps:
PATCH https://generativelanguage.googleapis.com/v1beta/{generateContentBatch.name=batches/*}:updateGenerateContentBatch
Path parameters
generateContentBatch.namestring
Output only. Identifier. Resource name of the batch.
Format: batches/{batchId}. It takes the form batches/{batches}.
Query parameters
Optional. The list of fields to update.
This is a comma-separated list of fully qualified names of fields. Example: "user.displayName,photo".
Request body
The request body contains an instance of GenerateContentBatch.
modelstring
Required. The name of the Model to use for generating the completion.
Format: models/{model}.
displayNamestring
Required. The user-defined name of this batch.
Required. Input configuration of the instances on which batch processing are performed.
Optional. The priority of the batch. Batches with a higher priority value will be processed before batches with a lower priority value. Negative values are allowed. Default is 0.
Response body
If successful, the response body contains an instance of GenerateContentBatch.
GenerateContentRequest
Request to generate a completion from the model.
modelstring
Required. The name of the Model to use for generating the completion.
Format: models/{model}.
Optional. A list of Tools the Model may use to generate the next response.
A Tool is a piece of code that enables the system to interact with external systems to perform an action, or set of actions, outside of knowledge and scope of the Model. Supported Tools are Function and codeExecution. Refer to the Function calling and the Code execution guides to learn more.
Optional. Tool configuration for any Tool specified in the request. Refer to the Function calling guide for a usage example.
Optional. A list of unique SafetySetting instances for blocking unsafe content.
This will be enforced on the GenerateContentRequest.contents and GenerateContentResponse.candidates. There should not be more than one setting for each SafetyCategory type. The API will block any contents and responses that fail to meet the thresholds set by these settings. This list overrides the default settings for each SafetyCategory specified in the safetySettings. If there is no SafetySetting for a given SafetyCategory provided in the list, the API will use the default safety setting for that category. Harm categories HARM_CATEGORY_HATE_SPEECH, HARM_CATEGORY_SEXUALLY_EXPLICIT, HARM_CATEGORY_DANGEROUS_CONTENT, HARM_CATEGORY_HARASSMENT, HARM_CATEGORY_CIVIC_INTEGRITY are supported. Refer to the guide for detailed information on available safety settings. Also refer to the Safety guidance to learn how to incorporate safety considerations in your AI applications.
Optional. Developer set system instruction(s). Currently, text only.
Optional. Configuration options for model generation and outputs.
cachedContentstring
Optional. The name of the content cached to use as context to serve the prediction. Format: cachedContents/{cachedContent}
| JSON representation | 
|---|
| { "model": string, "contents": [ { object ( | 
BatchState
The state of the batch.
| Enums | |
|---|---|
| BATCH_STATE_UNSPECIFIED | The batch state is unspecified. | 
| BATCH_STATE_PENDING | The service is preparing to run the batch. | 
| BATCH_STATE_RUNNING | The batch is in progress. | 
| BATCH_STATE_SUCCEEDED | The batch completed successfully. | 
| BATCH_STATE_FAILED | The batch failed. | 
| BATCH_STATE_CANCELLED | The batch has been cancelled. | 
| BATCH_STATE_EXPIRED | The batch has expired. | 
REST Resource: batches
Resource: Operation
This resource represents a long-running operation that is the result of a network API call.
namestring
The server-assigned name, which is only unique within the same service that originally returns it. If you use the default HTTP mapping, the name should be a resource name ending with operations/{unique_id}.
metadataobject
Service-specific metadata associated with the operation. It typically contains progress information and common metadata such as create time. Some services might not provide such metadata. Any method that returns a long-running operation should document the metadata type, if any.
An object containing fields of an arbitrary type. An additional field "@type" contains a URI identifying the type. Example: { "id": 1234, "@type": "types.example.com/standard/id" }.
doneboolean
If the value is false, it means the operation is still in progress. If true, the operation is completed, and either error or response is available.
resultUnion type
error or a valid response. If done == false, neither error nor response is set. If done == true, exactly one of error or response can be set. Some services might not provide the result. result can be only one of the following:The error result of the operation in case of failure or cancellation.
responseobject
The normal, successful response of the operation. If the original method returns no data on success, such as Delete, the response is google.protobuf.Empty. If the original method is standard Get/Create/Update, the response should be the resource. For other methods, the response should have the type XxxResponse, where Xxx is the original method name. For example, if the original method name is TakeSnapshot(), the inferred response type is TakeSnapshotResponse.
An object containing fields of an arbitrary type. An additional field "@type" contains a URI identifying the type. Example: { "id": 1234, "@type": "types.example.com/standard/id" }.
| JSON representation | 
|---|
| {
  "name": string,
  "metadata": {
    "@type": string,
    field1: ...,
    ...
  },
  "done": boolean,
  // result
  "error": {
    object ( |