A Chunk is a subpart of a Document that is treated as an independent unit for the purposes of vector representation and storage.
A Corpus can have a maximum of 1 million Chunk\ s.
Attributes
name
str
Immutable. Identifier. The Chunk resource name. The ID
(name excluding the corpora/*/documents/*/chunks/ prefix)
can contain up to 40 characters that are lowercase
alphanumeric or dashes (-). The ID cannot start or end with
a dash. If the name is empty on create, a random
12-character unique ID will be generated. Example:
corpora/{corpus_id}/documents/{document_id}/chunks/123a456b789c
data
google.ai.generativelanguage.ChunkData
Required. The content for the Chunk, such as the text
string. The maximum number of tokens per chunk is 2043.