Our first multimodal embedding model, providing efficient numerical mapping of
text, images, video, audio, and PDFs into a single unified embedding space. The
Gemini Embedding 2 model is best for cross-modal semantic search, document
retrieval, and recommendation systems that require fast, scalable similarity
calculations across large multimodal datasets.
Documentation
Visit the Embeddings page for full coverage
of features and capabilities.
[[["Easy to understand","easyToUnderstand","thumb-up"],["Solved my problem","solvedMyProblem","thumb-up"],["Other","otherUp","thumb-up"]],[["Missing the information I need","missingTheInformationINeed","thumb-down"],["Too complicated / too many steps","tooComplicatedTooManySteps","thumb-down"],["Out of date","outOfDate","thumb-down"],["Samples / code issue","samplesCodeIssue","thumb-down"],["Other","otherDown","thumb-down"]],["Last updated 2026-03-10 UTC."],[],[]]