Vertex AI is a Google Cloud platform for rapidly building and scaling machine learning projects without requiring in-house MLOps expertise. Vertex AI provides a console where you can work with a large selection of models and offers end-to-end MLOps capabilities and a serverless experience for streamlined development.
You can use Vertex AI as the downstream application that serves Gemma, which is available in Model Garden, a curated collection of models. For example, you could port weights from a Gemma implementation, and use Vertex AI to serve that version of Gemma to get predictions.
To learn more, refer to the following pages:
- Introduction to Vertex AI: Get started with Vertex AI.
- Gemma with Vertex AI: Use Gemma open models with Vertex AI.
- Fine-tune Gemma using KerasNLP and deploy to Vertex AI: End-to-end notebook to fine-tune Gemma using Keras.