Introducing Google AI Edge Portal: Benchmark Edge AI at scale. Sign-up to request access during private preview.

Convert PyTorch GenAI models for on-device inference

The LiteRT Torch Generative API is a high-performance library designed for authoring and converting transformer-based PyTorch models into the LiteRT/LiteRT-LM format. This enables developers to seamlessly deploy generative AI models, specifically Large Language Models (LLMs), for on-device text and image generation with ease.

The Torch Generative API supports model conversion for CPU and GPU execution, with NPU support in development. By pairing Torch Generative API with LiteRT-LM, you can build responsive, privacy-focused applications that run generative models entirely on-device.

For more information, see the Generative Torch API GitHub repo.

Except as otherwise noted, the content of this page is licensed under the Creative Commons Attribution 4.0 License, and code samples are licensed under the Apache 2.0 License. For details, see the Google Developers Site Policies. Java is a registered trademark of Oracle and/or its affiliates.

Last updated 2026-01-28 UTC.