全新推出：LiteRT：Google 专为设备端 AI 打造的高性能运行时，以前称为 TensorFlow Lite。

此页面由 Cloud Translation API 翻译。

Google Play 服务 Java (和 Kotlin) API 中的 LiteRT

除了原生 API 之外，您还可以使用 Java API（可从 Java 或 Kotlin 代码中使用）访问 Google Play 服务中的 LiteRT。具体而言，Google Play 服务中的 LiteRT 可通过 LiteRT 解释器 API 使用。

使用 Interpreter API

TensorFlow 运行时提供的 LiteRT 解释器 API 提供了一个通用接口，用于构建和运行机器学习模型。请按照以下步骤在 Google Play 服务运行时使用 TensorFlow Lite 通过 Interpreter API 运行推理。

1. 添加项目依赖项

将以下依赖项添加到应用项目代码中，以便访问适用于 LiteRT 的 Play 服务 API：

dependencies {
...
    // LiteRT dependencies for Google Play services
    implementation 'com.google.android.gms:play-services-tflite-java:16.1.0'
    // Optional: include LiteRT Support Library
    implementation 'com.google.android.gms:play-services-tflite-support:16.1.0'
...
}

2. 添加了 LiteRT 的初始化

在使用 LiteRT API 之前，请先初始化 Google Play 服务 API 的 LiteRT 组件：

KotlinJava

val initializeTask: Task<Void> by lazy { TfLite.initialize(this) }

Task<Void> initializeTask = TfLite.initialize(context);

3. 创建解释器并设置运行时选项

通过调用 InterpreterApi.Options.setRuntime()，使用 InterpreterApi.create() 创建解释器，并将其配置为使用 Google Play 服务运行时，如以下示例代码所示：

KotlinJava

import org.tensorflow.lite.InterpreterApi
import org.tensorflow.lite.InterpreterApi.Options.TfLiteRuntime
...
private lateinit var interpreter: InterpreterApi
...
initializeTask.addOnSuccessListener {
  val interpreterOption =
    InterpreterApi.Options().setRuntime(TfLiteRuntime.FROM_SYSTEM_ONLY)
  interpreter = InterpreterApi.create(
    modelBuffer,
    interpreterOption
  )}
  .addOnFailureListener { e ->
    Log.e("Interpreter", "Cannot initialize interpreter", e)
  }

import org.tensorflow.lite.InterpreterApi
import org.tensorflow.lite.InterpreterApi.Options.TfLiteRuntime
...
private InterpreterApi interpreter;
...
initializeTask.addOnSuccessListener(a -> {
    interpreter = InterpreterApi.create(modelBuffer,
      new InterpreterApi.Options().setRuntime(TfLiteRuntime.FROM_SYSTEM_ONLY));
  })
  .addOnFailureListener(e -> {
    Log.e("Interpreter", String.format("Cannot initialize interpreter: %s",
          e.getMessage()));
  });

您应使用上述实现，因为它可以避免阻塞 Android 界面线程。如果您需要更密切地管理线程执行，可以在创建解释器时添加 Tasks.await() 调用：

KotlinJava

import androidx.lifecycle.lifecycleScope
...
lifecycleScope.launchWhenStarted { // uses coroutine
  initializeTask.await()
}

@BackgroundThread
InterpreterApi initializeInterpreter() {
    Tasks.await(initializeTask);
    return InterpreterApi.create(...);
}

4. 运行推理

使用您创建的 interpreter 对象，调用 run() 方法以生成推理结果。

KotlinJava

interpreter.run(inputBuffer, outputBuffer)

interpreter.run(inputBuffer, outputBuffer);

硬件加速

借助 LiteRT，您可以使用专用硬件处理器（例如图形处理单元 [GPU]）加速模型的性能。您可以使用名为代理的硬件驱动程序来充分利用这些专用处理器。

GPU 代理通过 Google Play 服务提供，并像 Play 服务版本的 Interpreter API 一样动态加载。

检查设备兼容性

并非所有设备都支持使用 TFLite 进行 GPU 硬件加速。为了减少错误和潜在的崩溃，请使用 TfLiteGpu.isGpuDelegateAvailable 方法检查设备是否与 GPU 代理兼容。

使用此方法可确认设备是否与 GPU 兼容，并在 GPU 不受支持时使用 CPU 作为后备。

useGpuTask = TfLiteGpu.isGpuDelegateAvailable(context)

有了 useGpuTask 等变量后，您就可以使用它来确定设备是否使用 GPU 代理。

KotlinJava

val interpreterTask = useGpuTask.continueWith { task ->
  val interpreterOptions = InterpreterApi.Options()
      .setRuntime(TfLiteRuntime.FROM_SYSTEM_ONLY)
  if (task.result) {
      interpreterOptions.addDelegateFactory(GpuDelegateFactory())
  }
  InterpreterApi.create(FileUtil.loadMappedFile(context, MODEL_PATH), interpreterOptions)
}

Task<InterpreterApi.Options> interpreterOptionsTask = useGpuTask.continueWith({ task ->
  InterpreterApi.Options options =
      new InterpreterApi.Options().setRuntime(TfLiteRuntime.FROM_SYSTEM_ONLY);
  if (task.getResult()) {
     options.addDelegateFactory(new GpuDelegateFactory());
  }
  return options;
});

GPU 和解释器 API

如需将 GPU 代理与 Interpreter API 搭配使用，请执行以下操作：

更新项目依赖项以使用 Play 服务中的 GPU 代理：

implementation 'com.google.android.gms:play-services-tflite-gpu:16.2.0'

在 TFlite 初始化中启用 GPU 代理选项：

KotlinJava

TfLite.initialize(context,
  TfLiteInitializationOptions.builder()
    .setEnableGpuDelegateSupport(true)
    .build())

TfLite.initialize(context,
  TfLiteInitializationOptions.builder()
    .setEnableGpuDelegateSupport(true)
    .build());

在解释器选项中启用 GPU 代理：通过调用 addDelegateFactory() withinInterpreterApi.Options()` 将代理工厂设置为 GpuDelegateFactory：

KotlinJava

val interpreterOption = InterpreterApi.Options()
  .setRuntime(TfLiteRuntime.FROM_SYSTEM_ONLY)
  .addDelegateFactory(GpuDelegateFactory())

Options interpreterOption = InterpreterApi.Options()
  .setRuntime(TfLiteRuntime.FROM_SYSTEM_ONLY)
  .addDelegateFactory(new GpuDelegateFactory());

从独立 LiteRT 迁移

如果您打算将应用从独立的 LiteRT 迁移到 Play 服务 API，请参阅以下额外指南，了解如何更新应用项目代码：

请查看本页的限制部分，确保您的用例受支持。
在更新代码之前，我们建议您对模型进行性能和准确性检查，尤其是在您使用版本低于 2.1 的 LiteRT (TF Lite) 的情况下，这样您便有一个基准来与新实现进行比较。
如果您已将所有代码迁移到适用于 LiteRT 的 Play Services API，则应从 build.gradle 文件中移除现有的 LiteRT 运行时库依赖项（包含 org.tensorflow:tensorflow-lite:* 的条目），以便缩减应用大小。
在代码中找出 new Interpreter 对象创建的所有用例，并修改每个用例，使其使用 InterpreterApi.create() 调用。新的 TfLite.initialize 是异步的，这意味着在大多数情况下，它不是即插即用的替代项：您必须注册一个监听器，以便在调用完成时进行监听。请参阅第 3 步代码中的代码段。
使用 org.tensorflow.lite.Interpreter 或 org.tensorflow.lite.InterpreterApi 类将 import org.tensorflow.lite.InterpreterApi; 和 import org.tensorflow.lite.InterpreterApi.Options.TfLiteRuntime; 添加到任何源文件。
如果对 InterpreterApi.create() 的任何生成调用都只有一个参数，请将 new InterpreterApi.Options() 附加到参数列表。
将 .setRuntime(TfLiteRuntime.FROM_SYSTEM_ONLY) 附加到对 InterpreterApi.create() 的任何调用的最后一个实参。
将 org.tensorflow.lite.Interpreter 类的所有其他出现情况替换为 org.tensorflow.lite.InterpreterApi。

如果您想单独使用 LiteRT 和 Play Services API，则必须使用 LiteRT (TF Lite) 2.9 或更高版本。LiteRT (TF Lite) 版本 2.8 及更低版本与 Play 服务 API 版本不兼容。