LiteRT 運算子版本

本文件說明 LiteRT 的運算版本管理結構定義。運算版本管理可讓開發人員在現有的作業中新增功能和參數。此外，它也能保證下列事項：

回溯相容性：新的 LiteRT 實作應處理舊模型檔案
前瞻相容性：舊版 LiteRT 應會處理由新版轉換工具產生的新模型檔案，各項功能的使用方式
轉送相容性偵測：如果舊版 LiteRT 實作讀取的新模型，其中包含 op 的新版本，應回報錯誤。

範例：將深度卷積加入深度卷積

本文件的其餘部分會介紹如何修改 TFLite 的運算版本，將 dilation 參數新增至深度卷積運算。

本文件不需要知識化，也能瞭解本文件。請注意：

將新增 2 個新的整數參數：dilation_width_factor 和 dilation_height_factor。
舊版深度卷積核心 (不支援擴散核心) 為等效將擴散係數設為 1。

變更 FlatBuffer 結構定義

如要在運算中加入新參數，請變更 lite/schema/schema.fbs。

舉例來說，深度卷積選項表格如下所示：

table DepthwiseConv2DOptions {
  padding:Padding;
  stride_w:int;
  stride_h:int;
  depth_multiplier:int;
  fused_activation_function:ActivationFunctionType;
}

新增參數時：

新增註解，指出特定版本支援的參數。
當新的實作方式取得新加入項目的預設值時就會照常運作。

新增參數後，表格會如下所示：

table DepthwiseConv2DOptions {
  // Parameters for DepthwiseConv version 1 or above.
  padding:Padding;
  stride_w:int;
  stride_h:int;
  depth_multiplier:int;
  fused_activation_function:ActivationFunctionType;
  // Parameters for DepthwiseConv version 2 or above.
  dilation_w_factor:int = 1;
  dilation_h_factor:int = 1;
}

應該重新產生 lite/schema/schema_generated.h 檔案，結構定義。

變更 C 結構和核心實作

在 LiteRT 中，核心實作項目會與 FlatBuffer 分離定義核心會從 lite/c/builtin_op_data.h。

原始深度卷積參數如下：

typedef struct {
  TfLitePadding padding;
  int stride_width;
  int stride_height;
  int depth_multiplier;
  TfLiteFusedActivation activation;
} TfLiteDepthwiseConvParams;

與 FlatBuffer 結構定義相同，請新增註解，指出參數支援此功能。結果如下所示：

typedef struct {
  // Parameters for DepthwiseConv version 1 or above.
  TfLitePadding padding;
  int stride_width;
  int stride_height;
  int depth_multiplier;
  TfLiteFusedActivation activation;
  // Parameters for DepthwiseConv version 2 or above.
  int dilation_width_factor;
  int dilation_height_factor;
} TfLiteDepthwiseConvParams;

請一併變更核心實作，以讀取新增的參數從 C 結構開始定義這裡省略了詳細資料。

變更 FlatBuffer 讀取程式碼

讀取 FlatBuffer 和產生 C 結構的邏輯位於 lite/core/api/flatbuffer_conversions.cc。

更新檔案以處理新參數，如下所示：

TfLiteStatus ParseDepthwiseConv2D(const Operator* op,
                                  ErrorReporter* error_reporter,
                                  BuiltinDataAllocator* allocator,
                                  void** builtin_data) {
  CheckParsePointerParams(op, error_reporter, allocator, builtin_data);

  SafeBuiltinDataAllocator safe_allocator(allocator);

  std::unique_ptr<TfLiteDepthwiseConvParams,
                  SafeBuiltinDataAllocator::BuiltinDataDeleter>
      params = safe_allocator.Allocate<TfLiteDepthwiseConvParams>();
  TF_LITE_ENSURE(error_reporter, params != nullptr);

  const DepthwiseConv2DOptions* schema_params =
      op->builtin_options_as_DepthwiseConv2DOptions();

  if (schema_params != nullptr) {
    params->padding = ConvertPadding(schema_params->padding());
    params->stride_width = schema_params->stride_w();
    params->stride_height = schema_params->stride_h();
    params->depth_multiplier = schema_params->depth_multiplier();
    params->activation =
        ConvertActivation(schema_params->fused_activation_function());

    params->dilation_width_factor = schema_params->dilation_w_factor();
    params->dilation_height_factor = schema_params->dilation_h_factor();
  }

  *builtin_data = params.release();
  return kTfLiteOk;
}

不需要在此查看運算版本。新的導入項目時讀取缺少擴散因數的舊模型檔案，會採用 1 做為預設值，而新的核心會與舊核心一致。

變更核心註冊作業

MutableOpResolver (在 lite/mutable_op_resolver.h 中定義) 提供幾種註冊運算核心的函式。最小與最大版本為 1 預設：

void AddBuiltin(tflite::BuiltinOperator op, TfLiteRegistration* registration,
                int min_version = 1, int max_version = 1);
void AddCustom(const char* name, TfLiteRegistration* registration,
               int min_version = 1, int max_version = 1);

內建作業是在 lite/kernels/register.cc 中註冊。在這個例子中我們導入了新的運算核心，可處理 DepthwiseConv2D 第 1 版和第 1 版 2，所以我們需要變更這行程式碼：

AddBuiltin(BuiltinOperator_DEPTHWISE_CONV_2D, Register_DEPTHWISE_CONV_2D());

收件者：

AddBuiltin(BuiltinOperator_DEPTHWISE_CONV_2D, Register_DEPTHWISE_CONV_2D(),
             /* min_version = */ 1,
             /* max_version = */ 2);

變更 TFLite 運算版本

下一步是讓 TFLite 填入最低版本需求就可以執行運算這個範例的意義如下：

當延遲因數皆為 1 時，填入 version=1。
否則請填入 version=2。

在以下位置修改運算子的 GetBuiltinOperatorVersion 函式： lite/tools/versioning/op_version.cc，將新版本新增至 DepthwiseConv2D:

case BuiltinOperator_DEPTHWISE_CONV_2D:
  auto depthwise_conv_params =
      reinterpret_cast<TfLiteDepthwiseConvParams*>(op_sig.builtin_data);
  TFLITE_DCHECK(depthwise_conv_params != nullptr);
  if (depthwise_conv_params->dilation_width_factor != 1 ||
       depthwise_conv_params->dilation_height_factor != 1) {
    return 2;
  }
  return 1;

更新運算子版本對應

最後一個步驟是將新的版本資訊加到運算子版本對應中。這個因為我們須產生模型中下限產生 200 個執行階段版本的執行階段版本

您需要在 lite/tools/versioning/runtime_version.cc。

在這種情況下，您需要將下列項目新增至 op_version_map：

{ {BuiltinOperator_DEPTHWISE_CONV_2D, 2}, %CURRENT_RUNTIME_VERSION%}

其中 %CURRENT_RUNTIME_VERSION% 對應目前的執行階段版本定義於 tensorflow/core/public/version.h 中定義的

委派實作

LiteRT 提供委派 API，能將作業委派硬體後端在委派的 Prepare 函式中，檢查版本是否。

const int kMaxVersion = 1;
TfLiteNode* node;
TfLiteRegistration* registration = nullptr;
TF_LITE_ENSURE_STATUS(context->GetNodeAndRegistration(context, node_index, &node, &registration));

if (registration->version > kMaxVersion) {
  // Reject the node if the version isn't supported.
}

即使委派作業僅支援版本 1 作業，這是必要步驟，因此委派作業可以在收到較高的版本運算時，偵測不相容。