पेश है Google AI Edge Portal: बड़े पैमाने पर एज एआई का बेंचमार्क. निजी झलक के दौरान ऐक्सेस का अनुरोध करने के लिए, साइन-अप करें.

इस पेज का अनुवाद Cloud Translation API से किया गया है.

LiteRT की सहायता लाइब्रेरी की मदद से, इनपुट और आउटपुट डेटा को प्रोसेस करना

मोबाइल ऐप्लिकेशन डेवलपर आम तौर पर, टाइप किए गए ऑब्जेक्ट के साथ इंटरैक्ट करते हैं. जैसे, बिटमैप या पूर्णांक जैसे प्रिमिटिव. हालांकि, डिवाइस पर मशीन लर्निंग मॉडल चलाने वाला LiteRT इंटरप्रेटर एपीआई, ByteBuffer के तौर पर टेंसर का इस्तेमाल करता है. इसलिए, इन्हें डीबग करना और इनमें बदलाव करना मुश्किल हो सकता है. LiteRT Android Support Library को LiteRT मॉडल के इनपुट और आउटपुट को प्रोसेस करने के लिए डिज़ाइन किया गया है. साथ ही, इससे LiteRT इंटरप्रेटर को इस्तेमाल करना आसान हो जाता है.

शुरू करें

Gradle डिपेंडेंसी और अन्य सेटिंग इंपोर्ट करना

.tflite मॉडल फ़ाइल को Android मॉड्यूल की ऐसेट डायरेक्ट्री में कॉपी करें. मॉडल को इसी डायरेक्ट्री में चलाया जाएगा. यह तय करें कि फ़ाइल को कंप्रेस नहीं किया जाना चाहिए. साथ ही, मॉड्यूल की build.gradle फ़ाइल में LiteRT लाइब्रेरी जोड़ें:

android {
    // Other settings

    // Specify tflite file should not be compressed for the app apk
    aaptOptions {
        noCompress "tflite"
    }

}

dependencies {
    // Other dependencies

    // Import tflite dependencies
    implementation 'org.tensorflow:tensorflow-lite:0.0.0-nightly-SNAPSHOT'
    // The GPU delegate library is optional. Depend on it as needed.
    implementation 'com.google.ai.edge.litert:litert-gpu:0.0.0-nightly-SNAPSHOT'
    implementation 'com.google.ai.edge.litert:litert-support:0.0.0-nightly-SNAPSHOT'
}

Support Library के अलग-अलग वर्शन के लिए, MavenCentral पर होस्ट की गई LiteRT Support Library AAR देखें.

इमेज में बुनियादी बदलाव करना और उसे कन्वर्ट करना

LiteRT Support Library में, इमेज में बदलाव करने के बुनियादी तरीकों का एक सुइट होता है. जैसे, काटना और साइज़ बदलना. इसका इस्तेमाल करने के लिए, ImagePreprocessor बनाएं और ज़रूरी कार्रवाइयां जोड़ें. इमेज को LiteRT इंटरप्रेटर के लिए ज़रूरी टेंसर फ़ॉर्मैट में बदलने के लिए, इनपुट के तौर पर इस्तेमाल करने के लिए TensorImage बनाएं:

import org.tensorflow.lite.DataType;
import org.tensorflow.lite.support.image.ImageProcessor;
import org.tensorflow.lite.support.image.TensorImage;
import org.tensorflow.lite.support.image.ops.ResizeOp;

// Initialization code
// Create an ImageProcessor with all ops required. For more ops, please
// refer to the ImageProcessor Architecture section in this README.
ImageProcessor imageProcessor =
    new ImageProcessor.Builder()
        .add(new ResizeOp(224, 224, ResizeOp.ResizeMethod.BILINEAR))
        .build();

// Create a TensorImage object. This creates the tensor of the corresponding
// tensor type (uint8 in this case) that the LiteRT interpreter needs.
TensorImage tensorImage = new TensorImage(DataType.UINT8);

// Analysis code for every frame
// Preprocess the image
tensorImage.load(bitmap);
tensorImage = imageProcessor.process(tensorImage);

DataType को मेटाडेटा एक्सट्रैक्टर लाइब्रेरी के साथ-साथ मॉडल की अन्य जानकारी के ज़रिए भी पढ़ा जा सकता है.

ऑडियो डेटा की बुनियादी प्रोसेसिंग

LiteRT Support Library में, TensorAudio क्लास भी तय की जाती है. यह ऑडियो डेटा प्रोसेसिंग के कुछ बुनियादी तरीकों को रैप करती है. इसका इस्तेमाल ज़्यादातर AudioRecord के साथ किया जाता है. साथ ही, यह रिंग बफ़र में ऑडियो सैंपल कैप्चर करता है.

import android.media.AudioRecord;
import org.tensorflow.lite.support.audio.TensorAudio;

// Create an `AudioRecord` instance.
AudioRecord record = AudioRecord(...)

// Create a `TensorAudio` object from Android AudioFormat.
TensorAudio tensorAudio = new TensorAudio(record.getFormat(), size)

// Load all audio samples available in the AudioRecord without blocking.
tensorAudio.load(record)

// Get the `TensorBuffer` for inference.
TensorBuffer buffer = tensorAudio.getTensorBuffer()

आउटपुट ऑब्जेक्ट बनाना और मॉडल को चलाना

मॉडल को चलाने से पहले, हमें कंटेनर ऑब्जेक्ट बनाने होंगे. इनमें नतीजे सेव किए जाएंगे:

import org.tensorflow.lite.DataType;
import org.tensorflow.lite.support.tensorbuffer.TensorBuffer;

// Create a container for the result and specify that this is a quantized model.
// Hence, the 'DataType' is defined as UINT8 (8-bit unsigned integer)
TensorBuffer probabilityBuffer =
    TensorBuffer.createFixedSize(new int[]{1, 1001}, DataType.UINT8);

मॉडल लोड हो रहा है और अनुमान लगाया जा रहा है:

import java.nio.MappedByteBuffer;
import org.tensorflow.lite.InterpreterFactory;
import org.tensorflow.lite.InterpreterApi;

// Initialise the model
try{
    MappedByteBuffer tfliteModel
        = FileUtil.loadMappedFile(activity,
            "mobilenet_v1_1.0_224_quant.tflite");
    InterpreterApi tflite = new InterpreterFactory().create(
        tfliteModel, new InterpreterApi.Options());
} catch (IOException e){
    Log.e("tfliteSupport", "Error reading model", e);
}

// Running inference
if(null != tflite) {
    tflite.run(tImage.getBuffer(), probabilityBuffer.getBuffer());
}

नतीजे को ऐक्सेस करना

डेवलपर, आउटपुट को सीधे probabilityBuffer.getFloatArray() के ज़रिए ऐक्सेस कर सकते हैं. अगर मॉडल, क्वान्टाइज़ किया गया आउटपुट जनरेट करता है, तो नतीजे को बदलना न भूलें. MobileNet के क्वांटाइज़्ड मॉडल के लिए, डेवलपर को हर आउटपुट वैल्यू को 255 से भाग देना होगा. इससे, हर कैटगरी के लिए 0 (सबसे कम संभावना) से 1 (सबसे ज़्यादा संभावना) तक की संभावना मिलेगी.

ज़रूरी नहीं: नतीजों को लेबल से मैप करना

डेवलपर के पास, नतीजों को लेबल के साथ मैप करने का विकल्प भी होता है. सबसे पहले, लेबल वाली टेक्स्ट फ़ाइल को मॉड्यूल की ऐसेट डायरेक्ट्री में कॉपी करें. इसके बाद, इस कोड का इस्तेमाल करके लेबल फ़ाइल लोड करें:

import org.tensorflow.lite.support.common.FileUtil;

final String ASSOCIATED_AXIS_LABELS = "labels.txt";
List<String> associatedAxisLabels = null;

try {
    associatedAxisLabels = FileUtil.loadLabels(this, ASSOCIATED_AXIS_LABELS);
} catch (IOException e) {
    Log.e("tfliteSupport", "Error reading label file", e);
}

यहां दिए गए स्निपेट में, कैटगरी के लेबल के साथ संभावनाओं को जोड़ने का तरीका बताया गया है:

import java.util.Map;
import org.tensorflow.lite.support.common.TensorProcessor;
import org.tensorflow.lite.support.common.ops.NormalizeOp;
import org.tensorflow.lite.support.label.TensorLabel;

// Post-processor which dequantize the result
TensorProcessor probabilityProcessor =
    new TensorProcessor.Builder().add(new NormalizeOp(0, 255)).build();

if (null != associatedAxisLabels) {
    // Map of labels and their corresponding probability
    TensorLabel labels = new TensorLabel(associatedAxisLabels,
        probabilityProcessor.process(probabilityBuffer));

    // Create a map to access the result based on label
    Map<String, Float> floatMap = labels.getMapWithFloatValue();
}

इस्तेमाल के उदाहरणों की मौजूदा कवरेज

LiteRT Support Library के मौजूदा वर्शन में ये शामिल हैं:

सामान्य डेटा टाइप (फ़्लोट, uint8, इमेज, ऑडियो, और इन ऑब्जेक्ट का ऐरे) को tflite मॉडल के इनपुट और आउटपुट के तौर पर इस्तेमाल किया जा सकता है.
इमेज से जुड़ी बुनियादी कार्रवाइयां (इमेज काटना, उसका साइज़ बदलना, और उसे घुमाना).
नॉर्मलाइज़ेशन और क्वॉन्टाइज़ेशन
फ़ाइल यूटिलिटी

आने वाले वर्शन में, टेक्स्ट से जुड़े ऐप्लिकेशन के लिए बेहतर सपोर्ट मिलेगा.

ImageProcessor आर्किटेक्चर

ImageProcessor के डिज़ाइन की वजह से, इमेज में बदलाव करने की प्रोसेस को पहले से तय किया जा सकता है. साथ ही, इसे बनाने की प्रोसेस के दौरान ऑप्टिमाइज़ किया जा सकता है. ImageProcessor फ़ंक्शन, फ़िलहाल तीन बुनियादी प्रीप्रोसेसिंग ऑपरेशन के साथ काम करता है. इनके बारे में यहां बताया गया है. साथ ही, इनके बारे में नीचे दिए गए कोड स्निपेट में तीन टिप्पणियों में भी बताया गया है:

import org.tensorflow.lite.support.common.ops.NormalizeOp;
import org.tensorflow.lite.support.common.ops.QuantizeOp;
import org.tensorflow.lite.support.image.ops.ResizeOp;
import org.tensorflow.lite.support.image.ops.ResizeWithCropOrPadOp;
import org.tensorflow.lite.support.image.ops.Rot90Op;

int width = bitmap.getWidth();
int height = bitmap.getHeight();

int size = height > width ? width : height;

ImageProcessor imageProcessor =
    new ImageProcessor.Builder()
        // Center crop the image to the largest square possible
        .add(new ResizeWithCropOrPadOp(size, size))
        // Resize using Bilinear or Nearest neighbour
        .add(new ResizeOp(224, 224, ResizeOp.ResizeMethod.BILINEAR));
        // Rotation counter-clockwise in 90 degree increments
        .add(new Rot90Op(rotateDegrees / 90))
        .add(new NormalizeOp(127.5, 127.5))
        .add(new QuantizeOp(128.0, 1/128.0))
        .build();

नॉर्मलाइज़ेशन और क्वांटाइज़ेशन के बारे में ज़्यादा जानकारी के लिए, यहां क्लिक करें.

सहायता लाइब्रेरी का मुख्य लक्ष्य, सभी tf.image ट्रांसफ़ॉर्मेशन को सपोर्ट करना है. इसका मतलब है कि ट्रांसफ़ॉर्मेशन, TensorFlow जैसा ही होगा. साथ ही, इसे लागू करने के लिए ऑपरेटिंग सिस्टम की ज़रूरत नहीं होगी.

डेवलपर भी कस्टम प्रोसेसर बना सकते हैं. इन मामलों में, ट्रेनिंग प्रोसेस के साथ अलाइन होना ज़रूरी है. इसका मतलब है कि ट्रेनिंग और अनुमान लगाने, दोनों के लिए एक ही प्रीप्रोसेसिंग लागू होनी चाहिए, ताकि नतीजे दोहराए जा सकें.

क्वांटाइज़ेशन

TensorImage या TensorBuffer जैसे इनपुट या आउटपुट ऑब्जेक्ट शुरू करते समय, आपको उनके टाइप DataType.UINT8 या DataType.FLOAT32 के तौर पर तय करने होंगे.

TensorImage tensorImage = new TensorImage(DataType.UINT8);
TensorBuffer probabilityBuffer =
    TensorBuffer.createFixedSize(new int[]{1, 1001}, DataType.UINT8);

TensorProcessor का इस्तेमाल, इनपुट टेंसर को क्वांटाइज़ करने या आउटपुट टेंसर को डीक्वांटाइज़ करने के लिए किया जा सकता है. उदाहरण के लिए, क्वांटाइज़ किए गए आउटपुट TensorBuffer को प्रोसेस करते समय, डेवलपर DequantizeOp का इस्तेमाल करके, नतीजे को 0 से 1 के बीच की फ़्लोटिंग पॉइंट संभावना में बदल सकता है:

import org.tensorflow.lite.support.common.TensorProcessor;

// Post-processor which dequantize the result
TensorProcessor probabilityProcessor =
    new TensorProcessor.Builder().add(new DequantizeOp(0, 1/255.0)).build();
TensorBuffer dequantizedBuffer = probabilityProcessor.process(probabilityBuffer);

टेंसर के क्वांटाइज़ेशन पैरामीटर को मेटाडेटा एक्सट्रैक्टर लाइब्रेरी की मदद से पढ़ा जा सकता है.