Known Direct Subclasses |
Delegate options.
Nested Classes
enum | GpuDelegateFactory.Options.GpuBackend | Which GPU backend to select. |
Constants
int | INFERENCE_PREFERENCE_FAST_SINGLE_ANSWER | Delegate will be used only once, therefore, bootstrap/init time should be taken into account. |
int | INFERENCE_PREFERENCE_SUSTAINED_SPEED | Prefer maximizing the throughput. |
Public Constructors
Options()
|
Public Methods
boolean | |
GpuDelegateFactory.Options.GpuBackend | |
int | |
String | |
String | |
boolean | |
GpuDelegateFactory.Options | |
GpuDelegateFactory.Options |
setInferencePreference(int preference)
Sets the inference preference for precision/compilation/runtime tradeoffs.
|
GpuDelegateFactory.Options |
setPrecisionLossAllowed(boolean precisionLossAllowed)
Sets whether precision loss is allowed.
|
GpuDelegateFactory.Options |
setQuantizedModelsAllowed(boolean quantizedModelsAllowed)
Enables running quantized models with the delegate.
|
GpuDelegateFactory.Options |
setSerializationParams(String serializationDir, String modelToken)
Enables serialization on the delegate.
|
Inherited Methods
Constants
public static final int INFERENCE_PREFERENCE_FAST_SINGLE_ANSWER
Delegate will be used only once, therefore, bootstrap/init time should be taken into account.
public static final int INFERENCE_PREFERENCE_SUSTAINED_SPEED
Prefer maximizing the throughput. Same delegate will be used repeatedly on multiple inputs.
Public Constructors
public Options ()
Public Methods
public boolean areQuantizedModelsAllowed ()
public int getInferencePreference ()
public boolean isPrecisionLossAllowed ()
public GpuDelegateFactory.Options setForceBackend (GpuDelegateFactory.Options.GpuBackend forceBackend)
Sets the GPU Backend.
Parameters
forceBackend |
---|
public GpuDelegateFactory.Options setInferencePreference (int preference)
Sets the inference preference for precision/compilation/runtime tradeoffs.
Parameters
preference | One of `INFERENCE_PREFERENCE_FAST_SINGLE_ANSWER` (default), `INFERENCE_PREFERENCE_SUSTAINED_SPEED`. |
---|
public GpuDelegateFactory.Options setPrecisionLossAllowed (boolean precisionLossAllowed)
Sets whether precision loss is allowed.
Parameters
precisionLossAllowed | When `true` (default), the GPU may quantify tensors, downcast values, process in FP16. When `false`, computations are carried out in 32-bit floating point. |
---|
public GpuDelegateFactory.Options setQuantizedModelsAllowed (boolean quantizedModelsAllowed)
Enables running quantized models with the delegate.
WARNING: This is an experimental API and subject to change.
Parameters
quantizedModelsAllowed | When true (default), the GPU may run quantized models.
|
---|
public GpuDelegateFactory.Options setSerializationParams (String serializationDir, String modelToken)
Enables serialization on the delegate. Note non-null serializationDir
and modelToken
are required for serialization.
WARNING: This is an experimental API and subject to change.
Parameters
serializationDir | The directory to use for storing data. Caller is responsible to
ensure the model is not stored in a public directory. It's recommended to use Context.getCodeCacheDir() to provide a private location for the
application on Android. |
---|---|
modelToken | The token to be used to identify the model. Caller is responsible to ensure the token is unique to the model graph and data. |