MediaPipe Tasks provides the core programming interface of the MediaPipe Solutions suite, including a set of libraries for deploying innovative ML solutions onto devices with a minimum of code. It supports multiple platforms, including Android, Web / JavaScript, Python, and support for iOS is coming soon.
Easy to use, well-defined cross-platform APIs
Run ML Inferences with just 5 lines of code. Use the powerful and easy-to-use
solution APIs in MediaPipe Tasks as building blocks to build your own ML
features.
Customizable solutions
You can leverage all benefits MediaPipe Tasks provides, and easily customize it
using models built with your own data via Model
Maker. For example, you can create a model
that recognizes the custom gestures you defined using the Model Maker
GestureRecognizer API,
and deploy the model onto desired platforms using the Tasks GestureRecognizer
API.
High performance ML pipelines
Typical on-device ML solutions combine multiple ML and non-ML blocks, slowing
performance. MediaPipe Tasks provides optimized ML pipelines with end-to-end
acceleration on CPU, GPU, and TPU to meet the needs of real time on-device use
cases.
Supported platforms
This section provides an overview of MediaPipe Tasks for each supported platform. For specific implementations, see the platform-specific development guides for each task. For help getting your development environment set up to use MediaPipe Tasks on a platform, check out the platform setup guides.
Android
The MediaPipe Tasks Java API for Android is divided into packages that perform ML tasks in major domains, including vision, natural language, and audio. The following is a list of dependencies you can add to your Android app development project to enable these APIs:
dependencies {
implementation 'com.google.mediapipe:tasks-vision:latest.release'
implementation 'com.google.mediapipe:tasks-text:latest.release'
implementation 'com.google.mediapipe:tasks-audio:latest.release'
}
For specific implementation details, see the platform-specific development guides for each solution in MediaPipe Tasks.
Python
The MediaPipe Tasks Python API has a few main modules for solutions that perform ML tasks in major domains, including vision, natural language, and audio. The following shows you the install command and a list of imports you can add to your Python development project to enable these APIs:
$ python -m pip install mediapipe
import mediapipe as mp
from mediapipe.tasks import python
from mediapipe.tasks.python import vision
from mediapipe.tasks.python import text
from mediapipe.tasks.python import audio
For specific implementation details, see the platform-specific development guides for each solution in MediaPipe Tasks.
Web and JavaScript
The MediaPipe Tasks Web JavaScript API is divided into packages that perform ML tasks in major domains, including vision, natural language, and audio. The following is a list of script imports you can add to your Web and JavaScript development project to enable these APIs:
<head>
<script src="https://cdn.jsdelivr.net/npm/@mediapipe/tasks-vision/vision_bundle.js"
crossorigin="anonymous"></script>
<script src="https://cdn.jsdelivr.net/npm/@mediapipe/tasks-text/text_bundle.js"
crossorigin="anonymous"></script>
<script src="https://cdn.jsdelivr.net/npm/@mediapipe/tasks-audio/audio_bundle.js"
crossorigin="anonymous"></script>
</head>
For specific implementation details, see the platform-specific development guides for each solution in MediaPipe Tasks.