mediapipe/docs/solutions/pose.md

---
layout: default
title: Pose
parent: Solutions
has_children: true
has_toc: false
nav_order: 5
---

# MediaPipe Pose
{: .no_toc }

<details close markdown="block">
  <summary>
    Table of contents
  </summary>
  {: .text-delta }
1. TOC
{:toc}
</details>
---

**Attention:** *Thank you for your interest in MediaPipe Solutions.
As of March 1, 2023, this solution is planned to be upgraded to a new MediaPipe
Solution. For more information, see the new
[MediaPipe Solutions](https://developers.google.com/mediapipe/solutions/guide#legacy)
site.*

*This notice and web page will be removed on April 3, 2023.*

----

## Overview

Human pose estimation from video plays a critical role in various applications
such as [quantifying physical exercises](./pose_classification.md), sign
language recognition, and full-body gesture control. For example, it can form
the basis for yoga, dance, and fitness applications. It can also enable the
overlay of digital content and information on top of the physical world in
augmented reality.

MediaPipe Pose is a ML solution for high-fidelity body pose tracking, inferring
33 3D landmarks and background segmentation mask on the whole body from RGB
video frames utilizing our
[BlazePose](https://ai.googleblog.com/2020/08/on-device-real-time-body-pose-tracking.html)
research that also powers the
[ML Kit Pose Detection API](https://developers.google.com/ml-kit/vision/pose-detection).
Current state-of-the-art approaches rely primarily on powerful desktop
environments for inference, whereas our method achieves real-time performance on
most modern [mobile phones](#mobile), [desktops/laptops](#desktop), in
[python](#python-solution-api) and even on the [web](#javascript-solution-api).

![pose_tracking_example.gif](https://mediapipe.dev/images/mobile/pose_tracking_example.gif) |
:----------------------------------------------------------------------: |
*Fig 1. Example of MediaPipe Pose for pose tracking.*                    |

## ML Pipeline

The solution utilizes a two-step detector-tracker ML pipeline, proven to be
effective in our [MediaPipe Hands](./hands.md) and
[MediaPipe Face Mesh](./face_mesh.md) solutions. Using a detector, the pipeline
first locates the person/pose region-of-interest (ROI) within the frame. The
tracker subsequently predicts the pose landmarks and segmentation mask within
the ROI using the ROI-cropped frame as input. Note that for video use cases the
detector is invoked only as needed, i.e., for the very first frame and when the
tracker could no longer identify body pose presence in the previous frame. For
other frames the pipeline simply derives the ROI from the previous frame’s pose
landmarks.

The pipeline is implemented as a MediaPipe
[graph](https://github.com/google/mediapipe/tree/master/mediapipe/graphs/pose_tracking/pose_tracking_gpu.pbtxt)
that uses a
[pose landmark subgraph](https://github.com/google/mediapipe/tree/master/mediapipe/modules/pose_landmark/pose_landmark_gpu.pbtxt)
from the
[pose landmark module](https://github.com/google/mediapipe/tree/master/mediapipe/modules/pose_landmark)
and renders using a dedicated
[pose renderer subgraph](https://github.com/google/mediapipe/tree/master/mediapipe/graphs/pose_tracking/subgraphs/pose_renderer_gpu.pbtxt).
The
[pose landmark subgraph](https://github.com/google/mediapipe/tree/master/mediapipe/modules/pose_landmark/pose_landmark_gpu.pbtxt)
internally uses a
[pose detection subgraph](https://github.com/google/mediapipe/tree/master/mediapipe/modules/pose_detection/pose_detection_gpu.pbtxt)
from the
[pose detection module](https://github.com/google/mediapipe/tree/master/mediapipe/modules/pose_detection).

Note: To visualize a graph, copy the graph and paste it into
[MediaPipe Visualizer](https://viz.mediapipe.dev/). For more information on how
to visualize its associated subgraphs, please see
[visualizer documentation](../tools/visualizer.md).

## Pose Estimation Quality

To evaluate the quality of our [models](./models.md#pose) against other
well-performing publicly available solutions, we use three different validation
datasets, representing different verticals: Yoga, Dance and HIIT. Each image
contains only a single person located 2-4 meters from the camera. To be
consistent with other solutions, we perform evaluation only for 17 keypoints
from [COCO topology](https://cocodataset.org/#keypoints-2020).

Method                                                                                                | Yoga <br/> [`mAP`] | Yoga <br/> [`PCK@0.2`] | Dance <br/> [`mAP`] | Dance <br/> [`PCK@0.2`] | HIIT <br/> [`mAP`] | HIIT <br/> [`PCK@0.2`]
----------------------------------------------------------------------------------------------------- | -----------------: | ---------------------: | ------------------: | ----------------------: | -----------------: | ---------------------:
BlazePose GHUM Heavy                                                                                  | 68.1               | **96.4**               | 73.0                | **97.2**                | 74.0               | **97.5**
BlazePose GHUM Full                                                                                   | 62.6               | **95.5**               | 67.4                | **96.3**                | 68.0               | **95.7**
BlazePose GHUM Lite                                                                                   | 45.0               | **90.2**               | 53.6                | **92.5**                | 53.8               | **93.5**
[AlphaPose ResNet50](https://github.com/MVIG-SJTU/AlphaPose)                                          | 63.4               | **96.0**               | 57.8                | **95.5**                | 63.4               | **96.0**
[Apple Vision](https://developer.apple.com/documentation/vision/detecting_human_body_poses_in_images) | 32.8               | **82.7**               | 36.4                | **91.4**                | 44.5               | **88.6**

![pose_tracking_pck_chart.png](https://mediapipe.dev/images/mobile/pose_tracking_pck_chart.png) |
:--------------------------------------------------------------------------: |
*Fig 2. Quality evaluation in [`PCK@0.2`].*                                  |

We designed our models specifically for live perception use cases, so all of
them work in real-time on the majority of modern devices.

Method               | Latency <br/> Pixel 3 [TFLite GPU](https://www.tensorflow.org/lite/performance/gpu_advanced) | Latency <br/> MacBook Pro (15-inch 2017)
-------------------- | -------------------------------------------------------------------------------------------: | ---------------------------------------:
BlazePose GHUM Heavy | 53 ms                                                                                        | 38 ms
BlazePose GHUM Full  | 25 ms                                                                                        | 27 ms
BlazePose GHUM Lite  | 20 ms                                                                                        | 25 ms

## Models

### Person/pose Detection Model (BlazePose Detector)

The detector is inspired by our own lightweight
[BlazeFace](https://arxiv.org/abs/1907.05047) model, used in
[MediaPipe Face Detection](./face_detection.md), as a proxy for a person
detector. It explicitly predicts two additional virtual keypoints that firmly
describe the human body center, rotation and scale as a circle. Inspired by
[Leonardo’s Vitruvian man](https://en.wikipedia.org/wiki/Vitruvian_Man), we
predict the midpoint of a person's hips, the radius of a circle circumscribing
the whole person, and the incline angle of the line connecting the shoulder and
hip midpoints.

![pose_tracking_detector_vitruvian_man.png](https://mediapipe.dev/images/mobile/pose_tracking_detector_vitruvian_man.png) |
:----------------------------------------------------------------------------------------------------: |
*Fig 3. Vitruvian man aligned via two virtual keypoints predicted by BlazePose detector in addition to the face bounding box.* |

### Pose Landmark Model (BlazePose [GHUM](https://github.com/google-research/google-research/tree/master/ghum) 3D)

The landmark model in MediaPipe Pose predicts the location of 33 pose landmarks
(see figure below).

![pose_tracking_full_body_landmarks.png](https://mediapipe.dev/images/mobile/pose_tracking_full_body_landmarks.png) |
:----------------------------------------------------------------------------------------------: |
*Fig 4. 33 pose landmarks.*                                                                      |

Optionally, MediaPipe Pose can predicts a full-body
[segmentation mask](#segmentation_mask) represented as a two-class segmentation
(human or background).

Please find more detail in the
[BlazePose Google AI Blog](https://ai.googleblog.com/2020/08/on-device-real-time-body-pose-tracking.html),
this [paper](https://arxiv.org/abs/2006.10204),
[the model card](./models.md#pose) and the [Output](#output) section below.

## Solution APIs

### Cross-platform Configuration Options

Naming style and availability may differ slightly across platforms/languages.

#### static_image_mode

If set to `false`, the solution treats the input images as a video stream. It
will try to detect the most prominent person in the very first images, and upon
a successful detection further localizes the pose landmarks. In subsequent
images, it then simply tracks those landmarks without invoking another detection
until it loses track, on reducing computation and latency. If set to `true`,
person detection runs every input image, ideal for processing a batch of static,
possibly unrelated, images. Default to `false`.

#### model_complexity

Complexity of the pose landmark model: `0`, `1` or `2`. Landmark accuracy as
well as inference latency generally go up with the model complexity. Default to
`1`.

#### smooth_landmarks

If set to `true`, the solution filters pose landmarks across different input
images to reduce jitter, but ignored if [static_image_mode](#static_image_mode)
is also set to `true`. Default to `true`.

#### enable_segmentation

If set to `true`, in addition to the pose landmarks the solution also generates
the segmentation mask. Default to `false`.

#### smooth_segmentation

If set to `true`, the solution filters segmentation masks across different input
images to reduce jitter. Ignored if [enable_segmentation](#enable_segmentation)
is `false` or [static_image_mode](#static_image_mode) is `true`. Default to
`true`.

#### min_detection_confidence

Minimum confidence value (`[0.0, 1.0]`) from the person-detection model for the
detection to be considered successful. Default to `0.5`.

#### min_tracking_confidence

Minimum confidence value (`[0.0, 1.0]`) from the landmark-tracking model for the
pose landmarks to be considered tracked successfully, or otherwise person
detection will be invoked automatically on the next input image. Setting it to a
higher value can increase robustness of the solution, at the expense of a higher
latency. Ignored if [static_image_mode](#static_image_mode) is `true`, where
person detection simply runs on every image. Default to `0.5`.

### Output

Naming style may differ slightly across platforms/languages.

#### pose_landmarks

A list of pose landmarks. Each landmark consists of the following:

*   `x` and `y`: Landmark coordinates normalized to `[0.0, 1.0]` by the image
    width and height respectively.
*   `z`: Represents the landmark depth with the depth at the midpoint of hips
    being the origin, and the smaller the value the closer the landmark is to
    the camera. The magnitude of `z` uses roughly the same scale as `x`.
*   `visibility`: A value in `[0.0, 1.0]` indicating the likelihood of the
    landmark being visible (present and not occluded) in the image.

#### pose_world_landmarks

*Fig 5. Example of MediaPipe Pose real-world 3D coordinates.* |
:-----------------------------------------------------------: |
<video autoplay muted loop preload style="height: auto; width: 480px"><source src="https://mediapipe.dev/images/mobile/pose_world_landmarks.mp4" type="video/mp4"></video> |

Another list of pose landmarks in world coordinates. Each landmark consists of
the following:

*   `x`, `y` and `z`: Real-world 3D coordinates in meters with the origin at the
    center between hips.
*   `visibility`: Identical to that defined in the corresponding
    [pose_landmarks](#pose_landmarks).

#### segmentation_mask

The output segmentation mask, predicted only when
[enable_segmentation](#enable_segmentation) is set to `true`. The mask has the
same width and height as the input image, and contains values in `[0.0, 1.0]`
where `1.0` and `0.0` indicate high certainty of a "human" and "background"
pixel respectively. Please refer to the platform-specific usage examples below
for usage details.

*Fig 6. Example of MediaPipe Pose segmentation mask.* |
:---------------------------------------------------: |
<video autoplay muted loop preload style="height: auto; width: 480px"><source src="https://mediapipe.dev/images/mobile/pose_segmentation.mp4" type="video/mp4"></video> |

### Python Solution API

Please first follow general [instructions](../getting_started/python.md) to
install MediaPipe Python package, then learn more in the companion
[Python Colab](#resources) and the usage example below.

Supported configuration options:

*   [static_image_mode](#static_image_mode)
*   [model_complexity](#model_complexity)
*   [smooth_landmarks](#smooth_landmarks)
*   [enable_segmentation](#enable_segmentation)
*   [smooth_segmentation](#smooth_segmentation)
*   [min_detection_confidence](#min_detection_confidence)
*   [min_tracking_confidence](#min_tracking_confidence)

```python
import cv2
import mediapipe as mp
mp_drawing = mp.solutions.drawing_utils
mp_drawing_styles = mp.solutions.drawing_styles
mp_pose = mp.solutions.pose

# For static images:
IMAGE_FILES = []
BG_COLOR = (192, 192, 192) # gray
with mp_pose.Pose(
    static_image_mode=True,
    model_complexity=2,
    enable_segmentation=True,
    min_detection_confidence=0.5) as pose:
  for idx, file in enumerate(IMAGE_FILES):
    image = cv2.imread(file)
    image_height, image_width, _ = image.shape
    # Convert the BGR image to RGB before processing.
    results = pose.process(cv2.cvtColor(image, cv2.COLOR_BGR2RGB))

    if not results.pose_landmarks:
      continue
    print(
        f'Nose coordinates: ('
        f'{results.pose_landmarks.landmark[mp_pose.PoseLandmark.NOSE].x * image_width}, '
        f'{results.pose_landmarks.landmark[mp_pose.PoseLandmark.NOSE].y * image_height})'
    )

    annotated_image = image.copy()
    # Draw segmentation on the image.
    # To improve segmentation around boundaries, consider applying a joint
    # bilateral filter to "results.segmentation_mask" with "image".
    condition = np.stack((results.segmentation_mask,) * 3, axis=-1) > 0.1
    bg_image = np.zeros(image.shape, dtype=np.uint8)
    bg_image[:] = BG_COLOR
    annotated_image = np.where(condition, annotated_image, bg_image)
    # Draw pose landmarks on the image.
    mp_drawing.draw_landmarks(
        annotated_image,
        results.pose_landmarks,
        mp_pose.POSE_CONNECTIONS,
        landmark_drawing_spec=mp_drawing_styles.get_default_pose_landmarks_style())
    cv2.imwrite('/tmp/annotated_image' + str(idx) + '.png', annotated_image)
    # Plot pose world landmarks.
    mp_drawing.plot_landmarks(
        results.pose_world_landmarks, mp_pose.POSE_CONNECTIONS)

# For webcam input:
cap = cv2.VideoCapture(0)
with mp_pose.Pose(
    min_detection_confidence=0.5,
    min_tracking_confidence=0.5) as pose:
  while cap.isOpened():
    success, image = cap.read()
    if not success:
      print("Ignoring empty camera frame.")
      # If loading a video, use 'break' instead of 'continue'.
      continue

    # To improve performance, optionally mark the image as not writeable to
    # pass by reference.
    image.flags.writeable = False
    image = cv2.cvtColor(image, cv2.COLOR_BGR2RGB)
    results = pose.process(image)

    # Draw the pose annotation on the image.
    image.flags.writeable = True
    image = cv2.cvtColor(image, cv2.COLOR_RGB2BGR)
    mp_drawing.draw_landmarks(
        image,
        results.pose_landmarks,
        mp_pose.POSE_CONNECTIONS,
        landmark_drawing_spec=mp_drawing_styles.get_default_pose_landmarks_style())
    # Flip the image horizontally for a selfie-view display.
    cv2.imshow('MediaPipe Pose', cv2.flip(image, 1))
    if cv2.waitKey(5) & 0xFF == 27:
      break
cap.release()
```

### JavaScript Solution API

Please first see general [introduction](../getting_started/javascript.md) on
MediaPipe in JavaScript, then learn more in the companion [web demo](#resources)
and the following usage example.

Supported configuration options:

*   [modelComplexity](#model_complexity)
*   [smoothLandmarks](#smooth_landmarks)
*   [enableSegmentation](#enable_segmentation)
*   [smoothSegmentation](#smooth_segmentation)
*   [minDetectionConfidence](#min_detection_confidence)
*   [minTrackingConfidence](#min_tracking_confidence)

```html
<!DOCTYPE html>
<html>
<head>
  <meta charset="utf-8">
  <script src="https://cdn.jsdelivr.net/npm/@mediapipe/camera_utils/camera_utils.js" crossorigin="anonymous"></script>
  <script src="https://cdn.jsdelivr.net/npm/@mediapipe/control_utils/control_utils.js" crossorigin="anonymous"></script>
  <script src="https://cdn.jsdelivr.net/npm/@mediapipe/control_utils_3d/control_utils_3d.js" crossorigin="anonymous"></script>
  <script src="https://cdn.jsdelivr.net/npm/@mediapipe/drawing_utils/drawing_utils.js" crossorigin="anonymous"></script>
  <script src="https://cdn.jsdelivr.net/npm/@mediapipe/pose/pose.js" crossorigin="anonymous"></script>
</head>

<body>
  <div class="container">
    <video class="input_video"></video>
    <canvas class="output_canvas" width="1280px" height="720px"></canvas>
    <div class="landmark-grid-container"></div>
  </div>
</body>
</html>
```

```javascript
<script type="module">
const videoElement = document.getElementsByClassName('input_video')[0];
const canvasElement = document.getElementsByClassName('output_canvas')[0];
const canvasCtx = canvasElement.getContext('2d');
const landmarkContainer = document.getElementsByClassName('landmark-grid-container')[0];
const grid = new LandmarkGrid(landmarkContainer);

function onResults(results) {
  if (!results.poseLandmarks) {
    grid.updateLandmarks([]);
    return;
  }

  canvasCtx.save();
  canvasCtx.clearRect(0, 0, canvasElement.width, canvasElement.height);
  canvasCtx.drawImage(results.segmentationMask, 0, 0,
                      canvasElement.width, canvasElement.height);

  // Only overwrite existing pixels.
  canvasCtx.globalCompositeOperation = 'source-in';
  canvasCtx.fillStyle = '#00FF00';
  canvasCtx.fillRect(0, 0, canvasElement.width, canvasElement.height);

  // Only overwrite missing pixels.
  canvasCtx.globalCompositeOperation = 'destination-atop';
  canvasCtx.drawImage(
      results.image, 0, 0, canvasElement.width, canvasElement.height);

  canvasCtx.globalCompositeOperation = 'source-over';
  drawConnectors(canvasCtx, results.poseLandmarks, POSE_CONNECTIONS,
                 {color: '#00FF00', lineWidth: 4});
  drawLandmarks(canvasCtx, results.poseLandmarks,
                {color: '#FF0000', lineWidth: 2});
  canvasCtx.restore();

  grid.updateLandmarks(results.poseWorldLandmarks);
}

const pose = new Pose({locateFile: (file) => {
  return `https://cdn.jsdelivr.net/npm/@mediapipe/pose/${file}`;
}});
pose.setOptions({
  modelComplexity: 1,
  smoothLandmarks: true,
  enableSegmentation: true,
  smoothSegmentation: true,
  minDetectionConfidence: 0.5,
  minTrackingConfidence: 0.5
});
pose.onResults(onResults);

const camera = new Camera(videoElement, {
  onFrame: async () => {
    await pose.send({image: videoElement});
  },
  width: 1280,
  height: 720
});
camera.start();
</script>
```

## Example Apps

Please first see general instructions for
[Android](../getting_started/android.md), [iOS](../getting_started/ios.md), and
[desktop](../getting_started/cpp.md) on how to build MediaPipe examples.

Note: To visualize a graph, copy the graph and paste it into
[MediaPipe Visualizer](https://viz.mediapipe.dev/). For more information on how
to visualize its associated subgraphs, please see
[visualizer documentation](../tools/visualizer.md).

### Mobile

#### Main Example

*   Graph:
    [`mediapipe/graphs/pose_tracking/pose_tracking_gpu.pbtxt`](https://github.com/google/mediapipe/tree/master/mediapipe/graphs/pose_tracking/pose_tracking_gpu.pbtxt)
*   Android target:
    [(or download prebuilt ARM64 APK)](https://drive.google.com/file/d/17GFIrqEJS6W8UHKXlYevTtSCLxN9pWlY/view?usp=sharing)
    [`mediapipe/examples/android/src/java/com/google/mediapipe/apps/posetrackinggpu:posetrackinggpu`](https://github.com/google/mediapipe/tree/master/mediapipe/examples/android/src/java/com/google/mediapipe/apps/posetrackinggpu/BUILD)
*   iOS target:
    [`mediapipe/examples/ios/posetrackinggpu:PoseTrackingGpuApp`](http:/mediapipe/examples/ios/posetrackinggpu/BUILD)

### Desktop

Please first see general instructions for [desktop](../getting_started/cpp.md)
on how to build MediaPipe examples.

#### Main Example

*   Running on CPU
    *   Graph:
        [`mediapipe/graphs/pose_tracking/pose_tracking_cpu.pbtxt`](https://github.com/google/mediapipe/tree/master/mediapipe/graphs/pose_tracking/pose_tracking_cpu.pbtxt)
    *   Target:
        [`mediapipe/examples/desktop/pose_tracking:pose_tracking_cpu`](https://github.com/google/mediapipe/tree/master/mediapipe/examples/desktop/pose_tracking/BUILD)
*   Running on GPU
    *   Graph:
        [`mediapipe/graphs/pose_tracking/pose_tracking_gpu.pbtxt`](https://github.com/google/mediapipe/tree/master/mediapipe/graphs/pose_tracking/pose_tracking_gpu.pbtxt)
    *   Target:
        [`mediapipe/examples/desktop/pose_tracking:pose_tracking_gpu`](https://github.com/google/mediapipe/tree/master/mediapipe/examples/desktop/pose_tracking/BUILD)

## Resources

*   Google AI Blog:
    [BlazePose - On-device Real-time Body Pose Tracking](https://ai.googleblog.com/2020/08/on-device-real-time-body-pose-tracking.html)
*   Paper:
    [BlazePose: On-device Real-time Body Pose Tracking](https://arxiv.org/abs/2006.10204)
    ([presentation](https://youtu.be/YPpUOTRn5tA))
*   [Models and model cards](./models.md#pose)
*   [GHUM & GHUML: Generative 3D Human Shape and Articulated Pose Models](https://github.com/google-research/google-research/tree/master/ghum)
*   [Web demo](https://code.mediapipe.dev/codepen/pose)
*   [Python Colab](https://mediapipe.page.link/pose_py_colab)

[`mAP`]: https://cocodataset.org/#keypoints-eval
[`PCK@0.2`]: https://github.com/cbsudux/Human-Pose-Estimation-101
-												Project import generated by Copybara.

GitOrigin-RevId: 9295f8ea2339edb71073695ed4fb3fded2f48c60

											
										
										
											2020-08-13 03:57:56 +02:00
+								---
 								layout: default
 								title: Pose
 								parent: Solutions
-												Project import generated by Copybara.

GitOrigin-RevId: 5b4c149782c086ebf9ef390195fb260ad0103217

											
										
										
											2021-02-27 22:09:58 +01:00
+								has_children: true
 								has_toc: false
-												Project import generated by Copybara.

GitOrigin-RevId: 9295f8ea2339edb71073695ed4fb3fded2f48c60

											
										
										
											2020-08-13 03:57:56 +02:00
+								nav_order: 5
 								---
-												Project import generated by Copybara.

GitOrigin-RevId: f7d09ed033907b893638a8eb4148efa11c0f09a6

											
										
										
											2020-11-05 01:02:35 +01:00
+								# MediaPipe Pose
-												Project import generated by Copybara.

GitOrigin-RevId: 9295f8ea2339edb71073695ed4fb3fded2f48c60

											
										
										
											2020-08-13 03:57:56 +02:00
+								{: .no_toc }
-												Project import generated by Copybara.

GitOrigin-RevId: d8caa66de45839696f5bd0786ad3bfbcb9cff632

											
										
										
											2020-12-10 04:13:05 +01:00
+								<details close markdown="block">
 								  <summary>
 								    Table of contents
 								  </summary>
 								  {: .text-delta }
-												Project import generated by Copybara.

GitOrigin-RevId: 9295f8ea2339edb71073695ed4fb3fded2f48c60

											
										
										
											2020-08-13 03:57:56 +02:00
+. TOC
 								{:toc}
-												Project import generated by Copybara.

GitOrigin-RevId: d8caa66de45839696f5bd0786ad3bfbcb9cff632

											
										
										
											2020-12-10 04:13:05 +01:00
+								</details>
-												Project import generated by Copybara.

GitOrigin-RevId: 9295f8ea2339edb71073695ed4fb3fded2f48c60

											
										
										
											2020-08-13 03:57:56 +02:00
+								---
-												Internal change

PiperOrigin-RevId: 513255798

											
										
										
											2023-03-01 18:19:12 +01:00
+								**Attention:** *Thank you for your interest in MediaPipe Solutions.
 								As of March 1, 2023, this solution is planned to be upgraded to a new MediaPipe
 								Solution. For more information, see the new
 								[MediaPipe Solutions](https://developers.google.com/mediapipe/solutions/guide#legacy)
 								site.*
 								*This notice and web page will be removed on April 3, 2023.*
 								----
-												Project import generated by Copybara.

GitOrigin-RevId: 9295f8ea2339edb71073695ed4fb3fded2f48c60

											
										
										
											2020-08-13 03:57:56 +02:00
+								## Overview
 								Human pose estimation from video plays a critical role in various applications
-												Project import generated by Copybara.

GitOrigin-RevId: 5b4c149782c086ebf9ef390195fb260ad0103217

											
										
										
											2021-02-27 22:09:58 +01:00
+								such as [quantifying physical exercises](./pose_classification.md), sign
 								language recognition, and full-body gesture control. For example, it can form
 								the basis for yoga, dance, and fitness applications. It can also enable the
-												Project import generated by Copybara.

GitOrigin-RevId: d073f8e21be2fcc0e503cb97c6695078b6b75310

											
										
										
											2021-02-27 09:21:16 +01:00
+								overlay of digital content and information on top of the physical world in
 								augmented reality.
-												Project import generated by Copybara.

GitOrigin-RevId: 9295f8ea2339edb71073695ed4fb3fded2f48c60

											
										
										
											2020-08-13 03:57:56 +02:00
-												Project import generated by Copybara.

GitOrigin-RevId: d8caa66de45839696f5bd0786ad3bfbcb9cff632

											
										
										
											2020-12-10 04:13:05 +01:00
+								MediaPipe Pose is a ML solution for high-fidelity body pose tracking, inferring
-												Project import generated by Copybara.

GitOrigin-RevId: 1610e588e497817fae2d9a458093ab6a370e2972

											
										
										
											2021-08-19 00:18:12 +02:00
+3D landmarks and background segmentation mask on the whole body from RGB
 								video frames utilizing our
-												Project import generated by Copybara.

GitOrigin-RevId: aaca5c37abcf8b7a6c3c28804739afdbad46e704

											
										
										
											2020-08-13 21:02:55 +02:00
+								[BlazePose](https://ai.googleblog.com/2020/08/on-device-real-time-body-pose-tracking.html)
-												Project import generated by Copybara.

GitOrigin-RevId: d8caa66de45839696f5bd0786ad3bfbcb9cff632

											
										
										
											2020-12-10 04:13:05 +01:00
+								research that also powers the
 								[ML Kit Pose Detection API](https://developers.google.com/ml-kit/vision/pose-detection).
 								Current state-of-the-art approaches rely primarily on powerful desktop
-												Project import generated by Copybara.

GitOrigin-RevId: aaca5c37abcf8b7a6c3c28804739afdbad46e704

											
										
										
											2020-08-13 21:02:55 +02:00
+								environments for inference, whereas our method achieves real-time performance on
 								most modern [mobile phones](#mobile), [desktops/laptops](#desktop), in
-												Project import generated by Copybara.

GitOrigin-RevId: d073f8e21be2fcc0e503cb97c6695078b6b75310

											
										
										
											2021-02-27 09:21:16 +01:00
+								[python](#python-solution-api) and even on the [web](#javascript-solution-api).
-												Project import generated by Copybara.

GitOrigin-RevId: 9295f8ea2339edb71073695ed4fb3fded2f48c60

											
										
										
											2020-08-13 03:57:56 +02:00
-												Project import generated by Copybara.

GitOrigin-RevId: 1e13be30e2c6838d4a2ff768a39c414bc80534bb

											
										
										
											2022-09-06 23:29:51 +02:00
+								![pose_tracking_example.gif](https://mediapipe.dev/images/mobile/pose_tracking_example.gif) |
-												Project import generated by Copybara.

GitOrigin-RevId: ff83882955f1a1e2a043ff4e71278be9d7217bbe

											
										
										
											2021-05-05 03:30:15 +02:00
+								:----------------------------------------------------------------------: |
 								*Fig 1. Example of MediaPipe Pose for pose tracking.*                    |
-												Project import generated by Copybara.

GitOrigin-RevId: 9295f8ea2339edb71073695ed4fb3fded2f48c60

											
										
										
											2020-08-13 03:57:56 +02:00
 								## ML Pipeline
 								The solution utilizes a two-step detector-tracker ML pipeline, proven to be
 								effective in our [MediaPipe Hands](./hands.md) and
 								[MediaPipe Face Mesh](./face_mesh.md) solutions. Using a detector, the pipeline
-												Project import generated by Copybara.

GitOrigin-RevId: d8caa66de45839696f5bd0786ad3bfbcb9cff632

											
										
										
											2020-12-10 04:13:05 +01:00
+								first locates the person/pose region-of-interest (ROI) within the frame. The
-												Project import generated by Copybara.

GitOrigin-RevId: 1610e588e497817fae2d9a458093ab6a370e2972

											
										
										
											2021-08-19 00:18:12 +02:00
+								tracker subsequently predicts the pose landmarks and segmentation mask within
 								the ROI using the ROI-cropped frame as input. Note that for video use cases the
 								detector is invoked only as needed, i.e., for the very first frame and when the
 								tracker could no longer identify body pose presence in the previous frame. For
 								other frames the pipeline simply derives the ROI from the previous frame’s pose
-												Project import generated by Copybara.

GitOrigin-RevId: d8caa66de45839696f5bd0786ad3bfbcb9cff632

											
										
										
											2020-12-10 04:13:05 +01:00
+								landmarks.
-												Project import generated by Copybara.

GitOrigin-RevId: 9295f8ea2339edb71073695ed4fb3fded2f48c60

											
										
										
											2020-08-13 03:57:56 +02:00
 								The pipeline is implemented as a MediaPipe
-												Project import generated by Copybara.

GitOrigin-RevId: d8caa66de45839696f5bd0786ad3bfbcb9cff632

											
										
										
											2020-12-10 04:13:05 +01:00
+								[graph](https://github.com/google/mediapipe/tree/master/mediapipe/graphs/pose_tracking/pose_tracking_gpu.pbtxt)
-												Project import generated by Copybara.

GitOrigin-RevId: 9295f8ea2339edb71073695ed4fb3fded2f48c60

											
										
										
											2020-08-13 03:57:56 +02:00
+								that uses a
-												Project import generated by Copybara.

GitOrigin-RevId: d8caa66de45839696f5bd0786ad3bfbcb9cff632

											
										
										
											2020-12-10 04:13:05 +01:00
+								[pose landmark subgraph](https://github.com/google/mediapipe/tree/master/mediapipe/modules/pose_landmark/pose_landmark_gpu.pbtxt)
-												Project import generated by Copybara.

GitOrigin-RevId: 9295f8ea2339edb71073695ed4fb3fded2f48c60

											
										
										
											2020-08-13 03:57:56 +02:00
+								from the
 								[pose landmark module](https://github.com/google/mediapipe/tree/master/mediapipe/modules/pose_landmark)
 								and renders using a dedicated
-												Project import generated by Copybara.

GitOrigin-RevId: d8caa66de45839696f5bd0786ad3bfbcb9cff632

											
										
										
											2020-12-10 04:13:05 +01:00
+								[pose renderer subgraph](https://github.com/google/mediapipe/tree/master/mediapipe/graphs/pose_tracking/subgraphs/pose_renderer_gpu.pbtxt).
-												Project import generated by Copybara.

GitOrigin-RevId: 9295f8ea2339edb71073695ed4fb3fded2f48c60

											
										
										
											2020-08-13 03:57:56 +02:00
+								The
-												Project import generated by Copybara.

GitOrigin-RevId: d8caa66de45839696f5bd0786ad3bfbcb9cff632

											
										
										
											2020-12-10 04:13:05 +01:00
+								[pose landmark subgraph](https://github.com/google/mediapipe/tree/master/mediapipe/modules/pose_landmark/pose_landmark_gpu.pbtxt)
-												Project import generated by Copybara.

GitOrigin-RevId: 9295f8ea2339edb71073695ed4fb3fded2f48c60

											
										
										
											2020-08-13 03:57:56 +02:00
+								internally uses a
 								[pose detection subgraph](https://github.com/google/mediapipe/tree/master/mediapipe/modules/pose_detection/pose_detection_gpu.pbtxt)
 								from the
 								[pose detection module](https://github.com/google/mediapipe/tree/master/mediapipe/modules/pose_detection).
 								Note: To visualize a graph, copy the graph and paste it into
 								[MediaPipe Visualizer](https://viz.mediapipe.dev/). For more information on how
 								to visualize its associated subgraphs, please see
 								[visualizer documentation](../tools/visualizer.md).
-												Project import generated by Copybara.

GitOrigin-RevId: ff83882955f1a1e2a043ff4e71278be9d7217bbe

											
										
										
											2021-05-05 03:30:15 +02:00
+								## Pose Estimation Quality
 								To evaluate the quality of our [models](./models.md#pose) against other
-												Project import generated by Copybara.

GitOrigin-RevId: 2146b10f0a498f665f246e16033b686c7947b92d

											
										
										
											2021-05-10 21:19:00 +02:00
+								well-performing publicly available solutions, we use three different validation
 								datasets, representing different verticals: Yoga, Dance and HIIT. Each image
-												Project import generated by Copybara.

GitOrigin-RevId: ff83882955f1a1e2a043ff4e71278be9d7217bbe

											
										
										
											2021-05-05 03:30:15 +02:00
+								contains only a single person located 2-4 meters from the camera. To be
 								consistent with other solutions, we perform evaluation only for 17 keypoints
 								from [COCO topology](https://cocodataset.org/#keypoints-2020).
-												Project import generated by Copybara.

GitOrigin-RevId: 2146b10f0a498f665f246e16033b686c7947b92d

											
										
										
											2021-05-10 21:19:00 +02:00
+								Method                                                                                                | Yoga <br/> [`mAP`] | Yoga <br/> [`PCK@0.2`] | Dance <br/> [`mAP`] | Dance <br/> [`PCK@0.2`] | HIIT <br/> [`mAP`] | HIIT <br/> [`PCK@0.2`]
 								----------------------------------------------------------------------------------------------------- | -----------------: | ---------------------: | ------------------: | ----------------------: | -----------------: | ---------------------:
-												Project import generated by Copybara.

GitOrigin-RevId: 283c1a295de0a53e47d7a94996bda0c52dcfd677

											
										
										
											2021-09-14 01:56:21 +02:00
+								BlazePose GHUM Heavy                                                                                  | 68.1               | **96.4**               | 73.0                | **97.2**                | 74.0               | **97.5**
 								BlazePose GHUM Full                                                                                   | 62.6               | **95.5**               | 67.4                | **96.3**                | 68.0               | **95.7**
 								BlazePose GHUM Lite                                                                                   | 45.0               | **90.2**               | 53.6                | **92.5**                | 53.8               | **93.5**
 								[AlphaPose ResNet50](https://github.com/MVIG-SJTU/AlphaPose)                                          | 63.4               | **96.0**               | 57.8                | **95.5**                | 63.4               | **96.0**
 								[Apple Vision](https://developer.apple.com/documentation/vision/detecting_human_body_poses_in_images) | 32.8               | **82.7**               | 36.4                | **91.4**                | 44.5               | **88.6**
-												Project import generated by Copybara.

GitOrigin-RevId: 2146b10f0a498f665f246e16033b686c7947b92d

											
										
										
											2021-05-10 21:19:00 +02:00
-												Project import generated by Copybara.

GitOrigin-RevId: 1e13be30e2c6838d4a2ff768a39c414bc80534bb

											
										
										
											2022-09-06 23:29:51 +02:00
+								![pose_tracking_pck_chart.png](https://mediapipe.dev/images/mobile/pose_tracking_pck_chart.png) |
-												Project import generated by Copybara.

GitOrigin-RevId: 2146b10f0a498f665f246e16033b686c7947b92d

											
										
										
											2021-05-10 21:19:00 +02:00
+								:--------------------------------------------------------------------------: |
 								*Fig 2. Quality evaluation in [`PCK@0.2`].*                                  |
 								We designed our models specifically for live perception use cases, so all of
 								them work in real-time on the majority of modern devices.
-												Project import generated by Copybara.

GitOrigin-RevId: 283c1a295de0a53e47d7a94996bda0c52dcfd677

											
										
										
											2021-09-14 01:56:21 +02:00
+								Method               | Latency <br/> Pixel 3 [TFLite GPU](https://www.tensorflow.org/lite/performance/gpu_advanced) | Latency <br/> MacBook Pro (15-inch 2017)
 								-------------------- | -------------------------------------------------------------------------------------------: | ---------------------------------------:
 								BlazePose GHUM Heavy | 53 ms                                                                                        | 38 ms
 								BlazePose GHUM Full  | 25 ms                                                                                        | 27 ms
 								BlazePose GHUM Lite  | 20 ms                                                                                        | 25 ms
-												Project import generated by Copybara.

GitOrigin-RevId: ff83882955f1a1e2a043ff4e71278be9d7217bbe

											
										
										
											2021-05-05 03:30:15 +02:00
-												Project import generated by Copybara.

GitOrigin-RevId: 9295f8ea2339edb71073695ed4fb3fded2f48c60

											
										
										
											2020-08-13 03:57:56 +02:00
+								## Models
-												Project import generated by Copybara.

GitOrigin-RevId: d8caa66de45839696f5bd0786ad3bfbcb9cff632

											
										
										
											2020-12-10 04:13:05 +01:00
+								### Person/pose Detection Model (BlazePose Detector)
-												Project import generated by Copybara.

GitOrigin-RevId: 9295f8ea2339edb71073695ed4fb3fded2f48c60

											
										
										
											2020-08-13 03:57:56 +02:00
 								The detector is inspired by our own lightweight
 								[BlazeFace](https://arxiv.org/abs/1907.05047) model, used in
 								[MediaPipe Face Detection](./face_detection.md), as a proxy for a person
 								detector. It explicitly predicts two additional virtual keypoints that firmly
 								describe the human body center, rotation and scale as a circle. Inspired by
 								[Leonardo’s Vitruvian man](https://en.wikipedia.org/wiki/Vitruvian_Man), we
 								predict the midpoint of a person's hips, the radius of a circle circumscribing
 								the whole person, and the incline angle of the line connecting the shoulder and
 								hip midpoints.
-												Project import generated by Copybara.

GitOrigin-RevId: 1e13be30e2c6838d4a2ff768a39c414bc80534bb

											
										
										
											2022-09-06 23:29:51 +02:00
+								![pose_tracking_detector_vitruvian_man.png](https://mediapipe.dev/images/mobile/pose_tracking_detector_vitruvian_man.png) |
-												Project import generated by Copybara.

GitOrigin-RevId: 9295f8ea2339edb71073695ed4fb3fded2f48c60

											
										
										
											2020-08-13 03:57:56 +02:00
+								:----------------------------------------------------------------------------------------------------: |
-												Project import generated by Copybara.

GitOrigin-RevId: 2146b10f0a498f665f246e16033b686c7947b92d

											
										
										
											2021-05-10 21:19:00 +02:00
+								*Fig 3. Vitruvian man aligned via two virtual keypoints predicted by BlazePose detector in addition to the face bounding box.* |
-												Project import generated by Copybara.

GitOrigin-RevId: 9295f8ea2339edb71073695ed4fb3fded2f48c60

											
										
										
											2020-08-13 03:57:56 +02:00
-												Project import generated by Copybara.

GitOrigin-RevId: bbbbcb4f5174dea33525729ede47c770069157cd

											
										
										
											2021-10-18 21:39:29 +02:00
+								### Pose Landmark Model (BlazePose [GHUM](https://github.com/google-research/google-research/tree/master/ghum) 3D)
-												Project import generated by Copybara.

GitOrigin-RevId: 9295f8ea2339edb71073695ed4fb3fded2f48c60

											
										
										
											2020-08-13 03:57:56 +02:00
-												Project import generated by Copybara.

GitOrigin-RevId: ff83882955f1a1e2a043ff4e71278be9d7217bbe

											
										
										
											2021-05-05 03:30:15 +02:00
+								The landmark model in MediaPipe Pose predicts the location of 33 pose landmarks
 								(see figure below).
-												Project import generated by Copybara.

GitOrigin-RevId: 9295f8ea2339edb71073695ed4fb3fded2f48c60

											
										
										
											2020-08-13 03:57:56 +02:00
-												Project import generated by Copybara.

GitOrigin-RevId: 1e13be30e2c6838d4a2ff768a39c414bc80534bb

											
										
										
											2022-09-06 23:29:51 +02:00
+								![pose_tracking_full_body_landmarks.png](https://mediapipe.dev/images/mobile/pose_tracking_full_body_landmarks.png) |
-												Project import generated by Copybara.

GitOrigin-RevId: d8caa66de45839696f5bd0786ad3bfbcb9cff632

											
										
										
											2020-12-10 04:13:05 +01:00
+								:----------------------------------------------------------------------------------------------: |
-												Project import generated by Copybara.

GitOrigin-RevId: 2146b10f0a498f665f246e16033b686c7947b92d

											
										
										
											2021-05-10 21:19:00 +02:00
+								*Fig 4. 33 pose landmarks.*                                                                      |
-												Project import generated by Copybara.

GitOrigin-RevId: 9295f8ea2339edb71073695ed4fb3fded2f48c60

											
										
										
											2020-08-13 03:57:56 +02:00
-												Project import generated by Copybara.

GitOrigin-RevId: 1610e588e497817fae2d9a458093ab6a370e2972

											
										
										
											2021-08-19 00:18:12 +02:00
+								Optionally, MediaPipe Pose can predicts a full-body
 								[segmentation mask](#segmentation_mask) represented as a two-class segmentation
 								(human or background).
 								Please find more detail in the
 								[BlazePose Google AI Blog](https://ai.googleblog.com/2020/08/on-device-real-time-body-pose-tracking.html),
 								this [paper](https://arxiv.org/abs/2006.10204),
-												Project import generated by Copybara.

GitOrigin-RevId: f4b1fe3f15810450fb6539e733f6a260d3ee082c

											
										
										
											2021-09-01 22:49:12 +02:00
+								[the model card](./models.md#pose) and the [Output](#output) section below.
-												Project import generated by Copybara.

GitOrigin-RevId: 1610e588e497817fae2d9a458093ab6a370e2972

											
										
										
											2021-08-19 00:18:12 +02:00
-												Project import generated by Copybara.

GitOrigin-RevId: d8caa66de45839696f5bd0786ad3bfbcb9cff632

											
										
										
											2020-12-10 04:13:05 +01:00
+								## Solution APIs
-												Project import generated by Copybara.

GitOrigin-RevId: 9295f8ea2339edb71073695ed4fb3fded2f48c60

											
										
										
											2020-08-13 03:57:56 +02:00
-												Project import generated by Copybara.

GitOrigin-RevId: d8caa66de45839696f5bd0786ad3bfbcb9cff632

											
										
										
											2020-12-10 04:13:05 +01:00
+								### Cross-platform Configuration Options
-												Project import generated by Copybara.

GitOrigin-RevId: 9295f8ea2339edb71073695ed4fb3fded2f48c60

											
										
										
											2020-08-13 03:57:56 +02:00
-												Project import generated by Copybara.

GitOrigin-RevId: d8caa66de45839696f5bd0786ad3bfbcb9cff632

											
										
										
											2020-12-10 04:13:05 +01:00
+								Naming style and availability may differ slightly across platforms/languages.
-												Project import generated by Copybara.

GitOrigin-RevId: 9295f8ea2339edb71073695ed4fb3fded2f48c60

											
										
										
											2020-08-13 03:57:56 +02:00
-												Project import generated by Copybara.

GitOrigin-RevId: d8caa66de45839696f5bd0786ad3bfbcb9cff632

											
										
										
											2020-12-10 04:13:05 +01:00
+								#### static_image_mode
-												Project import generated by Copybara.

GitOrigin-RevId: 9295f8ea2339edb71073695ed4fb3fded2f48c60

											
										
										
											2020-08-13 03:57:56 +02:00
-												Project import generated by Copybara.

GitOrigin-RevId: d8caa66de45839696f5bd0786ad3bfbcb9cff632

											
										
										
											2020-12-10 04:13:05 +01:00
+								If set to `false`, the solution treats the input images as a video stream. It
 								will try to detect the most prominent person in the very first images, and upon
 								a successful detection further localizes the pose landmarks. In subsequent
 								images, it then simply tracks those landmarks without invoking another detection
 								until it loses track, on reducing computation and latency. If set to `true`,
 								person detection runs every input image, ideal for processing a batch of static,
 								possibly unrelated, images. Default to `false`.
-												Project import generated by Copybara.

GitOrigin-RevId: 9295f8ea2339edb71073695ed4fb3fded2f48c60

											
										
										
											2020-08-13 03:57:56 +02:00
-												Project import generated by Copybara.

GitOrigin-RevId: ff83882955f1a1e2a043ff4e71278be9d7217bbe

											
										
										
											2021-05-05 03:30:15 +02:00
+								#### model_complexity
-												Project import generated by Copybara.

GitOrigin-RevId: 9295f8ea2339edb71073695ed4fb3fded2f48c60

											
										
										
											2020-08-13 03:57:56 +02:00
-												Project import generated by Copybara.

GitOrigin-RevId: ff83882955f1a1e2a043ff4e71278be9d7217bbe

											
										
										
											2021-05-05 03:30:15 +02:00
+								Complexity of the pose landmark model: `0`, `1` or `2`. Landmark accuracy as
 								well as inference latency generally go up with the model complexity. Default to
 								`1`.
-												Project import generated by Copybara.

GitOrigin-RevId: 9295f8ea2339edb71073695ed4fb3fded2f48c60

											
										
										
											2020-08-13 03:57:56 +02:00
-												Project import generated by Copybara.

GitOrigin-RevId: d8caa66de45839696f5bd0786ad3bfbcb9cff632

											
										
										
											2020-12-10 04:13:05 +01:00
+								#### smooth_landmarks
-												Project import generated by Copybara.

GitOrigin-RevId: 9295f8ea2339edb71073695ed4fb3fded2f48c60

											
										
										
											2020-08-13 03:57:56 +02:00
-												Project import generated by Copybara.

GitOrigin-RevId: d8caa66de45839696f5bd0786ad3bfbcb9cff632

											
										
										
											2020-12-10 04:13:05 +01:00
+								If set to `true`, the solution filters pose landmarks across different input
 								images to reduce jitter, but ignored if [static_image_mode](#static_image_mode)
 								is also set to `true`. Default to `true`.
-												Project import generated by Copybara.

GitOrigin-RevId: 9295f8ea2339edb71073695ed4fb3fded2f48c60

											
										
										
											2020-08-13 03:57:56 +02:00
-												Project import generated by Copybara.

GitOrigin-RevId: 1610e588e497817fae2d9a458093ab6a370e2972

											
										
										
											2021-08-19 00:18:12 +02:00
+								#### enable_segmentation
 								If set to `true`, in addition to the pose landmarks the solution also generates
 								the segmentation mask. Default to `false`.
 								#### smooth_segmentation
 								If set to `true`, the solution filters segmentation masks across different input
 								images to reduce jitter. Ignored if [enable_segmentation](#enable_segmentation)
 								is `false` or [static_image_mode](#static_image_mode) is `true`. Default to
 								`true`.
-												Project import generated by Copybara.

GitOrigin-RevId: d8caa66de45839696f5bd0786ad3bfbcb9cff632

											
										
										
											2020-12-10 04:13:05 +01:00
+								#### min_detection_confidence
-												Project import generated by Copybara.

GitOrigin-RevId: f7d09ed033907b893638a8eb4148efa11c0f09a6

											
										
										
											2020-11-05 01:02:35 +01:00
-												Project import generated by Copybara.

GitOrigin-RevId: d8caa66de45839696f5bd0786ad3bfbcb9cff632

											
										
										
											2020-12-10 04:13:05 +01:00
+								Minimum confidence value (`[0.0, 1.0]`) from the person-detection model for the
 								detection to be considered successful. Default to `0.5`.
-												Project import generated by Copybara.

GitOrigin-RevId: 612e50bb8db2ec3dc1c30049372d87a80c3848db

											
										
										
											2020-08-30 05:41:10 +02:00
-												Project import generated by Copybara.

GitOrigin-RevId: d8caa66de45839696f5bd0786ad3bfbcb9cff632

											
										
										
											2020-12-10 04:13:05 +01:00
+								#### min_tracking_confidence
-												Project import generated by Copybara.

GitOrigin-RevId: 9295f8ea2339edb71073695ed4fb3fded2f48c60

											
										
										
											2020-08-13 03:57:56 +02:00
-												Project import generated by Copybara.

GitOrigin-RevId: d8caa66de45839696f5bd0786ad3bfbcb9cff632

											
										
										
											2020-12-10 04:13:05 +01:00
+								Minimum confidence value (`[0.0, 1.0]`) from the landmark-tracking model for the
 								pose landmarks to be considered tracked successfully, or otherwise person
 								detection will be invoked automatically on the next input image. Setting it to a
 								higher value can increase robustness of the solution, at the expense of a higher
 								latency. Ignored if [static_image_mode](#static_image_mode) is `true`, where
 								person detection simply runs on every image. Default to `0.5`.
 								### Output
 								Naming style may differ slightly across platforms/languages.
 								#### pose_landmarks
-												Project import generated by Copybara.

GitOrigin-RevId: 08c2016a4df5aef571b464a4d4491f38c6b2af10

											
										
										
											2021-06-03 22:13:30 +02:00
+								A list of pose landmarks. Each landmark consists of the following:
-												Project import generated by Copybara.

GitOrigin-RevId: d8caa66de45839696f5bd0786ad3bfbcb9cff632

											
										
										
											2020-12-10 04:13:05 +01:00
 								*   `x` and `y`: Landmark coordinates normalized to `[0.0, 1.0]` by the image
 								    width and height respectively.
-												Project import generated by Copybara.

GitOrigin-RevId: d073f8e21be2fcc0e503cb97c6695078b6b75310

											
										
										
											2021-02-27 09:21:16 +01:00
+								*   `z`: Represents the landmark depth with the depth at the midpoint of hips
 								    being the origin, and the smaller the value the closer the landmark is to
 								    the camera. The magnitude of `z` uses roughly the same scale as `x`.
-												Project import generated by Copybara.

GitOrigin-RevId: d8caa66de45839696f5bd0786ad3bfbcb9cff632

											
										
										
											2020-12-10 04:13:05 +01:00
+								*   `visibility`: A value in `[0.0, 1.0]` indicating the likelihood of the
 								    landmark being visible (present and not occluded) in the image.
-												Project import generated by Copybara.

GitOrigin-RevId: 9295f8ea2339edb71073695ed4fb3fded2f48c60

											
										
										
											2020-08-13 03:57:56 +02:00
-												Project import generated by Copybara.

GitOrigin-RevId: 33adfdf31f3a5cbf9edc07ee1ea583e95080bdc5

											
										
										
											2021-06-24 23:10:25 +02:00
+								#### pose_world_landmarks
 								*Fig 5. Example of MediaPipe Pose real-world 3D coordinates.* |
 								:-----------------------------------------------------------: |
-												Internal change

PiperOrigin-RevId: 477538515

											
										
										
											2022-09-28 22:35:30 +02:00
+								<video autoplay muted loop preload style="height: auto; width: 480px"><source src="https://mediapipe.dev/images/mobile/pose_world_landmarks.mp4" type="video/mp4"></video> |
-												Project import generated by Copybara.

GitOrigin-RevId: 33adfdf31f3a5cbf9edc07ee1ea583e95080bdc5

											
										
										
											2021-06-24 23:10:25 +02:00
 								Another list of pose landmarks in world coordinates. Each landmark consists of
 								the following:
 								*   `x`, `y` and `z`: Real-world 3D coordinates in meters with the origin at the
 								    center between hips.
 								*   `visibility`: Identical to that defined in the corresponding
 								    [pose_landmarks](#pose_landmarks).
-												Project import generated by Copybara.

GitOrigin-RevId: 1610e588e497817fae2d9a458093ab6a370e2972

											
										
										
											2021-08-19 00:18:12 +02:00
+								#### segmentation_mask
 								The output segmentation mask, predicted only when
 								[enable_segmentation](#enable_segmentation) is set to `true`. The mask has the
 								same width and height as the input image, and contains values in `[0.0, 1.0]`
 								where `1.0` and `0.0` indicate high certainty of a "human" and "background"
 								pixel respectively. Please refer to the platform-specific usage examples below
 								for usage details.
 								*Fig 6. Example of MediaPipe Pose segmentation mask.* |
-												Project import generated by Copybara.

GitOrigin-RevId: 283c1a295de0a53e47d7a94996bda0c52dcfd677

											
										
										
											2021-09-14 01:56:21 +02:00
+								:---------------------------------------------------: |
-												Internal change

PiperOrigin-RevId: 477538515

											
										
										
											2022-09-28 22:35:30 +02:00
+								<video autoplay muted loop preload style="height: auto; width: 480px"><source src="https://mediapipe.dev/images/mobile/pose_segmentation.mp4" type="video/mp4"></video> |
-												Project import generated by Copybara.

GitOrigin-RevId: 1610e588e497817fae2d9a458093ab6a370e2972

											
										
										
											2021-08-19 00:18:12 +02:00
-												Project import generated by Copybara.

GitOrigin-RevId: d8caa66de45839696f5bd0786ad3bfbcb9cff632

											
										
										
											2020-12-10 04:13:05 +01:00
+								### Python Solution API
-												Project import generated by Copybara.

GitOrigin-RevId: f7d09ed033907b893638a8eb4148efa11c0f09a6

											
										
										
											2020-11-05 01:02:35 +01:00
-												Project import generated by Copybara.

GitOrigin-RevId: d8caa66de45839696f5bd0786ad3bfbcb9cff632

											
										
										
											2020-12-10 04:13:05 +01:00
+								Please first follow general [instructions](../getting_started/python.md) to
-												Project import generated by Copybara.

GitOrigin-RevId: d073f8e21be2fcc0e503cb97c6695078b6b75310

											
										
										
											2021-02-27 09:21:16 +01:00
+								install MediaPipe Python package, then learn more in the companion
-												Project import generated by Copybara.

GitOrigin-RevId: 08c2016a4df5aef571b464a4d4491f38c6b2af10

											
										
										
											2021-06-03 22:13:30 +02:00
+								[Python Colab](#resources) and the usage example below.
-												Project import generated by Copybara.

GitOrigin-RevId: d8caa66de45839696f5bd0786ad3bfbcb9cff632

											
										
										
											2020-12-10 04:13:05 +01:00
 								Supported configuration options:
 								*   [static_image_mode](#static_image_mode)
-												Project import generated by Copybara.

GitOrigin-RevId: ff83882955f1a1e2a043ff4e71278be9d7217bbe

											
										
										
											2021-05-05 03:30:15 +02:00
+								*   [model_complexity](#model_complexity)
-												Project import generated by Copybara.

GitOrigin-RevId: d8caa66de45839696f5bd0786ad3bfbcb9cff632

											
										
										
											2020-12-10 04:13:05 +01:00
+								*   [smooth_landmarks](#smooth_landmarks)
-												Project import generated by Copybara.

GitOrigin-RevId: 1610e588e497817fae2d9a458093ab6a370e2972

											
										
										
											2021-08-19 00:18:12 +02:00
+								*   [enable_segmentation](#enable_segmentation)
 								*   [smooth_segmentation](#smooth_segmentation)
-												Project import generated by Copybara.

GitOrigin-RevId: d8caa66de45839696f5bd0786ad3bfbcb9cff632

											
										
										
											2020-12-10 04:13:05 +01:00
+								*   [min_detection_confidence](#min_detection_confidence)
 								*   [min_tracking_confidence](#min_tracking_confidence)
-												Project import generated by Copybara.

GitOrigin-RevId: f7d09ed033907b893638a8eb4148efa11c0f09a6

											
										
										
											2020-11-05 01:02:35 +01:00
 								```python
 								import cv2
 								import mediapipe as mp
 								mp_drawing = mp.solutions.drawing_utils
-												Project import generated by Copybara.

GitOrigin-RevId: 1610e588e497817fae2d9a458093ab6a370e2972

											
										
										
											2021-08-19 00:18:12 +02:00
+								mp_drawing_styles = mp.solutions.drawing_styles
-												Project import generated by Copybara.

GitOrigin-RevId: f7d09ed033907b893638a8eb4148efa11c0f09a6

											
										
										
											2020-11-05 01:02:35 +01:00
+								mp_pose = mp.solutions.pose
 								# For static images:
-												Project import generated by Copybara.

GitOrigin-RevId: 08c2016a4df5aef571b464a4d4491f38c6b2af10

											
										
										
											2021-06-03 22:13:30 +02:00
+								IMAGE_FILES = []
-												Project import generated by Copybara.

GitOrigin-RevId: 1610e588e497817fae2d9a458093ab6a370e2972

											
										
										
											2021-08-19 00:18:12 +02:00
+								BG_COLOR = (192, 192, 192) # gray
-												Project import generated by Copybara.

GitOrigin-RevId: d073f8e21be2fcc0e503cb97c6695078b6b75310

											
										
										
											2021-02-27 09:21:16 +01:00
+								with mp_pose.Pose(
-												Project import generated by Copybara.

GitOrigin-RevId: ff83882955f1a1e2a043ff4e71278be9d7217bbe

											
										
										
											2021-05-05 03:30:15 +02:00
+								    static_image_mode=True,
 								    model_complexity=2,
-												Project import generated by Copybara.

GitOrigin-RevId: 1610e588e497817fae2d9a458093ab6a370e2972

											
										
										
											2021-08-19 00:18:12 +02:00
+								    enable_segmentation=True,
-												Project import generated by Copybara.

GitOrigin-RevId: ff83882955f1a1e2a043ff4e71278be9d7217bbe

											
										
										
											2021-05-05 03:30:15 +02:00
+								    min_detection_confidence=0.5) as pose:
-												Project import generated by Copybara.

GitOrigin-RevId: 08c2016a4df5aef571b464a4d4491f38c6b2af10

											
										
										
											2021-06-03 22:13:30 +02:00
+								  for idx, file in enumerate(IMAGE_FILES):
-												Project import generated by Copybara.

GitOrigin-RevId: d073f8e21be2fcc0e503cb97c6695078b6b75310

											
										
										
											2021-02-27 09:21:16 +01:00
+								    image = cv2.imread(file)
 								    image_height, image_width, _ = image.shape
 								    # Convert the BGR image to RGB before processing.
 								    results = pose.process(cv2.cvtColor(image, cv2.COLOR_BGR2RGB))
 								    if not results.pose_landmarks:
 								      continue
 								    print(
 								        f'Nose coordinates: ('
-												Project import generated by Copybara.

GitOrigin-RevId: f4b1fe3f15810450fb6539e733f6a260d3ee082c

											
										
										
											2021-09-01 22:49:12 +02:00
+								        f'{results.pose_landmarks.landmark[mp_pose.PoseLandmark.NOSE].x * image_width}, '
 								        f'{results.pose_landmarks.landmark[mp_pose.PoseLandmark.NOSE].y * image_height})'
-												Project import generated by Copybara.

GitOrigin-RevId: d073f8e21be2fcc0e503cb97c6695078b6b75310

											
										
										
											2021-02-27 09:21:16 +01:00
+								    )
-												Project import generated by Copybara.

GitOrigin-RevId: 1610e588e497817fae2d9a458093ab6a370e2972

											
										
										
											2021-08-19 00:18:12 +02:00
-												Project import generated by Copybara.

GitOrigin-RevId: d073f8e21be2fcc0e503cb97c6695078b6b75310

											
										
										
											2021-02-27 09:21:16 +01:00
+								    annotated_image = image.copy()
-												Project import generated by Copybara.

GitOrigin-RevId: 1610e588e497817fae2d9a458093ab6a370e2972

											
										
										
											2021-08-19 00:18:12 +02:00
+								    # Draw segmentation on the image.
 								    # To improve segmentation around boundaries, consider applying a joint
 								    # bilateral filter to "results.segmentation_mask" with "image".
 								    condition = np.stack((results.segmentation_mask,) * 3, axis=-1) > 0.1
 								    bg_image = np.zeros(image.shape, dtype=np.uint8)
 								    bg_image[:] = BG_COLOR
 								    annotated_image = np.where(condition, annotated_image, bg_image)
 								    # Draw pose landmarks on the image.
-												Project import generated by Copybara.

GitOrigin-RevId: d073f8e21be2fcc0e503cb97c6695078b6b75310

											
										
										
											2021-02-27 09:21:16 +01:00
+								    mp_drawing.draw_landmarks(
-												Project import generated by Copybara.

GitOrigin-RevId: 1610e588e497817fae2d9a458093ab6a370e2972

											
										
										
											2021-08-19 00:18:12 +02:00
+								        annotated_image,
 								        results.pose_landmarks,
 								        mp_pose.POSE_CONNECTIONS,
 								        landmark_drawing_spec=mp_drawing_styles.get_default_pose_landmarks_style())
-												Project import generated by Copybara.

GitOrigin-RevId: d073f8e21be2fcc0e503cb97c6695078b6b75310

											
										
										
											2021-02-27 09:21:16 +01:00
+								    cv2.imwrite('/tmp/annotated_image' + str(idx) + '.png', annotated_image)
-												Project import generated by Copybara.

GitOrigin-RevId: 33adfdf31f3a5cbf9edc07ee1ea583e95080bdc5

											
										
										
											2021-06-24 23:10:25 +02:00
+								    # Plot pose world landmarks.
 								    mp_drawing.plot_landmarks(
 								        results.pose_world_landmarks, mp_pose.POSE_CONNECTIONS)
-												Project import generated by Copybara.

GitOrigin-RevId: f7d09ed033907b893638a8eb4148efa11c0f09a6

											
										
										
											2020-11-05 01:02:35 +01:00
 								# For webcam input:
 								cap = cv2.VideoCapture(0)
-												Project import generated by Copybara.

GitOrigin-RevId: d073f8e21be2fcc0e503cb97c6695078b6b75310

											
										
										
											2021-02-27 09:21:16 +01:00
+								with mp_pose.Pose(
 								    min_detection_confidence=0.5,
 								    min_tracking_confidence=0.5) as pose:
 								  while cap.isOpened():
 								    success, image = cap.read()
 								    if not success:
 								      print("Ignoring empty camera frame.")
 								      # If loading a video, use 'break' instead of 'continue'.
 								      continue
 								    # To improve performance, optionally mark the image as not writeable to
 								    # pass by reference.
 								    image.flags.writeable = False
-												Project import generated by Copybara.

GitOrigin-RevId: 373e3ac1e5839befd95bf7d73ceff3c5f1171969

											
										
										
											2021-10-06 22:44:33 +02:00
+								    image = cv2.cvtColor(image, cv2.COLOR_BGR2RGB)
-												Project import generated by Copybara.

GitOrigin-RevId: d073f8e21be2fcc0e503cb97c6695078b6b75310

											
										
										
											2021-02-27 09:21:16 +01:00
+								    results = pose.process(image)
 								    # Draw the pose annotation on the image.
 								    image.flags.writeable = True
 								    image = cv2.cvtColor(image, cv2.COLOR_RGB2BGR)
 								    mp_drawing.draw_landmarks(
-												Project import generated by Copybara.

GitOrigin-RevId: 1610e588e497817fae2d9a458093ab6a370e2972

											
										
										
											2021-08-19 00:18:12 +02:00
+								        image,
 								        results.pose_landmarks,
 								        mp_pose.POSE_CONNECTIONS,
 								        landmark_drawing_spec=mp_drawing_styles.get_default_pose_landmarks_style())
-												Project import generated by Copybara.

GitOrigin-RevId: 373e3ac1e5839befd95bf7d73ceff3c5f1171969

											
										
										
											2021-10-06 22:44:33 +02:00
+								    # Flip the image horizontally for a selfie-view display.
 								    cv2.imshow('MediaPipe Pose', cv2.flip(image, 1))
-												Project import generated by Copybara.

GitOrigin-RevId: d073f8e21be2fcc0e503cb97c6695078b6b75310

											
										
										
											2021-02-27 09:21:16 +01:00
+								    if cv2.waitKey(5) & 0xFF == 27:
 								      break
-												Project import generated by Copybara.

GitOrigin-RevId: f7d09ed033907b893638a8eb4148efa11c0f09a6

											
										
										
											2020-11-05 01:02:35 +01:00
+								cap.release()
-												Project import generated by Copybara.

GitOrigin-RevId: 9295f8ea2339edb71073695ed4fb3fded2f48c60

											
										
										
											2020-08-13 03:57:56 +02:00
+								```
-												Project import generated by Copybara.

GitOrigin-RevId: d8caa66de45839696f5bd0786ad3bfbcb9cff632

											
										
										
											2020-12-10 04:13:05 +01:00
+								### JavaScript Solution API
 								Please first see general [introduction](../getting_started/javascript.md) on
-												Project import generated by Copybara.

GitOrigin-RevId: d073f8e21be2fcc0e503cb97c6695078b6b75310

											
										
										
											2021-02-27 09:21:16 +01:00
+								MediaPipe in JavaScript, then learn more in the companion [web demo](#resources)
 								and the following usage example.
-												Project import generated by Copybara.

GitOrigin-RevId: d8caa66de45839696f5bd0786ad3bfbcb9cff632

											
										
										
											2020-12-10 04:13:05 +01:00
 								Supported configuration options:
-												Project import generated by Copybara.

GitOrigin-RevId: ff83882955f1a1e2a043ff4e71278be9d7217bbe

											
										
										
											2021-05-05 03:30:15 +02:00
+								*   [modelComplexity](#model_complexity)
-												Project import generated by Copybara.

GitOrigin-RevId: d8caa66de45839696f5bd0786ad3bfbcb9cff632

											
										
										
											2020-12-10 04:13:05 +01:00
+								*   [smoothLandmarks](#smooth_landmarks)
-												Project import generated by Copybara.

GitOrigin-RevId: 1610e588e497817fae2d9a458093ab6a370e2972

											
										
										
											2021-08-19 00:18:12 +02:00
+								*   [enableSegmentation](#enable_segmentation)
 								*   [smoothSegmentation](#smooth_segmentation)
-												Project import generated by Copybara.

GitOrigin-RevId: d8caa66de45839696f5bd0786ad3bfbcb9cff632

											
										
										
											2020-12-10 04:13:05 +01:00
+								*   [minDetectionConfidence](#min_detection_confidence)
 								*   [minTrackingConfidence](#min_tracking_confidence)
 								```html
 								<!DOCTYPE html>
 								<html>
 								<head>
 								  <meta charset="utf-8">
 								  <script src="https://cdn.jsdelivr.net/npm/@mediapipe/camera_utils/camera_utils.js" crossorigin="anonymous"></script>
 								  <script src="https://cdn.jsdelivr.net/npm/@mediapipe/control_utils/control_utils.js" crossorigin="anonymous"></script>
-												Project import generated by Copybara.

GitOrigin-RevId: 73d686c40057684f8bfaca285368bf1813f9fc26

											
										
										
											2022-03-21 20:07:37 +01:00
+								  <script src="https://cdn.jsdelivr.net/npm/@mediapipe/control_utils_3d/control_utils_3d.js" crossorigin="anonymous"></script>
-												Project import generated by Copybara.

GitOrigin-RevId: d8caa66de45839696f5bd0786ad3bfbcb9cff632

											
										
										
											2020-12-10 04:13:05 +01:00
+								  <script src="https://cdn.jsdelivr.net/npm/@mediapipe/drawing_utils/drawing_utils.js" crossorigin="anonymous"></script>
 								  <script src="https://cdn.jsdelivr.net/npm/@mediapipe/pose/pose.js" crossorigin="anonymous"></script>
 								</head>
 								<body>
 								  <div class="container">
 								    <video class="input_video"></video>
 								    <canvas class="output_canvas" width="1280px" height="720px"></canvas>
-												Project import generated by Copybara.

GitOrigin-RevId: f4b1fe3f15810450fb6539e733f6a260d3ee082c

											
										
										
											2021-09-01 22:49:12 +02:00
+								    <div class="landmark-grid-container"></div>
-												Project import generated by Copybara.

GitOrigin-RevId: d8caa66de45839696f5bd0786ad3bfbcb9cff632

											
										
										
											2020-12-10 04:13:05 +01:00
+								  </div>
 								</body>
 								</html>
 								```
-												Project import generated by Copybara.

GitOrigin-RevId: 612e50bb8db2ec3dc1c30049372d87a80c3848db

											
										
										
											2020-08-30 05:41:10 +02:00
-												Project import generated by Copybara.

GitOrigin-RevId: d8caa66de45839696f5bd0786ad3bfbcb9cff632

											
										
										
											2020-12-10 04:13:05 +01:00
+								```javascript
 								<script type="module">
 								const videoElement = document.getElementsByClassName('input_video')[0];
 								const canvasElement = document.getElementsByClassName('output_canvas')[0];
 								const canvasCtx = canvasElement.getContext('2d');
-												Project import generated by Copybara.

GitOrigin-RevId: 33adfdf31f3a5cbf9edc07ee1ea583e95080bdc5

											
										
										
											2021-06-24 23:10:25 +02:00
+								const landmarkContainer = document.getElementsByClassName('landmark-grid-container')[0];
 								const grid = new LandmarkGrid(landmarkContainer);
-												Project import generated by Copybara.

GitOrigin-RevId: d8caa66de45839696f5bd0786ad3bfbcb9cff632

											
										
										
											2020-12-10 04:13:05 +01:00
 								function onResults(results) {
-												Project import generated by Copybara.

GitOrigin-RevId: 33adfdf31f3a5cbf9edc07ee1ea583e95080bdc5

											
										
										
											2021-06-24 23:10:25 +02:00
+								  if (!results.poseLandmarks) {
 								    grid.updateLandmarks([]);
 								    return;
 								  }
-												Project import generated by Copybara.

GitOrigin-RevId: d8caa66de45839696f5bd0786ad3bfbcb9cff632

											
										
										
											2020-12-10 04:13:05 +01:00
+								  canvasCtx.save();
 								  canvasCtx.clearRect(0, 0, canvasElement.width, canvasElement.height);
-												Project import generated by Copybara.

GitOrigin-RevId: 1610e588e497817fae2d9a458093ab6a370e2972

											
										
										
											2021-08-19 00:18:12 +02:00
+								  canvasCtx.drawImage(results.segmentationMask, 0, 0,
 								                      canvasElement.width, canvasElement.height);
 								  // Only overwrite existing pixels.
 								  canvasCtx.globalCompositeOperation = 'source-in';
 								  canvasCtx.fillStyle = '#00FF00';
 								  canvasCtx.fillRect(0, 0, canvasElement.width, canvasElement.height);
 								  // Only overwrite missing pixels.
 								  canvasCtx.globalCompositeOperation = 'destination-atop';
-												Project import generated by Copybara.

GitOrigin-RevId: d8caa66de45839696f5bd0786ad3bfbcb9cff632

											
										
										
											2020-12-10 04:13:05 +01:00
+								  canvasCtx.drawImage(
 								      results.image, 0, 0, canvasElement.width, canvasElement.height);
-												Project import generated by Copybara.

GitOrigin-RevId: 1610e588e497817fae2d9a458093ab6a370e2972

											
										
										
											2021-08-19 00:18:12 +02:00
 								  canvasCtx.globalCompositeOperation = 'source-over';
-												Project import generated by Copybara.

GitOrigin-RevId: d8caa66de45839696f5bd0786ad3bfbcb9cff632

											
										
										
											2020-12-10 04:13:05 +01:00
+								  drawConnectors(canvasCtx, results.poseLandmarks, POSE_CONNECTIONS,
 								                 {color: '#00FF00', lineWidth: 4});
 								  drawLandmarks(canvasCtx, results.poseLandmarks,
 								                {color: '#FF0000', lineWidth: 2});
 								  canvasCtx.restore();
-												Project import generated by Copybara.

GitOrigin-RevId: 33adfdf31f3a5cbf9edc07ee1ea583e95080bdc5

											
										
										
											2021-06-24 23:10:25 +02:00
 								  grid.updateLandmarks(results.poseWorldLandmarks);
-												Project import generated by Copybara.

GitOrigin-RevId: d8caa66de45839696f5bd0786ad3bfbcb9cff632

											
										
										
											2020-12-10 04:13:05 +01:00
+								}
 								const pose = new Pose({locateFile: (file) => {
 								  return `https://cdn.jsdelivr.net/npm/@mediapipe/pose/${file}`;
 								}});
 								pose.setOptions({
-												Project import generated by Copybara.

GitOrigin-RevId: ff83882955f1a1e2a043ff4e71278be9d7217bbe

											
										
										
											2021-05-05 03:30:15 +02:00
+								  modelComplexity: 1,
-												Project import generated by Copybara.

GitOrigin-RevId: d8caa66de45839696f5bd0786ad3bfbcb9cff632

											
										
										
											2020-12-10 04:13:05 +01:00
+								  smoothLandmarks: true,
-												Project import generated by Copybara.

GitOrigin-RevId: 1610e588e497817fae2d9a458093ab6a370e2972

											
										
										
											2021-08-19 00:18:12 +02:00
+								  enableSegmentation: true,
 								  smoothSegmentation: true,
-												Project import generated by Copybara.

GitOrigin-RevId: d8caa66de45839696f5bd0786ad3bfbcb9cff632

											
										
										
											2020-12-10 04:13:05 +01:00
+								  minDetectionConfidence: 0.5,
 								  minTrackingConfidence: 0.5
 								});
 								pose.onResults(onResults);
 								const camera = new Camera(videoElement, {
 								  onFrame: async () => {
 								    await pose.send({image: videoElement});
 								  },
 								  width: 1280,
 								  height: 720
 								});
 								camera.start();
 								</script>
 								```
-												Project import generated by Copybara.

GitOrigin-RevId: 9295f8ea2339edb71073695ed4fb3fded2f48c60

											
										
										
											2020-08-13 03:57:56 +02:00
-												Project import generated by Copybara.

GitOrigin-RevId: d8caa66de45839696f5bd0786ad3bfbcb9cff632

											
										
										
											2020-12-10 04:13:05 +01:00
+								## Example Apps
 								Please first see general instructions for
 								[Android](../getting_started/android.md), [iOS](../getting_started/ios.md), and
 								[desktop](../getting_started/cpp.md) on how to build MediaPipe examples.
 								Note: To visualize a graph, copy the graph and paste it into
 								[MediaPipe Visualizer](https://viz.mediapipe.dev/). For more information on how
 								to visualize its associated subgraphs, please see
 								[visualizer documentation](../tools/visualizer.md).
 								### Mobile
 								#### Main Example
 								*   Graph:
 								    [`mediapipe/graphs/pose_tracking/pose_tracking_gpu.pbtxt`](https://github.com/google/mediapipe/tree/master/mediapipe/graphs/pose_tracking/pose_tracking_gpu.pbtxt)
 								*   Android target:
 								    [(or download prebuilt ARM64 APK)](https://drive.google.com/file/d/17GFIrqEJS6W8UHKXlYevTtSCLxN9pWlY/view?usp=sharing)
 								    [`mediapipe/examples/android/src/java/com/google/mediapipe/apps/posetrackinggpu:posetrackinggpu`](https://github.com/google/mediapipe/tree/master/mediapipe/examples/android/src/java/com/google/mediapipe/apps/posetrackinggpu/BUILD)
 								*   iOS target:
 								    [`mediapipe/examples/ios/posetrackinggpu:PoseTrackingGpuApp`](http:/mediapipe/examples/ios/posetrackinggpu/BUILD)
 								### Desktop
 								Please first see general instructions for [desktop](../getting_started/cpp.md)
 								on how to build MediaPipe examples.
 								#### Main Example
 								*   Running on CPU
 								    *   Graph:
 								        [`mediapipe/graphs/pose_tracking/pose_tracking_cpu.pbtxt`](https://github.com/google/mediapipe/tree/master/mediapipe/graphs/pose_tracking/pose_tracking_cpu.pbtxt)
 								    *   Target:
 								        [`mediapipe/examples/desktop/pose_tracking:pose_tracking_cpu`](https://github.com/google/mediapipe/tree/master/mediapipe/examples/desktop/pose_tracking/BUILD)
 								*   Running on GPU
 								    *   Graph:
 								        [`mediapipe/graphs/pose_tracking/pose_tracking_gpu.pbtxt`](https://github.com/google/mediapipe/tree/master/mediapipe/graphs/pose_tracking/pose_tracking_gpu.pbtxt)
 								    *   Target:
 								        [`mediapipe/examples/desktop/pose_tracking:pose_tracking_gpu`](https://github.com/google/mediapipe/tree/master/mediapipe/examples/desktop/pose_tracking/BUILD)
-												Project import generated by Copybara.

GitOrigin-RevId: 9295f8ea2339edb71073695ed4fb3fded2f48c60

											
										
										
											2020-08-13 03:57:56 +02:00
+								## Resources
 								*   Google AI Blog:
-												Project import generated by Copybara.

GitOrigin-RevId: aaca5c37abcf8b7a6c3c28804739afdbad46e704

											
										
										
											2020-08-13 21:02:55 +02:00
+								    [BlazePose - On-device Real-time Body Pose Tracking](https://ai.googleblog.com/2020/08/on-device-real-time-body-pose-tracking.html)
-												Project import generated by Copybara.

GitOrigin-RevId: 9295f8ea2339edb71073695ed4fb3fded2f48c60

											
										
										
											2020-08-13 03:57:56 +02:00
+								*   Paper:
 								    [BlazePose: On-device Real-time Body Pose Tracking](https://arxiv.org/abs/2006.10204)
 								    ([presentation](https://youtu.be/YPpUOTRn5tA))
-												Project import generated by Copybara.

GitOrigin-RevId: 4cee4a2c2317fb190680c17e31ebbb03bb73b71c

											
										
										
											2020-09-16 03:31:50 +02:00
+								*   [Models and model cards](./models.md#pose)
-												Project import generated by Copybara.

GitOrigin-RevId: bbbbcb4f5174dea33525729ede47c770069157cd

											
										
										
											2021-10-18 21:39:29 +02:00
+								*   [GHUM & GHUML: Generative 3D Human Shape and Articulated Pose Models](https://github.com/google-research/google-research/tree/master/ghum)
-												Project import generated by Copybara.

GitOrigin-RevId: d073f8e21be2fcc0e503cb97c6695078b6b75310

											
										
										
											2021-02-27 09:21:16 +01:00
+								*   [Web demo](https://code.mediapipe.dev/codepen/pose)
 								*   [Python Colab](https://mediapipe.page.link/pose_py_colab)
-												Project import generated by Copybara.

GitOrigin-RevId: 2146b10f0a498f665f246e16033b686c7947b92d

											
										
										
											2021-05-10 21:19:00 +02:00
 								[`mAP`]: https://cocodataset.org/#keypoints-eval
-												Project import generated by Copybara.

GitOrigin-RevId: 016275ca4057540b2370ed4531dbc81eb92caae2

											
										
										
											2021-05-11 06:52:16 +02:00
+								[`PCK@0.2`]: https://github.com/cbsudux/Human-Pose-Estimation-101