Multi-task Deep Learning for Real-Time 3D Human Pose Estimation and Action Recognition

Action recognition is a computer vision task that involves identifying and classifying the actions or activities being performed by one or more individuals in a video or sequence of images. This task typically involves analyzing the motion and posture of the human body to determine what action is being performed, such as walking, running, jumping, waving, or sitting.

No need for action recognition in our project.


VNect: Real-time 3D Human Pose Estimation with a Single RGB Camera

If your goal is to detect keypoints from a mobile video, you might not necessarily need a model that deals with 3D images. However, there are several advantages to using 3D pose estimation:

  1. Depth Information: 3D pose estimation provides depth information, which can improve the accuracy of keypoint detection, especially in complex scenes with occlusions or varying distances.
  2. Robustness: 3D models can be more robust to changes in camera angles and perspectives, as they capture the spatial relationships between keypoints in three dimensions.
  3. Enhanced Applications: 3D pose estimation enables more advanced applications, such as augmented reality, motion capture, and detailed activity recognition, which require understanding the full spatial configuration of the body.

To provide 3D information for a model using a mobile camera, you can use several techniques:

  1. Stereo Cameras:

  2. Depth Sensors:

  3. Monocular Depth Estimation:

  4. Augmented Reality (AR) Frameworks:

Reconstructing the 3D image seems reasonable for the discussed reasons and it seems like their are enough tools for that with different levels of “developer involvement”.


Real-time 2D Multi-Person Pose Estimation on CPU: Lightweight OpenPose

Edge devices are computing devices that are located at the edge of a network, close to the source of data generation. These devices perform data processing and analysis locally, rather than sending all the data to a centralized cloud or data center. Edge devices are used in edge computing, which aims to reduce latency, improve response times, and decrease bandwidth usage by processing data near its source.

This paper/repo has over 2K stars so it seems dependable and its also lightweight which means it could be deployed seamlessly on mobiles.