Anchor Points

YOLO-Pose: Enhancing YOLO for Multi Person Pose Estimation Using Object Keypoint Similarity Loss

This paper introduces YOLO-Pose, a novel approach that enhances YOLO for multi-person pose estimation by integrating Object Keypoint Similarity (OKS) loss. By leveraging the popular YOLO object detection framework, YOLO-Pose enables end-to-end training, optimizing the OKS metric directly. This framework excels in joint detection and 2D pose estimation by eliminating the need for post-processing steps, achieving state-of-the-art results on COCO datasets without test time augmentation, surpassing existing bottom-up methods in a single inference pass.

Introduction

Multi-person 2D pose estimation involves detecting individuals in an image and localizing their body joints. This task is challenging due to varying factors like the number of people, scale variations, occlusions, and the flexibility of the human body.

Key Points:

Related Work