READING NOTE: Towards Accurate Multi-person Pose Estimation in the Wild

原創

2020-06-21 04:40

TITLE: Towards Accurate Multi-person Pose Estimation in the Wild

AUTHOR: George Papandreou, Tyler Zhu, Nori Kanazawa, Alexander Toshev, Jonathan Tompson, Chris Bregler, Kevin Murphy

ASSOCIATION: Google

FROM: arXiv:1701.01779

CONTRIBUTIONS

A method for multi-person detection and 2D keypoint localization in the wild is proposed.

METHOD

The multi-person pose estimation system is a two step cascade, as illustrated in the Following figure.

In the first stage, a person detector is used to produce a bounding box around each person instance. In the second stage, a pose estimator is produced to the image crop extracted around each detected person instance in order to localize its keypoints.

Person Box Detection

A Faster-RCNN system based on ResNet-Inception architecture is used for person box detection. The detector is first trained on 80 categories in COCO dataset. Then the model is further finetuned on dataset only with bounding boxes of person.

Person Pose Estimation

A combined classification and regression approach is adoptted. Each spatial position is first classified whether it is in the vicinity of keypoints (K types) or not (which is a K-channel “heatmap”), then a 2-D local offset vector is predicted to get a more precise estimate of the corresponding keypoint location. The following figure illustrates the procedure.

The bounding box is first adjusted to a fixed aspect ratio (height/width = 1.37) and the patch is cropped from the image and resized to 353*257. A ResNet with 101 layers is used to produce heatmap and offsets. The following figure shows an input and ground-truth output of the network.

SOME IDEAS

The pipeline in two stages separated detection and pose estimation.
The relations between keypoints might be learnt in CNN, but it is not obvious.

發表評論

所有評論

還沒有人評論，想成為第一個評論的人麼? 請在上方評論欄輸入並且點擊發布.

READING NOTE: Towards Accurate Multi-person Pose Estimation in the Wild

CONTRIBUTIONS

METHOD

Person Box Detection

Person Pose Estimation

SOME IDEAS

[轉帖]使用NMT和pmap解決JVM資源泄漏問題原創

Python實現大麥網搶票的四大關鍵技術點解析

Python 安裝庫指令大全

salesforce零基礎學習（一百三十八）零碎知識點小總結（十）

一款開源的.NET程序集反編譯、編輯和調試神器

關於接口協議，你必須要知道這些！

2020年上半年數據庫系統工程師考試

基於 Milvus + LlamaIndex 實現高級 RAG

【2024-05-21】以茶會友

READING NOTE: Beyond Skip Connections: Top-Down Modulation for Object Detection

READING NOTE: Understanding Convolution for Semantic Segmentation

Reading Note: MobileNets: Efficient Convolutional Neural Networks for Mobile Vision Applications

READING NOTE: Optimizing Deep CNN-Based Queries over Video Streams at Scale

READING NOTE: Towards Accurate Multi-person Pose Estimation in the Wild

https://yachay.unat.edu.pe/blog/index.php?comment_area=format_blog&comment_component=blog&comment_co

linux以太網驅動總結