IEICE global.ieice.org Site

Author Search Result

[Author] Xianjun YANG(1hit)

1-1hit

Multi-Scale Contrastive Learning for Human Pose Estimation Open Access
Wenxia BAO An LIN Hua HUANG Xianjun YANG Hemu CHEN

PAPER-Image Recognition, Computer Vision

Pubricized:
2024/06/17
Vol:
E107-D No:10
Page(s):
1332-1341
Recent years have seen remarkable progress in human pose estimation. However, manual annotation of keypoints remains tedious and imprecise. To alleviate this problem, this paper proposes a novel method called Multi-Scale Contrastive Learning (MSCL). This method uses a siamese network structure with upper and lower branches that capture diffirent views of the same image. Each branch uses a backbone network to extract image representations, employing multi-scale feature vectors to capture information. These feature vectors are then passed through an enhanced feature pyramid for fusion, producing more robust feature representations. The feature vectors are then further encoded by mapping and prediction heads to predict the feature vector of another view. Using negative cosine similarity between vectors as a loss function, the backbone network is pre-trained on a large-scale unlabeled dataset, enhancing its capacity to extract visual representations. Finally, transfer learning is performed on a small amount of labelled data for the pose estimation task. Experiments on COCO datasets show significant improvements in Average Precision (AP) of 1.8%, 0.9%, and 1.2% with 1%, 5%, and 10% labelled data on COCO. In addition, the Percentage of Correct Keypoints (PCK) improves by 0.5% on MPII&AIC, outperforming mainstream contrastive learning methods.

Author Search Result

[Author] Xianjun YANG(1hit)

Multi-Scale Contrastive Learning for Human Pose Estimation Open Access

Latest Issue

Links

Call for Papers

Submit to IEICE Trans.

Transactions NEWS

Popular articles