Double Discrete Representation for 3D Human Pose Estimation from Head-mounted Camera

Hwang, Juheon; Kang, Jiwoo

doi:10.1109/ICCE59016.2024.10444241

상세 보기

Double Discrete Representation for 3D Human Pose Estimation from Head-mounted Camera

Hwang, Juheon;
Kang, Jiwoo

Citations

WEB OF SCIENCE

0

Citations

SCOPUS

3

초록

This work proposes a method to accurately estimate the 3D pose of humans from an egocentric image captured by a head-mounted camera. A third-person-view camera has a field of view, which limits many dynamic situations outside of a motion capture system. To solve the problem, several methods use egocentric views to overcome spatial constraints. However, in the egocentric view of the head-mounted camera, the lower body often appears smaller and is obscured by the upper body, leading to significantly unreliable and inaccurate pose estimation. To address the limitation, we propose an estimation pipeline using Vector Quantized-Variational AutoEncoder (VQ-VAE) to accurately predict the human pose from egocentric images and optimize the predicted pose. Thus, we introduce a novel pipeline for pose estimation and optimization using the codebook by learning egocentric image features and pose features from large human pose datasets with VQ-VAE. The proposed method with the vector quantizer of VQ-VAEs can help improve the generalization performance of the 3D pose estimation from the egocentric view. Through comparative experiments, our method is shown to achieve a significant performance improvement over state-of-the-art methods. © 2024 IEEE.

키워드

3D human pose estimation; egocentric images; head-mounted camera; vector quantized-variational autoencoder; virtual reality (VR)

제목: Double Discrete Representation for 3D Human Pose Estimation from Head-mounted Camera

저자: Hwang, Juheon; Kang, Jiwoo

DOI: 10.1109/ICCE59016.2024.10444241

발행일: 2024-01

유형: Conference paper

저널명: Digest of Technical Papers - IEEE International Conference on Consumer Electronics

페이지: 1 ~ 4

ScholarWorks@숙명여자대학교

상세 보기

초록

키워드