EMOVA: Emotion-driven neural volumetric avatar
Citations

WEB OF SCIENCE

2
Citations

SCOPUS

2

초록

3D facial reconstruction is essential for metaverse applications. Traditional mesh-based methods have difficulty rendering photorealistic faces and complex objects. Recent advancements in Neural Radiance Fields (NeRFs) have excelled in representing complex objects. However, they struggle with capturing subtle facial differences, particularly around the eyes and mouth, due to their reliance on RGB value comparisons. To address this, we propose an EMOtion-driven Volumetric Avatar (EMOVA) that utilizes emotional stimuli from images and voices to enhance facial precision. Visual emotional features ensure that the reconstructed face aligns with the input emotion, while auditory features enhance facial details from various viewpoints. Through an attention-based fusion of these features, EMOVA accurately captures and reconstructs faces, even in self-occlusion. EMOVA outperforms the state-of-the-art methods by more than 5.93% in terms of LPIPS for face reconstruction. © 2024 Elsevier B.V.

키워드

Dynamic scenesFace reconstructionMultimodal analysisNeural renderingNovel view synthesisFACIAL EXPRESSION RECOGNITIONMODELFACE
제목
EMOVA: Emotion-driven neural volumetric avatar
저자
Hwang, JuheonKim, Byung-gyuKim, TaewanOh, HeeseokKang, Jiwoo
DOI
10.1016/j.imavis.2024.105043
발행일
2024-06
유형
Article
저널명
Image and Vision Computing
146