Detailed Information

Cited 0 time in webofscience Cited 0 time in scopus
Metadata Downloads

Audio-Visual Tensor Fusion Network for Piano Player Posture Classification

Full metadata record
DC Field Value Language
dc.contributor.authorPark, So-Hyun-
dc.contributor.authorPark, Young-Ho-
dc.date.available2021-02-22T05:21:46Z-
dc.date.issued2020-10-
dc.identifier.issn2076-3417-
dc.identifier.issn2076-3417-
dc.identifier.urihttps://scholarworks.sookmyung.ac.kr/handle/2020.sw.sookmyung/1149-
dc.description.abstractPlaying the piano in the correct position is important because the correct position helps to produce good sound and prevents injuries. Many studies have been conducted in the field of piano playing posture recognition that combines various techniques. Most of these techniques are based on analyzing visual information. However, in the piano education field, it is essential to utilize audio information in addition to visual information due to the deep relationship between posture and sound. In this paper, we propose an audio-visual tensor fusion network (simply, AV-TFN) for piano performance posture classification. Unlike existing studies that used only visual information, the proposed method uses audio information to improve the accuracy in classifying the postures of professional and amateur pianists. For this, we first introduce a dataset called C3Pap (Classic piano performance postures of amateur and professionals) that contains actual piano performance videos in diverse environments. Furthermore, we propose a data structure that represents audio-visual information. The proposed data structure represents audio information on the color scale and visual information on the black and white scale for representing relativeness between them. We call this data structure an audio-visual tensor. Finally, we compare the performance of the proposed method with state-of-the-art approaches: VN (Visual Network), AN (Audio Network), AVN (Audio-Visual Network) with concatenation and attention techniques. The experiment results demonstrate that AV-TFN outperforms existing studies and, thus, can be effectively used in the classification of piano playing postures.-
dc.format.extent15-
dc.language영어-
dc.language.isoENG-
dc.publisherMDPI-
dc.titleAudio-Visual Tensor Fusion Network for Piano Player Posture Classification-
dc.typeArticle-
dc.publisher.locationSwitzerland-
dc.identifier.doi10.3390/app10196857-
dc.identifier.scopusid2-s2.0-85092777509-
dc.identifier.wosid000586586900001-
dc.identifier.bibliographicCitationAPPLIED SCIENCES-BASEL, v.10, no.19, pp 1 - 15-
dc.citation.titleAPPLIED SCIENCES-BASEL-
dc.citation.volume10-
dc.citation.number19-
dc.citation.startPage1-
dc.citation.endPage15-
dc.description.isOpenAccessN-
dc.description.journalRegisteredClassscie-
dc.description.journalRegisteredClassscopus-
dc.relation.journalResearchAreaChemistry-
dc.relation.journalResearchAreaEngineering-
dc.relation.journalResearchAreaMaterials Science-
dc.relation.journalResearchAreaPhysics-
dc.relation.journalWebOfScienceCategoryChemistry, Multidisciplinary-
dc.relation.journalWebOfScienceCategoryEngineering, Multidisciplinary-
dc.relation.journalWebOfScienceCategoryMaterials Science, Multidisciplinary-
dc.relation.journalWebOfScienceCategoryPhysics, Applied-
dc.subject.keywordPlusKINEMATICS-
dc.subject.keywordPlusKINECT-
dc.subject.keywordAuthorpiano playing posture-
dc.subject.keywordAuthoraudio-visual tensor fusion-
dc.subject.keywordAuthorclassification-
dc.identifier.urlhttps://www.mdpi.com/2076-3417/10/19/6857-
Files in This Item
Go to Link
Appears in
Collections
ICT융합공학부 > IT공학전공 > 1. Journal Articles

qrcode

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher

Researcher Park, Young Ho photo

Park, Young Ho
공과대학 (인공지능공학부)
Read more

Altmetrics

Total Views & Downloads

BROWSE