IDEA: Integrating Divisive and Ensemble-Agglomerate hierarchical clustering framework for arbitrary shape data
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Ahn, Hongryul | - |
dc.contributor.author | Jung, Inuk | - |
dc.contributor.author | Chae, Heejoon | - |
dc.contributor.author | Oh, Minsik | - |
dc.contributor.author | Kim, Inyoung | - |
dc.contributor.author | Kim, Sun | - |
dc.date.accessioned | 2022-04-20T08:40:03Z | - |
dc.date.available | 2022-04-20T08:40:03Z | - |
dc.date.issued | 2021-12 | - |
dc.identifier.issn | 0000-0000 | - |
dc.identifier.uri | https://scholarworks.sookmyung.ac.kr/handle/2020.sw.sookmyung/151273 | - |
dc.description.abstract | Hierarchical clustering, a traditional clustering method, has been getting attention again. Among several reasons, a credit goes to a recent paper by Dasgupta in 2016 that proposed a cost function that quantitatively evaluates hierarchical clustering trees. An important question is how to combine this recent advance with existing successful clustering methods. In this paper, we propose a hierarchical clustering method to minimize the cost function of clustering tree by incorporating existing clustering techniques. First, we developed an ensemble tree-search method that finds an integrated tree with reduced cost by integrating multiple existing hierarchical clustering methods. Second, to operate on large and arbitrary shape data, we designed an efficient hierarchical clustering framework, called integrating divisive and ensemble-agglomerate (IDEA) by combining it with advanced clustering techniques such as nearest neighbor graph construction, divisive-agglomerate hybridization, and dynamic cut tree. The IDEA clustering method showed better performance in minimizing Dasgupta's cost and improving accuracy (adjusted rand index) over existing cost-minimization-based, and density-based hierarchical clustering methods in experiments using arbitrary shape datasets and complex biology-domain datasets. © 2021 IEEE. | - |
dc.format.extent | 10 | - |
dc.language | 영어 | - |
dc.language.iso | ENG | - |
dc.publisher | Institute of Electrical and Electronics Engineers Inc. | - |
dc.title | IDEA: Integrating Divisive and Ensemble-Agglomerate hierarchical clustering framework for arbitrary shape data | - |
dc.type | Article | - |
dc.publisher.location | 미국 | - |
dc.identifier.doi | 10.1109/BigData52589.2021.9671953 | - |
dc.identifier.scopusid | 2-s2.0-85125329865 | - |
dc.identifier.bibliographicCitation | Proceedings - 2021 IEEE International Conference on Big Data, Big Data 2021, pp 2791 - 2800 | - |
dc.citation.title | Proceedings - 2021 IEEE International Conference on Big Data, Big Data 2021 | - |
dc.citation.startPage | 2791 | - |
dc.citation.endPage | 2800 | - |
dc.type.docType | Conference Paper | - |
dc.description.isOpenAccess | N | - |
dc.description.journalRegisteredClass | scopus | - |
dc.subject.keywordAuthor | Divisive-agglomerate hybrid clustering | - |
dc.subject.keywordAuthor | Ensemble clustering | - |
dc.subject.keywordAuthor | Hierarchical clustering | - |
dc.subject.keywordAuthor | Tree cost minimization | - |
dc.identifier.url | https://ieeexplore.ieee.org/document/9671953 | - |
Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.
Sookmyung Women's University. Cheongpa-ro 47-gil 100 (Cheongpa-dong 2ga), Yongsan-gu, Seoul, 04310, Korea02-710-9127
Copyright©Sookmyung Women's University. All Rights Reserved.
Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.