베스트셀러 도서 예측을 위한 머신러닝 알고리즘 성능평가

유지은; 조솔비; 유석종

doi:10.14801/jkiit.2023.21.7.1

Detailed Information

Cited 0 time in webofscience

Cited 0 time in scopus

Metadata Downloads

베스트셀러 도서 예측을 위한 머신러닝 알고리즘 성능평가

Full metadata record

DC Field	Value	Language
dc.contributor.author	유지은	-
dc.contributor.author	조솔비	-
dc.contributor.author	유석종	-
dc.date.accessioned	2023-11-08T05:48:38Z	-
dc.date.available	2023-11-08T05:48:38Z	-
dc.date.issued	2023-07	-
dc.identifier.issn	1598-8619	-
dc.identifier.issn	2093-7571	-
dc.identifier.uri	https://scholarworks.sookmyung.ac.kr/handle/2020.sw.sookmyung/151720	-
dc.description.abstract	베스트셀러 도서는 독자들이 책을 선택하는 가장 보편적인 방법이며, 이러한 이유로 베스트셀러의 예측과 선정은 출판 시장에서 중요한 마케팅 전략 지표이다. 본 연구에서는 도서의 메타 데이터를 활용하여 베스트셀러 순위 200위 내 유지 여부와 판매 지수 구간을 예측하는 모델을 제안하고, 다양한 머신러닝 알고리즘의 성능을 비교 평가하고자 한다. 이를 위하여 Yes24 사이트의 월간 베스트셀러 데이터를 크롤링하여 수집하고, 각 데이터 속성에 대해 적절한 전처리를 수행하였다. 순위 유지 여부 예측을 위해 다양한 분류 알고리즘을 활용하였고, 최종적으로 각 알고리즘의 예측 성능을 평가한 결과, 다중 퍼셉트론, CatBoost, 랜덤 포레스트의 순서로 정확도가 높게 나타났다. 본 연구는 베스트셀러 순위 유지 여부 예측 문제에 대해 주요 분류 알고리즘의 수행 성능을 종합적으로 비교했다는데 의미가 있다. 그러나 한계점으로 리뷰 수, 평점 등에 의존하는 예측 방법에서는 데이터가 부족한 신간 도서에서 cold start 문제를 극복하기 어려웠으며, 이에 대한 후속 보완 연구의 필요성을 제안한다.	-
dc.description.abstract	Bestsellers are the most common way for readers to choose books, and for this reason, the prediction and selection of bestsellers is an important marketing strategy indicator in the publishing market. In this study, we propose a model that predicts whether or not to remain in the top 200 bestseller rankings and sales index sections using metadata from bestsellers, and compare and evaluate the performance of various machine learning algorithms. To this end, monthly bestseller data on the Yes24 site were crawled and collected, and appropriate preprocessing was performed for each data attribute. Various classification algorithms were used to predict whether to maintain the ranking, and as a result of finally evaluating the prediction performance of each algorithm, the accuracy of MLP, CatBoost, and random forest was high. This study is meaningful in that it comprehensively compared the performance performance of various classification algorithms for predicting whether to maintain the bestseller ranking. However, in models that rely on the number of reviews and ratings as limitations, it was difficult to overcome the cold start problem in new books that lacked data, and the need for follow-up supplementary research is proposed.	-
dc.format.extent	6	-
dc.language	한국어	-
dc.language.iso	KOR	-
dc.publisher	한국정보기술학회	-
dc.title	베스트셀러 도서 예측을 위한 머신러닝 알고리즘 성능평가	-
dc.title.alternative	Performance Evaluation of Machine-Learning Algorithms for Bestseller Book Prediction	-
dc.type	Article	-
dc.publisher.location	대한민국	-
dc.identifier.doi	10.14801/jkiit.2023.21.7.1	-
dc.identifier.bibliographicCitation	한국정보기술학회논문지, v.21, no.7, pp 1 - 6	-
dc.citation.title	한국정보기술학회논문지	-
dc.citation.volume	21	-
dc.citation.number	7	-
dc.citation.startPage	1	-
dc.citation.endPage	6	-
dc.identifier.kciid	ART002983501	-
dc.description.isOpenAccess	N	-
dc.description.journalRegisteredClass	kci	-
dc.subject.keywordAuthor	.	-
dc.subject.keywordAuthor	machine learning	-
dc.subject.keywordAuthor	prediction model	-
dc.subject.keywordAuthor	random forest model	-
dc.subject.keywordAuthor	ensemble model	-
dc.identifier.url	https://www.dbpia.co.kr/journal/articleDetail?nodeId=NODE11481223&language=ko_KR&hasTopBanner=true	-

Files in This Item: Go to Link

Appears in Collections: ETC > 1. Journal Articles

Show simple item record

qrcode

Related Researcher

Researcher Yu, Seok Jong photo

Yu, Seok Jong: 공과대학 (소프트웨어학부(첨단))

Read more

Altmetrics

Total Views & Downloads

STATISTICS: Total View :6,704,688; Today View :4,461

RSS_1.0 RSS_2.0 ATOM_1.0

Sookmyung Women's University. Cheongpa-ro 47-gil 100 (Cheongpa-dong 2ga), Yongsan-gu, Seoul, 04310, Korea02-710-9127

Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.

Detailed Information

Related Researcher

Altmetrics

Total Views & Downloads

BROWSE