Big Data Processing on Single Board Computer Clusters: Exploring Challenges and Possibilities
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Lee, Eunseo | - |
dc.contributor.author | Oh, Hyunju | - |
dc.contributor.author | Park, Dongchul | - |
dc.date.accessioned | 2022-04-19T09:03:51Z | - |
dc.date.available | 2022-04-19T09:03:51Z | - |
dc.date.issued | 2021-10 | - |
dc.identifier.issn | 2169-3536 | - |
dc.identifier.uri | https://scholarworks.sookmyung.ac.kr/handle/2020.sw.sookmyung/146366 | - |
dc.description.abstract | For more than a decade, "big data" has been an industry and academia buzz phrase. Over this time, many companies adopted Apache Hadoop and Spark frameworks for their massive data storage and analysis efforts, using powerful, energy-hungry, general-purpose server as their big data processing platforms. But not all industry or academic fields want, or even need, such large systems. Moreover, capital costs aside, power consumption has also become a primary data center concern. Consequently, lower-cost, lower-power microservers have emerged as viable alternatives in many settings. Now, the latest generation Raspberry Pi (RPi), model 4B, exhibits significant computational performance improvements over its predecessors, and is presently considered a sufficiently powerful single board computer (SBC) to run many mainstream operating systems and accommodate heavy workloads. This paper reexamines SBC cluster big data processing possibilities by integrating the most powerful (presently) RPi model-the RPi 4B with 4 Gigabytes (GB) main memory. We examine external storage's performance impact on such an SBC cluster's big data processing performance by employing three different external storage solutions with measurably distinct performance characteristics. Moreover, we discuss challenges we encountered and identify further SBC cluster performance optimizations. We perform several representative big data application benchmarks and measure various key performance metrics such as execution time, power consumption, throughput, performance-per-dollars, etc. Our extensive experiments and comprehensive studies conclude this current, fourth-generation RPi has evolved to become the first generation to effectively run massive (i.e., more than 100GB) workloads in big data processing applications. | - |
dc.format.extent | 15 | - |
dc.language | 영어 | - |
dc.language.iso | ENG | - |
dc.publisher | IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC | - |
dc.title | Big Data Processing on Single Board Computer Clusters: Exploring Challenges and Possibilities | - |
dc.type | Article | - |
dc.publisher.location | 미국 | - |
dc.identifier.doi | 10.1109/ACCESS.2021.3120660 | - |
dc.identifier.scopusid | 2-s2.0-85117764856 | - |
dc.identifier.wosid | 000711706900001 | - |
dc.identifier.bibliographicCitation | IEEE ACCESS, v.9, pp 142551 - 142565 | - |
dc.citation.title | IEEE ACCESS | - |
dc.citation.volume | 9 | - |
dc.citation.startPage | 142551 | - |
dc.citation.endPage | 142565 | - |
dc.type.docType | Article | - |
dc.description.isOpenAccess | N | - |
dc.description.journalRegisteredClass | scie | - |
dc.description.journalRegisteredClass | scopus | - |
dc.relation.journalResearchArea | Computer Science | - |
dc.relation.journalResearchArea | Engineering | - |
dc.relation.journalResearchArea | Telecommunications | - |
dc.relation.journalWebOfScienceCategory | Computer Science, Information Systems | - |
dc.relation.journalWebOfScienceCategory | Engineering, Electrical & Electronic | - |
dc.relation.journalWebOfScienceCategory | Telecommunications | - |
dc.subject.keywordPlus | CLOUD | - |
dc.subject.keywordAuthor | Economic indicators | - |
dc.subject.keywordAuthor | Big Data | - |
dc.subject.keywordAuthor | Servers | - |
dc.subject.keywordAuthor | Media | - |
dc.subject.keywordAuthor | Sparks | - |
dc.subject.keywordAuthor | Universal Serial Bus | - |
dc.subject.keywordAuthor | Power demand | - |
dc.subject.keywordAuthor | Raspberry Pi | - |
dc.subject.keywordAuthor | big data | - |
dc.subject.keywordAuthor | Hadoop | - |
dc.subject.keywordAuthor | Spark | - |
dc.subject.keywordAuthor | UFS | - |
dc.subject.keywordAuthor | SBC | - |
dc.subject.keywordAuthor | single board computer | - |
dc.subject.keywordAuthor | cluster | - |
Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.
Sookmyung Women's University. Cheongpa-ro 47-gil 100 (Cheongpa-dong 2ga), Yongsan-gu, Seoul, 04310, Korea02-710-9127
Copyright©Sookmyung Women's University. All Rights Reserved.
Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.