Detailed Information

Cited 0 time in webofscience Cited 1 time in scopus
Metadata Downloads

BiSpark: a Spark-based highly scalable aligner for bisulfite sequencing data

Full metadata record
DC FieldValueLanguage
dc.contributor.authorSoe, Seokjun-
dc.contributor.authorPark, Yoonjae-
dc.contributor.authorChae, Heejoon-
dc.date.available2021-02-22T07:46:22Z-
dc.date.issued2018-12-
dc.identifier.issn1471-2105-
dc.identifier.urihttps://scholarworks.sookmyung.ac.kr/handle/2020.sw.sookmyung/4152-
dc.description.abstractBackgroundBisulfite sequencing is one of the major high-resolution DNA methylation measurement method. Due to the selective nucleotide conversion on unmethylated cytosines after treatment with sodium bisulfite, processing bisulfite-treated sequencing reads requires additional steps which need high computational demands. However, a dearth of efficient aligner that is designed for bisulfite-treated sequencing becomes a bottleneck of large-scale DNA methylome analyses.ResultsIn this study, we present a highly scalable, efficient, and load-balanced bisulfite aligner, BiSpark, which is designed for processing large volumes of bisulfite sequencing data. We implemented the BiSpark algorithm over the Apache Spark, a memory optimized distributed data processing framework, to achieve the maximum data parallel efficiency. The BiSpark algorithm is designed to support redistribution of imbalanced data to minimize delays on large-scale distributed environment.ConclusionsExperimental results on methylome datasets show that BiSpark significantly outperforms other state-of-the-art bisulfite sequencing aligners in terms of alignment speed and scalability with respect to dataset size and a number of computing nodes while providing highly consistent and comparable mapping results.AvailabilityThe implementation of BiSpark software package and source code is available at https://github.com/bhi-kimlab/BiSpark/.-
dc.language영어-
dc.language.isoENG-
dc.publisherBMC-
dc.titleBiSpark: a Spark-based highly scalable aligner for bisulfite sequencing data-
dc.typeArticle-
dc.publisher.location영국-
dc.identifier.doi10.1186/s12859-018-2498-2-
dc.identifier.scopusid2-s2.0-85058225965-
dc.identifier.wosid000452725300002-
dc.identifier.bibliographicCitationBMC BIOINFORMATICS, v.19, no.1-
dc.citation.titleBMC BIOINFORMATICS-
dc.citation.volume19-
dc.citation.number1-
dc.type.docTypeArticle-
dc.description.isOpenAccessY-
dc.description.journalRegisteredClassscie-
dc.description.journalRegisteredClassscopus-
dc.relation.journalResearchAreaBiochemistry & Molecular Biology-
dc.relation.journalResearchAreaBiotechnology & Applied Microbiology-
dc.relation.journalResearchAreaMathematical & Computational Biology-
dc.relation.journalWebOfScienceCategoryBiochemical Research Methods-
dc.relation.journalWebOfScienceCategoryBiotechnology & Applied Microbiology-
dc.relation.journalWebOfScienceCategoryMathematical & Computational Biology-
dc.subject.keywordPlusALIGNMENT-
dc.subject.keywordPlusSEQ-
dc.subject.keywordAuthorDNA methylation-
dc.subject.keywordAuthorBisulfite sequencing-
dc.subject.keywordAuthorAlignment-
dc.subject.keywordAuthorApache Spark-
dc.identifier.urlhttps://bmcbioinformatics.biomedcentral.com/articles/10.1186/s12859-018-2498-2-
Files in This Item
Go to Link
Appears in
Collections
공과대학 > 소프트웨어학부 > 1. Journal Articles

qrcode

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher

Researcher Chae, Hee Joon photo

Chae, Hee Joon
공과대학 (소프트웨어학부(첨단))
Read more

Altmetrics

Total Views & Downloads

BROWSE