상세 보기
- Choi, Joungmin;
- Park, Yoonjae;
- Kim, Sun;
- Chae, Heejoon
WEB OF SCIENCE
1SCOPUS
1초록
In recent years, there have been many studies utilizing DNA methylome data to answer fundamental biological questions. Bisulfite sequencing (BS-seq) has enabled measurement of a genome-wide absolute level of DNA methylation at single-nucleotide resolution. However, due to the ambiguity introduced by bisulfite-treatment, the aligning process especially in large-scale epigenetic research is still considered a huge burden. We present Cloud-BS, an efficient BS-seq aligner designed for parallel execution on a distributed environment. Utilizing Apache Hadoop framework, Cloud-BS splits sequencing reads into multiple blocks and transfers them to distributed nodes. By designing each aligning procedure into separate map and reducing tasks while an internal key-value structure is optimized based on the MapReduce programming model, the algorithm significantly improves alignment performance without sacrificing mapping accuracy. In addition, Cloud-BS minimizes the innate burden of configuring a distributed environment by providing a pre-configured cloud image. Cloud-BS shows significantly improved bisulfite alignment performance compared to other existing BS-seq aligners. We believe our algorithm facilitates large-scale methylome data analysis. The algorithm is freely available at https://paryoja.github.io/Cloud-BS/.
키워드
- 제목
- Cloud-BS: A MapReduce-based bisulfite sequencing aligner on cloud
- 저자
- Choi, Joungmin; Park, Yoonjae; Kim, Sun; Chae, Heejoon
- 발행일
- 2018-12
- 유형
- Article; Proceedings Paper
- 권
- 16
- 호
- 6