Developing cancer prediction model based on stepwise selection by AUC measure for proteomics data
- Authors
- Kim Y.; Lee S.; Kwon M.-S.; Na A.; Choi Y.; Yi S.G.; Namkung J.; Han S.; Kang M.; Kim S.W.; Jang J.-Y.; Kim Y.; Kim Y.; Park T.
- Issue Date
- Dec-2015
- Publisher
- Institute of Electrical and Electronics Engineers Inc.
- Keywords
- area under the curve (AUC); early diagnosis of cancer; multiple reaction monitoring (MRM); protein marker; Receiver operating characteristic (ROC) curve; stepwise selection; support vector machine
- Citation
- Proceedings - 2015 IEEE International Conference on Bioinformatics and Biomedicine, BIBM 2015, pp 1345 - 1350
- Pages
- 6
- Journal Title
- Proceedings - 2015 IEEE International Conference on Bioinformatics and Biomedicine, BIBM 2015
- Start Page
- 1345
- End Page
- 1350
- URI
- https://scholarworks.sookmyung.ac.kr/handle/2020.sw.sookmyung/10162
- DOI
- 10.1109/BIBM.2015.7359874
- ISSN
- 0000-0000
- Abstract
- Since most of the cancer markers that have been reported are obtained directly from cancer tissues, it is difficult to use them for early diagnosis of cancer without surgery. Thus, development of markers that can be detected by blood is crucial for making early diagnosis of cancer easier. One of the most feasible types of markers that can be detected by blood is a protein marker. Here, we focus on building prediction methods using the protein markers for early diagnosis of cancer. To develop a prediction model with high prediction ability, it is critical to choose appropriate markers first. Here, we consider a stepwise selection method using area under the receiver operating characteristic curve (Step-AUC) in order to construct a multi-protein prediction model. We showed that the performance of Step-AUC highly depends on the tuning parameter. We compared our proposed Step-AUC method to stepwise selection using information criteria and support vector machine recursive feature extraction (SVM-RFE). We observed that Step-AUC and stepwise selection using Bayesian information criteria (Step-BIC) perform better than other methods. The importance of each marker can be chosen using a new stepwise selection consistency (SSC) measure. The final models include the markers with high SSC measures. We applied our stepwise procedure to pancreatic cancer data and found two markers of interest. © 2015 IEEE.
- Files in This Item
-
Go to Link
- Appears in
Collections - 이과대학 > 통계학과 > 1. Journal Articles
Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.