MIDAS: Model-Independent Training Data Selection Under Cost Constraintsopen access
- Authors
- Joo, Gyoungdon; Kim, Chulyun
- Issue Date
- Nov-2018
- Publisher
- IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
- Keywords
- Training data selection; cost constraints; model-independent; deep learning; active learning; machine learning; artificial intelligence
- Citation
- IEEE ACCESS, v.6, pp 74462 - 74474
- Pages
- 13
- Journal Title
- IEEE ACCESS
- Volume
- 6
- Start Page
- 74462
- End Page
- 74474
- URI
- https://scholarworks.sookmyung.ac.kr/handle/2020.sw.sookmyung/4186
- DOI
- 10.1109/ACCESS.2018.2882269
- ISSN
- 2169-3536
- Abstract
- In general, as the amount of training data is increased, a deep learning model gains a higher training accuracy. To assign labels to training data for use in supervised learning, human resources are required, which incur temporal and economic costs. Therefore, if a sufficient amount of training data cannot be constructed owing to existing cost constraints, it becomes necessary to select the training data that can maximize the accuracy of the deep learning model with only a limited amount of training data. However, although conventional studies on such training data selections take into consideration the training data labeling cost, the selection cost required in the training data selection process is not taken into consideration, which is a problem. Therefore, with the consideration of the selection cost constraint in addition to the data labeling cost constraint, we introduce a training data selection problem and propose novel algorithms to solve it. The advantage of the proposed algorithms is that they can be applied to any network model or data model of deep learning. The performance was verified through experiments using various network models and data.
- Files in This Item
-
Go to Link
- Appears in
Collections - ICT융합공학부 > IT공학전공 > 1. Journal Articles
Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.