Abstract
Deep Learning (DL) models have proven to be very effective in solving many challenging problems, especially, those related to computer vision, text, and speech. However, the design of such models is challenging because of the vast search space and computational complexity that needs to be explored. Our goal in this paper is to reduce the human effort required to design architectures by using a system architecture development process that allows the exploration of large design space by automating certain model construction, alternative generation, and assessment. The proposed framework is generic and targeted at all deep learning architectures that can be expressed by logical models with certain numeric properties. The implementation of the proposed approach is presented, along with the test results achieved on CIFAR-10 dataset using a convolutional neural network (CNN). We show that the architecture generated by our approach achieves 5.23% error rate with only 1.2M parameters, which shows the capability to design high performing architectures.
Recommended Citation
R. D. Gottapu and C. H. Dagli, "System Architecting Approach for Designing Deep Learning Models," Procedia Computer Science, vol. 153, pp. 37 - 44, Elsevier B.V., Apr 2019.
The definitive version is available at https://doi.org/10.1016/j.procs.2019.05.053
Meeting Name
17th Annual Conference on Systems Engineering Research, CSER 2019 (2019: Apr. 3-4, Washington, DC)
Department(s)
Engineering Management and Systems Engineering
Keywords and Phrases
Convolutional Neural Network (CNN); Deep Learning (DL); System Architecting
International Standard Serial Number (ISSN)
1877-0509
Document Type
Article - Conference proceedings
Document Version
Final Version
File Type
text
Language(s)
English
Rights
© 2019 The Authors, All rights reserved.
Creative Commons Licensing
This work is licensed under a Creative Commons Attribution-Noncommercial-No Derivative Works 4.0 License.
Publication Date
01 Apr 2019