Neural network explanation using inversion
Abstract
An important drawback of many artificial neural networks (ANN) is their lack of explanation capability [Andrews, R., Diederich, J., & Tickle, A. B. (1996). A survey and critique of techniques for extracting rules from trained artificial neural networks. Knowledge-Based Systems, 8, 373-389]. This paper starts with a survey of algorithms which attempt to explain the ANN output. We then present HYPINV,1 a new explanation algorithm which relies on network inversion; i.e. calculating the ANN input which produces a desired output. HYPINV is a pedagogical algorithm, that extracts rules, in the form of hyperplanes. It is able to generate rules with arbitrarily desired fidelity, maintaining a fidelity-complexity tradeoff. To our knowledge, HYPINV is the only pedagogical rule extraction method, which extracts hyperplane rules from continuous or binary attribute neural networks. Different network inversion techniques, involving gradient descent as well as an evolutionary algorithm, are presented. An information theoretic treatment of rule extraction is presented. HYPINV is applied to example synthetic problems, to a real aerospace problem, and compared with similar algorithms using benchmark problems.
Recommended Citation
E. W. Saad and D. C. Wunsch, "Neural network explanation using inversion," Neural Networks, Elsevier Science Ltd., Oct 2006.
The definitive version is available at https://doi.org/10.1016/j.neunet.2006.07.005
Department(s)
Electrical and Computer Engineering
Keywords and Phrases
Evolutionary Algorithm; Explanation Capability of Neural Networks; Hyperplanes; Inversion; Neural Network Explanation; Pedagogical; Rule Extraction
International Standard Serial Number (ISSN)
0893-6080
Document Type
Article - Journal
Document Version
Citation
File Type
text
Language(s)
English
Rights
© 2006 Elsevier Science Ltd., All rights reserved.
Publication Date
01 Oct 2006