Doctoral Dissertations
Keywords and Phrases
Active Learning; Attention Mechanism; Biomedical Image Analysis; Computer Vision; Deep Learning; Person Re-Identification
Abstract
“Attention mechanism, which is one of the most important algorithms in the deep Learning community, was initially designed in the natural language processing for enhancing the feature representation of key sentence fragments over the context. In recent years, the attention mechanism has been widely adopted in solving computer vision tasks by guiding deep neural networks (DNNs) to focus on specific image features for better understanding the semantic information of the image. However, the attention mechanism is not only capable of helping DNNs understand semantics, but also useful for the feature fusion, visual cue discovering, and temporal information selection, which are seldom researched. In this study, we take the classic attention mechanism a step further by proposing the Semantic Attention Guidance Unit (SAGU) for multi-level feature fusion to tackle the challenging Biomedical Image Segmentation task. Furthermore, we propose a novel framework that consists of (1) Semantic Attention Unit (SAU), which is an advanced version of SAGU for adaptively bringing high-level semantics to mid-level features, (2) Two-level Spatial Attention Module (TSPAM) for discovering multiple visual cues within the image, and (3) Temporal Attention Module (TAM) for temporal information selection to solve the Videobased Person Re-identification task. To validate our newly proposed attention mechanisms, extensive experiments are conducted on challenging datasets. Our methods obtain competitive performance and outperform state-of-the-art methods. Selective publications are also presented in the Appendix”--Abstract, page iii.
Advisor(s)
Yin, Zhaozheng
Committee Member(s)
Leopold, Jennifer
Taylor, Patrick
Qin, Ruwen
Nadendla, V. Sriram Siddhardh
Department(s)
Computer Science
Degree Name
Ph. D. in Computer Science
Publisher
Missouri University of Science and Technology
Publication Date
Spring 2020
Pagination
ix, 126 pages
Note about bibliography
Includes bibliographic references (pages 115-125).
Rights
© 2020 Haohan Li, All rights reserved.
Document Type
Dissertation - Open Access
File Type
text
Language
English
Thesis Number
T 12087
Recommended Citation
Li, Haohan, "Attention mechanism in deep neural networks for computer vision tasks" (2020). Doctoral Dissertations. 3132.
https://scholarsmine.mst.edu/doctoral_dissertations/3132