CP-CLIP: Core-Periphery Feature Alignment CLIP For Zero-Shot Medical Image Analysis
Abstract
Multi-modality learning, exemplified by the language and image pair pre-trained CLIP model, has demonstrated remarkable performance in enhancing zero-shot capabilities and has gained significant attention in the field. However, simply applying language-image pretrained CLIP to medical image analysis encounters substantial domain shifts, resulting in significant performance degradation due to inherent disparities between natural (non-medical) and medical image characteristics. To address this challenge and uphold or even enhance CLIP's zero-shot capability in medical image analysis, we develop a novel framework, Core-Periphery feature alignment for CLIP (CP-CLIP), tailored for handling medical images and corresponding clinical reports. Leveraging the foundational core-periphery organization that has been widely observed in brain networks, we augment CLIP by integrating a novel core-periphery-guided neural network. This auxiliary CP network not only aligns text and image features into a unified latent space more efficiently but also ensures the alignment is driven by domain-specific core information, e.g., in medical images and clinical reports. In this way, our approach effectively mitigates and further enhances CLIP's zero-shot performance in medical image analysis. More importantly, our designed CP-CLIP exhibits excellent explanatory capability, enabling the automatic identification of critical regions in clinical analysis. Extensive experimentation and evaluation across five public datasets underscore the superiority of our CP-CLIP in zero-shot medical image prediction and critical area detection, showing its promising utility in multimodal feature alignment in current medical applications.
Recommended Citation
X. Yu et al., "CP-CLIP: Core-Periphery Feature Alignment CLIP For Zero-Shot Medical Image Analysis," Lecture Notes in Computer Science, vol. 15003 LNCS, pp. 88 - 97, Springer, Jan 2024.
The definitive version is available at https://doi.org/10.1007/978-3-031-72384-1_9
Department(s)
Computer Science
Keywords and Phrases
CP-CLIP; Feature Alignment; Zero-Shot
International Standard Book Number (ISBN)
978-303172383-4
International Standard Serial Number (ISSN)
1611-3349; 0302-9743
Document Type
Article - Conference proceedings
Document Version
Citation
File Type
text
Language(s)
English
Rights
© 2025 Springer, All rights reserved.
Publication Date
01 Jan 2024

Comments
National Institutes of Health, Grant R01AG075582