On the Theoretical Advantages of Bilinear Similarities in Dense Retrieval

Abstract

We present a theoretical and empirical study of bilinear similarity functions in neural IR, showing that they are strictly more expressive than dot-product and weighted dot-product (WDP) models under fixed embeddings. We prove this separation formally and illustrate it with the Structured Agreement Ranking Task, where a simple rank-2 bilinear model achieves 100% accuracy while all WDP models fail. This highlights the importance of modeling feature interactions for conditional relevance. On MS MARCO, low-rank bilinear models significantly outperform dot-product baselines: a rank-32 model triples performance (MRR@10: 0.090 vs. 0.031), and rank-128 approaches a 4x gain. These results offer a principled and practical case for using low-rank bilinear models in dense retrieval. Code:https://github.com/shubham526/bilinear-projection-theory.

Department(s)

Computer Science

International Standard Book Number (ISBN)

978-303206068-6

International Standard Serial Number (ISSN)

1611-3349; 0302-9743

Document Type

Article - Conference proceedings

Document Version

Citation

File Type

text

Language(s)

English

Rights

© 2026 Springer, All rights reserved.

Publication Date

01 Jan 2026

Share

 
COinS