OneSparse: A Unified System for Multi-index Vector Search

  • Yaoqi Chen ,
  • Ruicheng Zheng ,
  • ,
  • Shuotao Xu ,
  • ,
  • Xue Wu ,
  • Weihao Han ,
  • Hua Yuan ,
  • Mingqin Li ,
  • Yujing Wang ,
  • Jason Li ,
  • ,
  • Hao Sun ,
  • Weiwei Deng ,
  • Feng Sun ,
  • Qi Zhang ,
  • Mao Yang

2024 The Web Conference |

Multi-index vector search has become the cornerstone for many applications, such as recommendation systems. Efficient search in such a multi-modal hybrid vector space is challenging since no single index design performs well for all kinds of vector data. Existing approaches to processing multi-index hybrid queries either suffer from algorithmic limitations or processing inefficiency. In this paper, we propose OneSparse, a unified multi-vector index query system that incorporates multiple posting-based vector indices, which enables highly efficient retrieval of multi-modal data-sets. OneSparse introduces a novel multi-index query engine design of inter-index intersection push-down. It also optimizes the vector posting format to expedite multi-index queries. Our experiments show OneSparse achieves more than 6× search performance improvement while maintaining comparable accuracy. OneSparse has already been integrated into Microsoft online web search and advertising systems with 5 × + latency gain for Bing web search and 2.0% Revenue Per Mille (RPM) gain for Bing sponsored search.