Unveiling Milvus 2.4: Multi-vector Search, Sparse Vector, CAGRA Index, and More!

Published in

Vector Database for AI

3 min readMar 20, 2024

We are happy to announce the launch of Milvus 2.4, a major advancement in enhancing search capabilities for large-scale datasets. This latest release adds new features, such as support for the GPU-based CAGRA index, beta support for sparse embeddings, group search, and various other improvements in search capabilities. These developments reinforce our commitment to the community by offering developers like you a powerful and efficient tool for handling and querying vector data. Let’s jump into the key benefits of Milvus 2.4 together.

Enabled Multi-vector Search for Simplified Multimodal Searches

Milvus 2.4 provides multivector search capability, allowing simultaneous search and reranking of different vector types within the same Milvus system. This feature streamlines multimodal searches, significantly enhancing recall rates and enabling developers to effortlessly manage intricate AI applications with varied data types. Additionally, this functionality simplifies the integration and fine-tuning of custom reranking models, aiding in the creation of advanced search functions like precise recommender systems that utilize insights from multidimensional data.

Multivector support in Milvus has two components:

The ability to store/query multiple vectors for a single entity within a collection, which is a more natural way to organize data
The ability to build/optimize a reranking algorithm by leveraging the prebuilt reranking algorithms in Milvus

Besides being a highly requested feature, we built this capability because the industry is moving towards multimodal models with the release of GPT-4 and Claude 3. Reranking is a commonly used technique to further improve query performance in search. We aimed to make it easy for developers to build and optimize their rerankers within the Milvus ecosystem.

Grouping Search Support for Enhanced Compute Efficiency

Grouping Search is another often requested feature that we added to Milvus 2.4. This feature improves compute efficiency and developer productivity when handling grouped search queries. In particular, this functionality tackles the challenges associated with querying large datasets like documents or videos segmented into vectorized chunks or frames by enabling the aggregation of search results based on specific attributes. With this new feature, developers can obtain top results grouped by specified fields (BOOL, INT, or VARCHAR) with a simple query, thus removing any custom code to enable aggregation.

Beta Support for Sparse Vector Embeddings

We have expanded the Hybrid Search in Milvus to include sparse embeddings so developers can further refine their semantically rich approximate nearest neighbor (ANN) searches. This feature, compatible with neural models like SPLADEv2 and statistical models like BM25, enables hybrid search strategies that combine keyword and embedding approaches. It is ideal for users seeking enhanced search accuracy without extensive customization efforts.

We are labeling this feature as “Beta” to continue our performance testing of the feature and gather feedback from the community.

CAGRA Index Support for Advanced GPU-Accelerated Graph Indexing

Developed by NVIDIA, CAGRA (Cuda Anns GRAph-based) is a GPU-based graph indexing technology that significantly surpasses traditional CPU-based methods like the HNSW index in efficiency and performance, especially in high-throughput environments.

With the introduction of the CAGRA Index, Milvus 2.4 provides enhanced GPU-accelerated graph indexing capability. This enhancement is ideal for building similarity search applications requiring minimal latency. Additionally, Milvus 2.4 integrates a brute-force search with the CAGRA index to achieve maximum recall rates in applications. For detailed insights, explore the introduction blog on CAGRA.

Additional Enhancements and Features

Milvus 2.4 also includes other key enhancements, such as Regular Expression support for enhanced substring matching in metadata filtering, a new scalar inverted index for efficient scalar data type filtering, and a Change Data Capture tool for monitoring and replicating changes in Milvus collections. These updates collectively enhance Milvus’s performance and versatility, making it a comprehensive solution for complex data operations.

For more details, see Milvus 2.4 documentation.

Stay Connected!

Excited to learn more about Milvus 2.4? Join our upcoming webinar with James Luan, Zilliz’s VP of Engineering, for an in-depth discussion on the capabilities of this latest release. If you have questions or feedback, join our Discord channel to engage with our engineers and community members. Don’t forget to follow us on Twitter or LinkedIn for the latest news and updates about Milvus.