MongoDB Introduces Embedding and Reranking API on Atlas

MongoDB Introduces Embedding and Reranking API on Atlas

Abdul Rahman

Public Preview of Embedding and Reranking API

MongoDB has announced the public preview of its Embedding and Reranking API on MongoDB Atlas. This new interface provides developers with direct access to Voyage AI’s search models within the managed cloud database environment. The release aims to facilitate the creation of features such as semantic search and AI-powered assistants within a single integrated system, offering consolidated monitoring and billing capabilities.

Streamlining AI Retrieval Systems

According to MongoDB, the new API is database-agnostic and designed to integrate into various tech stacks. It targets teams building retrieval-powered AI systems, ranging from Retrieval-Augmented Generation (RAG) to AI agents. Thibaut Gourdel, senior technical product marketing manager, and Wen Phan, staff product manager at MongoDB, noted that building AI retrieval currently involves connecting databases, vector search, and retrieval model providers, which increases operational complexity. This API intends to address those challenges by centralizing components on the Atlas platform.

Voyage 4 Series Availability and Flexibility

The announcement coincides with the availability of the Voyage 4 series, which includes four distinct models: voyage-4-large, voyage-4, voyage-4-lite, and the open-weights voyage-4-nano. A key feature of this series is that the text embedding models operate within the same embedding space. This allows teams to store data using one model, such as voyage-4-large, and execute queries using a different Voyage 4 model, differing from previous generations that required identical models for both queries and documents.

Technical Specifications and Vector Search Updates

The embedding models support dimensions ranging from 256 to 2048 and include quantization capabilities, enabling developers to balance requirements for accuracy, cost, and speed. Beyond general models, Voyage offers options tailored for specific fields, whole-document analysis, multimodal data, and reranking in multi-step search systems. Additionally, automated embedding in vector search is now available in preview for the community edition, while Lexical Prefilters for MongoDB Vector Search have entered public preview to provide text and geo analysis alongside vector capabilities.

Shifts in Production Use Cases

Deepak Goyal commented on the trends in AI infrastructure, suggesting that if data is 24 hours old, RAG systems become mere indexed archives rather than intelligent systems. He observed a shift toward unifying data flows, noting that while specialized vector stores act as powerful external components, integrated solutions are increasingly favored for speed and simplicity in the majority of production use cases.

Read More