An AI-native database built for LLM applications, offering incredibly fast hybrid search across vectors, tensors, and full-text.
Infinity is an AI-native database engineered for LLM (Large Language Model) applications, providing incredibly fast hybrid search capabilities across dense vectors, sparse vectors, tensors, and full-text data. It solves the problem of efficiently retrieving and managing diverse data types essential for AI-driven tasks like search, recommendation, and RAG (Retrieval-augmented Generation).
AI developers, data engineers, and researchers building LLM applications such as search systems, recommenders, question-answering bots, conversational AI, and RAG pipelines who need high-performance, flexible data retrieval.
Developers choose Infinity for its exceptional speed (sub-millisecond latency), support for hybrid search across multiple data types, and ease of use with a Python-first API and simple deployment, making it a powerful, all-in-one database for modern AI workloads.
The AI-native database built for LLM applications, providing incredibly fast hybrid search of dense vector, sparse vector, tensor (multi-vector), and full-text.
Open-Awesome is built by the community, for the community. Submit a project, suggest an awesome list, or help improve the catalog on GitHub.
Achieves 0.1 milliseconds query latency and 15K+ QPS on million-scale vector datasets, as benchmarked in the README, making it ideal for high-throughput AI applications.
Supports hybrid search combining dense embeddings, sparse embeddings, tensors, and full text with rerankers like RRF and ColBERT, enabling complex, multi-modal queries essential for RAG.
Offers an intuitive Python API and can be embedded as a module, simplifying integration for AI developers, as shown in the example code for vector search.
Single-binary architecture with no dependencies allows for easy deployment via Docker or binary, reducing setup complexity, as highlighted in the deployment instructions.
Uses nightly Docker images and dev SDK versions (e.g., infinity-sdk==0.7.0.dev5), indicating potential instability, breaking changes, and lack of production readiness.
Requires x86_64 CPUs with AVX2 support and specific OS versions like Linux with glibc 2.17+, excluding older or non-compatible systems, as noted in prerequisites.
As a newer project, it lacks the extensive tooling, integrations, and community support of established databases like Elasticsearch or PostgreSQL with pgvector.