Open-Awesome
CategoriesAlternativesStacksSelf-HostedExplore
Open-Awesome

© 2026 Open-Awesome. Curated for the developer elite.

TermsPrivacyAboutGitHubRSS
  1. Home
  2. Tags
  3. Information Retrieval

Information Retrieval

32 projects

Showing 32 of 32 projects

Haystack
HaystackMDX

Open-source AI orchestration framework for building context-engineered, production-ready LLM applications in Python.

#semantic-search#ai#information-retrieval
Stars25.5k
Forks2.8k
Last commit3 days ago
Sentence Transformers
Sentence TransformersPython

A Python framework for computing and training state-of-the-art text embeddings, rerankers, and sparse encoders.

#semantic-search#information-retrieval#python-library
Stars18.8k
Forks2.8k
Last commit3 days ago
DocsGPT
DocsGPTPython

Open-source AI platform for building private agents, assistants, and enterprise search with document analysis and multi-model support.

#ai#information-retrieval#multi-model-support
Stars17.9k
Forks2.1k
Last commit1 day ago
gensim
gensimPython

A Python library for topic modeling, document indexing, and similarity retrieval with large corpora.

#word2vec#information-retrieval#python-library
Stars16.4k
Forks4.4k
Last commit7 months ago
gensim
gensimPython

A Python library for topic modeling, document indexing, and similarity retrieval with large text corpora.

#word2vec#information-retrieval#python-library
Stars16.4k
Forks4.4k
Last commit7 months ago
Weaviate
WeaviateGo

An open-source, cloud-native vector database that combines semantic search with structured filtering for AI applications.

#semantic-search#ai#approximate-nearest-neighbor-search
Stars16.3k
Forks1.3k
Last commit2 days ago
tantivy
tantivyRust

A full-text search engine library written in Rust, inspired by Apache Lucene.

#information-retrieval#search-index#text-search
Stars15.3k
Forks919
Last commit2 days ago
bleve
bleveGo

A modern indexing and search library for Go supporting text, numeric, geo-spatial, and vector data.

#information-retrieval#go-library#cli-tool
Stars11.1k
Forks703
Last commit4 days ago
Knwl.js
Knwl.jsJavaScript

A JavaScript library for parsing text to extract dates, times, phone numbers, emails, places, and other structured information.

#plugin-system#information-retrieval#natural-language-processing
Stars5.3k
Forks212
Last commit2 years ago
DrQA
DrQAPython

A PyTorch system for open-domain question answering by retrieving and reading documents, originally applied to Wikipedia.

#information-retrieval#neural-networks#question-answering
Stars4.5k
Forks885
Last commit2 years ago
StringZilla
StringZillaC

A high-performance string library leveraging SIMD and SWAR to accelerate search, hashing, sorting, and edit distances across C, C++, Python, Rust, and more.

#memory-mapping#substring#information-retrieval
Stars3.5k
Forks125
Last commit1 day ago
TensorFlow Ranking
TensorFlow RankingPython

A TensorFlow library for Learning-to-Rank (LTR) techniques, providing loss functions, metrics, and models for ranking tasks.

#information-retrieval#deep-learning#recommender-systems
Stars2.8k
Forks477
Last commit2 years ago
NumKong
NumKongC

A portable mixed-precision math library with 2,000+ SIMD kernels for 15+ numeric types across x86, Arm, RISC-V, and WebAssembly.

#blas#assembly#information-retrieval
Stars1.8k
Forks121
Last commit16 days ago
WFGY
WFGYJupyter Notebook

An open-source AI troubleshooting atlas and avatar runtime for diagnosing and fixing RAG, agent, and real-world AI workflow failures.

#reasoning#ai-evaluation#information-retrieval
Stars1.8k
Forks162
Last commit1 day ago
Information Retrieval
Information Retrieval

A curated list of awesome information retrieval resources including books, courses, datasets, software, and conferences.

#research-datasets#information-retrieval#awesome-list
Stars1.2k
Forks142
Last commit3 years ago
awesome Information Retrieval
awesome Information Retrieval

A curated list of awesome resources for information retrieval and web search, including books, courses, datasets, and software.

#research-datasets#information-retrieval#natural-language-processing
Stars1.2k
Forks142
Last commit3 years ago
Cornac
CornacPython

A comparative Python framework for building, evaluating, and deploying multimodal recommender systems with auxiliary data.

#recommendation-algorithms#multimodality#recommender-system
Stars1.0k
Forks166
Last commit4 days ago
allRank
allRankPython

A PyTorch framework for training neural learning-to-rank models with flexible loss functions and scoring architectures.

#transformer#ndcg#information-retrieval
Stars1.0k
Forks129
Last commit1 year ago
tf-idf-similarity
tf-idf-similarityRuby

A Ruby gem for calculating text similarity using tf*idf and BM25 vector space models.

#information-retrieval#tf-idf#text-analysis
Stars781
Forks62
Last commit2 years ago
Question Answering
Question Answering

A curated list of resources for Question Answering (QA), covering machine learning, deep learning, datasets, and research.

#squad#nlp-resources#information-retrieval
Stars769
Forks104
Last commit4 years ago
MeTA
MeTAC++

A modern C++ toolkit for text retrieval and analysis, featuring indexing, ranking, topic modeling, classification, and language models.

#information-retrieval#text-classification#graph-algorithms
Stars714
Forks237
Last commit3 years ago
resin
resinC#

A vector space search engine, vector database, and key/value store for efficient string processing and vector operations.

#search#nlu-engine#resin
Stars575
Forks41
Last commit3 months ago
JavaFastPFOR
JavaFastPFORJava

A high-performance Java library for compressing arrays of integers, optimized for databases and information retrieval.

#java-library#information-retrieval#high-performance
Stars567
Forks65
Last commit2 months ago
scrape
scrapeElixir

An Elixir library for structured data extraction from websites, articles, and RSS/Atom feeds using information-retrieval techniques.

#readability#elixir#information-retrieval
Stars337
Forks41
Last commit5 years ago
VeritasGraph
VeritasGraphPython

An enterprise-grade Graph RAG framework combining hierarchical tree navigation with knowledge graph reasoning for verifiable, on-premise AI.

#multi-hop-reasoning#information-retrieval#document-navigation
Stars282
Forks33
Last commit10 days ago
ferret
ferretC

An extensible information retrieval library for Ruby, similar to Apache Lucene.

#search-library#information-retrieval#ruby-bindings
Stars280
Forks57
Last commit3 years ago
awesome-hungarian-nlp
awesome-hungarian-nlp

A curated list of free tools, datasets, models, and resources for Hungarian Natural Language Processing.

#computational-linguistics#hungarian#information-retrieval
Stars278
Forks19
Last commit1 month ago
stemmer
stemmerElixir

An English (Porter2) stemming implementation in Elixir for reducing words to their base forms.

#nlp-library#elixir#information-retrieval
Stars154
Forks10
Last commit2 years ago
RAKE.go
RAKE.goGo

A Go implementation of the Rapid Automatic Keyword Extraction (RAKE) algorithm for extracting keywords from text.

#rake-algorithm#information-retrieval#text-analysis
Stars123
Forks19
Last commit1 year ago
porter-stemmer
porter-stemmerJavaScript

A Node.js implementation of Martin Porter's stemming algorithm for removing morphological endings from English words.

#commonjs#information-retrieval#natural-language-processing
Stars101
Forks12
Last commit5 years ago
Word Tokenizers
Word TokenizersJulia

A Julia package providing high-performance, configurable tokenizers and sentence splitters for natural language processing.

#julia#computational-linguistics#sentence-splitting
Stars100
Forks25
Last commit4 years ago
perlin
perlinRust

An efficient and ergonomic document search engine library built on top of perlin-core.

#open-source#information-retrieval#document-search
Stars78
Forks4
Last commit8 years ago

Related Tags

#Natural Language Processing15#Machine Learning14#Nlp9#Search Engine7#Text Analysis6#Python5#Awesome List4#Deep Learning4#Data Science4#Neural Networks4#Rag3#Text Mining3
Community-curated · Updated weekly · 100% open source

Found a gem we're missing?

Open-Awesome is built by the community, for the community. Submit a project, suggest an awesome list, or help improve the catalog on GitHub.

Submit a projectStar on GitHub