Information Retrieval

Sentence TransformersPython

A Python framework for computing and training state-of-the-art text embeddings, rerankers, and sparse encoders.

#semantic-search#information-retrieval#python-library

Open-source AI platform for building private agents, assistants, and enterprise search with document analysis and multi-model support.

#ai#information-retrieval#multi-model-support

Stars18.0k

Forks2.1k

Last commit4 hours ago

WeaviateGo

An open-source, cloud-native vector database that combines semantic search with structured filtering for AI applications.

#semantic-search#ai#approximate-nearest-neighbor-search

Stars16.6k

Forks1.4k

#word2vec#information-retrieval#python-library

gensimPython

A Python library for topic modeling, document indexing, and similarity retrieval with large text corpora.

Stars16.5k

Forks4.4k

Last commit8 months ago

gensimPython

A Python library for topic modeling, document indexing, and similarity retrieval with large corpora.

#word2vec#information-retrieval#python-library

Stars16.5k

Forks4.4k

Last commit8 months ago

tantivyRust

A full-text search engine library written in Rust, inspired by Apache Lucene.

#information-retrieval#search-index#text-search

Stars15.6k

Forks949

Last commit8 hours ago

bleveGo

A modern indexing and search library for Go supporting text, numeric, geo-spatial, and vector data.

#information-retrieval#go-library#cli-tool

Stars11.2k

Forks711

Last commit10 hours ago

Knwl.jsJavaScript

A JavaScript library for parsing text to extract dates, times, phone numbers, emails, places, and other structured information.

#plugin-system#information-retrieval#natural-language-processing

Stars5.3k

Forks210

#information-retrieval#neural-networks#question-answering

DrQAPython

A PyTorch system for open-domain question answering by retrieving and reading documents, originally applied to Wikipedia.

Stars4.5k

Forks882

#information-retrieval#shell#bash

screenfetchShell

Fetches system/theme information in terminal for Linux desktop screenshots.

Stars4.1k

Forks446

Last commit4 months ago

StringZillaC

A high-performance string library leveraging SIMD and SWAR to accelerate search, hashing, sorting, and edit distances across C, C++, Python, Rust, and more.

#memory-mapping#substring#information-retrieval

Stars3.5k

Forks131

#information-retrieval#deep-learning#recommender-systems

TensorFlow RankingPython

A TensorFlow library for Learning-to-Rank (LTR) techniques, providing loss functions, metrics, and models for ranking tasks.

Stars2.8k

Forks477

#blas#assembly#information-retrieval

NumKongC

A portable mixed-precision math library with 2,000+ SIMD kernels for 15+ numeric types across x86, Arm, RISC-V, and WebAssembly.

Stars1.9k

Forks125

#reasoning#ai-evaluation#information-retrieval

WFGYJupyter Notebook

An open-source AI troubleshooting atlas and avatar runtime for diagnosing and fixing RAG, agent, and real-world AI workflow failures.

Stars1.8k

Forks162

#search#information-retrieval#lucene

Apache SolrJava

Apache Solr open-source search software

Stars1.6k

Forks845

awesome Information Retrieval

A curated list of awesome resources for information retrieval and web search, including books, courses, datasets, and software.

#research-datasets#information-retrieval#natural-language-processing

Stars1.2k

Forks142

#research-datasets#information-retrieval#awesome-list

Information Retrieval

A curated list of awesome information retrieval resources including books, courses, datasets, software, and conferences.

Stars1.2k

Forks142

#recommendation-algorithms#multimodality#recommender-system

CornacPython

A comparative Python framework for building, evaluating, and deploying multimodal recommender systems with auxiliary data.

Stars1.1k

Forks171

Last commit5 days ago

allRankPython

A PyTorch framework for training neural learning-to-rank models with flexible loss functions and scoring architectures.

#transformer#ndcg#information-retrieval

Stars1.0k

Forks129

Last commit1 year ago

tf-idf-similarityRuby

A Ruby gem for calculating text similarity using tf*idf and BM25 vector space models.

#information-retrieval#tf-idf#text-analysis

Stars783

Forks62

#squad#nlp-resources#information-retrieval

Question Answering

A curated list of resources for Question Answering (QA), covering machine learning, deep learning, datasets, and research.

Stars769

Forks104

Last commit4 years ago

MeTAC++

A modern C++ toolkit for text retrieval and analysis, featuring indexing, ranking, topic modeling, classification, and language models.

#information-retrieval#text-classification#graph-algorithms

Stars714

Forks239

#java-library#information-retrieval#high-performance

resinC#

A vector space search engine, vector database, and key/value store for efficient string processing and vector operations.

#search#nlu-engine#resin

Stars577

Forks41

Last commit1 month ago

JavaFastPFORJava

A high-performance Java library for compressing arrays of integers, optimized for databases and information retrieval.

Stars569

Forks66

Last commit29 days ago

scrapeElixir

An Elixir library for structured data extraction from websites, articles, and RSS/Atom feeds using information-retrieval techniques.

#readability#elixir#information-retrieval

Stars337

Forks41

Last commit6 years ago

VeritasGraphPython

An enterprise-grade Graph RAG framework combining hierarchical tree navigation with knowledge graph reasoning for verifiable, on-premise AI.

#multi-hop-reasoning#information-retrieval#knowledge-graphs

Stars303

Forks34

Last commit

awesome-hungarian-nlp

A curated list of free tools, datasets, models, and resources for Hungarian Natural Language Processing.

#computational-linguistics#hungarian#information-retrieval

Stars281

Forks19

Last commit3 months ago

ferretC

An extensible information retrieval library for Ruby, similar to Apache Lucene.

#search-library#information-retrieval#ruby-bindings

Stars280

Forks57

#nlp-library#elixir#information-retrieval

stemmerElixir

An English (Porter2) stemming implementation in Elixir for reducing words to their base forms.

Stars154

Forks10