Named Entity Recognition

#machine-translation#academic#dialogue

spacyPython

Industrial-strength Natural Language Processing library for Python, featuring pretrained pipelines for 70+ languages and production-ready training.

#nlp-library#ai#spacy

Stars33.8k

Forks4.7k

Last commit2 months ago

NLP-progressPython

A community-driven repository tracking datasets and state-of-the-art results for common NLP tasks across multiple languages.

Stars23.0k

Forks3.6k

#part-of-speech-tagging#biomedical-nlp#semantic-role-labeling

flairPython

A simple Python framework for state-of-the-art natural language processing (NLP) tasks like named entity recognition and sentiment analysis.

Stars14.4k

Forks2.1k

Flair embeddings from PubMedPython

A simple Python framework for state-of-the-art natural language processing (NLP) tasks like named entity recognition and sentiment analysis.

#sequence-tagging#biomedical-nlp#python-library

Stars14.4k

Forks2.1k

#part-of-speech-tagging#nlp-library#plugin-system

NLP CompromiseJavaScript

A lightweight JavaScript library for natural language processing that transforms text into structured data with a modest, pragmatic approach.

A Python NLP library from Stanford for tokenization, sentence segmentation, NER, and dependency parsing across 60+ languages.

#biomedical-nlp#python-library#deep-learning

Stars7.9k

Forks947

Last commit21 hours ago

DeepPavlovPython

An open-source NLP framework for building and deploying deep learning dialog systems and chatbots with PyTorch and transformers.

#chitchat#deep-learning#nlp-framework

Stars7.0k

Forks1.2k

Last commit11 months ago

spark-nlpScala

A state-of-the-art Natural Language Processing library built on Apache Spark, offering 100,000+ pretrained models and pipelines in 200+ languages.

#apache-spark#spark#transformer-models

Stars4.1k

Forks743

Last commit2 days ago

Snips NLUPython

Snips Python library to extract meaning from text

#text-classification#intent parser#slot-filling

Stars4.0k

Forks504

MIT Information Extraction ToolkitC++

A free, state-of-the-art library and toolkit for named entity extraction and binary relation detection from text.

#relation-extraction#java-library#python-library

Stars3.0k

Forks532

Last commit9 months ago

universal-data-toolJavaScript

A web/desktop application for collaborative labeling and annotation of images, text, audio, documents, and other data types.

#dataset-creation#desktop-app#web-app

Stars2.1k

Forks200

#biomedical-nlp#scientific-text#spacy

ScispaCyPython

A spaCy pipeline and models specifically designed for processing scientific and biomedical documents.

Stars2.0k

Forks258

Last commit7 months ago

NeuroNERPython

An easy-to-use, state-of-the-art named-entity recognition (NER) tool based on neural networks.

#text-analysis#deep-learning#neural-networks

Stars1.7k

Forks472

#relation-extraction#scientific-text#text-classification

SciBERTPython

A BERT language model pre-trained on a large corpus of scientific papers for natural language processing tasks in scientific domains.

Stars1.7k

Forks231

Last commit4 years ago

treatRuby

A comprehensive natural language processing framework for Ruby with support for text extraction, parsing, and machine learning.

#text-extraction#computational-linguistics#text-analysis

Stars1.4k

Forks124

#chatbots#text-classification#domain-specific-language

ChatitoTypeScript

Generate datasets for AI chatbots, NLP tasks, NER, and text classification using a simple domain-specific language.

Stars889

Forks147

#stock-price-prediction#financial-ai#instruction-tuning

PIXIUJupyter Notebook

An open-source suite featuring financial large language models (FinMA), instruction datasets (FIT), and evaluation benchmarks (FinBen) for financial AI.

Stars878

Forks121

#natural-language-understanding#ai#text-analysis

CatalystC#

Catalyst is a high-performance C# NLP library inspired by spaCy, offering pre-trained models, entity recognition, and embedding training.

Stars854

Forks85

Last commit2 days ago

DBPedia SpotlightScala

A tool for automatically annotating mentions of DBpedia resources in text, linking entities to their global identifiers.

#content-tagging#text-analysis#semantic-annotation

Stars759

Forks192

Last commit8 years ago

BioBERT

Pre-trained biomedical language representation model for biomedical text mining tasks like named entity recognition and relation extraction.

#relation-extraction#biomedical-nlp#transfer-learning

Stars706

Forks92

Last commit6 years ago

stanford-corenlp-pythonPython

A Python wrapper and JSON-RPC server for Stanford CoreNLP, providing NLP tools like parsing, tagging, and coreference resolution.

#parsing#json-rpc#coreference-resolution

Stars610

Forks226

#ikvm#pos-tagging#recompiled-packages

Stanford.NLP for .NETC#

A .NET wrapper for Stanford CoreNLP providing natural language processing capabilities including tokenization, parsing, and named entity recognition.

Stars609

Forks117

#relation-extraction#biomedical-nlp#transfer-learning

BlueBERTPython

A BERT model pre-trained on PubMed abstracts and clinical notes for biomedical natural language processing tasks.

Stars597

Forks80

#part-of-speech-tagging#cogcomp#java-library

CogCompNLPJava

A comprehensive suite of Java NLP libraries and tools for text annotation, feature extraction, and language processing tasks.

Stars479

Forks143

#spacy#clinical-text#metamap

medaCyPython

A medical text mining and information extraction framework built on spaCy for rapid prototyping and training of predictive NLP models.

Stars438

Forks92

#computational-linguistics#pos-tagging#machine-translation

Spanish

A curated collection of linguistic resources, tools, and datasets for Natural Language Processing and Computational Linguistics on Spanish.

Stars351

Forks42

#text-classification#transformer-models#machine-translation

CybertronGo

A pure Go package for running inference with pre-trained Transformer models from Hugging Face, enabling NLP tasks without external languages.

Stars330

Forks27

#spacy#nlp-resources#bokmal

Norwegian NLP resources

A curated collection of open-source libraries, models, datasets, and tools for Natural Language Processing (NLP) in Norwegian.

Stars182

Forks15

Last commit5 years ago

EDS_NLPPython

A modular NLP framework for extracting information from French clinical notes, compatible with spaCy and PyTorch.

#medical-text#spacy#fast

Stars164

Forks44

Last commit13 days ago

NLP4JJava

A natural language processing framework for JVM languages with comprehensive linguistic analysis tools.

#coreference-resolution#java-nlp#semantic-role-labeling

Stars154

Forks32

Last commit5 years ago

ruby-nlpRuby

Ruby bindings for Stanford NLP tools providing part-of-speech tagging and named entity recognition capabilities.

#part-of-speech-tagging#nlp-tools#natural-language-processing

Stars92

Forks14

Last commit12 years ago

open-nlpRuby

Ruby bindings to the OpenNLP Java toolkit for natural language processing tasks like tokenization, POS tagging, and named entity recognition.

#java bindings#jruby#pos-tagging

Stars91

Forks11

#c-plus-plus-library#computational-linguistics#memory-based-learning

frogC++

A tagger, lemmatizer, morphological analyzer, and dependency parser for Dutch using memory-based NLP modules.

Stars82

Forks12

Last commit1 month ago

ruby-spacyRuby

A Ruby wrapper for the spaCy NLP library via PyCall, enabling tokenization, POS tagging, NER, and OpenAI integration.

#parsing#nlp-library#spacy

Stars68

Forks6

Last commit3 days ago

max-named-entity-taggerPython

A named entity recognition model that locates and tags entities like persons, locations, and organizations in text using a neural network.

#neural-network#text-analysis#api-service

Stars25

Forks18