Showing 24 of 24 projects
A curated list of resources, tools, datasets, and learning materials for Chinese Natural Language Processing.
A multi-domain Chinese word segmentation toolkit offering higher accuracy and domain-specific models.
A curated list of 100 foundational and influential papers in natural language processing for students and researchers.
A Python NLP library built on spaCy for text preprocessing, feature extraction, and analysis tasks.
A Python library and CLI tool for converting text to phonetic transcriptions (phones) across multiple languages using various backends.
A comprehensive natural language processing framework for Ruby with support for text extraction, parsing, and machine learning.
A curated list of awesome resources, libraries, and tools for natural language processing (NLP) in Ruby.
A curated list of awesome resources, libraries, and tools for natural language processing (NLP) in Ruby.
A curated directory of academic institutions and principal investigators in computational neuroscience worldwide.
An R package for the quantitative analysis of textual data, providing comprehensive tools for natural language processing and text management.
A curated list of open-access resources and tools for Natural Language Processing (NLP) focused on the German language.
An AI system that incrementally generates scientific paper drafts by predicting links between concepts and generating text sections.
A high-performance Go library for calculating Levenshtein distance between strings, including Unicode support.
A curated list of resources, tools, datasets, and communities for linguistics and natural language processing.
A curated collection of linguistic resources, datasets, and tools for Natural Language Processing and Computational Linguistics on Spanish.
A curated collection of linguistic resources, tools, and datasets for Natural Language Processing and Computational Linguistics on Spanish.
A curated list of free tools, datasets, models, and resources for Hungarian Natural Language Processing.
A Java library for parsing and generating text using combinatory categorial grammar and hybrid logic dependency semantics.
A statistical natural language generator for spoken dialogue systems, supporting both A*-search and seq2seq algorithms.
A C++ and Python library for efficient extraction and analysis of n-grams, skipgrams, and flexgrams from large corpora.
A Julia package providing high-performance, configurable tokenizers and sentence splitters for natural language processing.
A natural language processing library for Uralic and other languages, offering morphological analysis, generation, lemmatization, and lexical information.
A tagger, lemmatizer, morphological analyzer, and dependency parser for Dutch using memory-based NLP modules.
A rule-based Unicode tokenizer that separates words from punctuation and splits sentences for NLP preprocessing.
Open-Awesome is built by the community, for the community. Submit a project, suggest an awesome list, or help improve the catalog on GitHub.