Nlp Research

12 projects

Showing 12 of 12 projects

karthinkncode's Datasets for Natural Language Processing

A collaboratively maintained, reverse-chronological list of datasets and corpora for natural language processing tasks.

#ai-training-data#nlp-research#research-tools

Stars918

Forks249

Last commit6 years ago

The Schema-Guided Dialogue DatasetPython

A large-scale multi-domain dataset of over 20k annotated task-oriented dialogues for training and evaluating virtual assistants.

#zero-shot-learning#dialogue-dataset#nlp-research

An open-source benchmark toolkit for Natural Language Generation in spoken dialogue systems, featuring multiple RNN-based models and datasets.

#nltk#nlp-research#deep-learning

Stars491

Forks126

Last commit7 years ago

Neural machine translation between the writings of Shakespeare and modern English using TensorFlowPython

Neural machine translation between Shakespearean and modern English using TensorFlow.

#sequence-to-sequence#nlp-research#deep-learning

A dataset of NBA game summaries aligned with box- and line-scores for data-to-text generation research.

#nlp-research#data-to-text#nba

Stars115

Forks25

Last commit4 years ago

GraphQuestionsReScript

A characteristic-rich dataset for factoid question answering with explicit question specifications to enable fine-grained QA system evaluation.

#nlp-research#question-answering#natural-language-processing

An enriched dataset for Natural Language Generation research, providing intermediate representations for pipeline tasks like lexicalization and aggregation.

#pipeline-architecture#nlp-research#data-to-text

Stars71

Forks22

Last commit5 years ago

EasyCCGJava

A CCG parser implementing all combinators with parsing to logical form and parameter estimation for probabilistic CCG.

#probabilistic-models#computational-linguistics#nlp-research

Stars62

Forks20

Last commit8 years ago

Corpus LoadersJulia

A Julia package providing lazy-loading iterators for various NLP corpora with automatic data dependency management.

#corpora#julia#nlp-research

Stars32

Forks12

Last commit3 years ago

Alex Context NLG Dataset

A dataset for context-aware natural language generation in task-oriented spoken dialogue systems for public transport information.

#nlp-research#delexicalization#task-oriented

Stars23

Forks12

Last commit9 years ago

german-transformer-trainingPython

A repository for planning and training German transformer language models from scratch.

#german-language#transformer#language-model-training

Stars23

Forks2

Last commit5 years ago

Awesome New Languages in Machine Translation

A curated list of initiatives and projects for adding new or low-resource languages to open-source machine translation models.

#huggingface-models#translation-initiatives#language-diversity

Stars22

Forks1

Last commit6 months ago

Related Tags

Community-curated · Updated weekly · 100% open source

Found a gem we're missing?

Open-Awesome is built by the community, for the community. Submit a project, suggest an awesome list, or help improve the catalog on GitHub.

Submit a project Star on GitHub