Question 1

How to find a pre-trained German BERT model using German-NLP?

Accepted Answer

German-NLP lists several BERT models under the 'Deep learning models and transformers' section, such as dbmdz BERT and Deepset German BERT. Check the README for direct links to Hugging Face repositories and documentation to download and use them.

Question 2

What's the best German sentiment analysis dataset for social media?

Accepted Answer

The directory includes sentiment analysis datasets like Potsdam Twitter Sentiment Corpus (PotTS) and SpinningBytes Swiss German Sentiment Corpus under the 'Sentiment analysis datasets' section. Review these entries to compare data sources and licensing for your project.

Question 3

German-NLP vs Hugging Face for German language models?

Accepted Answer

German-NLP is a curated directory that helps discover German-specific models, including those on Hugging Face, while Hugging Face is a platform hosting models with direct integration. Use German-NLP for exploration and Hugging Face for implementation and community features.

Question 4

How to contribute a new German NLP tool to German-NLP?

Accepted Answer

Follow the contributing guidelines linked in the README. Typically, submit a pull request with the resource added to the appropriate category, ensuring it meets the usability and maintenance criteria specified, such as being open-source and currently maintained.

Question 5

Are there tools for historical German text normalization in German-NLP?

Accepted Answer

Yes, the 'Historical' section under text corpora lists resources like Deutsches Textarchiv, and the 'Normalization' section includes tools such as CAB and transnormer, which are designed for processing and normalizing historical German texts.

Question 6

Can I use German-NLP resources for commercial projects?

Accepted Answer

German-NLP itself is CC-BY licensed, but the resources listed have their own licenses. You must check each resource's license terms individually, as some may be open-source but with restrictions like non-commercial use or attribution requirements.

German NLP resources

What is German NLP resources?

Overview

Use Cases

Best For

Related Projects

Found a gem we're missing?

Not Ideal For

Pros & Cons

Pros

Cons

Frequently Asked Questions