A Python library providing German language support for TextBlob, enabling NLP tasks like tokenization, POS tagging, and sentiment analysis.
textblob-de is a Python library that extends TextBlob with German language support, enabling natural language processing tasks like tokenization, part-of-speech tagging, parsing, and sentiment analysis specifically for German text. It solves the problem of applying TextBlob's convenient API to German language content by providing specialized models and tools.
Python developers working with German text data who need NLP capabilities like sentiment analysis, text parsing, or linguistic feature extraction. Data scientists and researchers analyzing German language corpora.
Developers choose textblob-de because it provides a consistent TextBlob API experience for German language processing, eliminating the need to learn new libraries. It integrates established tools like pattern.de and NLTK while maintaining TextBlob's simplicity and ease of use.
German language support for TextBlob.
Maintains TextBlob's simple interface, allowing developers to use the same methods for German text as for English, reducing the learning curve, as shown in the usage examples.
Includes POS tagging with multiple tagsets (penn, universal, stts), parsing, lemmatization, and noun phrase extraction, leveraging established tools like pattern.de and NLTK.
Provides full access to pattern.text.de functionalities in Python 3, enabling advanced German language processing directly, such as attributive adjective inflection.
Uses NLTKPunktTokenizer for reliable German sentence boundary detection, handling complex cases like abbreviations and dates, as demonstrated in the example text.
Sentiment polarity detection is marked as experimental, lacks subjectivity scores, and uses an uninflected lexicon, making it unreliable for critical applications, per the README warning.
Relies on the pattern library and NLTK, which are older and may not match the performance or accuracy of modern NLP libraries like spaCy or transformers-based models.
Development releases can have issues on Windows, and setup requires downloading additional corpora, adding complexity compared to more seamless pip-installable packages.
general natural language facilities for node
Easy-to-use text annotation tool for teams with most comprehensive auto-annotation features. Supports NER, relations and document classification as well as OCR annotation for invoice labeling
Snowball is a language in which stemming algorithms can be easily represented
Software for corpus linguists and text/data mining enthusiasts. The CorpusExplorer combines over 50 interactive visualizations under a user-friendly interface
Open-Awesome is built by the community, for the community. Submit a project, suggest an awesome list, or help improve the catalog on GitHub.