A collection of libraries for parsing Tweet text, including character counting and entity linking.
twitter-text is a collection of libraries and conformance tests used to parse Tweet text according to Twitter's platform specifications. It handles character counting, URL detection, and entity linking for @usernames, #hashtags, and $cashtags. The project ensures consistent text processing across different programming languages and implementations.
Developers building applications that integrate with Twitter's API or need to process Tweet text in compliance with Twitter's rules. This includes social media tools, analytics platforms, and content management systems.
It provides an official, standardized way to parse Tweet text, reducing errors and inconsistencies when interacting with Twitter's platform. The conformance tests guarantee that all implementations meet the same parsing standards.
Twitter Text Libraries. This code is used at Twitter to tokenize and parse text to meet the expectations for what can be used on the platform.
Open-Awesome is built by the community, for the community. Submit a project, suggest an awesome list, or help improve the catalog on GitHub.
Provides maintained libraries in Java, Ruby, JavaScript, and Objective-C, ensuring reliability and alignment with Twitter's platform rules.
Adheres to Twitter's specific character counting and entity detection standards, reducing errors in API interactions and content display.
Includes comprehensive tests to validate that all implementations meet the same parsing standards, promoting cross-language consistency.
Lists unofficial implementations in languages like Swift and Rust, expanding usability beyond the core supported languages.
Designed solely for Twitter text parsing, making it unsuitable for other platforms without significant modification or additional parsers.
The last copyright notice is 2012-2020, and Twitter's text processing rules may have evolved, risking that the library doesn't reflect current standards.
Only supports four languages officially; developers using other languages must rely on community versions which may lack updates or thorough testing.