Showing 3 of 3 projects
An unsupervised text tokenizer and detokenizer for neural network-based text generation systems with subword units.
A multi-domain Chinese word segmentation toolkit offering higher accuracy and domain-specific models.
A Python library for processing simplified Chinese text, offering sentiment analysis, segmentation, and keyword extraction.
Open-Awesome is built by the community, for the community. Submit a project, suggest an awesome list, or help improve the catalog on GitHub.