A collection of pre-trained BERT, DistilBERT, ELECTRA, GPT-2, and ConvBERT models for multiple languages, including German, Italian, Turkish, and historic texts.
dbmdz/berts is a collection of transformer-based language models developed by the MDZ Digital Library team. It provides pre-trained models for various languages, including German, Italian, Turkish, French, Ukrainian, and historic languages, enabling natural language processing tasks without training from scratch. The project focuses on under-resourced languages and specialized domains like historic texts.
NLP researchers and developers working with specific languages like German, Italian, Turkish, French, Ukrainian, or historic languages such as Historic Dutch, Finnish, and Swedish. It is also aimed at those needing models for specialized domains like digitized newspapers and archival texts.
Developers choose this project for its specialized models for under-resourced languages and historic texts, which are not widely covered by generic multilingual models. The models are community-driven, optimized for specific languages, and fully compatible with the Hugging Face Transformers library for easy integration.
DBMDZ BERT, DistilBERT, ELECTRA, GPT-2 and ConvBERT models
Open-Awesome is built by the community, for the community. Submit a project, suggest an awesome list, or help improve the catalog on GitHub.
Provides pre-trained models for languages like Turkish, Ukrainian, and historic variants such as Historic Dutch, which are scarce in mainstream NLP libraries.
Includes BERT, DistilBERT, ELECTRA, ConvBERT, and GPT-2 variants for specific languages, allowing trade-offs between efficiency and performance.
All models are compatible with the Transformers library, enabling easy loading with AutoTokenizer and AutoModel as shown in the README examples.
Offers unique models trained on Europeana newspapers and archives, tailored for digital humanities and archival research.
The README explicitly states that only PyTorch weights are available, and TensorFlow checkpoints require raising an issue, limiting framework flexibility.
Some models, like the Italian XXL BERT, have known issues such as mismatched vocab sizes, which could affect fine-tuning or evaluation.
Results for downstream tasks are relegated to external repositories, making it harder to assess model effectiveness without additional research.