Pre-trained BERT models fine-tuned on clinical text from MIMIC for medical natural language processing tasks.
ClinicalBERT is a collection of pre-trained BERT models specifically fine-tuned on clinical text from the MIMIC database. It provides domain-specific embeddings that understand medical terminology and clinical context, enabling more accurate natural language processing for healthcare applications. The models are designed to reduce the data and computational requirements for building clinical NLP systems.
Researchers and developers working on medical natural language processing, clinical informatics, and healthcare AI applications who need domain-specific language models.
ClinicalBERT offers specialized embeddings trained on real clinical data, providing better performance on medical NLP tasks compared to general-purpose BERT models without requiring extensive domain-specific training from scratch.
repository for Publicly Available Clinical BERT Embeddings
Fine-tuned on MIMIC clinical notes, providing embeddings that outperform general BERT on medical NLP tasks like MedNLI and NER, as demonstrated in the associated paper.
Offers specialized models such as Bio+Clinical BERT and Discharge Summary BERT for different clinical documentation needs, allowing targeted use cases.
Available through the Transformers library with model pages on HuggingFace, enabling easy implementation without manual setup.
Includes scripts for pretraining and downstream tasks, such as format_mimic_for_BERT.py and finetune_lm_tf.sh, supporting research transparency.
Based on BERT from 2018, lacking improvements from newer models like RoBERTa or DeBERTa that may offer better efficiency and performance.
README notes issues like section splitting code needing improvement (issue #4), and scripts require manual path changes, making setup less user-friendly.
Fine-tuned specifically on MIMIC, which may not generalize well to other clinical datasets without additional fine-tuning or data adaptation.
Open-Awesome is built by the community, for the community. Submit a project, suggest an awesome list, or help improve the catalog on GitHub.