A biomedical text corpus with 97 full-text articles annotated for concepts, coreferences, and structural elements.
The Colorado Richly Annotated Full-Text (CRAFT) Corpus is a collection of 97 biomedical articles from PubMed Central, each annotated across multiple dimensions including structural elements, coreferences, and biomedical concepts. It serves as a valuable resource for natural language processing research, biomedical text mining, and training machine learning models in the biomedical domain.
The CRAFT corpus is designed to provide comprehensive, high-quality annotations to support advanced biomedical text analysis and natural language processing research.
Open-Awesome is built by the community, for the community. Submit a project, suggest an awesome list, or help improve the catalog on GitHub.