A domain-specific generative language model pre-trained on biomedical literature for text generation and mining tasks.
BioGPT is a generative pre-trained transformer model specifically designed for biomedical text generation and mining. It is trained on large-scale biomedical literature to understand and generate domain-specific text, enabling tasks like relation extraction, question answering, and document classification in the biomedical domain.
Researchers, data scientists, and developers working in biomedical natural language processing, healthcare AI, and life sciences who need domain-specific language models for text analysis and generation.
BioGPT offers a specialized model that outperforms general-purpose LLMs on biomedical tasks due to its domain-specific pre-training, and it is openly available with integration into popular frameworks like Hugging Face for easy adoption.
BioGPT is a generative pre-trained transformer model specifically designed for biomedical text generation and mining. It leverages large-scale biomedical literature to understand and generate domain-specific text, enabling advanced natural language processing applications in healthcare and life sciences.
BioGPT focuses on bridging the gap between general-purpose language models and domain-specific needs by providing a model that understands the nuances and terminology of biomedical literature.
Trained on PubMed abstracts and articles, BioGPT outperforms general models on biomedical tasks like relation extraction and QA, as evidenced by its fine-tuned checkpoints and demos.
Provides pre-fine-tuned models for key downstream tasks such as drug-target interaction extraction and document classification, reducing development time and effort.
Available through the transformers library with pipelines for easy text generation and feature extraction, as shown in the README with code examples for causal language modeling.
MIT-licensed with models hosted on GitHub and Hugging Face, promoting reproducibility and adoption in academic and industrial settings.
Requires manual setup of specific versions for PyTorch 1.12.0, fairseq 0.12.0, and tools like Moses and fastBPE, with environment variable configuration, increasing setup time and potential for errors.
Specialized training means it underperforms on non-biomedical text without additional fine-tuning, limiting its versatility for broader NLP applications.
Models like BioGPT-Large have significant computational and memory requirements, making them unsuitable for low-resource deployments without high-end GPUs.
Relies on older library versions (e.g., PyTorch 1.12.0), which may cause compatibility issues with newer systems and frameworks, requiring careful environment management.
🦖 ClawBio - The first bioinformatics-native AI agent skill library. Local-first. Reproducible. Built on OpenClaw.
Code and data for GeneGPT.
GenePT is a foundation model for single-cell biology that leverages ChatGPT embeddings of NCBI gene descriptions to perform gene-level and cell-level tasks. It offers an efficient alternative to traditional models that require extensive data curation and resource-intensive training from gene expression profiles. ## Key Features - **Gene Embeddings** — Uses GPT-3.5 embeddings of NCBI gene summary texts to represent genes. - **Cell Embeddings** — Generates single-cell embeddings by averaging gene embeddings weighted by expression or creating sentence embeddings from ordered gene names. - **Efficient Approach** — Eliminates the need for dataset curation and additional pre-training, making it user-friendly. - **Competitive Performance** — Achieves comparable or superior performance to existing single-cell foundation models in tasks like gene property classification and cell type annotation. - **Pre-computed Data** — Provides readily available datasets including extracted NCBI gene summaries and pre-computed OpenAI embeddings. ## Philosophy GenePT demonstrates that using large language model embeddings of scientific literature is a straightforward and effective approach for developing biological foundation models, complementing traditional expression-based methods.
Associated Repository for "Translation between Molecules and Natural Language"
Open-Awesome is built by the community, for the community. Submit a project, suggest an awesome list, or help improve the catalog on GitHub.