A vision-language foundation model for computational pathology, pretrained on 1.17M histopathology image-caption pairs for diverse AI tasks.
CONCH is a vision-language foundation model for computational pathology that learns from histopathology images paired with biomedical text captions. It solves the problem of label scarcity in medical AI by enabling powerful zero-shot and few-shot transfer to tasks like classification, segmentation, and retrieval without task-specific training. The model is pretrained on 1.17 million image-caption pairs, the largest dataset of its kind in histopathology.
Researchers and developers in computational pathology, medical AI, and digital pathology who need a versatile foundation model for building and evaluating diagnostic tools, slide analysis systems, or multimodal pathology workflows.
Developers choose CONCH because it offers state-of-the-art performance across a wider range of pathology tasks compared to vision-only models, supports non-H&E stains effectively, and minimizes benchmark data contamination risks due to its pretraining data curation.
Vision-Language Pathology Foundation Model - Nature Medicine
Processes both histopathology images and biomedical text, enabling cross-modal tasks like retrieval and captioning, as demonstrated by its pretraining on 1.17M image-caption pairs.
Achieves state-of-the-art performance on 14 diverse benchmarks including classification and segmentation, reducing reliance on extensive labeled data.
Produces performant representations for IHCs and special stains, unlike models trained only on H&E images, as highlighted in the README.
Pretrained without using large public slide collections like TCGA, making it safer for benchmarking on public or private datasets without leakage concerns.
Licensed under CC-BY-NC-ND, prohibiting commercial use without prior approval, which limits industry adoption and practical deployment.
Publicly released weights exclude the multimodal decoder, affecting full captioning capabilities as noted in the README, despite vision and text encoders being intact.
Requires Hugging Face access token, manual weight download, and environment setup, adding overhead compared to plug-and-play models.
While SOTA on many tasks, it underperforms on some benchmarks like EBRAINS-C compared to UNI, indicating task-specific strengths and weaknesses.
Deep probabilistic analysis of single-cell and spatial omics data
scGPT is a foundation model designed for single-cell multi-omics data analysis using generative AI. It leverages transformer architecture pretrained on millions of single-cell profiles to enable a wide range of downstream biological tasks, advancing computational biology by providing a powerful, unified model for cellular data. ## Key Features - **Pretrained Model Zoo** — Offers multiple organ-specific and whole-human models trained on millions of cells for various applications. - **Zero-Shot Applications** — Supports tasks like cell embedding and reference mapping without task-specific training. - **Reference Mapping** — Enables fast similarity search across millions of cells using efficient indexing with faiss. - **Multi-Task Fine-Tuning** — Can be adapted for scRNA-seq integration, cell type annotation, perturbation prediction, and GRN inference. - **Online Tools** — Provides accessible web applications for reference mapping, cell annotation, and GRN inference via cloud GPUs. ## Philosophy scGPT aims to build a foundational AI model for single-cell biology, democratizing access to advanced computational methods and accelerating discoveries in multi-omics research through open-source collaboration.
Pathology Foundation Model - Nature Medicine
Prov-GigaPath: A whole-slide foundation model for digital pathology from real-world data
Open-Awesome is built by the community, for the community. Submit a project, suggest an awesome list, or help improve the catalog on GitHub.