A Python library for deep probabilistic modeling and analysis of single-cell and spatial omics data.
scvi-tools is a Python library for deep probabilistic analysis of single-cell and spatial omics data. It provides a suite of models for tasks like dimensionality reduction, data integration, and automated annotation, built on modern machine learning frameworks. The library addresses the need for scalable, reproducible analysis methods in computational biology.
Bioinformaticians, computational biologists, and data scientists working with single-cell or spatial omics data who need robust, probabilistic analysis tools. It's also suitable for researchers developing novel analysis methods in this domain.
Developers choose scvi-tools for its comprehensive set of production-ready models, seamless integration with the Scanpy ecosystem, and GPU acceleration. Its modular design also allows for rapid development and deployment of custom probabilistic models.
Deep probabilistic analysis of single-cell and spatial omics data
Open-Awesome is built by the community, for the community. Submit a project, suggest an awesome list, or help improve the catalog on GitHub.
Built on PyTorch, scvi-tools leverages GPU acceleration to handle large-scale omics datasets efficiently, as emphasized in its focus on scalable, production-ready models.
Integrates tightly with Scanpy and AnnData, providing a high-level API that fits into standard single-cell analysis workflows, reducing adoption barriers for existing users.
Offers a wide range of pre-implemented models for tasks like dimensionality reduction and data integration, saving time and effort for common analytical needs in omics research.
Includes building blocks powered by PyTorch Lightning and Pyro, enabling rapid prototyping and deployment of novel probabilistic models, as highlighted in the skeleton repository for method development.
Requires careful setup of PyTorch with GPU compatibility, which can be error-prone and challenging for users unfamiliar with deep learning environments, as noted in the installation instructions.
Assumes proficiency in probabilistic modeling, deep learning, and single-cell biology, making it less accessible for beginners or researchers from non-computational backgrounds.
Primarily designed for omics data analysis, so it lacks general-purpose machine learning capabilities and is not suitable for other data types or broader biological applications.