Showing 36 of 157 projects
An open-source workflow management system for bioinformatics that scales from one-off use cases to massive production environments.
An open-source, Python-based data analysis tool with specialized data types and methods for genomic data at scale.
A genomics analysis platform that uses Apache Spark to parallelize genomic data processing across clusters, replacing traditional file-based workflows.
A validated, scalable, community-developed pipeline for variant calling, RNA-seq, and small RNA analysis in genomic sequencing.
WebGL-accelerated JavaScript library for interactive molecular visualization in web applications.
A batteries-included distribution of ImageJ for scientific image processing, focused on life sciences research.
A bioinformatics-native AI agent skill library for reproducible, local-first genomic analysis, built on OpenClaw.
An R package for visualizing and annotating phylogenetic trees and other tree-like structures using the grammar of graphics.
A C library for reading and writing high-throughput sequencing data formats like SAM, CRAM, and VCF.
A collection of transformer-based foundation models for genomics and transcriptomics, enabling tasks like sequence analysis, functional prediction, and conversational DNA exploration.
A curated list of awesome cheminformatics software, libraries, resources, and tools, primarily command-line based and open-source.
A comprehensive collection of notes, tools, and resources for analyzing ChIP-seq and related epigenomic data.
A Python package for applying graph neural networks to molecular graphs and biological networks in life science research.
An open-source image analysis software package for plant phenotyping using computer vision.
A long-range genomic foundation model that processes DNA sequences up to 1 million nucleotides at single nucleotide resolution.
R toolkit for inference, visualization and analysis of cell-cell communication from single-cell transcriptomics data.
A PyTorch and TorchDrug based deep learning library for drug pair scoring, predicting interactions, side effects, and synergy.
A biomedical knowledge graph integrating 20 resources to describe 17,080 diseases with over 4 million relationships across ten biological scales.
Public domain Java software for processing and analyzing scientific images across multiple platforms.
A pre-trained BERT model designed for DNA sequence analysis, enabling genome understanding tasks like classification and motif discovery.
A benchmark for evaluating protein language models through five biologically relevant semi-supervised learning tasks.
A Go package providing fast, reproducible computational tools for synthetic biology and organism engineering.
Open-source language models and tools for protein engineering and design using AI.
A deep learning library built on Chainer for molecular property prediction using graph convolutional neural networks.
A C++ library and command-line toolkit for parsing, manipulating, and analyzing VCF (Variant Call Format) files in bioinformatics.
A Dash component library for creating interactive and customizable network visualizations in Python and R, powered by Cytoscape.js.
A diffusion framework for controllable protein sequence and evolutionary alignment generation using discrete diffusion models.
Fast, sensitive, and accurate integration of single-cell RNA-seq data across multiple datasets, batches, or experimental conditions.
A free Google Colab-based toolbox with Jupyter notebooks and GUI for applying deep learning to microscopy data without coding expertise.
High-resolution de novo protein structure prediction from amino acid sequences using deep learning.
A lightweight, super fast C/C++ and Python library for sequence alignment using edit (Levenshtein) distance.
An open-source Java library for cheminformatics and bioinformatics, providing algorithms for molecular representation, analysis, and data processing.
A scalable Python toolkit for analyzing and visualizing spatial molecular data from tissue sections.
An R package that predicts doublets (multiple cells mistaken as one) in single-cell RNA sequencing data using artificial nearest neighbor analysis.
A Python package for analyzing protein structure, dynamics, and sequence evolution with a comprehensive API and GUI.
A Python library for molecular processing built on RDKit with a simple API and good defaults.
Open-Awesome is built by the community, for the community. Submit a project, suggest an awesome list, or help improve the catalog on GitHub.