A bioinformatics-native AI agent skill library for reproducible, local-first genomic analysis, built on OpenClaw.
ClawBio is the first bioinformatics-native AI agent skill library. It provides a collection of executable, specification-constrained skills for genomic analysis—such as pharmacogenomic reporting, GWAS lookup, and polygenic risk scoring—that run locally and ensure reproducibility. The project solves the problem of irreproducible bioinformatics by encoding expert decisions into versioned contracts, so AI agents can orchestrate analyses correctly without improvising from training data.
Bioinformaticians, genomic researchers, and AI agent developers who need reproducible, local-first analysis pipelines that integrate seamlessly with AI coding assistants like Claude Code or Telegram bots.
Developers choose ClawBio because it guarantees correctness and reproducibility through specification-first skills, keeps sensitive genomic data local, and works agent-agnostically across any AI platform—all while being open-source and community-driven.
🦖 ClawBio - The first bioinformatics-native AI agent skill library. Local-first. Reproducible. Built on OpenClaw.
Open-Awesome is built by the community, for the community. Submit a project, suggest an awesome list, or help improve the catalog on GitHub.
Skills encode expert bioinformatics decisions in versioned SKILL.md files, preventing LLM hallucination—for example, the PharmGx skill accurately applies CPIC guidelines to avoid misclassifying alleles like CYP2D6*4.
Genomic data never leaves the user's machine, as emphasized in the philosophy, ensuring privacy by avoiding cloud uploads or data exfiltration in sensitive health analyses.
Every analysis exports a reproducibility bundle with commands.sh, environment.yml, and SHA-256 checksums, enabling exact reproduction of results without relying on the original author, as detailed in the provenance section.
With 46+ skills spanning pharmacogenomics to scRNA-seq, plus integration with 8,000+ Galaxy tools via the Galaxy Bridge, it offers extensive bioinformatics functionality out of the box.
Systematic validation infrastructure includes ground truth benchmarks, mock APIs, and 74+ tests, such as the AD gene benchmark and swappable fine-mapping pipelines, ensuring skill reliability.
There is no pip package yet; users must clone the git repository and install dependencies manually, which adds setup time compared to one-command installs, as noted in the quick start.
Skills like metagenomics require external bioinformatics tools (Kraken2, RGI) that need separate installation, complicating setup and potentially limiting portability.
As version 0.5.0, the project is in active development with planned features and potential breaking changes, which might affect production stability for long-term projects.
It is solely focused on bioinformatics, lacking general-purpose data analysis features, making it less suitable for interdisciplinary projects without genomic components.