Open-Awesome
CategoriesAlternativesStacksSelf-HostedExplore
Open-Awesome

© 2026 Open-Awesome. Curated for the developer elite.

TermsPrivacyAboutGitHubRSS
  1. Home
  2. Computational Biology
  3. GuacaMol

GuacaMol

MITPython

A Python package for benchmarking generative models in de novo molecular design.

GitHubGitHub
525 stars99 forks0 contributors

What is GuacaMol?

GuacaMol is a Python package that provides benchmarks for evaluating generative models in de novo molecular design. It solves the problem of inconsistent evaluation in computational chemistry by offering standardized tests to measure how well models can generate novel, drug-like molecules. The package includes both distribution-learning and goal-directed benchmarks to assess different aspects of generative performance.

Target Audience

Computational chemists, machine learning researchers, and pharmaceutical scientists developing or using generative models for molecular design and drug discovery.

Value Proposition

Researchers choose GuacaMol because it provides rigorous, reproducible benchmarks that enable fair comparison of generative models, includes standardized datasets, and offers containerized environments for consistent evaluation across different systems.

Overview

Benchmarks for generative chemistry

Use Cases

Best For

  • Comparing different generative models for molecular design
  • Evaluating new algorithms for de novo drug discovery
  • Establishing performance baselines for generative chemistry research
  • Reproducible benchmarking in computational chemistry
  • Testing models against standardized chemical property optimization tasks
  • Academic research requiring transparent evaluation metrics

Not Ideal For

  • Projects requiring real-time molecular generation for interactive applications
  • Teams heavily invested in non-Python cheminformatics or machine learning frameworks
  • Small-scale academic projects lacking resources for Docker-based reproducible setups
  • Applications focused solely on molecular property prediction without generation tasks

Pros & Cons

Pros

Standardized Benchmarking Framework

Provides both distribution-learning and goal-directed benchmarks, enabling fair comparison of generative models as outlined in the benchmarking models section and the accompanying paper.

Reproducible Data and Environment

Includes pre-processed ChEMBL datasets and Docker support for consistent benchmarking, with detailed instructions in the Data and Docker sections to ensure reproducibility.

Baseline Model Implementations

Offers reference implementations of common generative models via a separate repository, helping establish performance baselines as mentioned in the Key Features and linked guacamol_baselines.

Community-Driven Leaderboard

Features a public leaderboard for transparent comparison of model performances, encouraging community engagement and progress tracking as highlighted in the README.

Cons

Complex Dependency Management

Requires specific versions of RDKit and FCD libraries, with pinned dependencies that can lead to installation conflicts, as noted in the installation section and change log updates.

Setup Overhead for Reproducibility

Docker is recommended for data generation and benchmarking, adding complexity for users unfamiliar with containerization, evident in the Docker commands and reproducibility warnings.

Limited Dataset Flexibility

Primarily relies on ChEMBL datasets, which may not cover all chemical spaces, and custom dataset integration requires handling forbidden symbols, as mentioned in the data generation and change log.

Frequently Asked Questions

Quick Stats

Stars525
Forks99
Contributors0
Open Issues10
Last commit2 years ago
CreatedSince 2018

Tags

#cheminformatics#open-science#rdkit#python#drug-discovery#benchmarking#machine-learning

Built With

R
RDKit
P
Python
D
Docker

Included in

Computational Biology122
Auto-fetched 2 hours ago

Related Projects

MOSESMOSES

Molecular Sets (MOSES): A Benchmarking Platform for Molecular Generation Models

Stars979
Forks280
Last commit2 years ago
TAPE (Tasks Assessing Protein Embeddings)TAPE (Tasks Assessing Protein Embeddings)

Tasks Assessing Protein Embeddings (TAPE), a set of five biologically relevant semi-supervised learning tasks spread across different domains of protein biology.

Stars740
Forks135
Last commit3 years ago
ProteinGymProteinGym

Official repository for the ProteinGym benchmarks

Stars442
Forks58
Last commit3 months ago
scIB (Single-cell Integration Benchmarks)scIB (Single-cell Integration Benchmarks)

Benchmarking analysis of data integration tools

Stars423
Forks76
Last commit2 months ago
Community-curated · Updated weekly · 100% open source

Found a gem we're missing?

Open-Awesome is built by the community, for the community. Submit a project, suggest an awesome list, or help improve the catalog on GitHub.

Submit a projectStar on GitHub