How do I use Molecular Transformer to predict reactions for my own molecules?

Preprocess your SMILES data with the provided tokenizer, format it to match the dataset structure, and run the translate.py script with a trained model. For custom reactions, you may need to retrain or fine-tune using the training commands, which requires adapting the data pipeline.

Molecular Transformer vs IBM RXN: which is better for everyday use?

Molecular Transformer is open-source and customizable for research with public data, while IBM RXN offers a user-friendly GUI and models trained on more diverse, possibly proprietary data. Choose based on whether you need flexibility or ease of use without coding.

What hardware is needed to train Molecular Transformer from scratch?

The README specifies training on a single GPU for 48-72 hours, so a modern GPU with sufficient memory is essential. Inference can run on CPUs but will be slower, and specific CUDA versions are required for the outdated PyTorch setup.

Can I fine-tune the pre-trained models on my proprietary dataset?

Yes, but it requires adapting the data preprocessing scripts to your format and retraining with the provided commands. This process is complex and assumes familiarity with the codebase and machine learning workflows.

Is there a web API or demo for Molecular Transformer?

No, the repository only provides code for local deployment; for a web interface, use IBM RXN. Deploying as a service would require building a custom API wrapper, which isn't included in the project.

Molecular Transformer — Chemical Reaction Predictor

What is Molecular Transformer?

Molecular Transformer is a sequence-to-sequence neural network model that predicts chemical reaction outcomes and retrosynthetic pathways. It treats molecules as SMILES strings and uses transformer architecture to translate between reactants and products, helping chemists design synthesis routes faster. The model includes uncertainty estimation to indicate prediction confidence.

Target Audience

Computational chemists, researchers in cheminformatics, and organic chemists who need AI tools for reaction prediction and retrosynthesis planning.

Value Proposition

It provides an open-source, uncertainty-calibrated model trained on public reaction datasets, unlike proprietary tools. The integration with RDKit for data preprocessing and availability of pre-trained models lowers the barrier for academic and industrial adoption.

Overview

Molecular Transformer is a neural machine translation model adapted for chemistry that predicts chemical reaction outcomes and retrosynthetic pathways. It translates between molecular representations (SMILES strings) to forecast how molecules react or how target molecules can be synthesized, accelerating discovery in organic chemistry and drug development.

Key Features

Retrosynthesis Prediction — Predicts reactant molecules needed to synthesize a target product molecule.
Uncertainty Calibration — Provides confidence estimates for predictions, helping chemists assess reliability.
SMILES Tokenization — Uses custom tokenization of SMILES strings to treat molecules as sequences for transformer models.
Data Augmentation — Doubles training data by generating random equivalent SMILES representations via RDKit.
Pre-trained Models — Includes models trained on public datasets (USPTO_MIT, USPTO_STEREO) with mixed or separated reactant/reagent formats.

Philosophy

Molecular Transformer aims to make AI-assisted chemical reaction prediction accessible to organic chemists, with the goal of integrating these models into daily laboratory workflows to accelerate molecular discovery.

Use Cases

Best For

Predicting reactants for a target molecule in retrosynthesis analysis
Estimating confidence scores for chemical reaction predictions
Academic research on AI-driven reaction prediction models
Data augmentation for chemical reaction datasets using SMILES randomization
Benchmarking new machine learning approaches against published USPTO dataset results
Integrating reaction prediction into automated synthesis planning pipelines

Not Ideal For

Teams requiring real-time, high-throughput reaction prediction in production pipelines
Chemists seeking drag-and-drop interfaces without coding or ML expertise
Projects focused on non-organic or novel reaction types outside USPTO patent data

Pros & Cons

Pros

Uncertainty Calibration

Provides confidence estimates for predictions, explicitly mentioned in the README to help chemists assess reliability, which is rare in open-source models.

Pre-trained Models

Includes models trained on public datasets like USPTO_MIT and USPTO_STEREO, available for download, allowing immediate use without training from scratch.

Data Augmentation

Doubles training data by generating random equivalent SMILES via RDKit, as described in the README, improving model robustness and accuracy.

RDKit Integration

Utilizes RDKit for SMILES canonicalization and tokenization, ensuring accurate molecular representation and preprocessing, which is critical for chemistry applications.

Cons

Outdated Dependencies

Requires Python 3.5 and PyTorch 0.4.1, which are obsolete and may cause compatibility issues with modern systems or libraries, as noted in the installation steps.

Complex Setup and Workflow

Involves multi-step conda environment setup, data preprocessing, and model averaging (last 20 checkpoints), making it inaccessible for non-experts without deep ML or chemistry knowledge.

Limited Domain Generalization

Trained primarily on USPTO patent data, so predictions may falter for reactions outside this domain, as admitted in the README regarding the need for more diverse data on IBM RXN.

Molecular Transformer

What is Molecular Transformer?

Overview

Key Features

Philosophy

Use Cases

Best For

Not Ideal For

Pros & Cons

Pros

Cons

Frequently Asked Questions

Related Projects

Found a gem we're missing?

Molecular Transformer

What is Molecular Transformer?

Overview

Key Features

Philosophy

Use Cases

Best For

Not Ideal For

Pros & Cons

Pros

Cons

Frequently Asked Questions

Related Projects

Found a gem we're missing?