Question 1

How do I install descriptastorus with kyotocabinet?

Accepted Answer

First, install kyotocabinet in your environment using system packages or conda. Then, clone the repository from Bitbucket and run 'python setup.py install' as per the README, since kyotocabinet is optional but recommended for indexing.

Question 2

What's the advantage of descriptastorus over using RDKit descriptors directly?

Accepted Answer

Descriptastorus adds a persistent storage layer and fast random access, making it ideal for large-scale, reproducible ML workflows where descriptors are reused. Direct RDKit usage is better for one-off computations without storage needs.

Question 3

Can I add custom descriptors not based on RDKit to descriptastorus?

Accepted Answer

Yes, you can use the raw store creation tools to define custom columns and data types, allowing integration of proprietary or alternative descriptors, though the library is primarily designed for RDKit-based generators.

Question 4

How does descriptastorus handle performance with huge chemical datasets?

Accepted Answer

It optimizes for fast iteration and lookup once stores are built, but initial store creation can be slow. The README notes it's suitable for ML pipelines where precomputed descriptors are accessed repeatedly.

Question 5

Is descriptastorus compatible with Python 3.9 or newer versions?

Accepted Answer

The README states support for Python 2/3, but since it's maintained on Bitbucket and hasn't seen recent updates, compatibility with specific newer versions like 3.9 may require testing or community patches.

Question 6

How to index molecules by InChIKey in descriptastorus?

Accepted Answer

Use the '--index-inchikey' flag when creating a store with the storus.py script, which requires kyotocabinet. This enables fast lookups by InChIKey, as detailed in the usage examples.

DescriptaStorus

What is DescriptaStorus?

Overview

Use Cases

Best For

Related Projects

Found a gem we're missing?

Not Ideal For

Pros & Cons

Pros

Cons

Frequently Asked Questions