Question 1

How to install distance library with C extensions on Windows?

Accepted Answer

You need Microsoft Visual C++ 2010 or a compatible compiler, then run 'python setup.py install --with-c'. However, due to its age, compatibility with modern Windows setups might require additional configuration or fallback to pure Python.

Question 2

Distance vs fuzzywuzzy for string matching in Python?

Accepted Answer

Distance provides low-level string distance metrics with C extensions for speed, ideal for custom similarity logic. Fuzzywuzzy offers higher-level fuzzy matching with ratio calculations and is more user-friendly for common tasks like record linkage.

Question 3

How to filter similar strings from a large list using distance?

Accepted Answer

Use the ifast_comp or ilevenshtein iterators. For example, 'sorted(distance.ifast_comp(reference, tokens))' returns tuples of distance and sequence, efficiently handling millions of tokens as shown in the README examples.

Question 4

Does distance support weighted edit distances?

Accepted Answer

No, Distance only implements standard Levenshtein and Hamming distances without customization for insertion, deletion, or substitution costs, limiting its use for nuanced similarity assessments.

Question 5

Is distance library still maintained in 2023?

Accepted Answer

The last update was in 2013, so it is not actively maintained. This may lead to compatibility issues, but the core functionality remains usable for basic string comparison tasks in supported Python versions.

Distance

What is Distance?

Overview

Use Cases

Best For

Related Projects

Found a gem we're missing?

Not Ideal For

Pros & Cons

Pros

Cons

Frequently Asked Questions