Showing 13 of 13 projects
A powerful Python library for data analysis and manipulation, providing fast, flexible data structures.
A powerful Python library for data manipulation and analysis, providing fast, flexible data structures.
An open-source data-centric AI library for automatically detecting and fixing data quality issues in machine learning datasets.
Fuzzy string matching library for Python that calculates similarity between strings using Levenshtein Distance.
A Python library using machine learning for accurate and scalable fuzzy matching, record deduplication, and entity resolution on structured data.
A sample MySQL database with integrated test suite for testing applications and database servers.
A flexible and expressive API for performing statistical data validation on dataframe-like objects.
A Python library for visualizing missing data in pandas DataFrames using matrix, bar, heatmap, and dendrogram plots.
A Python library that fixes mojibake and other Unicode text glitches by detecting and correcting encoding mix-ups.
A Python library that fixes mojibake and other Unicode text glitches by detecting and correcting encoding mix-ups.
An open-source Python library for low-code data preparation, offering fast EDA, data cleaning, and collection from APIs and databases.
A Python library for approximate and phonetic string matching, implementing algorithms like Levenshtein distance and Soundex.
A Python library for agile data preparation workflows that works with Pandas, Dask, cuDF, Dask-cuDF, Vaex, and PySpark.
Open-Awesome is built by the community, for the community. Submit a project, suggest an awesome list, or help improve the catalog on GitHub.