Showing 36 of 278 projects
A pandas DataFrame wrapper for calculating over 70 stock market indicators and statistics with inline column access.
An R package for reshaping and tidying data into a consistent format for easier analysis.
An educational tutorial and working demonstration pipeline for RNA-seq analysis on cloud platforms.
A visual, low-code data preparation tool that generates Python code for ETL, reporting, and AI-assisted workflows.
Query pandas DataFrames using SQL syntax, similar to sqldf in R.
A free software AI accelerator that speeds up scikit-learn applications by 10-100x on CPUs and GPUs with no code changes.
A curated collection of 500+ resources for data analysis and data science, covering Python, SQL, ML, visualization, roadmaps, and interview prep.
A Java/Groovy/JavaFX data visualization tool for ETL, machine learning, and publishing web visualizations.
A graphical toolkit for exploring, analyzing, and modifying real-time data streams through a visual interface.
A Python framework for processing seismological data, providing parsers, data clients, and signal processing routines.
A lightweight and intuitive Go library for data manipulation, statistics, and machine learning using DataFrames.
A collection of Jupyter notebooks for financial economics, providing high-level APIs to retrieve, analyze, and visualize economic data from sources like FRED.
A Python toolbox for explainable AI, providing tools for data analysis, model evaluation, and bias mitigation in machine learning.
A lightweight Python library for anomaly detection and correlation in time series data, enabling root cause analysis.
An advanced open-source MPP database for data warehousing, large-scale analytics, and AI/ML workloads.
A flexible Python framework for fast network flow data analysis, offering encrypted application identification, statistical feature extraction, and extensibility via plugins.
A language and embedded JIT compiler for efficient dynamic expression evaluation, data storage, and analysis in C++ applications.
A Python library that adds dynamic data aggregation to Plotly figures for scalable visualization of large time series.
A dataset and npm module quantifying the performance impact of third-party scripts across the web, categorized by entity.
A comprehensive collection of notes, tutorials, and resources for RNA-seq data analysis, covering alignment, quantification, differential expression, and more.
An open-source, Python-based data analysis tool with specialized data types and methods for genomic data at scale.
A Ruby library for data analysis with DataFrame and Vector structures, offering storage, manipulation, and visualization.
A fast and friendly R package for reading rectangular data from delimited files like CSV and TSV.
A .NET library for data and time series manipulation with structured data frames, designed for scientific programming.
A meta gem that bundles scientific computing and visualization libraries for Ruby, enabling data analysis and plotting.
An R interface for Apache Spark that enables distributed data processing, machine learning, and SQL queries using familiar R syntax.
Build animated charts in Jupyter Notebook and similar environments with a simple Python syntax.
An R package that extends ggplot2 with missing functionality for custom plot composition and advanced visualizations.
A JavaScript library for linear least-squares curve fitting and regression analysis.
A curated list of awesome tools, libraries, and resources for working with CSV files.
Learn statistics through Python with real-world examples like analyzing marijuana price data across US states.
A Ruby machine learning library with a Scikit-Learn-like interface for classification, regression, clustering, and dimensionality reduction.
A JavaScript library for scientific and statistical computing, offering R-like statistical methods and linear algebra.
A curated collection of resources for Go-based data analysis, visualization, machine learning, and data science.
A comprehensive collection of notes, tools, and resources for analyzing ChIP-seq and related epigenomic data.
A flexible command-line tool for generating graphs and charts from CSV data files.
Open-Awesome is built by the community, for the community. Submit a project, suggest an awesome list, or help improve the catalog on GitHub.