Open-Awesome
CategoriesAlternativesStacksSelf-HostedExplore
Open-Awesome

© 2026 Open-Awesome. Curated for the developer elite.

TermsPrivacyAboutGitHubRSS
  1. Home
  2. Tags
  3. Data Analysis

Data Analysis

282 projects

Showing 36 of 282 projects

voila
voilaPython

Voilà converts Jupyter notebooks into secure, standalone web applications with interactive widgets.

#jupyterlab-extension#notebook#data-science
Stars5.9k
Forks528
Last commit7 days ago
Curated list of Python tutorials for Data Science, NLP and Machine Learning
Curated list of Python tutorials for Data Science, NLP and Machine LearningPython

A curated collection of Python tutorials and resources for data science, machine learning, and natural language processing.

#python-tutorials#educational-resources#data-science
Stars5.8k
Forks1.5k
Last commit
OctoSQL
OctoSQLGo

A CLI tool and dataflow engine that lets you query and join data from multiple databases and file formats using SQL.

#stream-processing#plugin-system#redis
Stars5.3k
Forks212
Last commit2 years ago
awesome-geospatial
awesome-geospatial

A curated list of open-source geospatial analysis tools, libraries, and resources across multiple programming languages and domains.

#lidar#web-mapping#open-source
Stars5.1k
Forks732
Last commit4 days ago
dplyr <img class="emoji" alt="heart" src="https://cdn.jsdelivr.net/gh/qinwf/awesome-R@3c66da6e291bcc0520b1649125b0bed750896a9a/heart.png" height="20" align="absmiddle" width="20">
dplyr <img class="emoji" alt="heart" src="https://cdn.jsdelivr.net/gh/qinwf/awesome-R@3c66da6e291bcc0520b1649125b0bed750896a9a/heart.png" height="20" align="absmiddle" width="20">R

A grammar of data manipulation for R, providing a consistent set of verbs to solve common data manipulation challenges.

#r-package#data-science#data-wrangling
Stars5.0k
Forks2.1k
Last commit
R Studio
R StudioJava

An integrated development environment (IDE) for the R programming language with a comprehensive workbench and server capabilities.

#data-science#r-language#research-tools
Stars5.0k
Forks1.2k
Last commit1 day ago
matplotplusplus
matplotplusplusC++

A C++ graphics library for data visualization with interactive plotting, high-quality export, and dozens of plot categories.

#scientific-visualization#scientific-computing#graphics
Stars4.9k
Forks378
Last commit2 months ago
dataset
datasetPython

A Python library for easy database interaction with automatic table creation, bulk loading, and transaction support.

#database#transaction-management#python
Stars4.9k
Forks299
Last commit1 month ago
OpenAgents
OpenAgentsPython

An open platform for deploying and using language agents for data analysis, plugin automation, and web browsing.

#hacktoberfest#web-browsing#flask
Stars4.8k
Forks531
Last commit1 year ago
Blazer
BlazerRuby

A Rails engine for business intelligence that lets you explore data with SQL, create charts and dashboards, and share insights with your team.

#business-intelligence#self-hosted-analytics#dashboard
Stars4.8k
Forks499
Last commit12 days ago
Visual-Insights
Visual-InsightsTypeScript

An open-source augmented analytics platform that automates exploratory data analysis and visualization with AI-powered insights.

#automated-insights#augmented-analytics#datamining
Stars4.7k
Forks381
Last commit
datascience
datascience

A curated collection of Python libraries, tutorials, and tools for data science, from data wrangling to machine learning and visualization.

#data-science#statistics#deep-learning
Stars4.6k
Forks711
Last commit3 days ago
plotnine
plotninePython

A Python implementation of a grammar of graphics for creating complex and beautiful statistical plots.

#scientific-visualization#graphics#matplotlib
Stars4.6k
Forks248
Last commit4 days ago
missingno
missingnoPython

A Python library for visualizing missing data in pandas DataFrames using matrix, bar, heatmap, and dendrogram plots.

#data-cleaning#missing-data#python-library
Stars4.2k
Forks525
Last commit2 years ago
zvt
zvtPython

A modular quantitative finance framework for data collection, analysis, strategy backtesting, and machine learning across multiple markets.

#trading-platform#technical-analysis#factor-investing
Stars4.2k
Forks988
Last commit1 month ago
xarray
xarrayPython

A Python package for working with labeled multi-dimensional arrays, inspired by pandas and tailored for scientific data.

#multi-dimensional-arrays#labeled-data#scientific-computing
Stars4.2k
Forks1.3k
Last commit3 days ago
Data Science Specialization
Data Science SpecializationHTML

Course materials for the Johns Hopkins Data Science Specialization on Coursera.

#coursera#data-science#statistics
Stars4.1k
Forks31.0k
Last commit5 years ago
"most important thing in data science is the question"
"most important thing in data science is the question"HTML

Course materials for the Johns Hopkins Data Science Specialization on Coursera.

#coursera#data-science#statistics
Stars4.1k
Forks31.0k
Last commit5 years ago
TorBot
TorBotPython

An open-source intelligence (OSINT) tool for crawling and analyzing websites on the dark web and beyond.

#python-web-crawler#spider#osint
Stars4.1k
Forks675
Last commit5 months ago
data.table <img class="emoji" alt="heart" src="https://cdn.jsdelivr.net/gh/qinwf/awesome-R@3c66da6e291bcc0520b1649125b0bed750896a9a/heart.png" height="20" align="absmiddle" width="20">
data.table <img class="emoji" alt="heart" src="https://cdn.jsdelivr.net/gh/qinwf/awesome-R@3c66da6e291bcc0520b1649125b0bed750896a9a/heart.png" height="20" align="absmiddle" width="20">R

A high-performance R package for fast data manipulation of large datasets, extending data.frame with concise syntax and memory efficiency.

#parallel-computing#high-performance#r-package
Stars3.9k
Forks1.0k
Last commit
dsq
dsqGo

A command-line tool for running SQL queries against JSON, CSV, Excel, Parquet, and other structured data files.

#parquet#cli-tool#openoffice-calc
Stars3.9k
Forks160
Last commit2 years ago
Tablesaw
TablesawJava

A Java dataframe and visualization library for data loading, cleaning, transformation, and analysis.

#statistical-analysis#chart#data-science
Stars3.8k
Forks649
Last commit3 months ago
Math.NET Numerics
Math.NET NumericsC#

An open-source numerical library for .NET and Mono providing algorithms for scientific computing, linear algebra, statistics, and more.

#scientific-computing#fft#statistics
Stars3.7k
Forks934
Last commit1 year ago
QSV
QSVRust

A blazing-fast command-line toolkit for querying, slicing, analyzing, transforming, and validating tabular data (CSV, Excel, JSONL, etc.).

#ckan#parquet#luau
Stars3.7k
Forks104
Last commit1 day ago
simple-statistics
simple-statisticsJavaScript

A lightweight, dependency-free JavaScript library for descriptive, regression, and inference statistics.

#statistics#math#inference
Stars3.5k
Forks231
Last commit4 days ago
Data Science
Data Science

A curated list of Python software for data science, covering machine learning, deep learning, visualization, and data manipulation.

#data-science#deep-learning#awesome-list
Stars3.5k
Forks447
Last commit1 month ago
Breeze
BreezeScala

A numerical processing library for Scala, providing generic, clean, and powerful linear algebra and scientific computing capabilities.

#scientific-computing#netlib#matrix-operations
Stars3.5k
Forks690
Last commit8 months ago
PyBroker
PyBrokerPython

A Python framework for developing and backtesting algorithmic trading strategies with machine learning.

#ai#trading#backtesting
Stars3.4k
Forks438
Last commit28 days ago
gota
gotaGo

A Go library providing DataFrames, Series, and data wrangling operations for structured data manipulation.

#data-wrangling#go-library#structured-data
Stars3.3k
Forks290
Last commit2 years ago
gota
gotaGo

A Go library providing DataFrames, Series, and data wrangling operations for tabular data manipulation.

#dataframe#data-wrangling#series
Stars3.3k
Forks290
Last commit2 years ago
mimic-code
mimic-codeJupyter Notebook

A central hub for sharing, refining, and reusing code for analyzing the MIMIC family of critical care and hospital databases.

#medical-databases#sql-scripts#reproducible-research
Stars3.2k
Forks1.7k
Last commit1 month ago
ChartGPU
ChartGPUTypeScript

A WebGPU-accelerated TypeScript charting library for rendering millions of data points at 60 FPS with interactive dashboards.

#real-time-dashboards#open-source#high-performance
Stars3.1k
Forks95
Last commit1 month ago
SweetViz
SweetVizPython

A Python library for automated exploratory data analysis (EDA) with high-density visualizations and target analysis in two lines of code.

#statistical-analysis#data-science#automated-reporting
Stars3.1k
Forks287
Last commit1 month ago
xLearn
xLearnC++

A high-performance, easy-to-use, and scalable machine learning package for linear models, factorization machines, and field-aware factorization machines.

#ffm#high-performance#python-library
Stars3.1k
Forks515
Last commit2 years ago
stats
statsGo

A comprehensive, dependency-free statistics library for Go with extensive mathematical functions and thorough testing.

#regression-analysis#mathematics#statistics
Stars3.0k
Forks174
Last commit1 month ago
DataStation
DataStationTypeScript

An open-source data IDE for developers to query, script, and visualize data from databases, files, and APIs.

#data-ide#desktop-app#data-integration
Stars3.0k
Forks112
Last commit2 years ago
PreviousPage 2 of 8

Related Tags

Community-curated · Updated weekly · 100% open source

Found a gem we're missing?

Open-Awesome is built by the community, for the community. Submit a project, suggest an awesome list, or help improve the catalog on GitHub.

Submit a projectStar on GitHub
2 years ago
6 days ago
3 months ago
1 day ago
Next
#Python89
#Data Science78
#Data Visualization74
#Machine Learning63
#Statistics44
#R Package42
#R36
#Scientific Computing32
#Pandas30
#Sql26
#Data Manipulation25
#Dataframe21