Open-Awesome
CategoriesAlternativesStacksSelf-HostedExplore
Open-Awesome

© 2026 Open-Awesome. Curated for the developer elite.

TermsPrivacyAboutGitHubRSS
  1. Home
  2. Tags
  3. Data Analysis

Data Analysis

283 projects

Showing 36 of 283 projects

Suite2p
Suite2pPython

A complete pipeline for processing two-photon calcium imaging data, including registration, ROI detection, signal extraction, and spike deconvolution.

#neuroscience#roi-detection#two-photon
Stars452
Forks274
Last commit6 days ago
Mixed Models
Mixed ModelsJulia

A Julia package for fitting linear and generalized linear mixed-effects models with maximum likelihood estimation.

#statistical-models#maximum-likelihood#mixed-models
Stars447
Forks51
Last commit3 days ago
wavelib
wavelibC

C implementation of 1D/2D wavelet transforms including DWT, SWT, MODWT, wavelet packet transforms, and continuous wavelet transforms.

#c-library#scientific-computing#wavelet-packet-trees
Stars436
Forks137
Last commit5 months ago
sql
sqlTypeScript

A SQL GUI extension for JupyterLab that enables point-and-click database exploration and query execution.

#jupyterlab-extension#database#sql-gui
Stars433
Forks51
Last commit3 years ago
ESEUR
ESEURR

Code and data repository for reproducing examples from 'Evidence-based Software Engineering' book using publicly available data.

#statistical-analysis#cognitive-capitalism#r-language
Stars422
Forks48
Last commit3 months ago
connectordb
connectordbGo

A self-hosted, extensible personal data aggregator and analysis engine for quantified self.

#iot#personal-data-aggregator#plugin-system
Stars419
Forks32
Last commit4 years ago
odbc
odbcC++

A DBI-compliant R package for connecting to ODBC databases, offering a fast and standardized interface to SQL Server, Oracle, Databricks, Snowflake, and more.

#database#nanodbc#r-package
Stars414
Forks116
Last commit2 days ago
warcdb
warcdbPython

WarcDB is an SQLite-based file format that makes web crawl data easier to share and query.

#data-querying#database#warc
Stars405
Forks10
Last commit1 year ago
scikit-tensor
scikit-tensorPython

Python library for multilinear algebra operations and tensor factorizations with support for dense and sparse tensors.

#tensor-factorization#scientific-computing#scipy
Stars405
Forks111
Last commit7 years ago
MultivariateStats
MultivariateStatsJulia

A Julia package for multivariate statistics and data analysis, including dimension reduction techniques like PCA and LDA.

#linear-discriminant-analysis#julia#mds
Stars387
Forks84
Last commit1 month ago
TDSP-Utilities
TDSP-UtilitiesHTML

A collection of utilities and scripts for interactive data exploration, analysis, and automated modeling within Microsoft's Team Data Science Process.

#team-data-science-process#microsoft-r-server#reporting
Stars378
Forks266
Last commit
Bayadera - Bayesian Data Analysis on the GPU
Bayadera - Bayesian Data Analysis on the GPUClojure

A Clojure library for high-performance Bayesian data analysis and machine learning on the GPU.

#high-performance-computing#bayesian-statistics#bayesian-data-analysis
Stars370
Forks24
Last commit
Weave
WeaveActionScript

A web-based platform for data analysis and visualization with support for multiple data sources and interactive dashboards.

#geospatial-analysis#data-integration#business-intelligence
Stars368
Forks68
Last commit7 years ago
kixistats
kixistatsClojure

A Clojure/ClojureScript library for statistical distribution sampling and transducing functions.

#transducers#clojurescript#statistics
Stars368
Forks20
Last commit7 months ago
Tablecloth
TableclothClojure

A Clojure dataset manipulation library providing a dplyr-like API on top of tech.ml.dataset.

#columnar-data#dataframe#dataset-api
Stars363
Forks29
Last commit1 month ago
SparklingPandas
SparklingPandasPython

A Python library that provides a Pandas-like API on top of Apache Spark DataFrames for distributed data analysis.

#apache-spark#dataframe#python
Stars361
Forks79
Last commit2 years ago
ehrapy
ehrapyPython

A modular Python framework for exploratory analysis of heterogeneous epidemiological and electronic health record (EHR) data.

#epidemiology#statistical-analysis#scverse
Stars352
Forks47
Last commit5 days ago
anomalize
anomalizeR

A tidy R package for detecting anomalies in time series data using decomposition and statistical methods.

#iqr#detect anomalies#anomaly
Stars339
Forks61
Last commit2 years ago
RPostgres
RPostgresR

A DBI-compliant R interface to PostgreSQL, rewritten in C++ for improved performance and reliability.

#database#postgres#r-package
Stars338
Forks81
Last commit15 days ago
DBI
DBIR

A database interface (DBI) definition for communication between R and relational database management systems (RDBMS).

#interface#database#r-package
Stars318
Forks82
Last commit15 days ago
Hypothesis Tests
Hypothesis TestsJulia

A comprehensive Julia package implementing a wide range of statistical hypothesis tests for data analysis.

#hacktoberfest#julia#statistical-inference
Stars317
Forks87
Last commit11 days ago
visavail
visavailJavaScript

A D3.js library for visualizing time data availability with Gantt-like charts to identify gaps in datasets.

#chart#gantt#open-source
Stars312
Forks56
Last commit1 year ago
slackr
slackrR

An R package for sending messages, data, alerts, and plots from R directly to Slack channels.

#workflow#r-package#slack
Stars308
Forks84
Last commit1 year ago
Ookla internet speed data
Ookla internet speed dataJupyter Notebook

Global open dataset of aggregated fixed and mobile network performance metrics (download/upload/latency) in geospatial tiles.

#parquet#geospatial-data#gis
Stars305
Forks58
Last commit1 month ago
XPlot
XPlotF#

A collection of older F# plotting libraries using Plotly and Google Charts as backends for data visualization.

#google-charts#dotnet#data-visualization
Stars289
Forks70
Last commit1 year ago
mongolite
mongoliteC

A high-performance MongoDB client for R, built on libmongoc and jsonlite, supporting aggregation, indexing, and streaming.

#database-driver#r-package#aggregation
Stars288
Forks65
Last commit1 year ago
mongolite
mongoliteC

A high-performance MongoDB client for R, built on libmongoc and jsonlite, supporting aggregation, indexing, and streaming.

#database-driver#r-package#mongodb-client
Stars288
Forks65
Last commit1 year ago
datakit
datakitJavaScript

A lightweight JavaScript library for data analysis with CSV reading, statistical methods, and chart plotting.

#statistics#lightweight#csv-parsing
Stars287
Forks11
Last commit9 years ago
nless
nlessPython

A TUI pager for exploring and analyzing tabular data from logs, CSV, JSON, and streams with vi-like keybindings.

#terminal-pager#spreadsheet#streaming-data
Stars260
Forks6
Last commit1 month ago
go-hep
go-hepGo

A comprehensive Go toolkit for High Energy Physics (HEP) data analysis, simulation, and visualization.

#lhc#scientific-computing#xrootd
Stars252
Forks38
Last commit7 months ago
elastic
elasticR

R client for the Elasticsearch HTTP API, enabling data indexing, search, and analysis from R.

#data-indexing#database#r-package
Stars245
Forks59
Last commit6 months ago
Morpheus
MorpheusJava

A high-performance, type-safe DataFrame library for the JVM enabling large-scale data analysis with parallel processing capabilities.

#scientific-computing#parallel-computing#finance
Stars245
Forks24
Last commit2 years ago
R Type Provider
R Type ProviderF#

An F# type provider that enables seamless interoperability with R packages, offering type-safe access to R functions from .NET.

#type-provider#type-providers#data-science
Stars244
Forks69
Last commit24 days ago
Urban Informatics & Visualization-Berkeley
Urban Informatics & Visualization-BerkeleyJupyter Notebook

A UC Berkeley course teaching urban data analysis, visualization, and mapping using Python and open-source tools for city planning.

#academic-course#city-planning#gis-mapping
Stars242
Forks121
Last commit
tucan
tucanElixir

An Elixir plotting library built on VegaLite, offering a clean functional API for creating interactive visualizations.

#hacktoberfest#elixir#vega-lite
Stars224
Forks8
Last commit12 days ago
reshape2  <img class="emoji" alt="heart" src="https://cdn.jsdelivr.net/gh/qinwf/awesome-R@3c66da6e291bcc0520b1649125b0bed750896a9a/heart.png" height="20" align="absmiddle" width="20">
reshape2 <img class="emoji" alt="heart" src="https://cdn.jsdelivr.net/gh/qinwf/awesome-R@3c66da6e291bcc0520b1649125b0bed750896a9a/heart.png" height="20" align="absmiddle" width="20">R

An R package for flexibly rearranging, reshaping, and aggregating data, now superseded by tidyr.

#r-package#data-science#r-programming
Stars214
Forks56
Last commit
PreviousPage 6 of 8

Related Tags

Community-curated · Updated weekly · 100% open source

Found a gem we're missing?

Open-Awesome is built by the community, for the community. Submit a project, suggest an awesome list, or help improve the catalog on GitHub.

Submit a projectStar on GitHub
7 years ago
5 years ago
7 years ago
6 months ago
Next
#Python89
#Data Science79
#Data Visualization74
#Machine Learning63
#Statistics44
#R Package42
#R36
#Scientific Computing32
#Pandas30
#Data Manipulation26
#Sql26
#Dataframe22