A Ruby library for data analysis with DataFrame and Vector structures, offering storage, manipulation, and visualization.
daru is a Ruby library for data analysis that provides DataFrame and Vector data structures for storage, manipulation, and visualization of data. It solves the problem of performing data analysis tasks in Ruby by offering an intuitive API and compatibility with various data formats and plotting tools.
Ruby developers and data analysts who need to perform data analysis, manipulation, and visualization within Ruby applications or IRuby notebooks.
Developers choose daru for its pure Ruby implementation, flexible API, and seamless integration with other Ruby gems like statsample and plotting libraries, making it a comprehensive solution for data analysis in Ruby.
Data Analysis in RUby
Open-Awesome is built by the community, for the community. Submit a project, suggest an awesome list, or help improve the catalog on GitHub.
Works across all Ruby versions without external dependencies, as stated in the README, ensuring easy adoption in any Ruby environment.
Supports singly and hierarchically indexed data structures, enabling complex manipulations like time series and categorical data, detailed in notebooks.
Seamlessly works with IRuby notebooks, statsample for statistics, and plotting libraries like Nyaplot, facilitating interactive analysis and visualization.
Offers intuitive APIs for splitting, aggregation, grouping, and pivot tables, as shown in examples with where clauses and column operations.
Pure Ruby implementation can be slower for large datasets; optional optimizations with NMatrix require extra setup and aren't default.
Plotting depends on external gems like Nyaplot or GnuplotRB, which have smaller communities and fewer features compared to Python alternatives.
Smaller user base than pandas means fewer tutorials, third-party tools, and slower response to issues, despite Google Summer of Code backing.