Open-Awesome
CategoriesAlternativesStacksSelf-HostedExplore
Open-Awesome

© 2026 Open-Awesome. Curated for the developer elite.

TermsPrivacyAboutGitHubRSS
  1. Home
  2. Tags
  3. Structured Data

Structured Data

29 projects

Showing 29 of 29 projects

ProtoBuf
ProtoBufC++

A language-neutral, platform-neutral, extensible mechanism for serializing structured data developed by Google.

#protoc#data-serialization#schema-definition
Stars71.3k
Forks16.2k
Last commit21 hours ago
docling
doclingPython

A Python library for parsing diverse document formats into structured data, optimized for integration with generative AI applications.

#ai#tables#documents
Stars61.2k
Forks4.3k
Last commit1 day ago
PowerShell
PowerShellC#

A cross-platform automation and configuration tool/framework optimized for structured data, REST APIs, and object models.

#hacktoberfest#devops#rest-api
Stars53.8k
Forks8.3k
Last commit6 days ago
nushell
nushellRust

A modern, cross-platform shell that treats data as structured tables instead of plain text.

#productivity#nushell#pipeline
Stars39.7k
Forks2.2k
Last commit1 day ago
colly
collyGo

A fast and elegant scraping and crawling framework for Go, designed for extracting structured data from websites.

#spider#crawler#scraper
Stars25.3k
Forks1.9k
Last commit14 days ago
AutoGluon
AutoGluonPython

An automated machine learning library that trains and deploys high-accuracy models for tabular, text, image, and time series data with minimal code.

#ensemble-learning#python-library#data-science
Stars10.5k
Forks1.2k
Last commit3 days ago
Next SEO
Next SEOTypeScript

A Next.js plugin for adding structured data (JSON-LD) components to improve SEO and search appearance.

#hacktoberfest#json-ld#nextjs
Stars8.5k
Forks448
Last commit3 months ago
BAML
BAMLRust

A prompting language for building reliable AI workflows and agents with type-safe, structured outputs across multiple programming languages.

#multi-language#boundaryml#ai-framework
Stars8.3k
Forks427
Last commit1 day ago
structured-text-tools
structured-text-tools

A curated list of command-line tools for manipulating structured text data like CSV, JSON, XML, YAML, and more.

#delimited-files#command-line-tools#yaml
Stars7.1k
Forks249
Last commit4 months ago
Dedupe
DedupePython

A Python library using machine learning for accurate and scalable fuzzy matching, record deduplication, and entity resolution on structured data.

#data-cleaning#de duplicating#python-library
Stars4.5k
Forks569
Last commit10 months ago
SEOTools
SEOToolsPHP

A Laravel package providing helpers for common SEO techniques including meta tags, OpenGraph, Twitter Cards, and JSON-LD.

#lumen#json-ld#laravel
Stars3.4k
Forks516
Last commit2 months ago
gota
gotaGo

A Go library providing DataFrames, Series, and data wrangling operations for structured data manipulation.

#data-wrangling#go-library#structured-data
Stars3.3k
Forks290
Last commit2 years ago
TensorFlow Fold
TensorFlow FoldPython

A library for creating TensorFlow models that handle structured data with dynamic computation graphs using dynamic batching.

#deep-learning#neural-networks#natural-language-processing
Stars1.8k
Forks263
Last commit5 years ago
ngs
ngsC

A modern programming language designed specifically for DevOps tasks, offering structured data handling and cloud integration.

#programming-language#devops#shell-scripting
Stars1.5k
Forks49
Last commit1 month ago
amphi-etl
amphi-etlTypeScript

A visual, low-code data preparation tool that generates Python code for ETL, reporting, and AI-assisted workflows.

#jupyterlab-extension#analytics-automation#datatransformation
Stars1.4k
Forks105
Last commit1 day ago
Schema.NET
Schema.NETC#

Strongly typed C# classes for Schema.org structured data, serializable to JSON-LD and XML for .NET applications.

#json-ld#csharp#schema
Stars683
Forks87
Last commit6 days ago
jsongrep
jsongrepRust

A command-line tool and Rust library for fast querying of JSON, YAML, TOML, and other documents using regular path expressions.

#search#developer-tools#yaml
Stars646
Forks11
Last commit1 month ago
marshmallow
marshmallowGo

A high-performance Go library for JSON unmarshalling that handles both known and unknown fields without data loss.

#json-unmarshalling#unstructured-data#go-library
Stars392
Forks11
Last commit2 years ago
nvim-jqx
nvim-jqxLua

A Neovim plugin that populates the quickfix window with JSON/YAML entries for easy browsing and querying.

#json-browser#editor-tool#neovim-plugin
Stars338
Forks7
Last commit2 years ago
Instructor for PHP
Instructor for PHPPHP

A PHP library for structured data extraction from LLMs, unified LLM API access, and building AI agents.

#ai#laravel#gemini
Stars317
Forks26
Last commit1 month ago
gocrud
gocrudGo

A Go framework to simplify CRUD operations for arbitrarily deep structured data using graph concepts.

#backend-development#graph-operations#data-versioning
Stars307
Forks23
Last commit7 years ago
Summary Generation From Structured Data
Summary Generation From Structured DataPython

A TensorFlow implementation of neural text generation from structured data, converting tabular information into natural language summaries.

#data-to-text#neural-networks#text-generation
Stars186
Forks56
Last commit7 years ago
graph-2-text
graph-2-textPython

A PyTorch implementation combining Graph Convolutional Networks with OpenNMT-py for structured data to text generation.

#graph-neural-networks#opennmt#data-to-text
Stars153
Forks28
Last commit7 years ago
videre.nvim
videre.nvimLua

A JSON explorer plugin for Neovim that renders structured data as an interactive graph in the terminal interface.

#graph#yaml#terminal
Stars140
Forks3
Last commit22 hours ago
MagePlaza Seo
MagePlaza SeoPHP

A free Magento 2 extension that automatically optimizes SEO with duplicate content prevention, structured data, sitemaps, and meta tag management.

#magento#open-source#e-commerce
Stars138
Forks51
Last commit6 days ago
go-vcard
go-vcardGo

A Go library for parsing and formatting vCard data according to RFC 6350.

#go-modules#contacts#go-library
Stars126
Forks38
Last commit1 year ago
Wagtail SEO
Wagtail SEOPython

A comprehensive SEO package for Wagtail CMS that handles search engine and social media optimization.

#social-media#content-management#search-engine-optimization
Stars94
Forks29
Last commit11 months ago
feeder_ex
feeder_exElixir

An Elixir wrapper for the feeder RSS parser library, providing structured feed and entry parsing.

#elixir#wrapper-library#rss-parser
Stars71
Forks12
Last commit4 years ago
DynamicDataObjects
DynamicDataObjectsPascal

A Delphi library for modeling structured data with serialization to/from multiple formats like JSON, CBOR, BSON, and MessagePack.

#bson#data-serialization#cbor
Stars69
Forks16
Last commit6 months ago

Related Tags

#Seo5#Json5#Machine Learning5#Data Processing4#Python Library3#Toml3#Json Ld3#Serialization3#Search Engine Optimization3#Go3#Shell3#Yaml3
Community-curated · Updated weekly · 100% open source

Found a gem we're missing?

Open-Awesome is built by the community, for the community. Submit a project, suggest an awesome list, or help improve the catalog on GitHub.

Submit a projectStar on GitHub