Showing 27 of 27 projects
A polyglot document intelligence framework with a Rust core for extracting text, metadata, and structured data from 91+ file formats.
A Python library and CLI tool for web crawling, scraping, and extracting main text, metadata, and comments from web pages.
A Windows tool for extracting metadata and hidden information from documents found on web pages and local files.
A PHP library to extract metadata, embed codes, and structured data from any web page using multiple protocols.
A Ruby wrapper for FFmpeg that simplifies reading video metadata and transcoding movies.
A Swift library for generating link previews (title, description, images) from URLs on Apple platforms.
A Ruby gem that fetches image size and type from a URI by downloading only the minimal necessary data.
A fast, multi-threaded file system indexer and search tool with a web interface, supporting text/metadata extraction, thumbnails, OCR, and incremental scanning.
A C++ library and command-line utility to read, write, delete, and modify Exif, IPTC, XMP, and ICC image metadata.
A .NET library for extracting Exif, IPTC, XMP, ICC, and other metadata from image, video, and audio files.
A Ruby gem for web scraping that extracts titles, meta tags, links, images, and structured data from URLs.
A Swift library for extracting article previews including title, description, images, and metadata from web pages.
A lightweight, dependency-free Python library for reading metadata and images from various audio file formats.
A library for extracting and displaying technical metadata and tags from video and audio files.
A scalable, modular object scanner and intrusion detection system that extracts, flags, and enriches files with metadata.
A Markdown parser for Clojure and ClojureScript with extensive customization options.
Ruby gem that fetches images and metadata from URLs to generate link previews, similar to social media previews.
A Ruby library for extracting text and metadata from various document formats using Apache Tika.
An Android library that generates link previews by extracting titles, descriptions, and images from URLs.
A simple C library for reading, writing, and filtering PDF files with support for encryption and metadata.
A Windows GUI tool that reconstructs directory trees and analyzes FILE records from NTFS Master File Table ($MFT) files.
An Elixir library for extracting and curating the primary readable content from webpages.
An open-source platform for storing, visualizing, and sharing geospatial data like orthophotos, point clouds, and 3D models.
A CLI tool that uses AI to automatically categorize, rename, and organize files based on their content.
An NSURL extension for Swift that fetches webpage preview metadata (title, description, image) like Facebook link previews.
A fast Elixir library to parse image binaries and extract dimensions, mime-type, and validity for 13+ formats.
A Node.js library to generate HTML embeds from URLs using oEmbed, Open Graph, and meta tags.
Open-Awesome is built by the community, for the community. Submit a project, suggest an awesome list, or help improve the catalog on GitHub.