Showing 20 of 56 projects
A robust HTML to Markdown converter with plugin support, usable as a Go library, CLI tool, or via hosted API.
A Python module to bypass Cloudflare's anti-bot page by solving JavaScript challenges using Node.js.
A Go library for cross-browser automation, controlling Chromium, Firefox, and WebKit with a single API.
A Go library to automate Chromium, Firefox, and WebKit browsers with a single API for cross-browser web automation.
A PHP and Symfony library for browser testing and web scraping using real browsers via the WebDriver protocol.
A scriptable browser based on Firefox's Gecko engine, compatible with PhantomJS API for web automation and testing.
Official .NET library for cross-browser web automation and testing with Chromium, Firefox, and WebKit.
A TensorFlow-based CNN solution for recognizing character-based CAPTCHAs, providing training, validation, and API modules.
A robust Go library for parsing RSS, Atom, and JSON feeds with support for extensions and invalid feed handling.
A PHP library to control headless Chrome/Chromium instances for browser automation, screenshots, and PDF generation.
A Rust library for parsing HTML and querying elements using CSS selectors.
A simple and fast HTML and XML parser for PHP with CSS selector and XPath support.
A Python library and CLI tool that converts HTML into clean, readable Markdown-formatted plain text.
A PHP library to extract metadata, embed codes, and structured data from any web page using multiple protocols.
A simple HTML parser for Elixir that enables search for nodes using CSS selectors.
A high-level Ruby API for controlling Chrome/Chromium browsers directly via the Chrome DevTools Protocol.
A high-level Ruby API for controlling Chrome/Chromium via the Chrome DevTools Protocol without Selenium dependencies.
An async Python web scraping micro-framework built on asyncio and aiohttp for fast, extensible crawling.
A batteries-included Ruby framework for easy web-scraping with built-in debug mode and rate limiting.
A tidyverse package for web scraping in R, inspired by Beautiful Soup and designed for data extraction workflows.
Open-Awesome is built by the community, for the community. Submit a project, suggest an awesome list, or help improve the catalog on GitHub.