A PHP library to extract metadata, embed codes, and structured data from any web page using multiple protocols.
Embed is a PHP library that extracts structured metadata and embeddable content from web pages. It solves the problem of consistently retrieving information like titles, descriptions, images, and embed codes across various services (e.g., YouTube, Twitter, Instagram) by supporting multiple protocols such as oEmbed, OpenGraph, and HTML scraping.
PHP developers building applications that need to parse or display content from external URLs, such as social media aggregators, content preview generators, or CMS integrations.
Developers choose Embed for its comprehensive protocol support, extensible adapter/detector system, and PSR-standard compliance, which simplify fetching reliable metadata from diverse sources without writing custom parsers for each service.
Get info from any web service or page
Open-Awesome is built by the community, for the community. Submit a project, suggest an awesome list, or help improve the catalog on GitHub.
Extracts data using oEmbed, OpenGraph, Twitter Cards, and HTML scraping, ensuring compatibility with platforms like YouTube, Vimeo, and Twitter without custom parsers for each.
Allows creating custom adapters for specific sites (e.g., Wikipedia, Archive.org) and detectors for new data points, enabling handling of niche or evolving web services.
Built on PSR-7, PSR-17, and PSR-18 standards with automatic detection of popular implementations like Guzzle and Laminas, promoting interoperability in PHP ecosystems.
Supports asynchronous fetching of multiple URLs simultaneously via getMulti(), improving performance for batch operations like generating multiple link previews.
Requires Curl library and a PSR-17 implementation, adding setup complexity and potential version conflicts, especially in constrained or legacy environments.
Cannot execute JavaScript, so it fails to extract metadata from dynamic content or single-page applications that rely on client-side rendering.
Full functionality for services like Facebook, Instagram, and Twitter requires API tokens, necessitating additional configuration and maintenance.
Each metadata fetch involves HTTP requests, which can be slow and unreliable, impacting performance in high-volume or latency-sensitive applications.