Parse The Web At Lightning Speed
Page Munch is a simple API that allows you to turn webpages into rich, structured JSON.
Easily extract photos, videos, event, author and other metadata from any page on the internet in milliseconds.
{
"url": "http:\/\/www.imdb.com\/title\/tt0111161\/",
"type": "Movie",
"schema": "http:\/\/schema.org\/Movie",
"title": "The Shawshank Redemption (1994)",
"description": "Directed by Frank Darabont. With Tim Robbins, Morgan Freeman, Bob Gunton, William Sadler. Two imprisoned men bond over a number of years, finding solace and eventual redemption through acts of common decency.",
"director": "Frank Darabont",
"image": {
"format": "jpeg",
"url": "http:\/\/ia.media-imdb.com\/images\/M\/MV5BMTM2NjEyNzk2OF5BMl5BanBnXkFtZTcwNjcxNjUyMQ@@._V1._SX93_SY140_.jpg",
"width": 92,
"height": 140
},
"actor": [
...
📄 Any Webpage
Convert any HTML page on the internet into predictable, structured JSON data, all we need is the URL.
Microformats & HTML5
PageMunch understands the semantics of HTML5, microformats and many other semantic formats to collect the right information.
We ♥ JSON
API responses contain flexible and portable JSON data, for easy use in any application, framework and language.
Rich Previews
Create Facebook style page previews
It's easy to create Facebook-style page previews for links shared in your app or website. One request will return the correct title, description, image and other data such as video embed urls.
Classify Links
Find out what your users share
Want to know what kind of content your users are sharing? Use the PageMunch API to easily categorise URL's with a single request.
Data Mining
Extract accurate information
Millions of webpages contain microformats with dates, locations, prices, author attribution and much more data.
PageMunch understands the many different semantic standards to extract accurate a detailed data independent of the source.