Hacker News Clone

Show HN: Page.REST – An API to fetch details from a web page as JSON

by laktek on 9/7/2017, 4:15 AM with 32 comments

by xytop on 9/7/2017, 9:12 AM
For absolutely 0-cost you can use http://proc.link which will return oembed info for ANY url.
General example: http://api.proc.link/oembed?url=http%3A%2F%2Fpage.rest
Youtube: http://api.proc.link/oembed?url=https%3A%2F%2Fwww.youtube.co...
Facebook: http://api.proc.link/oembed?url=https%3A%2F%2Fwww.facebook.c...
by ganessh on 9/7/2017, 6:24 AM
If I have to know the elements' selectors, why should I prefer this service over using a HTML parser?
by kowdermeister on 9/7/2017, 8:46 AM
I would make the 5$ price and token validity much larger, like "4rem" or something, I was looking at the CC input field and thinking "seriously? how much will you charge?"
by squiggy22 on 9/7/2017, 7:24 AM
Not unlike YQL - https://developer.yahoo.com/yql/
by Luyanda on 9/7/2017, 11:19 AM
Uhm this is interesting. Reminds me of: https://wrapapi.com
by geetfun on 9/7/2017, 7:24 AM
Looks interesting. I wonder what kind of market this app might serve. For larger apps, I would worry about support. 5 dollars per year tells me that the developer is doing this as a hobby. For small side projects, I can see tinkerers building this themselves.
by keyle on 9/7/2017, 9:58 AM
I did something similar a loooong time ago. Granted not as sexy.
https://github.com/keyle/json-anything
by laktek on 9/13/2017, 11:50 PM
I wrote a follow up blog post about what I learnt from shipping Page.REST http://www.laktek.com/what-i-learned-from-building-pagerest/
by laktek on 9/7/2017, 10:13 PM
Added support for OpenGraph extraction https://www.page.rest/#open-graph
by andrevv_6 on 9/7/2017, 1:40 PM
Is there a await for JS Frontends to initialise before the scrape via a selector (i.e. Angular/Ember etc)?
by cinooo on 9/7/2017, 7:35 AM
Someone could really abuse this service, I don't see any mention of API limits.
by chenster on 9/7/2017, 1:10 PM
Zapier it.
by throwaway12s34 on 9/7/2017, 6:26 PM
How do you handle sites that have scraper prevention? Such as captcha, IP throttling, etc.
I'd pay $100-$500 per-month for a service that could reliably scrape some particularly difficult sites. With that being said, I'd need the service to be able to handle ~100 req/s in bursts and 2-4 req/s on average.