What open source projects can be used for extracting relevant content from various webpages? [closed]
We don’t allow questions seeking recommendations for books, tools, software libraries, and more. You can e开发者_Go百科dit the question so it can be answered with facts and citations.
Closed 7 years ago.
Improve this questionSo for example the youtube video ID from a youtube page, or a tweet ID from a twitter page, or a Facebook uid from a facebook profile...
You don't need an open source project for that. Lifting the ID from the page is usually a matter of parsing the URL that got you there. In youtube's case, the "v" querystring parameter indicates the video ID. The other examples have similar answers.
The oembed protocol has a specification for accessing structured relevant data based on a URL. embed.ly is a company that procides an api based on that standard.
- http://www.oembed.com/
- http://embed.ly/
精彩评论