Web scraping using ruby

2023-03-19 07:09 问答作者：

I am new to programming and I have a project where I have to write a Ruby script to retrieve info on a specified repository from github, parsing the data 开发者_运维问答from JSON format, and printing it in a usable format on the command line.

I have checked out mechanize guide. Any documentation that I can check in order to complete this?

Use Github's Repositories API. Everything you want is done there, without scraping or weird hacks. JSON formatted responses by default.

Following on to @Douglas' response. What you want to do is easy using the GitHub API and the HTTParty gem:

require 'httparty'
class Repository
  include HTTParty
  base_uri 'www.github.com'
end
response = Repository.get('/api/v2/json/repos/show/joncooper/beanstalkd')

require 'awesome_print'
>> ap response.parsed_response
{
    "repository" => {
                 "name" => "beanstalkd",
                 "size" => 128,
           "created_at" => "2011/04/29 09:43:43 -0700",
             "has_wiki" => true,
               "parent" => "kr/beanstalkd",
              "private" => false,
             "watchers" => 1,
                 "fork" => true,
             "language" => "C",
                  "url" => "https://github.com/joncooper/beanstalkd",
            "pushed_at" => "2011/07/05 22:10:53 -0700",
          "open_issues" => 0,
        "has_downloads" => true,
           "has_issues" => false,
             "homepage" => "http://kr.github.com/beanstalkd/",
                "forks" => 0,
          "description" => "Beanstalk is a simple, fast work queue.",
               "source" => "kr/beanstalkd",
                "owner" => "joncooper"
    }
}

See http://httparty.rubyforge.org/ for more.

继续阅读：json ruby

Web scraping using ruby

更多精彩内容

精彩评论

最新问答

央视是哪个频道？

请问买过的朋友，舒提啦旅行箱实际使用体验如何？？

检查不孕不育需要的费用？

海信ULED电视画质有什么不同的地方?？

钉子可以挂的住画框幕布吗？

问答排行榜

河神2九牛入海钓河妖是第几集河妖什么来历可活吞牛？

性激素六项检查的最佳时间是多久？多少钱？？

Easiest way to get words of one line from istream into a vector?

《梦在燃烧 (《三国演义》动画片主题曲)》MP3歌词-汤子星？

抽烟只抽炫赫门？