Downloading PDF content from a website -

2023-03-02 13:28 问答作者：

I'm trying to download a PDF to my desktop - The PDF upda开发者_如何学Pythontes about every couple days with new content, and I was trying to see if there is a way to have the PDF automatically update its self when it has fresh content without having to go to the actual link.

-- http://www.uakron.edu/dotAsset/1265971.pdf

Assuming this is even remotely a programming question, you could try a HTTP HEAD query (ideally sending a If-Modified-Since header in your request), and inspect the response headers - if the server is friendly, it'll tell you whether it hasn't been updated via a 304 response code.

If you don't get a 304, then issue a GET request and save the response stream.

You could also just try issuing a GET with last-modified (skipping the HEAD); but a HEAD request might save some bandwidth if the server isn't entirely happy with just a GET / 304.

Not tested extensively, but:

using System;
using System.IO;
using System.Net;

static class Program
{
    static void Main()
    {
        string url = "http://www.uakron.edu/dotAsset/1265971.pdf", localPath = "1265971.pdf";

        var req = (HttpWebRequest)WebRequest.Create(url);
        req.AutomaticDecompression = DecompressionMethods.Deflate | DecompressionMethods.GZip;
        req.Headers.Add("Accept-Encoding","gzip,deflate");
        if(File.Exists(localPath))
            req.IfModifiedSince = File.GetLastWriteTimeUtc(localPath);
        try
        {
            using (var resp = req.GetResponse())
            {
                int len;
                checked
                {
                    len = (int)resp.ContentLength;
                }
                using (var file = File.Create(localPath))
                using (var data = resp.GetResponseStream())
                {
                    byte[] buffer = new byte[4 * 1024];
                    int bytesRead;
                    while (len > 0 && (bytesRead = data.Read(buffer, 0, Math.Min(len, buffer.Length))) > 0)
                    {
                        len -= bytesRead;
                        file.Write(buffer, 0, bytesRead);
                    }
                }
            }
            Console.WriteLine("New version downloaded");
        }
        catch (WebException ex)
        {
            if (ex.Response == null || ex.Status != WebExceptionStatus.ProtocolError)
                throw;
            Console.WriteLine("Not updated");
        }
    }
}

Downloading PDF content from a website -

更多精彩内容

精彩评论

最新问答

央视是哪个频道？

请问买过的朋友，舒提啦旅行箱实际使用体验如何？？

检查不孕不育需要的费用？

海信ULED电视画质有什么不同的地方?？

钉子可以挂的住画框幕布吗？

问答排行榜

河神2九牛入海钓河妖是第几集河妖什么来历可活吞牛？

性激素六项检查的最佳时间是多久？多少钱？？

Easiest way to get words of one line from istream into a vector?

《梦在燃烧 (《三国演义》动画片主题曲)》MP3歌词-汤子星？

抽烟只抽炫赫门？