httplib: incomplete read

2023-01-15 22:36 问答作者：

I have some python code on both the client and server side. I am getting an IncompleteRead exception thrown for what seems to be no good reason. I can navigate to the URL with Firefox without any error message and also WGET it without any odd results.

The server code is:

import random
import hashlib
print "Content-Type: text/html"     
print                              

m = hashlib.md5()
m.update(str(random.random()))
print m.hexdigest()
print

On the client site, I use a relatively straightforward POST approach:

    data = urllib.urlencode({"username": username,
                     "password" : password})
    #POST in the data.
    req = urllib2.Request(url, data)

    response = urllib2.urlopen(req)
    string =  response.read()

And the response.read() throws the error.

Edit: Further information - Adding explicit CRLF emissions does not alter the change. Checking the error log

[Wed Sep 08 10:36:43 2010] [error] [client 192.168.80.1] (104)Connection reset by peer: ap_content_length_filter: apr_bucket_read() failed

The SSL access log shows(mildly r开发者_StackOverflowedacted):

192.168.80.1 - - [08/Sep/2010:10:38:02 -0700] "POST /serverfile.py HTTP/1.1" 200 1357 "-" "Python-urllib/2.7"

Does terminating the lines with \r\n make any difference? Something like this:

import random
import hashlib
import sys

sys.stdout.write("Content-Type: text/html\r\n\r\n")

m = hashlib.md5()
m.update(str(random.random()))
print m.hexdigest()
print

The problem is a bug in Apache.

Apache throws this particular kind of error when the receiving script does not consume all of the POST request.

Apache developers consider this to be an "As-designed" design.

The fix is to have something like this as soon as possible:

workaround = cgi.FieldStorage()

I got this error when I had failed to completely read the previous response, e.g.:

# This is using an opener from urllib2, but I am guessing similar...
response1 = opener.open(url1)
for line in response1:
    m = re.match("href='(.*)'", line):
    if m:
        url2 = m.group(1) # Grab the URL from line, that's all I want.
        break             # Oops.  Apache is mad because I suck.

response2 = opener.open(url2)
for line in response2:
    print line

The server gave me "200 OK" on the first request, followed by the data up to the link I was looking for, then waited five minutes on the second open, then gave me "200 OK" on the second request, followed by all the data for the second request, then gave me IncompleteRead on the first request!

I am reading between the lines that the Paul's original script logged into two sites and got the problem on the second site.

I can see how reading two pages in parallel might be a nice feature. So what can I do to gracefully tell the server "No more, thanks?" I solved this by reading through and ignoring the rest of the first request (only 200K in this case).

If I were allowed to comment rather than answer, I'd ask Paul Nathan,

What is

workaround = cgi.FieldStorage()

, what do you mean by as soon as possible, and how does it help here? Have pity on a beginner.

I'm guessing the original poster was actually running the request twice, succeeding the first time and failing on the second.

I got IncompleteRead (from Apache) when I had failed to completely read the previous response, e.g.:

# This is using an opener from urllib2, but I am guessing similar...
response1 = opener.open(url1)
for line in response1:
    m = re.match("href='(.*)'", line):
    if m:
        url2 = m.group(1) # Grab the URL from line, that's all I want.
        break             # Oops.  Apache is mad because I suck.

response2 = opener.open(url2)
for line in response2:
    print line

I can imagine wanting to have two responses open simultaneously for reading. So the question is, how do I finish with a response? Do I have to read all the data even though I don't need it? No, (urllib.urlopen documentation) the response is like a file, just close it, so for my example,

for line in response1:
    m = re.match("href='(.*)'", line):
    if m:
        url2 = m.group(1) # Grab the URL from line, that's all I want.
        break

response1.close()
response2 = opener.open(url2)
...

继续阅读：python

httplib: incomplete read

更多精彩内容

精彩评论

最新问答

央视是哪个频道？

请问买过的朋友，舒提啦旅行箱实际使用体验如何？？

检查不孕不育需要的费用？

海信ULED电视画质有什么不同的地方?？

钉子可以挂的住画框幕布吗？

问答排行榜

河神2九牛入海钓河妖是第几集河妖什么来历可活吞牛？

性激素六项检查的最佳时间是多久？多少钱？？

Easiest way to get words of one line from istream into a vector?

《梦在燃烧 (《三国演义》动画片主题曲)》MP3歌词-汤子星？

抽烟只抽炫赫门？