Fwd: Re: Fwd: [WEB4LIB] "spiderability" problem

Nick Arnett listbot at mccmedia.com
Thu May 6 14:22:23 EDT 1999


 From the robot wizards... a bit more detail.

>Date:         Thu, 6 May 1999 12:58:34 -0500
>Reply-To:     "Michael A. Grady" <m-grady at uiuc.edu>
>Sender:       robots at MCCMEDIA.COM
>From:         "Michael A. Grady" <m-grady at UIUC.EDU>
>Subject:      Re: Fwd: [WEB4LIB] "spiderability" problem
>Comments: To: robots at MCCMEDIA.COM
>To:           robots at MCCMEDIA.COM
>
>Well, if you 'telnet aabc.bc.ca 80', and then, once the connection is
>established, you enter 'GET / HTTP/1.0', you'll note several problems
>with what their web server returns:
>
>1) the server should WAIT to return a page until I enter a null line,
>    so that I have the option of entering additional info such as
>    HOST: etc., but it does not -- it returns the page immediately
>
>    However, I'm sure the robot doesn't care too much about that.
>
>2) the ONLY header the server returns is 'HTTP/1.0 200 OK'. It does
>    not return a 'Content-Type:', 'Content-Length:', etc.
>
>    I could very well see a robot complaining about not getting one
>    or more of these headers returned.



More information about the Web4lib mailing list