[WEB4LIB] Re: Google indexing the invisible Web
Avi Rappoport
avirr at LanMinds.Com
Wed Feb 7 13:18:45 EST 2001
Not only are they indexing PDFs (sometimes), they're also doing some
dynamic pages with ? in the URL.
I think this is a good thing -- there is really valuable information
in those documents that is missing when you just search simple static
HTML pages.
Avi
At 7:39 AM -0800 2/7/01, Walt_Crawford at notes.rlg.org wrote:
>On a wildly different topic: has anyone else run into Google's new and
>slightly peculiar feature--namely, indexing PDFs?
>
>I encountered it accidentally, doing a vanity search (for "Cites &
>Insights," because it took Google so long to show it--at least 6-8 weeks
>after it was introduced). Suddenly, the current issue turns up, with a
>portion of the text--and a click on a "text version" icon yields...well, a
>pretty poor rendition, although most of the text is there somewhere.
>
>This appears to be an effort to "index the invisible Web." I'm not entirely
>thrilled about the idea (but not yet ready to set spider repellers). Is
>this going to improve retrieval in general?
--
________________________________________________________________
Avi Rappoport, Search Tools Maven: <mailto:avirr at lanminds.com>
Guide to Site, Intranet, and Portal Search Engines:
<http://www.searchtools.com>
More information about the Web4lib
mailing list