[Web4lib] Vendors Deep Indexed in Google
Binkley, Peter
Peter.Binkley at ualberta.ca
Fri Dec 21 18:33:29 EST 2007
It would be interesting to pull a list of target domains of interest out
of our link resolvers, google them using the site: prefix, and tabulate
the number of hits week by week. You could also search separately for
pdfs using the file format option. If you saw a sudden increase for a
particular domain week over week, you'd at least know that new stuff had
been added. For targets that structure their urls the right way, you
could even search for particular issns in the url using the
"Occurrences" field. In this way you could start to build a map of the
coverage of the main scholarly resources, and then refine it by manual
searching.
At least, you could do all this until Google noticed and blocked you.
Let's let Roy do it: not even Google would mess with OCLC!
Happy solstice, all.
Peter
-----Original Message-----
From: web4lib-bounces at webjunction.org
[mailto:web4lib-bounces at webjunction.org] On Behalf Of Roy Tennant
Sent: Friday, December 21, 2007 3:54 PM
To: web4lib at webjunction.org
Subject: Re: [Web4lib] Vendors Deep Indexed in Google
In the past, repeated requests to know the breadth and depth of Google's
indexing of scholarly content have fallen on deaf ears. They have
suggested that the way to find out is by running experiments (basically
throwing spaghetti against a wall to see what sticks) in order to
determine this for ourselves. If anyone has heard any different, I would
love to know about it.
Roy
On 12/21/07 2:41 PM, "Sara Amato" <samato at bowdoin.edu> wrote:
> Is anyone keeping a list that they would be willing to share of
library
> database and index vendors who are deep indexed in Google? E.g. it
> appears most if not all of JSTOR is indexed in google (just limit a
> search to site:jstor.org), and I'm wondering what other vendors are
> out there.
> _______________________________________________
> Web4lib mailing list
> Web4lib at webjunction.org
> http://lists.webjunction.org/web4lib/
--
_______________________________________________
Web4lib mailing list
Web4lib at webjunction.org
http://lists.webjunction.org/web4lib/
More information about the Web4lib
mailing list