[Web4lib] Which databases can Google Scholar crawl?

Bill Drew dreww at tc3.edu
Tue Feb 19 21:02:01 EST 2008


Noncommittal answer if I ever heard one.  How hard would it be to just
say here is a list of database providers who let us crawl their sites? 
That is a simple answer.
-----------------------------------------
Wilfred (Bill) Drew
Interim Library Director
Librarian, Systems and Tech Services
Tompkins Cortland Community College  (TC3) Library:
http://www.tc3.edu/library/
Dryden, N.Y. 13053-0139
E-mail: dreww at tc3.edu
Phone: 607-844-8222 ext.4406
AOL Instant Messenger:BillDrew4
StrengthsQuest: Ideation, Input, Learner, Activator, Communication
>>> Corey Murata <murata at u.washington.edu> 02/19/08 6:03 PM >>>
Here's Acharya's answer from a 2006 interview 
(http://www.google.com/librariancenter/articles/0612_01.html):

*************
TH: Why don't you provide a list of journals and/or publishers included 
in Google Scholar? Without such information, it's hard for librarians to

provide guidance to users about how or when to use Google Scholar.

AA: Since we automatically extract citations from articles, we cover a 
wide range of journals and publishers, including even articles that are 
not yet online. While this approach allows us to include popular 
articles from all sources, it makes it difficult to create a succinct 
description of coverage. For example, while we include Einstein's 
articles from 1905 (the "miracle year" in which he published seminal 
articles on special relativity, matter and energy equivalence, Brownian 
motion and the photoelectric effect), we don't yet include all articles 
published in that year.

That said, I'm not quite sure that a coverage description, if available,

would help provide guidance about how or when to use Google Scholar. In 
general, this is hard to do when considering large search indices with 
broad coverage. For example, the notes and comparisons I have seen about

other large scholarly search indices (for which detailed coverage 
information is already available) provide little guidance about when to 
use each of them, and instead recommend searching all of them.
**********

Cm
-- 
Corey Murata
Collection Assessment Projects Librarian
Business Computer-based Services Librarian
University of Washington
Box 353224
Seattle, WA 98195
(206) 543-4360
murata at u.washington.edu


Roy Tennant wrote:
> Yes. I have personally and directly asked Anurag Acharya and got
nowhere. He
> suggested that we try to determine coverage by throwing searches at
it.
> Roy
>
>
> On 2/19/08 2:32 PM, "Bill Drew" <dreww at tc3.edu> wrote:
>
>   
>> Has anyone approached Google asking for this information?
>>
>> Bill Drew
>> _______________________________________________
>> Web4lib mailing list
>> Web4lib at webjunction.org
>> http://lists.webjunction.org/web4lib/
>>     
>
>   
_______________________________________________
Web4lib mailing list
Web4lib at webjunction.org
http://lists.webjunction.org/web4lib/



More information about the Web4lib mailing list