[Web4lib] Re: Rollyo

Patricia F Anderson pfa at umich.edu
Tue Jul 25 15:05:34 EDT 2006


I agree completely about limiting to subdirectories. That has been a major 
problem for the rollyo searches I've constructed, and the greatest portion 
of the poor searches or irrelevant hits can be attributed to that factor.

  -- Patricia Anderson, pfa at umich.edu

On Tue, 25 Jul 2006, Heather Christenson wrote:

> I think Rollyo is a great, simple web-based alternative to running your
> own crawler.  However, compared to what you might be able to do with
> your own crawler, the results aren't great.   Rollyo would be much more
> powerful if you could limit or filter to specific directories within a
> given URL.   Also, there are a few idiosyncrasies that arise because the
> underlying crawler (Yahoo) is broad and aiming to get to as many sites
> as possible, rather than getting as deep into specific sites as
> possible.  The crawl may or may not have gotten all the way to every
> page underneath your chosen URLs, so the corpus that is searched within
> your roll may be a subset of what you intended.   Also, I believe the
> results ranking shows how your results rank in the broader context, not
> just against the smaller group within your roll.   These problems come
> up with the Google API as well.
>
>
>
> All that being said, Rollyo is so easy to use and share (given the
> challenges of running your own crawl -- which I haven't even begun to
> enumerate!), living with the results isn't too hard
>
> --Heather
>
>
>
> __________________________
>
>
>
> Heather Christenson
>
> California Digital Library
>
> University of California
>
> 415 20th St, 4th Floor
>
> Oakland, CA  94612-3550
>
> phone: (510) 987-0525
>
> fax: (510) 893-5212
>
>
>
> _______________________________________________
> Web4lib mailing list
> Web4lib at webjunction.org
> http://lists.webjunction.org/web4lib/
>
>
>


More information about the Web4lib mailing list