Site Search Tool

Mark Vega vegamf at UCI.EDU
Mon May 12 11:56:13 EDT 2014

We are in the process of dropping our Google CSEs and attempting to build our own search tool using the free, open-source Apache Nutch and SOLR modules (Nutch for crawling and SOLR for indexing crawled content) overlayed with a PHP search interface.  We are using them to crawl and index all of our websites and databases and provide a unified search across all sources from a single search box.  We've only just started our first public BETA and I expect to be making adjustments for at least the next 6 months to a year in order to get the search tool we want, but we we've been dissatisfied with the Google CSE for some time and were not willing to pay for the Google Search Appliance. Once you learn how to configure and use these two modules, they are a powerful combination but be advised that, although the basics are pretty simple, there is an extremely high learning curve to tweak the crawling, indexing and searching to get the results exactly as you want and as your users expect.  
Mark Vega
University of California, Irvine Libraries - Web Services


To unsubscribe:

Web4Lib Web Site:


More information about the Web4lib mailing list