lycos relevancy scoring - question

vaughnk1 at westatpo.westat.com vaughnk1 at westatpo.westat.com
Wed May 15 09:15:06 EDT 1996


Barb

I've run into such inconsistancies before at Lycos.  I wrote to 
webmaster at lycos.com.  They answered pretty quickly.  Here is the reply I got 
which contains enough of my message to make sense of it.  It _may_ apply to your
term "music" but at least it will hint at some of the weaknesses in search 
engines yet to be ironed out.
Forgive the formatting of the message.  Too many mailers, too little time.


~~~~~~~~~~~FROM LYCOS WEBMASTER~~~~~~~~~~~~~~~~~~~~

At 02:17 PM 4/2/96 -0800, you wrote: >1.  SEARCH:  "westat research"
>retrieves only 1 irrelevant document.
>2.  SURF TO:  www.westat.com and you will find "westat" and "research" both 
used in 
>plain text near the top of the page (twice).
>3.  Now search Lycos:  "westat."  and www.westat.com comes up along with child 
pages.  
>URL #1 is www.westat.com mentioned above.  
>  Westat is fairly unconcerned about publicity but I am a librarian and Lycos 
has been 
>my #1 tool.  How is Lycos' engine doing this? 

I was pretty curious about this myself, so I had a long talk with our catalog 
developer about it.

His best guess is that, due to the size of the catalog, that "common" words can 
sometimes be underrepresented.  So even though the word "research" does appear 
in your file, it might not be searchable because so many files contain the word 
"research".  But few files contain the word "westat", so a search just on 
"westat" should always lead you
to your page.

This is a problem we're working on.  It might have something to do
with size limits buried deep in the search algorithm.  We apologize for any 
inconvenience.


**  Laurie D. T. Mann    ****    Lycos Webmaster Triage  ** Links to Lycos 
Services - just copy these links to your pages: <a 
href="http://www.lycos.com">Lycos catalog</a><br>
<a href="http://a2z.lycos.com">a2z directory</a><br>
<a href="http://www.pointcom.com">Point reviews</a><br>

~~~~~~~~~~~~~~END~~~~~~~~~~
______________________________ Reply Separator _________________________________
Subject: lycos relevancy scoring - question
Author:  BARB at DAYTON.LIB.OH.US at internet-e-mail
Date:    5/14/96 4:57 PM



 ... I did a search for old time music in Lycos in its default 
form - (match any term, display 10, loose match, standard 
output).  The first four have in their scoring 3 of 3 found.  
After that, all the hits have 2 of 3 found.  However, in hits 
21-30, there are some hits where all three words are noticeably 
present, even as far as being adjacent in the title.  

Can anyone help me with why this occurs?  I was wondering if a clue 
was in "Words Matched on Page:" that shows up in the detailed 
display.

I have looked in "Web Search Strategies" (book) and in the web4lib 
archive and didn't find an answer.  Thanks in advance for any 
information about these searhc results.

Barb

Barbara Kuhns
Dayton & Montgomery County Public Library 
Dayton, OH
barb at dayton.lib.oh.us

   * Kenneth Vaughn ~~~~~~~~~~  LIBRARIAN K  ~~~~~~~~~~ Tech Srvcs Libr *
   *       Westat, Inc. 1650 Research Blvd. Rockville MD 20850          *
   * vaughnk1 at westat.com  =  voice: 301-294-2881  =  fax: 301-294-2034  *




More information about the Web4lib mailing list