[WEB4LIB] Re: Even stranger AltaVista result

Charles P. Hobbs transit at primenet.com
Wed Nov 17 12:02:37 EST 1999


On Wed, 17 Nov 1999, Tara Calishain wrote:

> 
> >One more question - how does AltaVista decide which language?
> >Is there some sort of automatic analysis and comparison to
> >frequently used words on the site from some sort of word list?
> >This was asked from one of the participants - at the time I just
> >wanted to step down and laugh hysterically.
> 
> I asked AltaVista this last April. Bear in mind that they might have changed
> their method, but last April they said that when their spiders index a page, it
> sets a language tag on it as soon as it encounters a sufficient number of words
> in a given language.

It might default to English, until a sufficient number of foreign language
words have been read. Often, I've seen multi-lingual pages, or pages in
languages that AV doesn't know (for example, Arabic), reported as English.
. .



More information about the Web4lib mailing list