[WEB4LIB] Re: another tangent to Re: Inline forms in CSS
Vicki Falkland
library at cryptic.rch.unimelb.edu.au
Sun Mar 3 17:35:02 EST 2002
thanks for all the responses to this.
i really thought Atomz was the way to go until i saw that comment about the
<noindex> thing. obviously i cannot use it now. i'll be taking a look at
swish-e instead.
when my boss asks me "why the change?" i'll tell her "cos the WEB4LIBbers
told me its a bad bad thing" :)
many thanks,
Vicki
At 06:54 AM 28/02/02 -0800, you wrote:
>At 05:20 PM 2/27/2002, Vicki Falkland wrote:
>> >
>> >[Second things second: Who invented <NOINDEX>...</NOINDEX>
>> >elements? Proprietary/made-up stuff like that gets more and more
likely to
>> >screw things up as browsers start expecting you to abide by your doctype
>> >declaration.]
>> >
>>
>>I am in the process of implementing a search feature on our site using
>>Atomz (www.atomz.com)
>>While testing, I noticed that if I searched on a word which happened to be
>>used in various bits of navigation text, the search results listed every
>>page in the site and the descriptive text was simply a rehash of the
>>navigation text.
>>Their help files suggest using <noindex></noindex> for any portions of text
>>I may wish to exclude from being indexed (like navigation!) to correct this
>>problem.
>
>HTML is not just an arbitrary bunch of tags, to which vendors can add their
>own creations willy-nilly. It is a standard derived by a consensus of the
>W3C membership. Anyone writing software that does something with HTML can
>look at the standard; they won't look at the Atomz help documents.
>
>By adding a bogus NOINDEX element, you break any program that looks for
>valid markup. That includes, obviously, validators, which will never pass
>your pages--so you may not be able to use them to see what else is wrong
>with them.
>
>It also increases the risk that newer browsers, some of which take your
>doctype declaration seriously, will choke to some extent on seeing this
>unknown element. Will that cause problems with HTML and/or CSS
>rendering? Answer: you can't know for sure, so stick to the spec.
>
>And the most likely problem: an HTML editor, upon opening your page, may
>discard the bogus elements it finds, perhaps without warning you, so that
>when you save it again there will be changes you're not aware of.
>
>It's a pity the Atomz didn't take the obvious step of delimiting
>non-indexed parts of the document with comments. It seems obvious to me
>that you could look for "<!-- atomz indexing off -->...<!-- atomz indexing
>on -->" without affecting the validity of the document. If they have
>responsive developers, you might suggest something like that.
>
>
>If you were running a search engine on your own server that looked at the
>source HTML files rather than getting them through your server, you could
>use server-side includes for the navigation bars. Then they wouldn't even
>be in the files that get indexed.
>
>
>Thomas Dowling
>OhioLINK - Ohio Library and Information Network
>tdowling at ohiolink.edu
>
>
>
More information about the Web4lib
mailing list