[WEB4LIB] Cleaning up WORD HTML
Edward Spodick, HKUST Library, 2358-6743
lbspodic at ust.hk
Tue Feb 5 21:43:22 EST 2002
At 5:46 PM -0800 5/2/2002 [their time], Drew, Bill wrote:
>Any suggestions for cleaning up Word documents saved as HTML? I use the
>Commands within Dreamweaver but the code still looks very dirty but better
>than it was. Any suggestions other than getting them to use something else?
>We are mounting handouts written originally in Word onto our web.
I have heard several people say that the HTMLTidy program works pretty well. I have not used it, but it sounds like it's worth trying. Its' also available for lots of platforms, and is free. The original web site (http://www.w3.org/People/Raggett/tidy/) says "Tidy can now perform wonders on HTML saved from Microsoft Word 2000! Word bulks out HTML files with stuff for round-tripping presentation between HTML and Word. If you are more concerned about using HTML on the Web, check out Tidy's "Word-2000" config option! Of course Tidy does a good job on Word'97 files as well!"
Current web site: http://tidy.sourceforge.net/
-Spode
- - - - -
Edward F Spodick, Systems Librarian - lbspodic at ust.hk
Hong Kong University of Science & Technology Library
tel: 852-2358-6743 fax: 852-2358-1043
More information about the Web4lib
mailing list