[WEB4LIB] Cleaning up WORD HTML

Edward Spodick, HKUST Library, 2358-6743 lbspodic at ust.hk
Tue Feb 5 21:43:22 EST 2002


At 5:46 PM -0800 5/2/2002 [their time], Drew, Bill wrote:
>Any suggestions for cleaning up Word documents saved as HTML?  I use the
>Commands within Dreamweaver but the code still looks very dirty but better
>than it was.  Any suggestions other than getting them to use something else?
>We are mounting handouts written originally in Word onto our web.

I have heard several people say that the HTMLTidy program works pretty well.  I have not used it, but it sounds like it's worth trying.  Its' also available for lots of platforms, and is free.  The original web site (http://www.w3.org/People/Raggett/tidy/) says "Tidy can now perform wonders on HTML saved from Microsoft Word 2000! Word bulks out HTML files with stuff for round-tripping presentation between HTML and Word. If you are more concerned about using HTML on the Web, check out Tidy's "Word-2000" config option! Of course Tidy does a good job on Word'97 files as well!"

Current web site:  http://tidy.sourceforge.net/

-Spode

- - - - -
Edward F Spodick, Systems Librarian - lbspodic at ust.hk
Hong Kong University of Science & Technology Library
tel:  852-2358-6743     fax:  852-2358-1043



More information about the Web4lib mailing list