ARCHIVED: When I use Microsoft Word to make web pages, why is the resulting code so long?

This content has been archived, and is no longer maintained by Indiana University. Information here may no longer be accurate, and links may no longer be available or reliable.

Microsoft Word's Save as Web Page option does not output pure HTML, but instead adds special tags designed for Microsoft Office, as well as XML (Extensible Markup Language) and CSS (Cascading Style Sheet) tags. These can almost double the length of your document.

In Word for Windows, Microsoft provides an HTML filter to strip Office-specific code from HTML documents, creating much cleaner coding. To use it, from the File menu (most versions of Word) or Office Button menu (Word 2007 only), select Save As... to save your document. Then, under "Save as type:", select Web Page, Filtered.

After you have installed the filter, whenever you use Word to convert your file to HTML, use Export to HTML instead of Save as Web Page. You can also use Export to HTML to convert any existing files containing Office-specific code you wish to clean up.

This is document aiai in the Knowledge Base.
Last modified on 2018-01-18 12:37:15.