Tip: Convert from HTML to XML with HTML Tidy
By Benoit Marchal2003-12-16
Conclusion
HTML Tidy is one of those neat little utilities that all webmasters should have in their toolbox. It is particularly helpful for XML/XSL webmasters because it can output XHTML. Any other vocabulary is only a stylesheet away.
Resources:
- Download the source code used in this article, including the author's photo gallery.
- Download HTML Tidy from
SourceForge. It runs on Windows, Linux, MacOS, and other platforms. Graphical interfaces and a library (useful for embedding in a workflow) are available on the same site. A Java-language version, JTidy, is also available. - Visit Dave Raggett's
Web site, the original home of HTML Tidy, for a wealth of information. - Read Fundamentals of Web publishing with XML (developerWorks, July 2003) by Benoît Marchal for step-by-step instructions on Web publishing with XML and XSL.
- Use stylesheets to publish online galleries with Divide and conquer large XML documents (developerWorks, June 2003), also by the author.
- Learn more about XHTML on the W3C's HTML home page.
- For more insights into XHTML, read The Web's future: XHTML 2.0 (developerWorks, September 2002) by Nicolas Chase.
- Find more XML resources on the developerWorks
XML zone. For a complete list of XML tips to date, check out the
tips summary page. - IBM's DB2 database provides not only relational database storage, but also XML-related tools such as the DB2 XML Extender , which provides a bridge between XML and relational systems. Visit the DB2 Developer Domain to learn more about DB2.
- Find out how you can become an IBM Certified Developer in XML and related technologies.
Tutorial Pages:
» Preserve Legacy Web Sites With This Handy Utility
» Tool Of The Trade
» Listing 1. index.html (an excerpt)
» Tidying Up
» Listing 2. index.xml (an excerpt)
» Further Processing
» Listing 3. index-transform.xml (an excerpt)
» Listing 4. cleanup.xsl
» Conclusion
First published by IBM developerWorks
