Jsoup 1.2.3 processes HTML 5

Aug 04, 2010

Jsoup, a free Java library for processing HTML, is available in version 1.2.3 with enhanced HTML 5 support.

Jsoup, a free Java library for processing HTML, is available in version 1.2.3 with enhanced HTML 5 support.

As the parser has always implicitly supported HTML 5 tags, it now knows element definitions of the new standards. The tool can also generate an HTML-5-standards compliant page parse tree for further processing.

The second important innovation in Jsoup automatically detects the character set of a scanned document and decodes the input before parsing. There are also new selectors as well as small fixes and improvements.

Jsoup runs on Java version 1.5 and is under MIT / X license. On the Jsoup homepage there are Jar files for download and instructions in the Cookbook-style and the API reference.

Related content

comments powered by Disqus