Jsoup 1.2.3 processes HTML 5
Jsoup, a free Java library for processing HTML, is available in version 1.2.3 with enhanced HTML 5 support.
Jsoup, a free Java library for processing HTML, is available in version 1.2.3 with enhanced HTML 5 support.
As the parser has always implicitly supported HTML 5 tags, it now knows element definitions of the new standards. The tool can also generate an HTML-5-standards compliant page parse tree for further processing.
The second important innovation in Jsoup automatically detects the character set of a scanned document and decodes the input before parsing. There are also new selectors as well as small fixes and improvements.
Jsoup runs on Java version 1.5 and is under MIT / X license. On the Jsoup homepage there are Jar files for download and instructions in the Cookbook-style and the API reference.