JTidy, either by processing the stream to XHTML then using your favourite DOM implementation to re-parse, or using parseDOM if the limited DOM imp that gives you is enough.
Alternatively Neko.
More Related Contents:
- HTML/XML Parser for Java [closed]
- Remove HTML tags from a String
- What are the pros and cons of the leading Java HTML parsers? [closed]
- How can I efficiently parse HTML with Java?
- Java HTML Parsing [closed]
- What HTML parsing libraries do you recommend in Java [closed]
- Parse JSON from HttpURLConnection object
- Simplest way to correctly load html from web page into a string in Java
- How can I parse a HTML string in Java?
- How to convert jsonString to JSONObject in Java
- Which is the best library for XML parsing in java [closed]
- Parsing query strings on Android
- Best way to compare 2 XML documents in Java
- Safe String to BigDecimal conversion
- Regex date format validation on Java
- How to format LocalDate object to MM/dd/yyyy and have format persist
- Java XML Parser for huge files
- JSP – What is wrong with scriptlets, and what to use instead [duplicate]
- Is there a built in package to parse html into dom?
- Parse CSV with double quote in some cases
- How can I parse this JSON in Android?
- ANDROID: email client receiver email id empty in android-parse
- How can I parse a String to BigDecimal? [duplicate]
- Text Extraction from HTML Java
- How do I put html in a JLabel in java?
- Parse XML TO JAVA POJO in efficient way
- Library and examples of parsing multipart/form-data from inputstream
- Using gson to deserialize specific JSON field of an object
- Dynamic column cell width
- setting JTextPane to content type HTML and using string builders