NekoHTML, TagSoup, and JTidy will allow you to parse HTML and then process with XML tools, like XPath.
More Related Contents:
- Remove HTML tags from a String
- What are the pros and cons of the leading Java HTML parsers? [closed]
- How can I efficiently parse HTML with Java?
- Java HTML Parsing [closed]
- HTML/XML Parser for Java [closed]
- Reading HTML file to DOM tree using Java
- Parse JSON from HttpURLConnection object
- Text Extraction from HTML Java
- Simplest way to correctly load html from web page into a string in Java
- How can I parse a HTML string in Java?
- What is the best way to parse html in C#? [closed]
- What is the recommended way to escape HTML symbols in plain Java?
- How to parse a mathematical expression given as a string and return a number? [duplicate]
- SQL parser library for Java [closed]
- Inconsistent performance applying ForegroundActions in a JEditorPane when reading HTML
- What is the easiest way to parse an INI file in Java?
- Convert HTML Character Back to Text Using Java Standard Library
- How to convert/parse from String to char in java?
- Stripping HTML tags in Java [duplicate]
- Scanner only reads file name and nothing else
- Is there a Java XML API that can parse a document without resolving character entities?
- Fast CSV parsing
- JSON parsing to Java – Android application
- Parsing nested JSON data using GSON
- How do I parse EDIFACT in Java? [closed]
- How do I return a video with Spring MVC so that it can be navigated using the html5 tag?
- Advanced PDF parser for Java
- Using gson to deserialize specific JSON field of an object
- Dynamic column cell width
- setting JTextPane to content type HTML and using string builders