Parsing XML file containing HTML entities in Java without changing the XML
I would use a library like Jsoup for this purpose. I tested the following below and it works. I don’t know if this helps. It can be located here: http://jsoup.org/download public static void main(String args[]){ String html = “<?xml version=\”1.0\” encoding=\”UTF-8\”?><foo>” + “<bar>Some text — invalid!</bar></foo>”; Document doc = Jsoup.parse(html, “”, Parser.xmlParser()); for (Element e : … Read more