Why doesn't xpath work when processing an XHTML document with lxml (in python)?

The problem is the namespaces. When parsed as XML, the img tag is in the http://www.w3.org/1999/xhtml namespace since that is the default namespace for the element. You are asking for the img tag in no namespace.

Try this:

>>> tree.getroot().xpath(
...     "//xhtml:img", 
...     namespaces={'xhtml':'http://www.w3.org/1999/xhtml'}
...     )
[<Element {http://www.w3.org/1999/xhtml}img at 11a29e0>]

More Related Contents:

How do I use a default namespace in an lxml xpath query?
Why does this xpath fail using lxml in python?
How to use XPath in Python?
Using Python Iterparse For Large XML Files
how to remove an element in lxml
How can this function be rewritten to implement OrderedDict? [duplicate]
Remove namespace and prefix from xml in python using lxml
using lxml and iterparse() to parse a big (+- 1Gb) XML file
How to get path of an element in lxml?
parsing xml containing default namespace to get an element value using lxml
Using XPath in ElementTree
How to use regular expression in lxml xpath?
Parse SGML with Open Arbitrary Tags in Python 3
parsing XML file gets UnicodeEncodeError (ElementTree) / ValueError (lxml)
python – lxml: enforcing a specific order for attributes
How to include the namespaces into a xml file using lxml?
Extracting lxml xpath for html table
Python: Using xpath locally / on a specific element
Pretty print in lxml is failing when I add tags to a parsed tree
Find element by text with XPath in ElementTree
Why is lxml.etree.iterparse() eating up all my memory?
Parsing broken XML with lxml.etree.iterparse
lxml: add namespace to input file
Creating a simple XML file using python
ElementClickInterceptedException: Message: element click intercepted: Element is not clickable with Selenium and Python
How to update/modify an XML file in python?
How to write XML declaration using xml.etree.ElementTree
Installing lxml for Python 3.4 on Windows x 86 (32 bit) with Visual Studio C++ 2010 Express
Beautiful Soup and Table Scraping – lxml vs html parser
Emitting namespace specifications with ElementTree in Python

Why doesn’t xpath work when processing an XHTML document with lxml (in python)?

Leave a Comment Cancel reply