matching unicode characters in python regular expressions

You need to specify the re.UNICODE flag, and input your string as a Unicode string by using the u prefix:

>>> re.match(r'^/by_tag/(?P<tag>\w+)/(?P<filename>(\w|[.,!#%{}()@])+)$', u'/by_tag/påske/øyfjell.jpg', re.UNICODE).groupdict()
{'tag': u'p\xe5ske', 'filename': u'\xf8yfjell.jpg'}

This is in Python 2; in Python 3 you must leave out the u because all strings are Unicode, and you can leave off the re.UNICODE flag.

More Related Contents:

Python regex matching Unicode properties
Matching only a unicode letter in Python re
Regex and unicode
Python and regular expression with Unicode
How to fetch a non-ascii url with urlopen?
remove unicode emoji using re in python
How do I specify a range of unicode characters
python-re: How do I match an alpha character
Python unicode regular expression matching failing with some unicode characters -bug or mistake?
Match any unicode letter?
Remove non-ASCII characters from a string using python / django
Searching for IP addresses in a file
What is the best way to remove accents (normalize) in a Python unicode string?
Escaping regex string
Fast punctuation removal with pandas
How do you validate a URL with a regular expression in Python?
Removing unicode \u2026 like characters in a string in python2.7 [duplicate]
Python __str__ versus __unicode__
Regex matching between two strings?
Find USA phone numbers in python script
Find out how many times a regex matches in a string in Python
How to unquote a urlencoded unicode string in python?
Regular Expression to match cross platform newline characters
replacing only single instances of a character with python regexp
Python Regular Expression Match All 5 Digit Numbers but None Larger
python: unicode in Windows terminal, encoding used?
Type of compiled regex object in python
Python returns length of 2 for single Unicode character string
match dates using python regular expressions
How to use regex with optional characters in python?

More Related Contents:

Leave a Comment Cancel reply