How to remove xa0 from string in Python?

\xa0 is actually non-breaking space in Latin1 (ISO 8859-1), also chr(160). You should replace it with a space.

string = string.replace(u'\xa0', u' ')

When .encode(‘utf-8’), it will encode the unicode to utf-8, that means every unicode could be represented by 1 to 4 bytes. For this case, \xa0 is represented by 2 bytes \xc2\xa0.

Read up on http://docs.python.org/howto/unicode.html.

Please note: this answer in from 2012, Python has moved on, you should be able to use unicodedata.normalize now

More Related Contents:

Python and BeautifulSoup encoding issues [duplicate]
How to convert a string to utf-8 in Python
How to correctly parse UTF-8 encoded HTML to Unicode strings with BeautifulSoup? [duplicate]
UnicodeEncodeError: ‘ascii’ codec can’t encode character u’\xa0′ in position 20: ordinal not in range(128)
Saving utf-8 texts with json.dumps as UTF8, not as \u escape sequence
Unicode (UTF-8) reading and writing to files in Python
UnicodeDecodeError: ‘ascii’ codec can’t decode byte 0xef in position 1
Python & MySql: Unicode and Encoding
Python – ‘ascii’ codec can’t decode byte
Writing UTF-8 String to MySQL with Python
bs4.FeatureNotFound: Couldn’t find a tree builder with the features you requested: lxml. Do you need to install a parser library?
General Unicode/UTF-8 support for csv files in Python 2.6
MySQL “incorrect string value” error when save unicode string in Django
UnicodeEncodeError: ‘ascii’ codec can’t encode character at special name [duplicate]
Python reading from a file and saving to utf-8
BeautifulSoup returns empty list when searching by compound class names
Python ascii utf unicode
Get unicode code point of a character using Python
ElementTree and unicode
UnicodeEncodeError: ‘ascii’ codec can’t encode characters in position 0-5: ordinal not in range(128) [duplicate]
Why does ENcoding a string result in a DEcoding error (UnicodeDecodeError)?
Convert UTF-16 to UTF-8 and remove BOM?
How can I use io.StringIO() with the csv module?
python encoding utf-8
Install Beautiful Soup using pip [duplicate]
BeautifulSoup getText from between , not picking up subsequent paragraphs
SQLite, python, unicode, and non-utf data
UnicodeDecodeError when performing os.walk
How to handle IncompleteRead: in python
Bytes in a unicode Python string

How to remove \xa0 from string in Python?

Leave a Comment Cancel reply