‘utf-8’ codec can’t decode byte 0x92 in position 18: invalid start byte

The data is indeed not encoded as UTF-8; everything is ASCII except for that single 0x92 byte: b’Korea, Dem. People\x92s Rep.’ Decode it as Windows codepage 1252 instead, where 0x92 is a fancy quote, ’: df1 = pd.read_csv(“https://raw.githubusercontent.com/tuyenhavan/Statistics/Dataset/World_Life_Expectancy.csv”, sep=”;”, encoding=’cp1252′) Demo: >>> import pandas as pd >>> df1 = pd.read_csv(“https://raw.githubusercontent.com/tuyenhavan/Statistics/Dataset/World_Life_Expectancy.csv”, … sep=”;”, encoding=’cp1252′) >>> df1.head() … Read more