Remove duplicate rows from Pandas dataframe where only some columns have the same value

Use drop_duplicates with parameter subset, for keeping only last duplicated rows add keep='last':

df1 = df.drop_duplicates(subset=['A','B'])
#same as
#df1 = df.drop_duplicates(subset=['A','B'], keep='first')
print (df1)
   A  B  C
0  1  2  x
2  3  4  z
3  3  5  x

df2 = df.drop_duplicates(subset=['A','B'], keep='last')
print (df2)
   A  B  C
1  1  2  y
2  3  4  z
3  3  5  x

More Related Contents:

Remove pandas rows with duplicate indices
Remove duplicates from dataframe, based on two columns A,B, keeping row with max value in another column C
Group duplicate column IDs in pandas dataframe
How to repeat a Pandas DataFrame?
How to “select distinct” across multiple data frame columns in pandas?
Why does the following code cant get value of n_fold = 1?
How to change the order of DataFrame columns?
Add missing dates to pandas dataframe
Pandas: sum DataFrame rows for given columns
datetime dtypes in pandas read_csv
Pandas DataFrame: replace all values in a column, based on condition
What is the fastest way to upload a big csv file in notebook to work with python pandas?
Calculate average of every x rows in a table and create new table
How to check if any value is NaN in a Pandas DataFrame
Removing prefix from column names in Pandas
How to concatenate two dataframes without duplicates?
How to remove square bracket from pandas dataframe
Python Pandas update a dataframe value from another dataframe
Mapping columns from one dataframe to another to create a new column [duplicate]
How can I replicate rows in Pandas?
Action with pandas SettingWithCopyWarning
python dataframe pandas drop column using int
How to create a DataFrame of random integers with Pandas?
How to plot two columns of a pandas data frame using points
How to read a list of parquet files from S3 as a pandas dataframe using pyarrow?
append dictionary to data frame
Calculate summary statistics of columns in dataframe
Python: Adding hours to pandas timestamp
Pandas: Join dataframe with condition
UnicodeDecodeError when reading CSV file in Pandas

More Related Contents:

Leave a Comment Cancel reply