Replace duplicate values across columns in Pandas

You can use the duplicated method to return a boolean indexer of whether elements are duplicates or not:

In [214]: pd.Series(['M', '0', 'M', '0']).duplicated()
Out[214]:
0    False
1    False
2     True
3     True
dtype: bool

Then you could create a mask by mapping this across the rows of your dataframe, and using where to perform your substitution:

is_duplicate = df.apply(pd.Series.duplicated, axis=1)
df.where(~is_duplicate, 0)

  col1 col2 col3 col4
0    A    B    C    0
1    M    0    0    0
2    B    0    0    0
3    X    0    Y    0

More Related Contents:

Selecting with complex criteria from pandas.DataFrame
Combine two pandas Data Frames (join on a common column)
Restart cumsum and get index if cumsum more than value
Remove non-numeric rows in one column with pandas
How to deal with multi-level column names downloaded with yfinance
Replace all occurrences of a string in a pandas dataframe (Python)
Running get_dummies on several DataFrame columns?
ValueError: numpy.dtype has the wrong size, try recompiling
Get business days between start and end date using pandas
How to change the datetime tick label frequency for matplotlib plots
Pandas DataFrame aggregate function using multiple columns
Subtract two columns in dataframe
Python Pandas: Is Order Preserved When Using groupby() and agg()?
Pass percentiles to pandas agg function
How to calculate time difference by group using pandas?
Capitalize first letter of each word in a dataframe column
Compute the running (cumulative) maximum for a series in pandas
Get total of Pandas column
Sorting columns and selecting top n rows in each group pandas dataframe
Get first and second highest values in pandas columns
ImportError: No module named ‘pandas.indexes’
select columns based on columns names containing a specific string in pandas
python dask DataFrame, support for (trivially parallelizable) row apply?
Enumerate each row for each group in a DataFrame
Python How to use ExcelWriter to write into an existing worksheet
Pandas combining rows based on dates
Collapsing rows in a Pandas dataframe if all rows have only one value in their columns
Preserving column order in Python Pandas DataFrame
Convert dates to pd.to_datetime where month could be either a number or month name
Multi Index Sorting in Pandas

More Related Contents:

Leave a Comment Cancel reply