Find symmetric pairs quickly in numpy

You can sort the values, then groupby:

a= np.sort(df.to_numpy(), axis=1)
df.groupby([a[:,0], a[:,1]], as_index=False, sort=False).first()

Option 2: If you have a lot of pairs c1, c2, groupby can be slow. In that case, we can assign new values and filter by drop_duplicates:

a= np.sort(df.to_numpy(), axis=1) 

(df.assign(one=a[:,0], two=a[:,1])   # one and two can be changed
   .drop_duplicates(['one','two'])   # taken from above
   .reindex(df.columns, axis=1)
)

More Related Contents:

Fast punctuation removal with pandas
String concatenation of two pandas columns
How to one-hot-encode from a pandas column containing a list?
Calculate average of every x rows in a table and create new table
Find column name in pandas that matches an array
Most efficient way to forward-fill NaN values in numpy array
Select multiple ranges of columns in Pandas DataFrame
How to one hot encode variant length features?
Improve Row Append Performance On Pandas DataFrames
Missing data, insert rows in Pandas and fill with NAN
Precision lost while using read_csv in pandas
pandas equivalent of np.where
which is faster for load: pickle or hdf5 in python
Saving Matplotlib graphs to image as full screen
Create new dataframe in pandas with dynamic names also add new column
Pandas Latitude-Longitude to distance between successive rows [duplicate]
pd.Timestamp versus np.datetime64: are they interchangeable for selected uses?
Create new column based on values from other columns / apply a function of multiple columns, row-wise in Pandas
Pandas in AWS lambda gives numpy error
Pandas groupby with categories with redundant nan
Filling in date gaps in MultiIndex Pandas Dataframe
Different std in pandas vs numpy
Python numpy: cannot convert datetime64[ns] to datetime64[D] (to use with Numba)
Getting days since last occurence in Pandas DataFrame?
Add numpy array as column to Pandas data frame
Stratified Sampling in Pandas
Load CSV to Pandas MultiIndex DataFrame
AttributeError: module ‘numpy’ has no attribute ‘__version__’
Python pandas – new column’s value if the item is in the list
Why is NaN considered as a float?

More Related Contents:

Leave a Comment Cancel reply