PySpark: match the values of a DataFrame column against another DataFrame column

This kind of operation is called left semi join in spark:

df_B.join(df_A, ['col1'], 'leftsemi')

Leave a Comment