Pandas get topmost n records within each group

Did you try

df.groupby('id').head(2)

Output generated:

       id  value
id             
1  0   1      1
   1   1      2 
2  3   2      1
   4   2      2
3  7   3      1
4  8   4      1

(Keep in mind that you might need to order/sort before, depending on your data)

EDIT: As mentioned by the questioner, use

df.groupby('id').head(2).reset_index(drop=True)

to remove the MultiIndex and flatten the results:

    id  value
0   1      1
1   1      2
2   2      1
3   2      2
4   3      1
5   4      1

More Related Contents:

Select row by max value in group in a pandas dataframe
Find names of top-n highest-value columns in each pandas dataframe row
Select the max row per group – pandas performance issue
how many countries have organised more than one edition of olympic games? using python pandas [closed]
How to sort a dataFrame in python pandas by two or more columns?
pandas get column average/mean
python pandas extract year from datetime: df[‘year’] = df[‘date’].year is not working
How can I strip the whitespace from Pandas DataFrame headers?
Select Pandas rows based on list index
selecting across multiple columns with python pandas?
Problem in combining bar plot and line plot (python)
Bar Chart: How to choose color if value is positive vs value is negative
Python pandas: remove everything after a delimiter in a string
import pandas_datareader gives ImportError: cannot import name ‘is_list_like’
Finding the intersection between two series in Pandas
vectorize conditional assignment in pandas dataframe
Read csv from Google Cloud storage to pandas dataframe
How to set the pandas dataframe data left/right alignment?
Convert numpy type to python
How to get/set a pandas index column title or name?
Pandas: drop columns with all NaN’s
Getting vertical gridlines to appear in line plot in matplotlib
How can I create the minimum size executable with pyinstaller?
dataframe to long format
Can you format pandas integers for display, like `pd.options.display.float_format` for floats?
How to convert string representation of dictionary in Pandas DataFrame to a new columns?
Get the row corresponding to the max in pandas GroupBy [duplicate]
countplot() with frequencies
Horizontal stacked bar chart in Matplotlib
pandas combine two strings ignore nan values

More Related Contents:

Leave a Comment Cancel reply