pandas-groupby - w3toppers.com

Split pandas dataframe based on groupby

gb = df.groupby(‘ZZ’) [gb.get_group(x) for x in gb.groups]

Groupby value counts on the dataframe pandas

I use groupby and size df.groupby([‘id’, ‘group’, ‘term’]).size().unstack(fill_value=0) Timing 1,000,000 rows df = pd.DataFrame(dict(id=np.random.choice(100, 1000000), group=np.random.choice(20, 1000000), term=np.random.choice(10, 1000000)))

Pandas groupby with delimiter join

Alternatively you can do it this way: In [48]: df.groupby(‘col’)[‘val’].agg(‘-‘.join) Out[48]: col A Cat-Tiger B Ball-Bat Name: val, dtype: object UPDATE: answering question from the comment: In [2]: df Out[2]: col val 0 A Cat 1 A Tiger 2 A Panda 3 B Ball 4 B Bat 5 B Mouse 6 B Egg In [3]: … Read more

Aggregation in Pandas

Question 1 How can I perform aggregation with Pandas? Expanded aggregation documentation. Aggregating functions are the ones that reduce the dimension of the returned objects. It means output Series/DataFrame have less or same rows like original. Some common aggregating functions are tabulated below: Function Description mean() Compute mean of groups sum() Compute sum of group … Read more

Concatenate strings from several rows using Pandas groupby

You can groupby the ‘name’ and ‘month’ columns, then call transform which will return data aligned to the original df and apply a lambda where we join the text entries: In [119]: df[‘text’] = df[[‘name’,’text’,’month’]].groupby([‘name’,’month’])[‘text’].transform(lambda x: ‘,’.join(x)) df[[‘name’,’text’,’month’]].drop_duplicates() Out[119]: name text month 0 name1 hej,du 11 2 name1 aj,oj 12 4 name2 fin,katt 11 6 … Read more

Get the row(s) which have the max value in groups using groupby

In [1]: df Out[1]: Sp Mt Value count 0 MM1 S1 a 3 1 MM1 S1 n 2 2 MM1 S3 cb 5 3 MM2 S3 mk 8 4 MM2 S4 bg 10 5 MM2 S4 dgd 1 6 MM4 S2 rd 2 7 MM4 S2 cb 2 8 MM4 S2 uyi 7 In [2]: … Read more