Re: value_counts after group_by

2022-08-26 Thread Weston Pace
I'm happy to spread the word. The thanks here go to Eduardo Ponce, Aldrin Montana, and the various reviewers who have all worked hard to create this doc. On Fri, Aug 26, 2022 at 6:05 AM Suresh V wrote: > > Hi Weston > > Thanks a lot for the response. I tried the list approach a while back to

Re: value_counts after group_by

2022-08-26 Thread Suresh V
Hi Weston Thanks a lot for the response. I tried the list approach a while back to get the group keys in this fashion and run parallel computation at group level and the performance penalty for the dataset of 50m rows was way too high(2s vs 8s). Thanks a lot for the awesome initiative of

Re: value_counts after group_by

2022-08-25 Thread Weston Pace
> Is there a way to get value_counts of a given column after doing table > group_by? Is your goal to group by some key and then get the value counts of an entirely different non-key column? If so, then no, not today, at least not directly. The only group by node we have is a hash-group-by and

value_counts after group_by

2022-08-25 Thread Suresh V
Hi, Is there a way to get value_counts of a given column after doing table group_by? If its not possible, can you please point me the relevant cpp/python files I need to modify for this to work? Thanks