[ https://issues.apache.org/jira/browse/ARROW-8199?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Kouhei Sutou updated ARROW-8199: -------------------------------- Summary: [C++] Add support for multi-column sort on Table (was: [C++] Add support for multi-column sort on Table example?) > [C++] Add support for multi-column sort on Table > ------------------------------------------------ > > Key: ARROW-8199 > URL: https://issues.apache.org/jira/browse/ARROW-8199 > Project: Apache Arrow > Issue Type: Wish > Components: C++ > Affects Versions: 0.16.0 > Reporter: Scott Wilson > Assignee: Kouhei Sutou > Priority: Minor > Labels: pull-request-available > Attachments: ArrowCsv.cpp, DataFrame.h > > Time Spent: 5h 10m > Remaining Estimate: 0h > > I'm just coming up to speed with Arrow and am noticing a dearth of examples > ... maybe I can help here. > I'd like to implement multi-column sorting for Tables and just want to ensure > that I'm not duplicating existing work or proposing a bad design. > My thought was to create a Table-specific version of SortToIndices() where > you can specify the columns and sort order. > Then I'd create Array "views" that use the Indices to remap from the original > Array values to the values in sorted order. (Original data is not sorted, but > could be as a second step.) I noticed some of the array list variants keep > offsets, but didn't see anything that supports remapping per a list of > indices, but this may just be my oversight? > Thanks in advance, Scott -- This message was sent by Atlassian Jira (v8.3.4#803005)