GitHub user NamanRastogi opened a pull request:

    https://github.com/apache/carbondata/pull/3029

    [CARBONDATA-3200] No-Sort compaction

    When the data is loaded with SORT_SCOPE as NO_SORT, and done compaction 
upon, the data still remains unsorted. This does not affect much in query. The 
major purpose of compaction, is better pack the data and improve query 
performance.
    
    Now, the expected behaviour of compaction is sort to the data, so that 
after compaction, query performance becomes better. The columns to sort upon 
are provided by SORT_COLUMNS.
    
    
     - [ ] Any interfaces changed?  -->  No
     - [ ] Any backward compatibility impacted?   -->  No
     - [ ] Document update required?  ->  No
     - [ ] Testing done
     - [ ] For large changes, please consider breaking it into sub-tasks under 
an umbrella JIRA. 
    


You can merge this pull request into a Git repository by running:

    $ git pull https://github.com/NamanRastogi/carbondata nosort_compaction

Alternatively you can review and apply these changes as the patch at:

    https://github.com/apache/carbondata/pull/3029.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

    This closes #3029
    
----
commit f9e0142149ccd83a48f828bf032842b2a18ce90d
Author: namanrastogi <naman.rastogi.52@...>
Date:   2018-12-27T13:26:18Z

    Added HybridSortProcessor

commit d406a9f595558f2f027a56425b0f432b534e47c8
Author: namanrastogi <naman.rastogi.52@...>
Date:   2018-12-21T16:48:15Z

    Added flow for HybridSorterProcessor.
    
    TODO: Implement HybridSorterProcessor itself.

----


---

Reply via email to