xuchuanyin created CARBONDATA-2288:
--------------------------------------

             Summary: Compaction should be able to run concurrently with data 
loading
                 Key: CARBONDATA-2288
                 URL: https://issues.apache.org/jira/browse/CARBONDATA-2288
             Project: CarbonData
          Issue Type: Improvement
          Components: data-load
            Reporter: xuchuanyin
            Assignee: xuchuanyin


Currently in carbondata, compaction can be triggered in two ways:
1. Manually trigger compaction using ALTER statement.
2. Atomically trigger compaction when doing data loading.

In both ways, compaction and data loading cannot run concurrently. In way 1, 
compation will fail if data load is processing. In way 2, the compaction will 
only start after the main data loading finished and the user has to wait until 
the compaction is finished.

In my option, data loading will work on a new segment, whereas compaction works 
on the existed segments, so we can let them run concurrently.

For the 1st way, compaction will succeed even data loading is processing;
For the 2nd way, compaction will run concurrently with the data loading, or 
after the data loading (we can configure it). And user will not have to wait 
the compaction finished.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to