[DISCUSSION] Forceful minor Compaction

2017-04-12 Thread Kunal Kapoor
Hi all,
I was looking into compaction and had a query regarding the same.
If you have auto compaction turned on and the threshold level is 4,3
Now try loading data 7 times which will make 7 segments. Now as the auto
compaction was on therefore 4 segments will be merged to 0.1.
The segments visible will be 0.1, 4, 5, 6.

When i try to run the compaction command nothing happens because the
threshold level is not reached for compaction to happen.
What if i want to merge the 3 segments i.e (4, 5, 6) and make a level-1
compacted segment.

The proposed solution would be to add a new option to the compaction
command which contains what level of compaction the user would like to do

Example:- alter table carbon_table compact 'minor' level '1'.
This would forcefully combine the segments(4, 5, 6) in to a level 1
compacted segment called 4.1 giving me 2 level-1 compacted segment (0.1 and
4.1)
Similar operation can be done with level-1 compacted segments.


Re: [DISCUSSION] Forceful minor Compaction

2017-04-19 Thread Kunal Kapoor
Hi Liang,
Currently the user cannot choose which segments should be merged or not.
We should give a DDL which supports both forceful minor and major
compaction separately.

Another point would be if the user can specify the segments which he needs
merged. Obviously the segments that can be given should meet some criteria.
for example:-- no major compacted or 2nd Level compacted segments can be
provided during forced minor compaction.


--
Regards

*Kunal Kapoor*
Software Consultant

*Knoldus Software LLP <http://www.knoldus.com/home.knol>*
Twitter <http://twitter/kunal642> | LinkedIn
<https://in.linkedin.com/in/kunal642> | FB
<https://www.facebook.com/kunalkapoor642>
Mobile: +91-9711013430

On Wed, Apr 19, 2017 at 2:33 PM, Liang Chen  wrote:

> Hi Kunal
>
> Thank you for taking the good topic for discussion.
> First , let us think about : why users want to do forceful minor
> compaction,
> which cases?
> Current "MAJOR compaction" whether can cover "forceful MINOR compaction"
> scenarios ?
>
> As we know, compaction is mainly for optimizing index effectiveness by
> merging for multiple times data loading(segments), so current system
> provides two options for users to choose :"soft compaction(minor) and
> strong
> compaction(major)"。
>
> So i could not find the typical cases to add a new strong
> compaction(forceful minor)。
>
> Regards
> Liang
>
>
> Kunal Kapoor wrote
> > Hi all,
> > I was looking into compaction and had a query regarding the same.
> > If you have auto compaction turned on and the threshold level is 4,3
> > Now try loading data 7 times which will make 7 segments. Now as the auto
> > compaction was on therefore 4 segments will be merged to 0.1.
> > The segments visible will be 0.1, 4, 5, 6.
> >
> > When i try to run the compaction command nothing happens because the
> > threshold level is not reached for compaction to happen.
> > What if i want to merge the 3 segments i.e (4, 5, 6) and make a level-1
> > compacted segment.
> >
> > The proposed solution would be to add a new option to the compaction
> > command which contains what level of compaction the user would like to do
> >
> > Example:- alter table carbon_table compact 'minor' level '1'.
> > This would forcefully combine the segments(4, 5, 6) in to a level 1
> > compacted segment called 4.1 giving me 2 level-1 compacted segment (0.1
> > and
> > 4.1)
> > Similar operation can be done with level-1 compacted segments.
>
>
>
>
>
> --
> View this message in context: http://apache-carbondata-
> mailing-list-archive.1130556.n5.nabble.com/DISCUSSION-
> Forceful-minor-Compaction-tp10924p11319.html
> Sent from the Apache CarbonData Mailing List archive mailing list archive
> at Nabble.com.
>


[jira] [Created] (CARBONDATA-346) Update unit test for core module

2016-10-28 Thread Kunal Kapoor (JIRA)
Kunal Kapoor created CARBONDATA-346:
---

 Summary: Update unit test for core module
 Key: CARBONDATA-346
 URL: https://issues.apache.org/jira/browse/CARBONDATA-346
 Project: CarbonData
  Issue Type: Test
Reporter: Kunal Kapoor
Priority: Trivial


Improve code coverage for util package.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (CARBONDATA-773) During parallel load multiple instances of DictionaryServer are being created.

2017-03-15 Thread Kunal Kapoor (JIRA)
Kunal Kapoor created CARBONDATA-773:
---

 Summary: During parallel load multiple instances of 
DictionaryServer are being created.
 Key: CARBONDATA-773
 URL: https://issues.apache.org/jira/browse/CARBONDATA-773
 Project: CarbonData
  Issue Type: Improvement
Reporter: Kunal Kapoor
Priority: Minor


During parallel load multiple instances of DictionaryServer are being created.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)


[jira] [Created] (CARBONDATA-839) Table lock file is not getting deleted after table rename is successful

2017-03-30 Thread Kunal Kapoor (JIRA)
Kunal Kapoor created CARBONDATA-839:
---

 Summary: Table lock file is not getting deleted after table rename 
is successful
 Key: CARBONDATA-839
 URL: https://issues.apache.org/jira/browse/CARBONDATA-839
 Project: CarbonData
  Issue Type: Bug
Reporter: Kunal Kapoor
Assignee: Kunal Kapoor
Priority: Minor


Table lock file is not getting deleted after table rename is successful



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)


[jira] [Created] (CARBONDATA-846) Add support to revert changes to alter table commands if there is a failure while executing the changes on hive.

2017-04-03 Thread Kunal Kapoor (JIRA)
Kunal Kapoor created CARBONDATA-846:
---

 Summary: Add support to revert changes to alter table commands if 
there is a failure while executing the changes on hive.
 Key: CARBONDATA-846
 URL: https://issues.apache.org/jira/browse/CARBONDATA-846
 Project: CarbonData
  Issue Type: Improvement
Reporter: Kunal Kapoor
Assignee: Kunal Kapoor






--
This message was sent by Atlassian JIRA
(v6.3.15#6346)


[jira] [Created] (CARBONDATA-926) Higher MAXCOLUMNS value in load DML options is leading to out of memory error

2017-04-14 Thread Kunal Kapoor (JIRA)
Kunal Kapoor created CARBONDATA-926:
---

 Summary: Higher MAXCOLUMNS value in load DML options is leading to 
out of memory error
 Key: CARBONDATA-926
 URL: https://issues.apache.org/jira/browse/CARBONDATA-926
 Project: CarbonData
  Issue Type: Improvement
Reporter: Kunal Kapoor
Assignee: Kunal Kapoor






--
This message was sent by Atlassian JIRA
(v6.3.15#6346)


[jira] [Created] (CARBONDATA-957) Table not found exception in rename table after lock acquire failure

2017-04-18 Thread Kunal Kapoor (JIRA)
Kunal Kapoor created CARBONDATA-957:
---

 Summary: Table not found exception in rename table after lock 
acquire failure
 Key: CARBONDATA-957
 URL: https://issues.apache.org/jira/browse/CARBONDATA-957
 Project: CarbonData
  Issue Type: Bug
Reporter: Kunal Kapoor
Assignee: Kunal Kapoor
Priority: Minor


During rename table if an exception is thrown during acquiring locks then table 
not exists exception was thrown while reverting the changes.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)