[GitHub] carbondata pull request #3064: [CARBONDATA-3243] Updated DOC for No-Sort Com...

2019-01-11 Thread NamanRastogi
Github user NamanRastogi commented on a diff in the pull request: https://github.com/apache/carbondata/pull/3064#discussion_r247097474 --- Diff: integration/spark-common/src/main/scala/org/apache/spark/sql/catalyst/CarbonDDLSqlParser.scala --- @@ -1201,6 +1202,17 @@ abstract

[GitHub] carbondata pull request #3064: [CARBONDATA-3243] Updated DOC for No-Sort Com...

2019-01-10 Thread NamanRastogi
Github user NamanRastogi commented on a diff in the pull request: https://github.com/apache/carbondata/pull/3064#discussion_r247011216 --- Diff: docs/dml-of-carbondata.md --- @@ -106,6 +107,13 @@ CarbonData DML statements are documented here,which includes: OPTIONS

[GitHub] carbondata pull request #3064: [CARBONDATA-3243] Updated DOC for No-Sort Com...

2019-01-10 Thread NamanRastogi
Github user NamanRastogi commented on a diff in the pull request: https://github.com/apache/carbondata/pull/3064#discussion_r247010700 --- Diff: docs/dml-of-carbondata.md --- @@ -49,6 +49,7 @@ CarbonData DML statements are documented here,which includes: | [COMMENTCHAR

[GitHub] carbondata pull request #3064: [CARBONDATA-3243] Updated DOC for No-Sort Com...

2019-01-10 Thread NamanRastogi
Github user NamanRastogi commented on a diff in the pull request: https://github.com/apache/carbondata/pull/3064#discussion_r247010601 --- Diff: docs/configuration-parameters.md --- @@ -208,6 +208,7 @@ RESET | carbon.options.date.format | Specifies the data

[GitHub] carbondata pull request #3064: [CARBONDATA-3243] Updated DOC for No-Sort Com...

2019-01-10 Thread NamanRastogi
Github user NamanRastogi commented on a diff in the pull request: https://github.com/apache/carbondata/pull/3064#discussion_r246780308 --- Diff: integration/spark2/src/main/scala/org/apache/spark/sql/hive/execution/command/CarbonHiveCommands.scala --- @@ -127,6 +127,9 @@ object

[GitHub] carbondata pull request #3064: [WIP] Updated DOC for No-Sort Compaction and ...

2019-01-10 Thread NamanRastogi
GitHub user NamanRastogi opened a pull request: https://github.com/apache/carbondata/pull/3064 [WIP] Updated DOC for No-Sort Compaction and a few Fixes 1. Updated Doc 2. Checking SORT_SCOPE in session property CARBON.TABLE.LOAD.SORT.SCOPE in CarbonTable.getSortScope() 3

[GitHub] carbondata pull request #3029: [CARBONDATA-3200] No-Sort compaction

2019-01-09 Thread NamanRastogi
Github user NamanRastogi commented on a diff in the pull request: https://github.com/apache/carbondata/pull/3029#discussion_r246329602 --- Diff: processing/src/main/java/org/apache/carbondata/processing/merger/CarbonCompactionUtil.java --- @@ -400,24 +417,53 @@ private static int

[GitHub] carbondata pull request #3029: [CARBONDATA-3200] No-Sort compaction

2019-01-09 Thread NamanRastogi
Github user NamanRastogi commented on a diff in the pull request: https://github.com/apache/carbondata/pull/3029#discussion_r246329142 --- Diff: processing/src/main/java/org/apache/carbondata/processing/merger/CarbonCompactionExecutor.java --- @@ -105,10 +105,15 @@ public

[GitHub] carbondata pull request #3014: [CARBONDATA-3201] Added load level SORT_SCOPE

2019-01-08 Thread NamanRastogi
Github user NamanRastogi commented on a diff in the pull request: https://github.com/apache/carbondata/pull/3014#discussion_r246274857 --- Diff: integration/spark2/src/main/scala/org/apache/spark/sql/execution/command/management/CarbonLoadDataCommand.scala --- @@ -191,10 +191,17

[GitHub] carbondata pull request #2996: [CARBONDATA-3235] Fix Rename-Fail & Datamap-c...

2019-01-08 Thread NamanRastogi
Github user NamanRastogi commented on a diff in the pull request: https://github.com/apache/carbondata/pull/2996#discussion_r245925395 --- Diff: integration/spark2/src/main/scala/org/apache/spark/sql/execution/command/schema/CarbonAlterTableRenameCommand.scala --- @@ -165,15

[GitHub] carbondata pull request #3014: [CARBONDATA-3201] Added load level SORT_SCOPE

2019-01-07 Thread NamanRastogi
Github user NamanRastogi commented on a diff in the pull request: https://github.com/apache/carbondata/pull/3014#discussion_r245896341 --- Diff: core/src/main/java/org/apache/carbondata/core/util/SessionParams.java --- @@ -229,6 +229,12 @@ private boolean validateKeyValue(String

[GitHub] carbondata pull request #3014: [CARBONDATA-3201] Added load level SORT_SCOPE

2019-01-07 Thread NamanRastogi
Github user NamanRastogi commented on a diff in the pull request: https://github.com/apache/carbondata/pull/3014#discussion_r245895212 --- Diff: integration/spark2/src/test/scala/org/apache/spark/carbondata/commands/SetCommandTestCase.scala --- @@ -128,6 +128,34 @@ class

[GitHub] carbondata issue #3014: [CARBONDATA-3201] Added load level SORT_SCOPE

2019-01-07 Thread NamanRastogi
Github user NamanRastogi commented on the issue: https://github.com/apache/carbondata/pull/3014 retest this please ---

[GitHub] carbondata pull request #2996: [WIP] Fix Rename-Fail & Datamap-creation-Fail

2019-01-07 Thread NamanRastogi
Github user NamanRastogi commented on a diff in the pull request: https://github.com/apache/carbondata/pull/2996#discussion_r245669672 --- Diff: integration/spark2/src/main/scala/org/apache/spark/sql/execution/command/schema/CarbonAlterTableRenameCommand.scala --- @@ -165,15

[GitHub] carbondata pull request #3014: [CARBONDATA-3201] Added load level SORT_SCOPE

2019-01-03 Thread NamanRastogi
Github user NamanRastogi commented on a diff in the pull request: https://github.com/apache/carbondata/pull/3014#discussion_r244947621 --- Diff: integration/spark2/src/main/scala/org/apache/spark/sql/execution/command/management/CarbonLoadDataCommand.scala --- @@ -191,10 +191,17

[GitHub] carbondata pull request #3029: [CARBONDATA-3200] No-Sort compaction

2019-01-02 Thread NamanRastogi
Github user NamanRastogi commented on a diff in the pull request: https://github.com/apache/carbondata/pull/3029#discussion_r244766765 --- Diff: processing/src/main/java/org/apache/carbondata/processing/merger/CompactionResultSortProcessor.java --- @@ -156,21 +159,22 @@ public

[GitHub] carbondata pull request #3029: [CARBONDATA-3200] No-Sort compaction

2019-01-02 Thread NamanRastogi
Github user NamanRastogi commented on a diff in the pull request: https://github.com/apache/carbondata/pull/3029#discussion_r244766044 --- Diff: integration/spark-common/src/main/scala/org/apache/carbondata/spark/rdd/StreamHandoffRDD.scala --- @@ -118,7 +118,7 @@ class

[GitHub] carbondata pull request #3029: [CARBONDATA-3200] No-Sort compaction

2019-01-02 Thread NamanRastogi
Github user NamanRastogi commented on a diff in the pull request: https://github.com/apache/carbondata/pull/3029#discussion_r244766555 --- Diff: processing/src/main/java/org/apache/carbondata/processing/merger/CarbonCompactionUtil.java --- @@ -400,24 +417,53 @@ private static int

[GitHub] carbondata pull request #3029: [CARBONDATA-3200] No-Sort compaction

2019-01-02 Thread NamanRastogi
Github user NamanRastogi commented on a diff in the pull request: https://github.com/apache/carbondata/pull/3029#discussion_r244713711 --- Diff: processing/src/main/java/org/apache/carbondata/processing/sort/sortdata/InMemorySortTempChunkHolder.java --- @@ -0,0 +1,109

[GitHub] carbondata issue #3029: [CARBONDATA-3200] No-Sort compaction

2018-12-27 Thread NamanRastogi
Github user NamanRastogi commented on the issue: https://github.com/apache/carbondata/pull/3029 @qiuchenjian We dont need a parameter to control to control whether to sort the data or not while compacting. The sole purpose of compaction is to improve subsequent query performance

[GitHub] carbondata pull request #3029: [CARBONDATA-3200] No-Sort compaction

2018-12-27 Thread NamanRastogi
GitHub user NamanRastogi opened a pull request: https://github.com/apache/carbondata/pull/3029 [CARBONDATA-3200] No-Sort compaction When the data is loaded with SORT_SCOPE as NO_SORT, and done compaction upon, the data still remains unsorted. This does not affect much in query

[GitHub] carbondata pull request #3014: [WIP] Added load level SORT_SCOPE

2018-12-20 Thread NamanRastogi
GitHub user NamanRastogi opened a pull request: https://github.com/apache/carbondata/pull/3014 [WIP] Added load level SORT_SCOPE ### Load level SORT_SCOPE ```sql LOAD DATA INPATH 'path/to/data.csv' INTO TABLE my_table OPTIONS ( 'sort_scope'='no_sort

[GitHub] carbondata pull request #2996: [WIP] Fix Rename-Fail & Datamap-creation-Fail

2018-12-18 Thread NamanRastogi
GitHub user NamanRastogi opened a pull request: https://github.com/apache/carbondata/pull/2996 [WIP] Fix Rename-Fail & Datamap-creation-Fail Fixed negative scenarios: 1. Alter Table Rename Table Fail 2. Create Preagregate-Datamap Fail ### Alter Table Rename T

[GitHub] carbondata issue #2942: [CARBONDATA-3121] Improvement of CarbonReader build ...

2018-11-26 Thread NamanRastogi
Github user NamanRastogi commented on the issue: https://github.com/apache/carbondata/pull/2942 retest this please ---

[GitHub] carbondata pull request #2942: [CARBONDATA-3121] Improvement of CarbonReader...

2018-11-23 Thread NamanRastogi
Github user NamanRastogi commented on a diff in the pull request: https://github.com/apache/carbondata/pull/2942#discussion_r235901284 --- Diff: core/src/main/java/org/apache/carbondata/core/scan/result/iterator/ChunkRowIterator.java --- @@ -52,17 +49,11 @@ public

[GitHub] carbondata pull request #2942: [CARBONDATA-3121] Improvement of CarbonReader...

2018-11-23 Thread NamanRastogi
Github user NamanRastogi commented on a diff in the pull request: https://github.com/apache/carbondata/pull/2942#discussion_r235892844 --- Diff: core/src/main/java/org/apache/carbondata/core/scan/result/iterator/ChunkRowIterator.java --- @@ -52,17 +49,11 @@ public

[GitHub] carbondata pull request #2942: [CARBONDATA-3121] Improvement of CarbonReader...

2018-11-22 Thread NamanRastogi
GitHub user NamanRastogi opened a pull request: https://github.com/apache/carbondata/pull/2942 [CARBONDATA-3121] Improvement of CarbonReader build time CarbonReader builder is taking huge time. **Reason** Initialization of ChunkRowIterator is triggring actual I/O

[GitHub] carbondata pull request #2921: [CARBONDATA-3104] Removed unnecessary configu...

2018-11-19 Thread NamanRastogi
Github user NamanRastogi commented on a diff in the pull request: https://github.com/apache/carbondata/pull/2921#discussion_r234875057 --- Diff: integration/spark2/src/main/scala/org/apache/spark/sql/execution/command/table/CarbonDropTableCommand.scala --- @@ -143,7 +143,6

[GitHub] carbondata issue #2921: [CARBONDATA-3104] Removed unnecessary configuration ...

2018-11-19 Thread NamanRastogi
Github user NamanRastogi commented on the issue: https://github.com/apache/carbondata/pull/2921 @kunal642 Please review ---

[GitHub] carbondata pull request #2850: [CARBONDATA-3056] Added concurrent reading th...

2018-10-30 Thread NamanRastogi
Github user NamanRastogi commented on a diff in the pull request: https://github.com/apache/carbondata/pull/2850#discussion_r229299196 --- Diff: store/sdk/src/main/java/org/apache/carbondata/sdk/file/CarbonReader.java --- @@ -114,6 +117,43 @@ public static CarbonReaderBuilder

[GitHub] carbondata issue #2850: [WIP] Added concurrent reading through SDK

2018-10-27 Thread NamanRastogi
Github user NamanRastogi commented on the issue: https://github.com/apache/carbondata/pull/2850 Please check the split method, it splits the list of `CarbonRecordReader` into multiple `CarbonReader`s. It does not jumble the order of `CarbonRecordReader`, it still keeps them

[GitHub] carbondata issue #2850: [WIP] Added concurrent reading through SDK

2018-10-27 Thread NamanRastogi
Github user NamanRastogi commented on the issue: https://github.com/apache/carbondata/pull/2850 Yes, data coming from one file will always be in order. Please check the `split` method, it splits the list of CarbonRecordReader into multiple CarbonReader s. Suppose

[GitHub] carbondata issue #2850: [WIP] Added concurrent reading through SDK

2018-10-26 Thread NamanRastogi
Github user NamanRastogi commented on the issue: https://github.com/apache/carbondata/pull/2850 @xuchuanyin Using this API the overhead of reading concurrently is on the consumer of CarbonReader. So yes, what you said what you said is right. But this API was made to consider

[GitHub] carbondata issue #2850: [WIP] Added concurrent reading through SDK

2018-10-24 Thread NamanRastogi
Github user NamanRastogi commented on the issue: https://github.com/apache/carbondata/pull/2850 @xuchuanyin Since `CarbonReader` is iterator based, we can only read a line when user wants. So, even if we keep the parallelism internally before the build method, it will still

[GitHub] carbondata pull request #2850: Added concurrent reading through SDK

2018-10-24 Thread NamanRastogi
GitHub user NamanRastogi opened a pull request: https://github.com/apache/carbondata/pull/2850 Added concurrent reading through SDK Added another API for _CarbonReader.split_ to enable concurrent reading of carbondata files through SDK. ```java List multipleReaders

[GitHub] carbondata pull request #2748: [CARBONDATA-2959] Added validations for TABLE...

2018-09-26 Thread NamanRastogi
Github user NamanRastogi closed the pull request at: https://github.com/apache/carbondata/pull/2748 ---

[GitHub] carbondata pull request #2748: [CARBONDATA-2959] Added validations for TABLE...

2018-09-21 Thread NamanRastogi
GitHub user NamanRastogi opened a pull request: https://github.com/apache/carbondata/pull/2748 [CARBONDATA-2959] Added validations for TABLE OPTIONS create table validations are added for table OPTIONS while table is created using: ``` CREATE TABLE ( ) USING carbon