[GitHub] carbondata pull request #1050: [CARBONDATA -1186]resolved bug for class cast...

2017-11-01 Thread anubhav100
Github user anubhav100 closed the pull request at:

https://github.com/apache/carbondata/pull/1050


---


[GitHub] carbondata issue #1050: [CARBONDATA -1186]resolved bug for class cast except...

2017-11-01 Thread anubhav100
Github user anubhav100 commented on the issue:

https://github.com/apache/carbondata/pull/1050
  
@sraghunandan i am unable to reproduce this bug closing the ticket


---


[GitHub] carbondata issue #1419: [CARBONDATA-1601] Add carbondata-store module

2017-11-01 Thread ravipesala
Github user ravipesala commented on the issue:

https://github.com/apache/carbondata/pull/1419
  
SDV Build Fail , Please check CI 
http://144.76.159.231:8080/job/ApacheSDVTests/1425/



---


[GitHub] carbondata issue #1419: [CARBONDATA-1601] Add carbondata-store module

2017-11-01 Thread CarbonDataQA
Github user CarbonDataQA commented on the issue:

https://github.com/apache/carbondata/pull/1419
  
Build Success with Spark 2.1.0, Please check CI 
http://136.243.101.176:8080/job/ApacheCarbonPRBuilder1/792/



---


[GitHub] carbondata issue #1460: [Docs] Fix partition-guide.md docs NUM_PARTITIONS wr...

2017-11-01 Thread CarbonDataQA
Github user CarbonDataQA commented on the issue:

https://github.com/apache/carbondata/pull/1460
  
Can one of the admins verify this patch?


---


[GitHub] carbondata issue #1460: [Docs] Fix partition-guide.md docs NUM_PARTITIONS wr...

2017-11-01 Thread ravipesala
Github user ravipesala commented on the issue:

https://github.com/apache/carbondata/pull/1460
  
Can one of the admins verify this patch?


---


[GitHub] carbondata pull request #1460: [Docs] Fix partition-guide.md docs NUM_PARTIT...

2017-11-01 Thread LiShuMing
GitHub user LiShuMing opened a pull request:

https://github.com/apache/carbondata/pull/1460

[Docs] Fix partition-guide.md docs NUM_PARTITIONS wrong keyword

It's an obvious keyword spelling mistake,  `PARTITION_NUM` should be 
`NUM_PARTITION `.

No Tests.

You can merge this pull request into a Git repository by running:

$ git pull https://github.com/LiShuMing/carbondata fix-docs-partition-guide

Alternatively you can review and apply these changes as the patch at:

https://github.com/apache/carbondata/pull/1460.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

This closes #1460


commit 517a40e0707709ebf4b9812e4208717222ef2e06
Author: lishuming 
Date:   2017-11-02T03:55:02Z

Fix partition-guide.md docs NUM_PARTITIONS wrong keyword




---


[GitHub] carbondata issue #1419: [CARBONDATA-1601] Add carbondata-store module

2017-11-01 Thread jackylk
Github user jackylk commented on the issue:

https://github.com/apache/carbondata/pull/1419
  
retest this please


---


[GitHub] carbondata issue #1455: [CARBONDATA-1624]Set the default value of 'carbon.nu...

2017-11-01 Thread zzcclp
Github user zzcclp commented on the issue:

https://github.com/apache/carbondata/pull/1455
  
@ravipesala @jackylk why it used 2 as the default value of 
'carbon.number.of.cores.while.loading' before?


---


[GitHub] carbondata pull request #1452: [CARBONDATA-1593] Add partition to table caus...

2017-11-01 Thread asfgit
Github user asfgit closed the pull request at:

https://github.com/apache/carbondata/pull/1452


---


[GitHub] carbondata issue #1417: [CARBONDATA-1594] Add precision and scale to Decimal...

2017-11-01 Thread ravipesala
Github user ravipesala commented on the issue:

https://github.com/apache/carbondata/pull/1417
  
SDV Build Fail , Please check CI 
http://144.76.159.231:8080/job/ApacheSDVTests/1424/



---


[GitHub] carbondata issue #1104: [CARBONDATA-1239] Add validation for set command par...

2017-11-01 Thread jackylk
Github user jackylk commented on the issue:

https://github.com/apache/carbondata/pull/1104
  
please rebase


---


[jira] [Resolved] (CARBONDATA-1617) Merging carbonindex files for each segment.

2017-11-01 Thread Jacky Li (JIRA)

 [ 
https://issues.apache.org/jira/browse/CARBONDATA-1617?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jacky Li resolved CARBONDATA-1617.
--
   Resolution: Fixed
Fix Version/s: 1.3.0

> Merging carbonindex files for each segment.
> ---
>
> Key: CARBONDATA-1617
> URL: https://issues.apache.org/jira/browse/CARBONDATA-1617
> Project: CarbonData
>  Issue Type: New Feature
>Reporter: Ravindra Pesala
>Priority: Major
> Fix For: 1.3.0
>
>  Time Spent: 10h 20m
>  Remaining Estimate: 0h
>
> Hi,
> Problem :
>  The first-time query of carbon becomes very slow. It is because of reading 
> many small carbonindex files and cache to the driver at the first time. 
>  Many carbonindex files are created in below case
>  Loading data in large cluster
>For example, if the cluster size is 100 nodes then for each load 100 index 
> files are created per segment. So after 100 loads, the number of carbonindex 
> files becomes 1. .
> It will be slower to read all the files from the driver since a lot of 
> namenode calls and IO operations.
> Solution :
> Merge the carbonindex files in two levels.so that we can reduce the IO calls 
> to namenode and improves the read performance.
> Merge within a segment.
> Merge the carbonindex files to single file immediately after load completes 
> within the segment. It would be named as a .carbonindexmerge file. It is 
> actually not a true data merging but a simple file merge. So that the current 
> structure of carbonindex files does not change. While reading we just read 
> one file instead of many carbonindex files within the segment.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)


[GitHub] carbondata pull request #1436: [CARBONDATA-1617] Merging carbonindex files w...

2017-11-01 Thread asfgit
Github user asfgit closed the pull request at:

https://github.com/apache/carbondata/pull/1436


---


[GitHub] carbondata issue #1417: [CARBONDATA-1594] Add precision and scale to Decimal...

2017-11-01 Thread CarbonDataQA
Github user CarbonDataQA commented on the issue:

https://github.com/apache/carbondata/pull/1417
  
Build Success with Spark 2.1.0, Please check CI 
http://136.243.101.176:8080/job/ApacheCarbonPRBuilder1/791/



---


[GitHub] carbondata issue #1436: [CARBONDATA-1617] Merging carbonindex files within s...

2017-11-01 Thread jackylk
Github user jackylk commented on the issue:

https://github.com/apache/carbondata/pull/1436
  
LGTM


---


[GitHub] carbondata issue #1417: [CARBONDATA-1594] Add precision and scale to Decimal...

2017-11-01 Thread jackylk
Github user jackylk commented on the issue:

https://github.com/apache/carbondata/pull/1417
  
retest this please


---


[GitHub] carbondata issue #1437: [CARBONDATA-1618] Fix issue of not support table com...

2017-11-01 Thread ravipesala
Github user ravipesala commented on the issue:

https://github.com/apache/carbondata/pull/1437
  
SDV Build Success , Please check CI 
http://144.76.159.231:8080/job/ApacheSDVTests/1423/



---


[GitHub] carbondata issue #1452: [CARBONDATA-1593] Add partition to table cause NoSuc...

2017-11-01 Thread sounakr
Github user sounakr commented on the issue:

https://github.com/apache/carbondata/pull/1452
  
LGTM


---


[GitHub] carbondata issue #1437: [CARBONDATA-1618] Fix issue of not support table com...

2017-11-01 Thread CarbonDataQA
Github user CarbonDataQA commented on the issue:

https://github.com/apache/carbondata/pull/1437
  
Build Success with Spark 2.1.0, Please check CI 
http://136.243.101.176:8080/job/ApacheCarbonPRBuilder1/790/



---


[GitHub] carbondata issue #1437: [CARBONDATA-1618] Fix issue of not support table com...

2017-11-01 Thread ravipesala
Github user ravipesala commented on the issue:

https://github.com/apache/carbondata/pull/1437
  
SDV Build Success , Please check CI 
http://144.76.159.231:8080/job/ApacheSDVTests/1422/



---


[GitHub] carbondata issue #1437: [CARBONDATA-1618] Fix issue of not support table com...

2017-11-01 Thread CarbonDataQA
Github user CarbonDataQA commented on the issue:

https://github.com/apache/carbondata/pull/1437
  
Build Success with Spark 2.1.0, Please check CI 
http://136.243.101.176:8080/job/ApacheCarbonPRBuilder1/789/



---


[GitHub] carbondata issue #1437: [CARBONDATA-1618] Fix issue of not support table com...

2017-11-01 Thread chenerlu
Github user chenerlu commented on the issue:

https://github.com/apache/carbondata/pull/1437
  
retest this please


---


[GitHub] carbondata issue #1459: [CARBONDATA-1661] Fixed bug related to display of ti...

2017-11-01 Thread ravipesala
Github user ravipesala commented on the issue:

https://github.com/apache/carbondata/pull/1459
  
SDV Build Success , Please check CI 
http://144.76.159.231:8080/job/ApacheSDVTests/1421/



---


[GitHub] carbondata issue #1164: [CARBONDATA-1303] Update CarbonContext.scala

2017-11-01 Thread sraghunandan
Github user sraghunandan commented on the issue:

https://github.com/apache/carbondata/pull/1164
  
can you please add test case to reproduce this scenario


---


[GitHub] carbondata issue #1459: [CARBONDATA-1661] Fixed bug related to display of ti...

2017-11-01 Thread CarbonDataQA
Github user CarbonDataQA commented on the issue:

https://github.com/apache/carbondata/pull/1459
  
Build Success with Spark 2.1.0, Please check CI 
http://136.243.101.176:8080/job/ApacheCarbonPRBuilder1/788/



---


[GitHub] carbondata issue #1452: [CARBONDATA-1593] Add partition to table cause NoSuc...

2017-11-01 Thread lionelcao
Github user lionelcao commented on the issue:

https://github.com/apache/carbondata/pull/1452
  
LGTM


---


[jira] [Commented] (CARBONDATA-1657) Partition column is empty when insert from a hive table

2017-11-01 Thread cen yuhai (JIRA)

[ 
https://issues.apache.org/jira/browse/CARBONDATA-1657?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16233951#comment-16233951
 ] 

cen yuhai commented on CARBONDATA-1657:
---

dt's datatype is string

> Partition column is empty when insert from a hive table
> ---
>
> Key: CARBONDATA-1657
> URL: https://issues.apache.org/jira/browse/CARBONDATA-1657
> Project: CarbonData
>  Issue Type: Bug
>  Components: data-load
>Affects Versions: 1.2.0
> Environment: carbonata1.2.0 spark 2.1.1
>Reporter: cen yuhai
>Priority: Critical
>
> I create table a carbon table, the schema is like a hive table(dt is the 
> partition column).
> And then
> {code}
> insert overwrite table dm_test.dm_trd_wide_carbondata select * from 
> hive_table where dt='2017-10-10';
> insert overwrite table dm_test.dm_trd_wide_parquet select * from hive_table 
> where dt='2017-10-10';
> {code}
> {code}
> spark-sql> select dt from dm_test.dm_trd_wide_parquet limit 10;
> 2017-10-10
> 2017-10-10
> 2017-10-10
> 2017-10-10
> 2017-10-10
> 2017-10-10
> 2017-10-10
> 2017-10-10
> 2017-10-10
> 2017-10-10
> Time taken: 1.259 seconds, Fetched 10 row(s)
> spark-sql> select dt from dm_test.dm_trd_wide_carbondata limit 10;
> NULL
> NULL
> NULL
> NULL
> NULL
> NULL
> NULL
> NULL
> NULL
> NULL
> {code}



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)


[jira] [Comment Edited] (CARBONDATA-1657) Partition column is empty when insert from a hive table

2017-11-01 Thread cen yuhai (JIRA)

[ 
https://issues.apache.org/jira/browse/CARBONDATA-1657?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16233951#comment-16233951
 ] 

cen yuhai edited comment on CARBONDATA-1657 at 11/1/17 11:36 AM:
-

the datatype of dt is string


was (Author: cenyuhai):
dt's datatype is string

> Partition column is empty when insert from a hive table
> ---
>
> Key: CARBONDATA-1657
> URL: https://issues.apache.org/jira/browse/CARBONDATA-1657
> Project: CarbonData
>  Issue Type: Bug
>  Components: data-load
>Affects Versions: 1.2.0
> Environment: carbonata1.2.0 spark 2.1.1
>Reporter: cen yuhai
>Priority: Critical
>
> I create table a carbon table, the schema is like a hive table(dt is the 
> partition column).
> And then
> {code}
> insert overwrite table dm_test.dm_trd_wide_carbondata select * from 
> hive_table where dt='2017-10-10';
> insert overwrite table dm_test.dm_trd_wide_parquet select * from hive_table 
> where dt='2017-10-10';
> {code}
> {code}
> spark-sql> select dt from dm_test.dm_trd_wide_parquet limit 10;
> 2017-10-10
> 2017-10-10
> 2017-10-10
> 2017-10-10
> 2017-10-10
> 2017-10-10
> 2017-10-10
> 2017-10-10
> 2017-10-10
> 2017-10-10
> Time taken: 1.259 seconds, Fetched 10 row(s)
> spark-sql> select dt from dm_test.dm_trd_wide_carbondata limit 10;
> NULL
> NULL
> NULL
> NULL
> NULL
> NULL
> NULL
> NULL
> NULL
> NULL
> {code}



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)


[jira] [Commented] (CARBONDATA-1657) Partition column is empty when insert from a hive table

2017-11-01 Thread Babulal (JIRA)

[ 
https://issues.apache.org/jira/browse/CARBONDATA-1657?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16233948#comment-16233948
 ] 

Babulal commented on CARBONDATA-1657:
-

Hi 
if possible please provide  create statement for hive_table  and 
dm_trd_wide_carbondata . 
also  please check whether date format in carbon.properties is in -MM-dd 
format . 
Thanks
Babu

> Partition column is empty when insert from a hive table
> ---
>
> Key: CARBONDATA-1657
> URL: https://issues.apache.org/jira/browse/CARBONDATA-1657
> Project: CarbonData
>  Issue Type: Bug
>  Components: data-load
>Affects Versions: 1.2.0
> Environment: carbonata1.2.0 spark 2.1.1
>Reporter: cen yuhai
>Priority: Critical
>
> I create table a carbon table, the schema is like a hive table(dt is the 
> partition column).
> And then
> {code}
> insert overwrite table dm_test.dm_trd_wide_carbondata select * from 
> hive_table where dt='2017-10-10';
> insert overwrite table dm_test.dm_trd_wide_parquet select * from hive_table 
> where dt='2017-10-10';
> {code}
> {code}
> spark-sql> select dt from dm_test.dm_trd_wide_parquet limit 10;
> 2017-10-10
> 2017-10-10
> 2017-10-10
> 2017-10-10
> 2017-10-10
> 2017-10-10
> 2017-10-10
> 2017-10-10
> 2017-10-10
> 2017-10-10
> Time taken: 1.259 seconds, Fetched 10 row(s)
> spark-sql> select dt from dm_test.dm_trd_wide_carbondata limit 10;
> NULL
> NULL
> NULL
> NULL
> NULL
> NULL
> NULL
> NULL
> NULL
> NULL
> {code}



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)


[GitHub] carbondata issue #1050: [CARBONDATA -1186]resolved bug for class cast except...

2017-11-01 Thread sraghunandan
Github user sraghunandan commented on the issue:

https://github.com/apache/carbondata/pull/1050
  
can you please add test case to reproduce this scenario.Also pls rebase


---


[GitHub] carbondata pull request #1459: [CARBONDATA-1661] Fixed bug related to displa...

2017-11-01 Thread geetikagupta16
GitHub user geetikagupta16 opened a pull request:

https://github.com/apache/carbondata/pull/1459

[CARBONDATA-1661] Fixed bug related to display of timestamp values in presto



You can merge this pull request into a Git repository by running:

$ git pull https://github.com/geetikagupta16/incubator-carbondata 
CARBONDATA-1661

Alternatively you can review and apply these changes as the patch at:

https://github.com/apache/carbondata/pull/1459.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

This closes #1459


commit fccfdc6be7be70467c31297ef318581d575c0a87
Author: Geetika Gupta 
Date:   2017-11-01T11:12:20Z

Fixed bug related to display of timestamp values in presto




---


[jira] [Assigned] (CARBONDATA-1661) Incorrect output of select query with timestamp data type on presto CLI

2017-11-01 Thread anubhav tarar (JIRA)

 [ 
https://issues.apache.org/jira/browse/CARBONDATA-1661?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

anubhav tarar reassigned CARBONDATA-1661:
-

Assignee: anubhav tarar

> Incorrect output of select query with timestamp data type on presto CLI
> ---
>
> Key: CARBONDATA-1661
> URL: https://issues.apache.org/jira/browse/CARBONDATA-1661
> Project: CarbonData
>  Issue Type: Bug
>  Components: data-query
>Affects Versions: 1.3.0
> Environment: spark 2.1
>Reporter: Vandana Yadav
>Assignee: anubhav tarar
>Priority: Minor
> Attachments: 2000_UniqData.csv
>
>
> Incorrect output of select query with timestamp data type on presto CLI
> Steps to Reproduce: 
> On Beeline:
> 1) Create Table:
> CREATE TABLE uniqdata (CUST_ID int,CUST_NAME String,ACTIVE_EMUI_VERSION 
> string, DOB timestamp, DOJ timestamp, BIGINT_COLUMN1 bigint,BIGINT_COLUMN2 
> bigint,DECIMAL_COLUMN1 decimal(30,10), DECIMAL_COLUMN2 
> decimal(36,10),Double_COLUMN1 double, Double_COLUMN2 double,INTEGER_COLUMN1 
> int) STORED BY 'org.apache.carbondata.format' TBLPROPERTIES 
> ("TABLE_BLOCKSIZE"= "256 MB")
> 2)Load Data:
> LOAD DATA INPATH 'HDFS_URL/BabuStore/Data/uniqdata/2000_UniqData.csv' into 
> table uniqdata OPTIONS('DELIMITER'=',' , 
> 'QUOTECHAR'='"','BAD_RECORDS_ACTION'='FORCE','FILEHEADER'='CUST_ID,CUST_NAME,ACTIVE_EMUI_VERSION,DOB,DOJ,BIGINT_COLUMN1,BIGINT_COLUMN2,DECIMAL_COLUMN1,DECIMAL_COLUMN2,Double_COLUMN1,Double_COLUMN2,INTEGER_COLUMN1')
> 3) Start presto server:
> bin/launcher run
> 4) run presto CLI:
> ./presto --server localhost:9000 --catalog carbondata --schema newpresto
> On presto CLI
> 1) Execute select Query:
> select cust_name from uniqdata where dob= cast('1970-01-11 01:00:03.000' as 
> timestamp);
> 2)Expected Result: it should display correct output as on beeline:
> +--+--+
> |cust_name |
> +--+--+
> | CUST_NAME_00010  |
> +--+--+
> 3) Actual Result:
> cust_name 
> ---
> (0 rows)
> Query 20171031_084306_00030_k9q68, FINISHED, 1 node
> Splits: 17 total, 17 done (100.00%)
> 0:00 [0 rows, 0B] [0 rows/s, 0B/s]



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)


[jira] [Assigned] (CARBONDATA-1660) Incorrect result displays while executing select query with where clause for decimal data type

2017-11-01 Thread anubhav tarar (JIRA)

 [ 
https://issues.apache.org/jira/browse/CARBONDATA-1660?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

anubhav tarar reassigned CARBONDATA-1660:
-

Assignee: anubhav tarar

> Incorrect result displays while executing select query with where clause for 
> decimal data type
> --
>
> Key: CARBONDATA-1660
> URL: https://issues.apache.org/jira/browse/CARBONDATA-1660
> Project: CarbonData
>  Issue Type: Bug
>  Components: data-query
>Affects Versions: 1.3.0
> Environment: spark 2.1
>Reporter: Vandana Yadav
>Assignee: anubhav tarar
>Priority: Minor
> Attachments: 2000_UniqData.csv
>
>
> Incorrect result displays while executing select query with where clause for 
> decimal data type
> Steps to reproduce:
> On Beeline:
> 1) Create Table:
> CREATE TABLE uniqdata (CUST_ID int,CUST_NAME String,ACTIVE_EMUI_VERSION 
> string, DOB timestamp, DOJ timestamp, BIGINT_COLUMN1 bigint,BIGINT_COLUMN2 
> bigint,DECIMAL_COLUMN1 decimal(30,10), DECIMAL_COLUMN2 
> decimal(36,10),Double_COLUMN1 double, Double_COLUMN2 double,INTEGER_COLUMN1 
> int) STORED BY 'org.apache.carbondata.format' TBLPROPERTIES 
> ("TABLE_BLOCKSIZE"= "256 MB")
> 2)Load Data:
> LOAD DATA INPATH 'HDFS_URL/BabuStore/Data/uniqdata/2000_UniqData.csv' into 
> table uniqdata OPTIONS('DELIMITER'=',' , 
> 'QUOTECHAR'='"','BAD_RECORDS_ACTION'='FORCE','FILEHEADER'='CUST_ID,CUST_NAME,ACTIVE_EMUI_VERSION,DOB,DOJ,BIGINT_COLUMN1,BIGINT_COLUMN2,DECIMAL_COLUMN1,DECIMAL_COLUMN2,Double_COLUMN1,Double_COLUMN2,INTEGER_COLUMN1')
> 3) Start presto server:
> bin/launcher run
> 4) run presto CLI:
> ./presto --server localhost:9000 --catalog carbondata --schema newpresto
> On presto CLI
> 1) Execute select Query:
> select cust_name from uniqdata where decimal_column1=12345678902.123400;
> Expected Result: it should display the cust_name as on beeline
> +--+--+
> |cust_name |
> +--+--+
> | CUST_NAME_1  |
> +--+--+
> Actual Result:
> it throws an error saying error while setting filter expression to job.
> presto:newpresto> select cust_name from uniqdata where 
> decimal_column1=12345678902.123400;
> Query 20171031_074909_00013_k9q68 failed: Error while setting filter 
> expression to Job



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)