[jira] [Created] (CARBONDATA-941) 7. Compaction of Partition Table

2017-04-17 Thread QiangCai (JIRA)
QiangCai created CARBONDATA-941:
---

 Summary: 7. Compaction of Partition Table
 Key: CARBONDATA-941
 URL: https://issues.apache.org/jira/browse/CARBONDATA-941
 Project: CarbonData
  Issue Type: Sub-task
Reporter: QiangCai


compaction same partition of segments



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)


[jira] [Created] (CARBONDATA-940) 6. Alter table add/drop partition

2017-04-17 Thread QiangCai (JIRA)
QiangCai created CARBONDATA-940:
---

 Summary: 6. Alter table add/drop partition 
 Key: CARBONDATA-940
 URL: https://issues.apache.org/jira/browse/CARBONDATA-940
 Project: CarbonData
  Issue Type: Sub-task
Reporter: QiangCai






--
This message was sent by Atlassian JIRA
(v6.3.15#6346)


[jira] [Created] (CARBONDATA-939) 5. Partition tables join on partition column

2017-04-17 Thread QiangCai (JIRA)
QiangCai created CARBONDATA-939:
---

 Summary: 5. Partition tables join on partition column
 Key: CARBONDATA-939
 URL: https://issues.apache.org/jira/browse/CARBONDATA-939
 Project: CarbonData
  Issue Type: Sub-task
Reporter: QiangCai






--
This message was sent by Atlassian JIRA
(v6.3.15#6346)


[jira] [Created] (CARBONDATA-938) 4. Detail filter query on partition column

2017-04-17 Thread QiangCai (JIRA)
QiangCai created CARBONDATA-938:
---

 Summary: 4. Detail filter query on partition column 
 Key: CARBONDATA-938
 URL: https://issues.apache.org/jira/browse/CARBONDATA-938
 Project: CarbonData
  Issue Type: Sub-task
Reporter: QiangCai


use filter(equal,range, in etc.) to get partition id list, use this partition 
id list to filter BTree. 




--
This message was sent by Atlassian JIRA
(v6.3.15#6346)


[jira] [Created] (CARBONDATA-937) 3. Data loading of partition table

2017-04-17 Thread QiangCai (JIRA)
QiangCai created CARBONDATA-937:
---

 Summary: 3. Data loading of partition table 
 Key: CARBONDATA-937
 URL: https://issues.apache.org/jira/browse/CARBONDATA-937
 Project: CarbonData
  Issue Type: Sub-task
Reporter: QiangCai


use PartitionInfo to generate Partitioner (hash, list, range) 
use Partitioner to repartition input data file, reuse loadDataFrame flow 
use partition id to replace task no in carbondata/index file name 



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)


[jira] [Created] (CARBONDATA-936) 2. Create Table with Partition

2017-04-17 Thread QiangCai (JIRA)
QiangCai created CARBONDATA-936:
---

 Summary: 2. Create Table with Partition
 Key: CARBONDATA-936
 URL: https://issues.apache.org/jira/browse/CARBONDATA-936
 Project: CarbonData
  Issue Type: Sub-task
 Environment: CarbonSparkSqlParser parse partition part to generate 
PartitionInfo, add PartitionInfo to TableModel. 

CreateTable add PartitionInfo to TableInfo,  store PartitionInfo in TableSchema 

support spark 2.1 at first.
Reporter: QiangCai






--
This message was sent by Atlassian JIRA
(v6.3.15#6346)


[jira] [Created] (CARBONDATA-935) 1. Define PartitionInfo model

2017-04-17 Thread QiangCai (JIRA)
QiangCai created CARBONDATA-935:
---

 Summary: 1. Define PartitionInfo model
 Key: CARBONDATA-935
 URL: https://issues.apache.org/jira/browse/CARBONDATA-935
 Project: CarbonData
  Issue Type: Sub-task
Reporter: QiangCai


modify schema.thrift to define PartitionInfo, add PartitionInfo to TableSchema 



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)


[jira] [Created] (CARBONDATA-923) InserInto read from one row not working

2017-04-13 Thread QiangCai (JIRA)
QiangCai created CARBONDATA-923:
---

 Summary: InserInto read from one row not working
 Key: CARBONDATA-923
 URL: https://issues.apache.org/jira/browse/CARBONDATA-923
 Project: CarbonData
  Issue Type: Bug
Reporter: QiangCai
Assignee: QiangCai


Reproduce:
create table OneRowTable(col1 string, col2 string, col3 int, col4 double) 
stored by 'carbondata'
insert into OneRowTable select '0.1', 'a.b', 1, 1.2

Exception:
org.apache.spark.sql.AnalysisException: cannot resolve '`0.1`' given input 
columns: [0.1, a.b, 1, 1.2];;
'Project ['0.1, 'a.b]
+- Project [0.1 AS 0.1#11, a.b AS a.b#12, 1 AS 1#13, 1.2 AS 1.2#14]
   +- OneRowRelation$



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)


[jira] [Created] (CARBONDATA-887) lazy rdd iterator for InsertInto

2017-04-07 Thread QiangCai (JIRA)
QiangCai created CARBONDATA-887:
---

 Summary: lazy rdd iterator for InsertInto
 Key: CARBONDATA-887
 URL: https://issues.apache.org/jira/browse/CARBONDATA-887
 Project: CarbonData
  Issue Type: Improvement
Reporter: QiangCai
Assignee: QiangCai






--
This message was sent by Atlassian JIRA
(v6.3.15#6346)


[jira] [Created] (CARBONDATA-886) remove all redundant local variable

2017-04-07 Thread QiangCai (JIRA)
QiangCai created CARBONDATA-886:
---

 Summary: remove all redundant local variable
 Key: CARBONDATA-886
 URL: https://issues.apache.org/jira/browse/CARBONDATA-886
 Project: CarbonData
  Issue Type: Improvement
Reporter: QiangCai
Assignee: QiangCai
Priority: Minor






--
This message was sent by Atlassian JIRA
(v6.3.15#6346)


[jira] [Created] (CARBONDATA-842) when SORT_COLUMN is empty, no need to sort data.

2017-03-31 Thread QiangCai (JIRA)
QiangCai created CARBONDATA-842:
---

 Summary: when SORT_COLUMN is empty, no need to sort data.
 Key: CARBONDATA-842
 URL: https://issues.apache.org/jira/browse/CARBONDATA-842
 Project: CarbonData
  Issue Type: Sub-task
Reporter: QiangCai






--
This message was sent by Atlassian JIRA
(v6.3.15#6346)


[jira] [Created] (CARBONDATA-841) improve the compress encoding for numeric type column to give good performance

2017-03-31 Thread QiangCai (JIRA)
QiangCai created CARBONDATA-841:
---

 Summary: improve the compress encoding for numeric type column to 
give good performance
 Key: CARBONDATA-841
 URL: https://issues.apache.org/jira/browse/CARBONDATA-841
 Project: CarbonData
  Issue Type: Sub-task
Reporter: QiangCai


Now no-dictionary column use LV(length-value) encoding. It isn't the best 
choice for numeric type column.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)


[jira] [Created] (CARBONDATA-782) Support SORT_COLUMNS

2017-03-15 Thread QiangCai (JIRA)
QiangCai created CARBONDATA-782:
---

 Summary: Support SORT_COLUMNS
 Key: CARBONDATA-782
 URL: https://issues.apache.org/jira/browse/CARBONDATA-782
 Project: CarbonData
  Issue Type: New Feature
Reporter: QiangCai
Assignee: QiangCai


The tasks of SORT_COLUMNS:

1.Support create table with sort_columns property.
e.g. tblproperties('sort_columns' = 'col7,col3')
The table with SORT_COLUMNS property will be sorted by SORT_COLUMNS. The order 
of columns is decided by SORT_COLUMNS.

2.Change the encoding rule of SORT_COLUMNS
Firstly, the rule of column encoding will keep consistent with previous.
Secondly, if a column of SORT_COLUMNS is a measure before, now this column will 
be created as a dimension. And this dimension is a no-dicitonary column(Better 
to use other direct-dictionary).
Thirdly, the dimension of SORT_COLUMNS have RLE and ROWID page, other dimension 
have only RLE(not sorted).

3.The start/end key should be composed of SORT_COLUMNS.
Using SORT_COLUMNS to build start/end key during data loading and select query.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)


[jira] [Created] (CARBONDATA-765) dataframe wirter need to first drop table unless loading said table not found

2017-03-12 Thread QiangCai (JIRA)
QiangCai created CARBONDATA-765:
---

 Summary: dataframe wirter need to first drop table unless loading 
said table not found
 Key: CARBONDATA-765
 URL: https://issues.apache.org/jira/browse/CARBONDATA-765
 Project: CarbonData
  Issue Type: Bug
Reporter: QiangCai
Assignee: QiangCai


dataframe wirter need to first drop table unless loading said table not found



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)


[jira] [Created] (CARBONDATA-764) Improving Non-dictionary storage & performance

2017-03-12 Thread QiangCai (JIRA)
QiangCai created CARBONDATA-764:
---

 Summary: Improving Non-dictionary storage & performance
 Key: CARBONDATA-764
 URL: https://issues.apache.org/jira/browse/CARBONDATA-764
 Project: CarbonData
  Issue Type: Improvement
Reporter: QiangCai


mail list:
http://apache-carbondata-mailing-list-archive.1130556.n5.nabble.com/Improving-Non-dictionary-storage-amp-performance-td8146.html



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)


[jira] [Created] (CARBONDATA-762) modify all schemaName->databaseName, cubeName->tableName

2017-03-12 Thread QiangCai (JIRA)
QiangCai created CARBONDATA-762:
---

 Summary: modify all schemaName->databaseName, cubeName->tableName
 Key: CARBONDATA-762
 URL: https://issues.apache.org/jira/browse/CARBONDATA-762
 Project: CarbonData
  Issue Type: Bug
Reporter: QiangCai
Assignee: QiangCai
Priority: Minor


modify all schemaName->databaseName, cubeName->tableName



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)


[jira] [Created] (CARBONDATA-763) Add L5 loading support, global sorting like HBase

2017-03-12 Thread QiangCai (JIRA)
QiangCai created CARBONDATA-763:
---

 Summary: Add L5 loading support, global sorting like HBase
 Key: CARBONDATA-763
 URL: https://issues.apache.org/jira/browse/CARBONDATA-763
 Project: CarbonData
  Issue Type: Bug
Reporter: QiangCai


Add L5 loading support, global sorting like HBase



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)


[jira] [Created] (CARBONDATA-761) Dictionary server should not be shutdown after loading

2017-03-12 Thread QiangCai (JIRA)
QiangCai created CARBONDATA-761:
---

 Summary: Dictionary server should not be shutdown after loading
 Key: CARBONDATA-761
 URL: https://issues.apache.org/jira/browse/CARBONDATA-761
 Project: CarbonData
  Issue Type: Bug
  Components: data-load
Reporter: QiangCai
Assignee: QiangCai
Priority: Minor


Code:
CarbonTableSchema/LoadTable



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)


[jira] [Created] (CARBONDATA-760) Should to avoid ERROR log for successful select query

2017-03-12 Thread QiangCai (JIRA)
QiangCai created CARBONDATA-760:
---

 Summary: Should to avoid ERROR log for successful select query
 Key: CARBONDATA-760
 URL: https://issues.apache.org/jira/browse/CARBONDATA-760
 Project: CarbonData
  Issue Type: Bug
  Components: data-query
Reporter: QiangCai
Assignee: QiangCai
Priority: Minor


Some table without delete or update operator maybe not have delta files. Select 
query shouldn't record error log.

Code:
SegmentUpdateStatusManager.getDeltaFiles

Log detail:
 ERROR 06-03 19:21:37,531 - pool-475-thread-1 Invalid tuple id 
arbonstore/default/comparetest_carbon/Fact/0/0/0-0-0-1488799238178/0
ERROR 06-03 19:21:37,948 - pool-475-thread-1 Invalid tuple id 
arbonstore/default/comparetest_carbon/Fact/0/0/0-0-0-1488799238178/1
ERROR 06-03 19:21:38,517 - pool-475-thread-1 Invalid tuple id 
arbonstore/default/comparetest_carbon/Fact/0/0/0-0-0-1488799238178/2
ERROR 06-03 19:21:38,909 - pool-475-thread-1 Invalid tuple id 
arbonstore/default/comparetest_carbon/Fact/0/0/0-0-0-1488799238178/3
ERROR 06-03 19:21:39,292 - pool-475-thread-1 Invalid tuple id 
arbonstore/default/comparetest_carbon/Fact/0/0/0-0-0-1488799238178/4



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)


[jira] [Created] (CARBONDATA-701) There is a memory leak issue in no kettle loading flow

2017-02-10 Thread QiangCai (JIRA)
QiangCai created CARBONDATA-701:
---

 Summary: There is a memory leak issue in no kettle loading flow
 Key: CARBONDATA-701
 URL: https://issues.apache.org/jira/browse/CARBONDATA-701
 Project: CarbonData
  Issue Type: Improvement
  Components: data-load
Affects Versions: 1.0.0-incubating
Reporter: QiangCai
Assignee: QiangCai
 Fix For: 1.0.1-incubating


When loading more data, throw OOM exception.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)


[jira] [Created] (CARBONDATA-659) Should add WhitespaceAround and ParenPad to javastyle

2017-01-18 Thread QiangCai (JIRA)
QiangCai created CARBONDATA-659:
---

 Summary: Should add WhitespaceAround and ParenPad to javastyle
 Key: CARBONDATA-659
 URL: https://issues.apache.org/jira/browse/CARBONDATA-659
 Project: CarbonData
  Issue Type: Improvement
Reporter: QiangCai
Assignee: QiangCai
Priority: Trivial






--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (CARBONDATA-651) The license header of java file should be same with scala's

2017-01-17 Thread QiangCai (JIRA)
QiangCai created CARBONDATA-651:
---

 Summary: The license header of java file should be same with 
scala's
 Key: CARBONDATA-651
 URL: https://issues.apache.org/jira/browse/CARBONDATA-651
 Project: CarbonData
  Issue Type: Improvement
Reporter: QiangCai
Assignee: QiangCai
Priority: Minor






--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (CARBONDATA-627) Fix Union unit test case for spark2

2017-01-11 Thread QiangCai (JIRA)
QiangCai created CARBONDATA-627:
---

 Summary: Fix Union unit test case for spark2
 Key: CARBONDATA-627
 URL: https://issues.apache.org/jira/browse/CARBONDATA-627
 Project: CarbonData
  Issue Type: Bug
  Components: data-query
Affects Versions: 1.0.0-incubating
Reporter: QiangCai
Assignee: QiangCai
Priority: Minor
 Fix For: 1.0.0-incubating


UnionTestCase failed in spark2, We should fix it.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (CARBONDATA-622) Should use the same fileheader reader for dict generation and data loading

2017-01-10 Thread QiangCai (JIRA)
QiangCai created CARBONDATA-622:
---

 Summary: Should use the same fileheader reader for dict generation 
and data loading
 Key: CARBONDATA-622
 URL: https://issues.apache.org/jira/browse/CARBONDATA-622
 Project: CarbonData
  Issue Type: Bug
  Components: data-load
Affects Versions: 1.0.0-incubating
Reporter: QiangCai
Assignee: QiangCai
Priority: Minor
 Fix For: 1.0.0-incubating


We can get file header from DDL command and CSV file. 

1. If the file header comes from DDL command, separate this file header by 
comma ","

2. if the file header comes from CSV file, sparate this file header by specify 
delimiter in DDL command.




--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (CARBONDATA-614) Should fix dictionary locked issue

2017-01-09 Thread QiangCai (JIRA)
QiangCai created CARBONDATA-614:
---

 Summary: Should fix dictionary locked issue
 Key: CARBONDATA-614
 URL: https://issues.apache.org/jira/browse/CARBONDATA-614
 Project: CarbonData
  Issue Type: Bug
  Components: data-load
Affects Versions: 1.0.0-incubating
Reporter: QiangCai
Assignee: QiangCai
 Fix For: 1.0.0-incubating


When carbon.properties.filepath is configured exactly,  still show the 
following exception.

Error: org.apache.spark.SparkException: Job aborted due to stage failure: 
Task 0 in stage 2.0 failed 4 times, most recent failure: Lost task 0.3 in 
stage 2.0 (TID 8, hadoop-slave-2): java.lang.RuntimeException: Dictionary 
file name is locked for updation. Please try after some time 
at scala.sys.package$.error(package.scala:27) 
at 
org.apache.carbondata.spark.rdd.CarbonGlobalDictionaryGenerateRDD$$anon$1.(CarbonGlobalDictionaryRDD.scala:364)
 
at 
org.apache.carbondata.spark.rdd.CarbonGlobalDictionaryGenerateRDD.compute(CarbonGlobalDictionaryRDD.scala:302)
 
at org.apache.spark.rdd.RDD.computeOrReadCheckpoint(RDD.scala:306) 
at org.apache.spark.rdd.RDD.iterator(RDD.scala:270) 
at org.apache.spark.scheduler.ResultTask.runTask(ResultTask.scala:66) 
at org.apache.spark.scheduler.Task.run(Task.scala:89) 
at org.apache.spark.executor.Executor$TaskRunner.run(Executor.scala:214) 
at 
java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142) 
at 
java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617) 
at java.lang.Thread.run(Thread.java:745) 



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (CARBONDATA-600) Should reuse unit test case for integration module

2017-01-05 Thread QiangCai (JIRA)
QiangCai created CARBONDATA-600:
---

 Summary: Should reuse unit test case for integration module
 Key: CARBONDATA-600
 URL: https://issues.apache.org/jira/browse/CARBONDATA-600
 Project: CarbonData
  Issue Type: Bug
  Components: spark-integration
Affects Versions: 1.0.0-incubating
Reporter: QiangCai
Assignee: QiangCai
Priority: Minor
 Fix For: 1.0.0-incubating






--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (CARBONDATA-601) Should reuse unit test case for integration module

2017-01-05 Thread QiangCai (JIRA)
QiangCai created CARBONDATA-601:
---

 Summary: Should reuse unit test case for integration module
 Key: CARBONDATA-601
 URL: https://issues.apache.org/jira/browse/CARBONDATA-601
 Project: CarbonData
  Issue Type: Test
  Components: spark-integration
Affects Versions: 1.0.0-incubating
Reporter: QiangCai
Assignee: QiangCai
Priority: Minor
 Fix For: 1.0.0-incubating






--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (CARBONDATA-540) Support inserInto without kettle for spark2

2016-12-18 Thread QiangCai (JIRA)
QiangCai created CARBONDATA-540:
---

 Summary: Support inserInto without kettle for spark2
 Key: CARBONDATA-540
 URL: https://issues.apache.org/jira/browse/CARBONDATA-540
 Project: CarbonData
  Issue Type: Improvement
  Components: data-load
Affects Versions: 1.0.0-incubating
Reporter: QiangCai
Assignee: QiangCai
 Fix For: 1.0.0-incubating


Support inserInto without kettle for spark2



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (CARBONDATA-536) For spark2, GlobalDictionaryUtil.updateTableMetadataFunc should been initialized

2016-12-15 Thread QiangCai (JIRA)
QiangCai created CARBONDATA-536:
---

 Summary: For spark2, GlobalDictionaryUtil.updateTableMetadataFunc 
should been initialized
 Key: CARBONDATA-536
 URL: https://issues.apache.org/jira/browse/CARBONDATA-536
 Project: CarbonData
  Issue Type: Bug
  Components: data-load
Affects Versions: 1.0.0-incubating
Reporter: QiangCai
Assignee: QiangCai
 Fix For: 1.0.0-incubating


For spark2, GlobalDictionaryUtil.updateTableMetadataFunc should been initialized



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (CARBONDATA-535) carbondata should support datatype: Date and Char

2016-12-15 Thread QiangCai (JIRA)
QiangCai created CARBONDATA-535:
---

 Summary: carbondata should support datatype: Date and Char
 Key: CARBONDATA-535
 URL: https://issues.apache.org/jira/browse/CARBONDATA-535
 Project: CarbonData
  Issue Type: Improvement
  Components: file-format
Affects Versions: 1.0.0-incubating
Reporter: QiangCai
Assignee: QiangCai
 Fix For: 1.0.0-incubating


carbondata should support datatype: Date and Char



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (CARBONDATA-516) [SPARK2]fix union issue in CarbonLateDecoderRule

2016-12-08 Thread QiangCai (JIRA)
QiangCai created CARBONDATA-516:
---

 Summary: [SPARK2]fix union issue in CarbonLateDecoderRule
 Key: CARBONDATA-516
 URL: https://issues.apache.org/jira/browse/CARBONDATA-516
 Project: CarbonData
  Issue Type: Bug
Reporter: QiangCai
Assignee: QiangCai


In spark2, Union class is no longer the sub-class of BinaryNode. We need fix 
union issue in CarbonLateDecoderRule for spark2.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (CARBONDATA-497) [Spark2] fix datatype issue of CarbonLateDecoderRule

2016-12-06 Thread QiangCai (JIRA)
QiangCai created CARBONDATA-497:
---

 Summary: [Spark2] fix datatype issue of CarbonLateDecoderRule
 Key: CARBONDATA-497
 URL: https://issues.apache.org/jira/browse/CARBONDATA-497
 Project: CarbonData
  Issue Type: Bug
  Components: data-query
Affects Versions: 1.0.0-incubating
Reporter: QiangCai
Assignee: QiangCai
 Fix For: 1.0.0-incubating


In spark2, LogicalPlan resolve method need to check input data type. If the 
data type is wrong, the logical plan will be unresolved. 
CarbonLateDecoderRule should correct the datatype of dictionary dimension to 
resolve the logical plan.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (CARBONDATA-493) Insertinto sql can not select from a empty table

2016-12-04 Thread QiangCai (JIRA)
QiangCai created CARBONDATA-493:
---

 Summary: Insertinto sql can not select from a empty table
 Key: CARBONDATA-493
 URL: https://issues.apache.org/jira/browse/CARBONDATA-493
 Project: CarbonData
  Issue Type: Bug
Affects Versions: 1.0.0-incubating
Reporter: QiangCai
Assignee: QiangCai
 Fix For: 1.0.0-incubating






--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (CARBONDATA-492) When profile spark-2.0 is avtive , CarbonExample have error in intellij idea

2016-12-03 Thread QiangCai (JIRA)
QiangCai created CARBONDATA-492:
---

 Summary:  When profile spark-2.0 is avtive , CarbonExample have 
error in intellij idea
 Key: CARBONDATA-492
 URL: https://issues.apache.org/jira/browse/CARBONDATA-492
 Project: CarbonData
  Issue Type: Bug
Affects Versions: 0.3.0-incubating
Reporter: QiangCai
Assignee: QiangCai
Priority: Critical
 Fix For: 0.3.0-incubating






--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (CARBONDATA-488) add InsertInto feature for spark2

2016-12-02 Thread QiangCai (JIRA)
QiangCai created CARBONDATA-488:
---

 Summary: add InsertInto feature for spark2
 Key: CARBONDATA-488
 URL: https://issues.apache.org/jira/browse/CARBONDATA-488
 Project: CarbonData
  Issue Type: New Feature
  Components: data-load
Affects Versions: 0.3.0-incubating
Reporter: QiangCai
Assignee: QiangCai
 Fix For: 0.3.0-incubating






--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (CARBONDATA-486) Rreading dataframe concurrently will lead to wrong data

2016-12-02 Thread QiangCai (JIRA)
QiangCai created CARBONDATA-486:
---

 Summary: Rreading dataframe concurrently will lead to wrong data
 Key: CARBONDATA-486
 URL: https://issues.apache.org/jira/browse/CARBONDATA-486
 Project: CarbonData
  Issue Type: Bug
  Components: data-load
Affects Versions: 0.3.0-incubating
Reporter: QiangCai
Assignee: QiangCai
 Fix For: 0.3.0-incubating






--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (CARBONDATA-481) [SPARK2]fix late decoder and support whole stage code gen

2016-12-01 Thread QiangCai (JIRA)
QiangCai created CARBONDATA-481:
---

 Summary: [SPARK2]fix late decoder and support whole stage code gen
 Key: CARBONDATA-481
 URL: https://issues.apache.org/jira/browse/CARBONDATA-481
 Project: CarbonData
  Issue Type: Bug
  Components: data-query
Affects Versions: 0.2.0-incubating
Reporter: QiangCai
 Fix For: 0.3.0-incubating






--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (CARBONDATA-478) Spark2 module should have different SparkRowReadSupportImpl with spark1

2016-12-01 Thread QiangCai (JIRA)
QiangCai created CARBONDATA-478:
---

 Summary: Spark2 module should have different 
SparkRowReadSupportImpl with spark1
 Key: CARBONDATA-478
 URL: https://issues.apache.org/jira/browse/CARBONDATA-478
 Project: CarbonData
  Issue Type: Bug
  Components: data-query
Affects Versions: 0.2.0-incubating
Reporter: QiangCai
Assignee: QiangCai
 Fix For: 0.3.0-incubating






--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (CARBONDATA-443) Implement nosort dataloading

2016-11-23 Thread QiangCai (JIRA)
QiangCai created CARBONDATA-443:
---

 Summary: Implement nosort dataloading
 Key: CARBONDATA-443
 URL: https://issues.apache.org/jira/browse/CARBONDATA-443
 Project: CarbonData
  Issue Type: Improvement
  Components: data-load
Affects Versions: 0.3.0-incubating
Reporter: QiangCai
Assignee: QiangCai
 Fix For: 0.3.0-incubating


Improve dataloading performance




--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (CARBONDATA-368) Should improve performance of DataFrame loading

2016-11-03 Thread QiangCai (JIRA)
QiangCai created CARBONDATA-368:
---

 Summary: Should improve performance of DataFrame loading 
 Key: CARBONDATA-368
 URL: https://issues.apache.org/jira/browse/CARBONDATA-368
 Project: CarbonData
  Issue Type: Improvement
  Components: data-load
Affects Versions: 0.3.0-incubating
Reporter: QiangCai
Assignee: QiangCai






--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (CARBONDATA-287) Save the sorted temp files to multi local dirs to improve dataloading perfomance

2016-10-08 Thread QiangCai (JIRA)
QiangCai created CARBONDATA-287:
---

 Summary: Save the sorted temp files to multi local dirs to improve 
dataloading perfomance 
 Key: CARBONDATA-287
 URL: https://issues.apache.org/jira/browse/CARBONDATA-287
 Project: CarbonData
  Issue Type: Improvement
  Components: data-load
Affects Versions: 0.2.0-incubating
Reporter: QiangCai
Assignee: QiangCai
Priority: Minor
 Fix For: 0.2.0-incubating


Now for each dataloading, we use only a different local dir to save  the sorted 
temp files. I think it is neccessary to use multi local dirs for each 
dataloading to improve dataloading performance.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (CARBONDATA-279) [DataLoading]Save a DataFrame to CarbonData file without writing CSV file

2016-09-27 Thread QiangCai (JIRA)
QiangCai created CARBONDATA-279:
---

 Summary: [DataLoading]Save a DataFrame to CarbonData file without 
writing CSV file
 Key: CARBONDATA-279
 URL: https://issues.apache.org/jira/browse/CARBONDATA-279
 Project: CarbonData
  Issue Type: Improvement
Affects Versions: 0.1.0-incubating
Reporter: QiangCai
Assignee: QiangCai
Priority: Minor
 Fix For: 0.2.0-incubating


Directly save a DataFrame to CarbonData file without writing CSV file



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (CARBONDATA-268) CarbonOptimizer has performance problem

2016-09-22 Thread QiangCai (JIRA)
QiangCai created CARBONDATA-268:
---

 Summary: CarbonOptimizer has performance problem
 Key: CARBONDATA-268
 URL: https://issues.apache.org/jira/browse/CARBONDATA-268
 Project: CarbonData
  Issue Type: Bug
Affects Versions: 0.1.0-incubating
Reporter: QiangCai






--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (CARBONDATA-238) CarbonOptimizer shouldn't add CarbonDictionaryCatalystDecoder for HiveTable

2016-09-14 Thread QiangCai (JIRA)
QiangCai created CARBONDATA-238:
---

 Summary: CarbonOptimizer shouldn't add 
CarbonDictionaryCatalystDecoder for HiveTable
 Key: CARBONDATA-238
 URL: https://issues.apache.org/jira/browse/CARBONDATA-238
 Project: CarbonData
  Issue Type: Bug
Reporter: QiangCai
Assignee: QiangCai
Priority: Minor
 Fix For: 0.2.0-incubating






--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (CARBONDATA-218) Remove Dependency: spark-csv and Unify CSV Reader for dataloading

2016-09-07 Thread QiangCai (JIRA)
QiangCai created CARBONDATA-218:
---

 Summary: Remove Dependency: spark-csv and Unify CSV Reader for 
dataloading
 Key: CARBONDATA-218
 URL: https://issues.apache.org/jira/browse/CARBONDATA-218
 Project: CarbonData
  Issue Type: Improvement
Reporter: QiangCai
Priority: Minor
 Fix For: 0.2.0-incubating






--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (CARBONDATA-213) Remove thrift complier dependency

2016-09-05 Thread QiangCai (JIRA)
QiangCai created CARBONDATA-213:
---

 Summary: Remove thrift complier dependency
 Key: CARBONDATA-213
 URL: https://issues.apache.org/jira/browse/CARBONDATA-213
 Project: CarbonData
  Issue Type: Bug
Reporter: QiangCai
Priority: Minor
 Fix For: 0.2.0-incubating






--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (CARBONDATA-180) give proper error message when dataloading with wrong delimiter value

2016-08-25 Thread QiangCai (JIRA)
QiangCai created CARBONDATA-180:
---

 Summary: give proper error message when dataloading with wrong 
delimiter value
 Key: CARBONDATA-180
 URL: https://issues.apache.org/jira/browse/CARBONDATA-180
 Project: CarbonData
  Issue Type: Bug
Reporter: QiangCai
Assignee: QiangCai
Priority: Minor


Reproduce:
CREATE TABLE IF NOT EXISTS t3
(ID Int, date Timestamp, country String, name String, phonetype String, 
serialname String, salary Int)
STORED BY 'carbondata'

LOAD DATA LOCAL INPATH 'data.csv' into table t3 options('DELIMITER'='#', 
'FILEHEADER'='ID,date,country,name,phonetype,serialname,salary')

File name:data.csv
1,2015/7/23,china,aaa1,phone197,ASD69643,15000
2,2015/7/24,china,aaa2,phone756,ASD42892,15001
3,2015/7/25,china,aaa3,phone1904,ASD37014,15002

Exception:
ERROR 26-08 11:44:12,881 - generate global dictionary failed
java.lang.ArrayIndexOutOfBoundsException: 2
 at 
org.apache.carbondata.spark.util.GlobalDictionaryUtil$$anonfun$pruneDimensions$1$$anonfun$apply$1$$anonfun$apply$mcV$sp$1.apply(GlobalDictionaryUtil.scala:79)
 at 
org.apache.carbondata.spark.util.GlobalDictionaryUtil$$anonfun$pruneDimensions$1$$anonfun$apply$1$$anonfun$apply$mcV$sp$1.apply(GlobalDictionaryUtil.scala:76)
 at 
scala.collection.IndexedSeqOptimized$class.foreach(IndexedSeqOptimized.scala:33)
 at scala.collection.mutable.ArrayOps$ofRef.foreach(ArrayOps.scala:108)
 at 
org.apache.carbondata.spark.util.GlobalDictionaryUtil$$anonfun$pruneDimensions$1$$anonfun$apply$1.apply$mcV$sp(GlobalDictionaryUtil.scala:76)
 at scala.util.control.Breaks.breakable(Breaks.scala:37)
 at 
org.apache.carbondata.spark.util.GlobalDictionaryUtil$$anonfun$pruneDimensions$1.apply(GlobalDictionaryUtil.scala:75)
 at 
org.apache.carbondata.spark.util.GlobalDictionaryUtil$$anonfun$pruneDimensions$1.apply(GlobalDictionaryUtil.scala:74)
 at 
scala.collection.IndexedSeqOptimized$class.foreach(IndexedSeqOptimized.scala:33)
 at scala.collection.mutable.ArrayOps$ofRef.foreach(ArrayOps.scala:108)
 at 
org.apache.carbondata.spark.util.GlobalDictionaryUtil$.pruneDimensions(GlobalDictionaryUtil.scala:74)
 at 
org.apache.carbondata.spark.util.GlobalDictionaryUtil$.generateGlobalDictionary(GlobalDictionaryUtil.scala:697)
 at 
org.apache.spark.sql.execution.command.LoadTable.run(carbonTableSchema.scala:1159)
 at 
org.apache.spark.sql.execution.ExecutedCommand.sideEffectResult$lzycompute(commands.scala:57)
 at 
org.apache.spark.sql.execution.ExecutedCommand.sideEffectResult(commands.scala:57)
 at 
org.apache.spark.sql.execution.ExecutedCommand.doExecute(commands.scala:69)
 at 
org.apache.spark.sql.execution.SparkPlan$$anonfun$execute$5.apply(SparkPlan.scala:140)
 at 
org.apache.spark.sql.execution.SparkPlan$$anonfun$execute$5.apply(SparkPlan.scala:138)
 at 
org.apache.spark.rdd.RDDOperationScope$.withScope(RDDOperationScope.scala:147)
 at 
org.apache.spark.sql.execution.SparkPlan.execute(SparkPlan.scala:138)
 at 
org.apache.spark.sql.SQLContext$QueryExecution.toRdd$lzycompute(SQLContext.scala:933)
 at 
org.apache.spark.sql.SQLContext$QueryExecution.toRdd(SQLContext.scala:933)
 at org.apache.spark.sql.DataFrame.(DataFrame.scala:144)
 at org.apache.spark.sql.DataFrame.(DataFrame.scala:129)
 at 
org.apache.carbondata.spark.rdd.CarbonDataFrameRDD.(CarbonDataFrameRDD.scala:23)
 at org.apache.spark.sql.CarbonContext.sql(CarbonContext.scala:130)
 at 
org.apache.carbondata.examples.CarbonExample$.main(CarbonExample.scala:42)
 at 
org.apache.carbondata.examples.CarbonExample.main(CarbonExample.scala)



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Assigned] (CARBONDATA-1) CarbonData supports docker for simplifying deployment

2016-06-23 Thread QiangCai (JIRA)

 [ 
https://issues.apache.org/jira/browse/CARBONDATA-1?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

QiangCai reassigned CARBONDATA-1:
-

Assignee: QiangCai

> CarbonData supports docker for simplifying deployment
> -
>
> Key: CARBONDATA-1
> URL: https://issues.apache.org/jira/browse/CARBONDATA-1
> Project: CarbonData
>  Issue Type: New Feature
> Environment: the latest docker stable version
>Reporter: ChenLiang
>Assignee: QiangCai
>
> CarbonData supports docker for simplifying deployment



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)