答复: insert carbondata table failed

2017-09-18 Thread 刘feng
Thank you , I have tried to resolve this issue by making changes in the spark configuration and use two fields as DICTIONARY_INCLUDE. test data(30G) load 8 times, each time about 1.5 minutes to complete Is currently testing another larger data, hope to be successful, thank you very much for t

Re: insert carbondata table failed

2017-09-18 Thread manishgupta88
Hi Feng, You can also refer the below links wherein the spark users have tried to resolve this issue by making changes in the configuration. This might help you. https://stackoverflow.com/questions/28901123/why-do-spark-jobs-fail-with-org-apache-spark-shuffle-metadatafetchfailedexceptio https://

答复: insert carbondata table failed

2017-09-18 Thread 刘feng
Sorry, A total of 4 nodes . of which 3 as datanode and snn on one of the datanodes. Version: Carbondata 1.1.0 Spark 1.6.0 Hadoop :2.7.2 Thank you for your help , I'm trying again = Liu feng -邮件原件- 发件人: ravipesala [mailto:ravi.pes...@gmail.com] 发送时间: 2017年9月19日

Re: insert carbondata table failed

2017-09-18 Thread ravipesala
Hello, I don't get much from the logs but the error seems related to memory issue from Spark. From your old emails I get that you are using 3 node cluster. Is that all 3 node has nodemanager and datanodes? So better give only less number of executors and provide more memory to it like below. While

Re: [VOTE] Apache CarbonData 1.2.0(RC2) release

2017-09-18 Thread Liang Chen
Hi 1.Source code can be compiled successfully with script "mvn clean -DskipTests -Pspark-2.1 -Pbuild-with-format package" 2.Can query carbondata file properly in Spark-shell. 3.License file looks good. 4.Signature file looks good 5.Hash checksum files look good 6.NOTICE file looks good My vote :

insert carbondata table failed

2017-09-18 Thread 刘feng
Hi,community: It inserts records from a source table into a target CarbonData table(kc22_ca). The source table can be a Hive table(‘kc22_p1’). kc22_p1 records : 102200946 51.5 G Stage: spark-shell --master yarn-client --driver-memory 20G --executor-cores 1 --num-executors 12 --executor-mem

Re: carbondata 加载数据问题咨询

2017-09-18 Thread Liang Chen
Hi I have the same comments as cenyuhai, please provide more detail info, which version you used? Please refer to https://github.com/apache/carbondata/blob/master/docs/useful-tips-on-carbondata.md, for high cardinality columns, you can use script like TBLPROPERTIES ('DICTIONARY_EXCLUDE'='MSISD

Re: [VOTE] Apache CarbonData 1.2.0(RC2) release

2017-09-18 Thread Liang Chen
Hi I think you may input the wrong description "apache-carbondata-1.2.0-rc1"? 2. The tag to be voted upon : apache-carbondata-1.2.0-rc1(commit: ede03f5c963b13cc640feba799a22466246951c6) *https://github.com/apache/carbondata/relea

[VOTE] Apache CarbonData 1.2.0(RC2) release

2017-09-18 Thread Ravindra Pesala
Hi I submit the Apache CarbonData 1.2.0 (RC2) to your vote. 1.Release Notes: *https://issues.apache.org/jira/secure/ReleaseNote.jspa?projectId=12320220&version=12340260 * Some key improvement in this