Hive on Tez local debug

2018-01-18 Thread Jia, Ke A
Hi all, How debug execution code in hive on tez? "hive --debug" command only debug the explain level code and does not debug the execution code. Does hot have the similar usage as hos by using command "set spark.master=local;"? Thanks for your help. Regards, Jia Ke

RE: Hive +Tez+LLAP does not have obvious performance improvement than HIVE + Tez

2017-11-28 Thread Jia, Ke A
Hi Gopal, > I have upgrade hive version to 3.0 and the somaxconn value of shuffle > port(15551) has been 16384 not 50. Thank you very much. > But I encounter the following problem when run llap, and this is same with > https://issues.apache.org/jira/browse/HIVE-10693 . Whether it is a bug of > l

RE: Hive +Tez+LLAP does not have obvious performance improvement than HIVE + Tez

2017-11-26 Thread Jia, Ke A
Hi Gopal, After setting the fds, somaxconn and DNS UDP packet loss configuration as you provided, the result have no change. Please help me check whether I set wrong configuration. Thanks very much. >For FDs, we set the related value to 65536 in /etc/security/limits.conf * - nofile 6553

RE: Hive +Tez+LLAP does not have obvious performance improvement than HIVE + Tez

2017-11-25 Thread Jia, Ke A
Hi Gopal , >I still do not understand the "only" mode. In the "only" mode, where the query >fragment run, LLAP daemon or tez container? >I change the execution mode from "all" to "only" and disable the >HybridGraceHashJoin. The execution time of q1 from 76s to 456s. Now, in "all" mode , we set t

RE: Hive +Tez+LLAP does not have obvious performance improvement than HIVE + Tez

2017-11-22 Thread Jia, Ke A
Hi Gopal, Thanks for your reply. >For the Hadoop version, we will upgrade it to 2.8 later. >In our test, we found the shuffle stage of LLAP is very slow. Whether need to >configure some related shuffle value or not? And we get the following log >from the LLAP daemon in shuffle stage: 2017-11-23

RE: Hive +Tez+LLAP does not have obvious performance improvement than HIVE + Tez

2017-11-21 Thread Jia, Ke A
Hi Gopal, Thanks for your reply. > A first step would be to check if LLAP cache is actually being used (the LLAP > IO in the explain), vectorization is being used (llap, vectorized for tasks), > that the column stats show as COMPLETE (instead of NONE). 1. For the LLAP cache, we have enable the LL

Hive +Tez+LLAP does not have obvious performance improvement than HIVE + Tez

2017-11-21 Thread Jia, Ke A
Hi all, Now, we are running the benchmark of Hive +Tez+LLAP and Apache Tez in TPC-DS with 3TB orc data. But the result of Hive +Tez+LLAP is almost similar with hive+Tez and some queries may be poorer than hive+tez . The following is our cluster and llap configuration, Cluster: 1 master + 7 slave

Request write access to the Hive wiki

2015-08-24 Thread Jia, Ke A
Hi, I'd like to have write access to the Hive wiki. My Confluence username is ke.a@intel.com with Full Name "Jia Ke". Please help me deal with it. Thank you! Regards, Jia Ke

Request write access to the Hive wiki

2015-08-24 Thread Jia, Ke A
Hi, I'd like to have write access to the Hive wiki. My Confluence username is jia.a...@intel.com with Full Name "Jia Ke". Please help me deal with it. Thank you! Regards, Jia Ke