Re: Getting dot files for DAGs

2015-10-01 Thread Hitesh Shah
I don’t believe the binary should need changing at all unless you need 
enhancements from recent commits. It should just be setting up the UI and 
configuring Tez for using YARN Timeline.

The instructions that you can follow:
  http://tez.apache.org/tez-ui.html 
  http://tez.apache.org/tez_yarn_timeline.html

thanks
— Hitesh

On Oct 1, 2015, at 11:07 AM, James Pirz  wrote:

> Thanks for suggesting, I never used Tez UI before, and learned about it 
> yesterday.
> I am trying to find out how I can enable/use it. Apparently it needs some 
> changes in the binary that I am using (I had built the binary for tez 0.7 
> almost 2 months ago).
> 
> 
> 
> 
> On Wed, Sep 30, 2015 at 10:27 PM, Jörn Franke  wrote:
> Why not use tez ui?
> 
> Le jeu. 1 oct. 2015 à 2:29, James Pirz  a écrit :
> I am using Tez 0.7.0 on Hadopp 2.6 to run Hive queries.
> I am interested in checking DAGs for my queries visually, and I realized that 
> I can do that by graphviz once I can get "dot" files of my DAGs. My issue is 
> I can not find those files, they are not in the log directory of Yarn or 
> Hadoop or under /tmp .
> 
> Any hint as where I can find those files would be great. Do I need to add any 
> settings to my tez-site.xml in-order to enable generating them ?
> 
> Thanks. 
> 



Re: Getting dot files for DAGs

2015-09-30 Thread Hitesh Shah
The .dot file is generated into the Tez Application Master’s container log dir. 
Firstly, you need to figure out the yarn application in which the query/Tez DAG 
ran. Once you have the applicationId, you can use one of these 2 approaches: 

1) Go to the YARN ResourceManager UI, find the application and click through to 
the Application Master logs. The .dot file for the dag should be visible there.
2) Using the application Id ( if the application has completed), get the yarn 
logs using “bin/yarn logs -applicationId ” - once you have the logs, you 
will be able to find the contents of the .dot file within them. This approach 
only works if you have YARN log aggregation enabled.

thanks
— Hitesh


On Sep 30, 2015, at 5:29 PM, James Pirz  wrote:

> I am using Tez 0.7.0 on Hadopp 2.6 to run Hive queries.
> I am interested in checking DAGs for my queries visually, and I realized that 
> I can do that by graphviz once I can get "dot" files of my DAGs. My issue is 
> I can not find those files, they are not in the log directory of Yarn or 
> Hadoop or under /tmp .
> 
> Any hint as where I can find those files would be great. Do I need to add any 
> settings to my tez-site.xml in-order to enable generating them ?
> 
> Thanks. 



Re: Hive Tez support matrix

2015-07-07 Thread Hitesh Shah
From a Tez perspective, there was a major compatibility change between Tez 0.4 
and Tez 0.5. However, Tez-0.7.x and Tez-0.6.x are compatible with Tez-0.5.x. 

I believe Hive 0.13 is compatible only with Tez 0.4. 
For Hive 0.14 onwards ( including the Hive-1.x. releases ), they should work 
with anything in the range of Tez versions: 0.5.2 <= x <= 0.7.x .  

thanks
— Hitesh

On Jul 7, 2015, at 10:12 AM, Jim Green  wrote:

> Hi Team,
> 
> Is there any Hive <-> Tez support matrix?
> For example, Hive 1.2 should be on Tez which version?
> Tez 0.5.3 only supports which versions of Hive?
> etc…
> 
> My understanding is that it does not matter which version of Hive and which 
> version of Tez.
> 
> -- 
> Thanks,
> www.openkb.info 
> (Open KnowledgeBase for Hadoop/Database/OS/Network/Tool)



Re: fails to alter table concatenate

2015-06-30 Thread Hitesh Shah
Move to user@hive. BCC’ed user@tez.

— Hitesh 

On Jun 30, 2015, at 1:44 AM, patcharee  wrote:

> Hi,
> 
> I am using hive 0.14 + tez 0.5. It fails to alter table concatenate 
> occasionally (see the exception below). It is strange that it fails from time 
> to time not predictable. However, it works on mr.
> Is there any suggestion/clue?
> 
> hive> alter table 4dim partition(zone=2,z=15,year=2005,month=4) CONCATENATE;
> 
>  
>VERTICES  STATUS  TOTAL  COMPLETED  RUNNING  PENDING FAILED  KILLED
> 
>  
> File MergeFAILED -1  00 -1 0   0
> 
>  
> VERTICES: 00/01  [>>--] 0%ELAPSED TIME: 
> 1435651968.00 s
> 
>  
> Status: Failed
> Vertex failed, vertexName=File Merge, 
> vertexId=vertex_1435307579867_0041_1_00, diagnostics=[Vertex 
> vertex_1435307579867_0041_1_00 [File Merge] killed/failed due 
> to:ROOT_INPUT_INIT_FAILURE, Vertex Input: 
> [hdfs://service-10-0.local:8020/apps/hive/warehouse/wrf_tables/4dim/zone=2/z=15/year=2005/month=4]
>  initializer failed, vertex=vertex_1435307579867_0041_1_00 [File Merge], 
> java.lang.NullPointerException
>at 
> org.apache.hadoop.hive.ql.io.HiveInputFormat.init(HiveInputFormat.java:265)
>at 
> org.apache.hadoop.hive.ql.io.CombineHiveInputFormat.getSplits(CombineHiveInputFormat.java:452)
>at 
> org.apache.tez.mapreduce.hadoop.MRInputHelpers.generateOldSplits(MRInputHelpers.java:441)
>at 
> org.apache.tez.mapreduce.hadoop.MRInputHelpers.generateInputSplitsToMem(MRInputHelpers.java:295)
>at 
> org.apache.tez.mapreduce.common.MRInputAMSplitGenerator.initialize(MRInputAMSplitGenerator.java:124)
>at 
> org.apache.tez.dag.app.dag.RootInputInitializerManager$InputInitializerCallable$1.run(RootInputInitializerManager.java:245)
>at 
> org.apache.tez.dag.app.dag.RootInputInitializerManager$InputInitializerCallable$1.run(RootInputInitializerManager.java:239)
>at java.security.AccessController.doPrivileged(Native Method)
>at javax.security.auth.Subject.doAs(Subject.java:415)
>at 
> org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1628)
>at 
> org.apache.tez.dag.app.dag.RootInputInitializerManager$InputInitializerCallable.call(RootInputInitializerManager.java:239)
>at 
> org.apache.tez.dag.app.dag.RootInputInitializerManager$InputInitializerCallable.call(RootInputInitializerManager.java:226)
>at java.util.concurrent.FutureTask.run(FutureTask.java:262)
>at 
> java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145)
>at 
> java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615)
>at java.lang.Thread.run(Thread.java:745)
> ]
> DAG failed due to vertex failure. failedVertices:1 killedVertices:0
> FAILED: Execution Error, return code 2 from 
> org.apache.hadoop.hive.ql.exec.DDLTask
> 
> BR,
> Patcharee



Re: error on hive insert query

2015-06-16 Thread Hitesh Shah
Unless you can pinpoint the problem to something Tez specific, hive-specific 
questions might be better off being asked on user@hive initially as there is a 
larger group there that understands Hive as compared to the Tez community. 

FWIW, TezTask error 1 means “something in the Hive layer using Tez or somewhere 
within Tez itself” had a failure. Without additional logs/stack trace, this is 
usually impossible to debug/pinpoint where the problem is.  

thanks
— Hitesh

On Jun 16, 2015, at 10:19 AM, Sateesh Karuturi  
wrote:

> iam using hive 1.0.0 and tez 0.5.2. when i set hive.execution.engine value in 
> hive-site.xml to tezselect query works well... but in case of insert getting 
> error. the query is :
> 
> insert into table tablename values(intvalue,'string value');
> 
> and the error is :
> 
> FAILED: Execution Error, return code 1 from 
> org.apache.hadoop.hive.ql.exec.tez.Tez Task
> 
> 



Re: Enabling Tez sessions on HiveServer2

2014-12-02 Thread Hitesh Shah
BCC’ed user@tez.

This question belongs to either the hive user list or the Hortonworks user 
forums. 

thanks
— Hitesh

On Dec 2, 2014, at 1:28 PM, Pala M Muthaia  wrote:

> Hi,
> 
> I am trying to get Tez sessions enabled with HS2. I start the HiveServer2 
> instance with the flag "-hiveconf hive.execution.engine=tez" and then try to 
> submit multiple queries one after another, as the same user, to the HS2 
> instance. 
> 
> When i check the YARN UI, i find that each query of mine is launched as a new 
> YARN application. While the new Tez application is running, the old Tez 
> applications are still alive. This is different from Tez session in Hive CLI, 
> where multiple queries are submitted to the same Tez application (if launched 
> within the Tez session timeout).
> 
> 
> I followed the config instructions at 
> http://docs.hortonworks.com/HDPDocuments/HDP2/HDP-2.1.2/bk_installing_manually_book/content/rpm-chap-tez-configure_hive_for_tez.html
>  so far.
> 
> Is there a separate config flag that i need to turn on for Tez sessions on 
> HS2? How should i enable Tez sessions with HiveServer2.
> 
> 
> -pala



Re: Cannot submit Tez application

2014-05-11 Thread Hitesh Shah
Re-sending again. 

> BCC’ing dev@tez.
> 
> Re-directing to hive list.
> 
> @Azury Yu, would you mind providing the query details to the hive folks. 
> Seems like an issue in the DAG plan generation in Hive. 
> 
> thanks
> ― Hitesh
> 
> On May 10, 2014, at 8:26 PM, AzuryYu  wrote:
> 
>> Hi,
>> 
>> I built Tez-0.5 against hadoop-2.4.0, and I've put all jars to the HDFS, 
>> configured tez-site.xml correctly.
>> 
>> Hive version is 0.13.0, I've set hive.execution.engine to tez in 
>> hive-site.xml
>> 
>> then I submit the hive query, got the following exception:
>> 
>> 2014-05-11 11:13:08,533 ERROR [main]: exec.Task (TezTask.java:execute(185)) 
>> - Failed to execute tez graph.
>> java.lang.IllegalStateException: Vertex: Reducer 4 already has group input 
>> with name:Union 3
>>   at org.apache.tez.dag.api.Vertex.addGroupInput(Vertex.java:250)
>>   at org.apache.tez.dag.api.DAG.processEdgesAndGroups(DAG.java:223)
>>   at org.apache.tez.dag.api.DAG.verify(DAG.java:284)
>>   at org.apache.tez.dag.api.DAG.createDag(DAG.java:462)
>>   at org.apache.tez.client.TezSession.submitDAG(TezSession.java:216)
>>   at org.apache.tez.client.TezSession.submitDAG(TezSession.java:155)
>>   at org.apache.hadoop.hive.ql.exec.tez.TezTask.submit(TezTask.java:320)
>>   at org.apache.hadoop.hive.ql.exec.tez.TezTask.execute(TezTask.java:165)
>>   at org.apache.hadoop.hive.ql.exec.Task.executeTask(Task.java:153)
>>   at 
>> org.apache.hadoop.hive.ql.exec.TaskRunner.runSequential(TaskRunner.java:85)
>>   at org.apache.hadoop.hive.ql.Driver.launchTask(Driver.java:1503)
>>   at org.apache.hadoop.hive.ql.Driver.execute(Driver.java:1270)
>>   at org.apache.hadoop.hive.ql.Driver.runInternal(Driver.java:1088)
>>   at org.apache.hadoop.hive.ql.Driver.run(Driver.java:911)
>>   at org.apache.hadoop.hive.ql.Driver.run(Driver.java:901)
>>   at 
>> org.apache.hadoop.hive.cli.CliDriver.processLocalCmd(CliDriver.java:268)
>>   at org.apache.hadoop.hive.cli.CliDriver.processCmd(CliDriver.java:220)
>>   at org.apache.hadoop.hive.cli.CliDriver.processLine(CliDriver.java:423)
>>   at org.apache.hadoop.hive.cli.CliDriver.processLine(CliDriver.java:359)
>>   at 
>> org.apache.hadoop.hive.cli.CliDriver.executeDriver(CliDriver.java:742)
>>   at org.apache.hadoop.hive.cli.CliDriver.run(CliDriver.java:686)
>>   at org.apache.hadoop.hive.cli.CliDriver.main(CliDriver.java:625)
>>   at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
>>   at 
>> sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:57)
>>   at 
>> sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
>>   at java.lang.reflect.Method.invoke(Method.java:606)
>>   at org.apache.hadoop.util.RunJar.main(RunJar.java:212)
>> 2014-05-11 11:13:08,554 ERROR [main]: ql.Driver 
>> (SessionState.java:printError(545)) - FAILED: Execution Error, return code 1 
>> from org.apache.hadoop.hive.ql.exec.tez.TezTask
>>
> 


-- 
CONFIDENTIALITY NOTICE
NOTICE: This message is intended for the use of the individual or entity to 
which it is addressed and may contain information that is confidential, 
privileged and exempt from disclosure under applicable law. If the reader 
of this message is not the intended recipient, you are hereby notified that 
any printing, copying, dissemination, distribution, disclosure or 
forwarding of this communication is strictly prohibited. If you have 
received this communication in error, please contact the sender immediately 
and delete it from your system. Thank You.