Re: Hive shell not using manually set tez container size

2016-12-04 Thread Gopal Vijayaraghavan
> even that setting is not being applied after the hive shell is started and a > query is executed.  Are you increasing it or decreasing it? Tez will reuse existing larger containers, instead of releasing them - reducing the parameter has almost no effect without a session restart. Also that

Re: Trace Key-Value pairs

2016-12-04 Thread Rajesh Balamohan
Hi Robert, Tez deals with bytes and does not understand if the data is coming from Hive/Pig/Cascading etc. So in case you print the content from Hive, you would get mostly binary data. For hive, org.apache.hadoop.hive.ql.io.HiveKey, and value would be org.apache.hadoop.io.BytesWritable. Printing

Slow Response Time for Webhcat DDLs

2016-12-04 Thread Juan Delard de Rigoulières
Hi, We're planning to use WebHCat REST API to render JavaScript based Web console. As a result we'd like reasonable response times from WebHCat API. The responses we measure in our cluster are slow to deliver a user friendly UI. We are looking for ways to improve these response times without cachi

Trace Key-Value pairs

2016-12-04 Thread Robert Grandl
Hi guys, I am running Hive atop Tez and run several TPC-DS / TPC-H queries. I am trying to print the Key/Value pairs received as input by each vertex and generated as output accordingly. However, looking at Hive / Tez code, it seems they are converted to Object type and use their serialized fo