[jira] [Created] (HIVE-16946) Information Schema Improvements

2017-06-22 Thread Gunther Hagleitner (JIRA)
Gunther Hagleitner created HIVE-16946:
-

 Summary: Information Schema Improvements
 Key: HIVE-16946
 URL: https://issues.apache.org/jira/browse/HIVE-16946
 Project: Hive
  Issue Type: Improvement
Reporter: Gunther Hagleitner
Assignee: Gunther Hagleitner


Collection of requested enhancements and fixes for the info schema.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)


[jira] [Created] (HIVE-16657) Replace conf variable with checkDatabaseProduct-like in JDBCStorageHandler

2017-05-11 Thread Gunther Hagleitner (JIRA)
Gunther Hagleitner created HIVE-16657:
-

 Summary: Replace conf variable with checkDatabaseProduct-like in 
JDBCStorageHandler
 Key: HIVE-16657
 URL: https://issues.apache.org/jira/browse/HIVE-16657
 Project: Hive
  Issue Type: Bug
Reporter: Gunther Hagleitner






--
This message was sent by Atlassian JIRA
(v6.3.15#6346)


[jira] [Created] (HIVE-16631) Non-native tables should not enter stats state "accurate"

2017-05-09 Thread Gunther Hagleitner (JIRA)
Gunther Hagleitner created HIVE-16631:
-

 Summary: Non-native tables should not enter stats state "accurate"
 Key: HIVE-16631
 URL: https://issues.apache.org/jira/browse/HIVE-16631
 Project: Hive
  Issue Type: Bug
Reporter: Gunther Hagleitner


With non-native tables Hive is usually not the only system managing records. 
It's quite common to use XYZStorageHandler point to an existing data source. In 
these cases we can erroneously set the state of the statistics to accurate, 
which leads us to answer queries incorrectly.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)


[jira] [Created] (HIVE-16630) Make connection pool pluggable in JdbcStorageHandler

2017-05-09 Thread Gunther Hagleitner (JIRA)
Gunther Hagleitner created HIVE-16630:
-

 Summary: Make connection pool pluggable in JdbcStorageHandler
 Key: HIVE-16630
 URL: https://issues.apache.org/jira/browse/HIVE-16630
 Project: Hive
  Issue Type: Bug
Reporter: Gunther Hagleitner


Currently JdbcStorageHandler can only use DBCP. Would be nice to have that 
pluggable.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)


[jira] [Created] (HIVE-15873) Remove Windows-specific code

2017-02-09 Thread Gunther Hagleitner (JIRA)
Gunther Hagleitner created HIVE-15873:
-

 Summary: Remove Windows-specific code
 Key: HIVE-15873
 URL: https://issues.apache.org/jira/browse/HIVE-15873
 Project: Hive
  Issue Type: Bug
Reporter: Gunther Hagleitner


I know a lot of work has gone initially into supporting UT, runtime, etc on 
Windows, but this code seems to have been rotting. 

There have been no updates to the windows specific test files, or any new code 
to keep the new features compatible.

We're also not running the tests or builds on windows. That is really an 
impediment to keepting that code healthy.

The code is sprinkled all over the codebase. Makes it hard to maintain. I think 
we're better off removing it.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)


[jira] [Created] (HIVE-15791) Remove unused ant files

2017-02-01 Thread Gunther Hagleitner (JIRA)
Gunther Hagleitner created HIVE-15791:
-

 Summary: Remove unused ant files
 Key: HIVE-15791
 URL: https://issues.apache.org/jira/browse/HIVE-15791
 Project: Hive
  Issue Type: Bug
Reporter: Gunther Hagleitner
Assignee: Gunther Hagleitner






--
This message was sent by Atlassian JIRA
(v6.3.15#6346)


[jira] [Created] (HIVE-15790) Remove unused beeline golden files

2017-02-01 Thread Gunther Hagleitner (JIRA)
Gunther Hagleitner created HIVE-15790:
-

 Summary: Remove unused beeline golden files
 Key: HIVE-15790
 URL: https://issues.apache.org/jira/browse/HIVE-15790
 Project: Hive
  Issue Type: Bug
Reporter: Gunther Hagleitner
Assignee: Gunther Hagleitner






--
This message was sent by Atlassian JIRA
(v6.3.15#6346)


[jira] [Created] (HIVE-15549) Better naming of Tez edges

2017-01-05 Thread Gunther Hagleitner (JIRA)
Gunther Hagleitner created HIVE-15549:
-

 Summary: Better naming of Tez edges
 Key: HIVE-15549
 URL: https://issues.apache.org/jira/browse/HIVE-15549
 Project: Hive
  Issue Type: Bug
Reporter: Gunther Hagleitner
Assignee: Gunther Hagleitner


Do the following renames:

CUSTOM_EDGE -> CO_PARTITION_EDGE
CUSTOM_SIMPLE_EDGE -> PARTITION_EDGE
SIMPLE_EDGE -> SORT_PARTITION_EDGE

Because that's what those edges actually do.

Also rename Map/Reduce  to just Vertex . These vertices haven't mapped or 
reduced in a long time. The names are leftover items from MR.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (HIVE-14117) HS2 UI: List of recent queries shows most recent query last

2016-06-28 Thread Gunther Hagleitner (JIRA)
Gunther Hagleitner created HIVE-14117:
-

 Summary: HS2 UI: List of recent queries shows most recent query 
last
 Key: HIVE-14117
 URL: https://issues.apache.org/jira/browse/HIVE-14117
 Project: Hive
  Issue Type: Bug
Reporter: Gunther Hagleitner
Assignee: Gunther Hagleitner


It's more useful to see the latest one first in your "last n queries" view.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (HIVE-13823) Remove unnecessary log line in common join operator

2016-05-23 Thread Gunther Hagleitner (JIRA)
Gunther Hagleitner created HIVE-13823:
-

 Summary: Remove unnecessary log line in common join operator
 Key: HIVE-13823
 URL: https://issues.apache.org/jira/browse/HIVE-13823
 Project: Hive
  Issue Type: Bug
Reporter: Gunther Hagleitner
Assignee: Gunther Hagleitner
 Fix For: 2.1.0






--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (HIVE-13467) Show llap info on hs2 ui when available

2016-04-08 Thread Gunther Hagleitner (JIRA)
Gunther Hagleitner created HIVE-13467:
-

 Summary: Show llap info on hs2 ui when available
 Key: HIVE-13467
 URL: https://issues.apache.org/jira/browse/HIVE-13467
 Project: Hive
  Issue Type: Bug
  Components: llap
Reporter: Gunther Hagleitner
Assignee: Gunther Hagleitner


When llap is on and hs2 is configured with access to an llap cluster, HS2 UI 
should show some status of the daemons and provide a mechanism to click through 
to their respective UIs.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (HIVE-13417) Some vector operators return "OP" as name

2016-04-04 Thread Gunther Hagleitner (JIRA)
Gunther Hagleitner created HIVE-13417:
-

 Summary: Some vector operators return "OP" as name
 Key: HIVE-13417
 URL: https://issues.apache.org/jira/browse/HIVE-13417
 Project: Hive
  Issue Type: Bug
Reporter: Gunther Hagleitner
Assignee: Gunther Hagleitner


Select/Group by/Filter/etc need to return the same name whether they are the 
regular or the vector operators. If they don't the regular path matching in our 
optimizer code doesn't work on them.

>From the code it looks an attempt was made to follow this - unfortunately 
>getOperatorName is static and polymorphism doesn't work on these functions.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (HIVE-13162) Fixes for LlapDump and FileSinkoperator

2016-02-25 Thread Gunther Hagleitner (JIRA)
Gunther Hagleitner created HIVE-13162:
-

 Summary: Fixes for LlapDump and FileSinkoperator
 Key: HIVE-13162
 URL: https://issues.apache.org/jira/browse/HIVE-13162
 Project: Hive
  Issue Type: Sub-task
Reporter: Gunther Hagleitner
Assignee: Gunther Hagleitner
 Fix For: llap






--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (HIVE-13133) Create initial InputFormat + record readers/writers

2016-02-23 Thread Gunther Hagleitner (JIRA)
Gunther Hagleitner created HIVE-13133:
-

 Summary: Create initial InputFormat + record readers/writers
 Key: HIVE-13133
 URL: https://issues.apache.org/jira/browse/HIVE-13133
 Project: Hive
  Issue Type: Sub-task
Reporter: Gunther Hagleitner






--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (HIVE-12357) Allow user to set tez job name

2015-11-05 Thread Gunther Hagleitner (JIRA)
Gunther Hagleitner created HIVE-12357:
-

 Summary: Allow user to set tez job name
 Key: HIVE-12357
 URL: https://issues.apache.org/jira/browse/HIVE-12357
 Project: Hive
  Issue Type: Bug
Reporter: Gunther Hagleitner
Assignee: Gunther Hagleitner


Need something like mapred.job.name.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (HIVE-11727) Hive on Tez through Oozie: Some queries fail with fnf exception

2015-09-03 Thread Gunther Hagleitner (JIRA)
Gunther Hagleitner created HIVE-11727:
-

 Summary: Hive on Tez through Oozie: Some queries fail with fnf 
exception
 Key: HIVE-11727
 URL: https://issues.apache.org/jira/browse/HIVE-11727
 Project: Hive
  Issue Type: Bug
Reporter: Gunther Hagleitner
Assignee: Gunther Hagleitner


When we read back row containers from disk, a misconfiguration causes us to 
look for a non-existing file.

{noformat}
Caused by: java.io.FileNotFoundException: File 
file:/grid/0/hadoop/yarn/local/usercache/appcache/application_1440685000561_0028/container_e26_1440685000561_0028_01_05/container_tokens
 does not exist
at 
org.apache.hadoop.fs.RawLocalFileSystem.deprecatedGetFileStatus(RawLocalFileSystem.java:608)
at 
org.apache.hadoop.fs.RawLocalFileSystem.getFileLinkStatusInternal(RawLocalFileSystem.java:821)
at 
org.apache.hadoop.fs.RawLocalFileSystem.getFileStatus(RawLocalFileSystem.java:598)
at 
org.apache.hadoop.fs.FilterFileSystem.getFileStatus(FilterFileSystem.java:414)
at 
org.apache.hadoop.fs.ChecksumFileSystem$ChecksumFSInputChecker.(ChecksumFileSystem.java:140)
at 
org.apache.hadoop.fs.ChecksumFileSystem.open(ChecksumFileSystem.java:341)
at org.apache.hadoop.fs.FileSystem.open(FileSystem.java:766)
at 
org.apache.hadoop.security.Credentials.readTokenStorageFile(Credentials.java:169)
... 31 more
{noformat}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (HIVE-10957) QueryPlan's start time is incorrect in certain cases

2015-06-05 Thread Gunther Hagleitner (JIRA)
Gunther Hagleitner created HIVE-10957:
-

 Summary: QueryPlan's start time is incorrect in certain cases
 Key: HIVE-10957
 URL: https://issues.apache.org/jira/browse/HIVE-10957
 Project: Hive
  Issue Type: Bug
Reporter: Gunther Hagleitner
Assignee: Wei Zheng






--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (HIVE-10853) Create ExplainTask in ATS hook through ExplainWork

2015-05-28 Thread Gunther Hagleitner (JIRA)
Gunther Hagleitner created HIVE-10853:
-

 Summary: Create ExplainTask in ATS hook through ExplainWork
 Key: HIVE-10853
 URL: https://issues.apache.org/jira/browse/HIVE-10853
 Project: Hive
  Issue Type: Bug
Reporter: Gunther Hagleitner


Right now ExplainTask is created directly. That's fragile and can lead to stuff 
like: HIVE-10829



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (HIVE-10284) enable container reuse for grace hash join

2015-04-09 Thread Gunther Hagleitner (JIRA)
Gunther Hagleitner created HIVE-10284:
-

 Summary: enable container reuse for grace hash join 
 Key: HIVE-10284
 URL: https://issues.apache.org/jira/browse/HIVE-10284
 Project: Hive
  Issue Type: Bug
Reporter: Gunther Hagleitner
Assignee: Gunther Hagleitner






--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (HIVE-10274) Send context and description to tez via dag info

2015-04-08 Thread Gunther Hagleitner (JIRA)
Gunther Hagleitner created HIVE-10274:
-

 Summary: Send context and description to tez via dag info
 Key: HIVE-10274
 URL: https://issues.apache.org/jira/browse/HIVE-10274
 Project: Hive
  Issue Type: Bug
Reporter: Gunther Hagleitner
Assignee: Gunther Hagleitner
 Attachments: HIVE-10274.1.patch

tez has a way to specify context and description (which is shown in the ui) for 
each dag.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (HIVE-10203) Move fragment execution onto a thread pool

2015-04-02 Thread Gunther Hagleitner (JIRA)
Gunther Hagleitner created HIVE-10203:
-

 Summary: Move fragment execution onto a thread pool
 Key: HIVE-10203
 URL: https://issues.apache.org/jira/browse/HIVE-10203
 Project: Hive
  Issue Type: Sub-task
Reporter: Gunther Hagleitner


Currently we use a lot of thread local magic to make things work in llap. 
MRLegacyInput makes sure we use the same thread in input and processor. The 
execution spawns a new thread per fragment and kills it afterwards.

That's being done because of Tez/MapredContext, ExecContext and IOContext. 
These need to be shared for a single fragment but not across.

Way to clean this up is probably by creating a HiveInput that can handle the 
config + state sharing more efficiently than generic Tez code.

cc [~sseth] [~gopalv] [~vikram.dixit]



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (HIVE-10078) Optionally allow logging of records processed in fixed intervals

2015-03-24 Thread Gunther Hagleitner (JIRA)
Gunther Hagleitner created HIVE-10078:
-

 Summary: Optionally allow logging of records processed in fixed 
intervals
 Key: HIVE-10078
 URL: https://issues.apache.org/jira/browse/HIVE-10078
 Project: Hive
  Issue Type: Bug
Reporter: Gunther Hagleitner
Assignee: Gunther Hagleitner


Tasks today log progress (records in/records out) on an exponential scale (1, 
10, 100, ...). Sometimes it's helpful to be able to switch to fixed interval. 
That can help debugging certain issues that look like a hang, etc.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (HIVE-10005) remove some unnecessary branches from the inner loop

2015-03-18 Thread Gunther Hagleitner (JIRA)
Gunther Hagleitner created HIVE-10005:
-

 Summary: remove some unnecessary branches from the inner loop
 Key: HIVE-10005
 URL: https://issues.apache.org/jira/browse/HIVE-10005
 Project: Hive
  Issue Type: Bug
Reporter: Gunther Hagleitner
Assignee: Gunther Hagleitner
 Attachments: HIVE-10005.1.patch

Operator.forward is doing too much. There's no reason to do the done checking 
per row and update it inline. It's much more efficient to just do that when the 
event that completes an operator happens.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (HIVE-10002) fix yarn service registry not found in ut problem

2015-03-18 Thread Gunther Hagleitner (JIRA)
Gunther Hagleitner created HIVE-10002:
-

 Summary: fix yarn service registry not found in ut problem
 Key: HIVE-10002
 URL: https://issues.apache.org/jira/browse/HIVE-10002
 Project: Hive
  Issue Type: Sub-task
Reporter: Gunther Hagleitner
Assignee: Gunther Hagleitner






--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (HIVE-10003) MiniTez ut fail with missing configs

2015-03-18 Thread Gunther Hagleitner (JIRA)
Gunther Hagleitner created HIVE-10003:
-

 Summary: MiniTez ut fail with missing configs
 Key: HIVE-10003
 URL: https://issues.apache.org/jira/browse/HIVE-10003
 Project: Hive
  Issue Type: Bug
Reporter: Gunther Hagleitner
Assignee: Gopal V






--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (HIVE-10004) yarn service registry should be shim'd

2015-03-18 Thread Gunther Hagleitner (JIRA)
Gunther Hagleitner created HIVE-10004:
-

 Summary: yarn service registry should be shim'd
 Key: HIVE-10004
 URL: https://issues.apache.org/jira/browse/HIVE-10004
 Project: Hive
  Issue Type: Bug
Reporter: Gunther Hagleitner
Assignee: Gopal V






--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (HIVE-10000) 10000 whoooohooo

2015-03-17 Thread Gunther Hagleitner (JIRA)
Gunther Hagleitner created HIVE-1:
-

 Summary: 1 whhooo
 Key: HIVE-1
 URL: https://issues.apache.org/jira/browse/HIVE-1
 Project: Hive
  Issue Type: Bug
Reporter: Gunther Hagleitner






--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (HIVE-9984) JoinReorder's getOutputSize is exponential

2015-03-16 Thread Gunther Hagleitner (JIRA)
Gunther Hagleitner created HIVE-9984:


 Summary: JoinReorder's getOutputSize is exponential
 Key: HIVE-9984
 URL: https://issues.apache.org/jira/browse/HIVE-9984
 Project: Hive
  Issue Type: Bug
Reporter: Gunther Hagleitner
Assignee: Gunther Hagleitner


Found by [~mmokhtar]. Causes major issues in large plans (50+ joins). Simple 
fix would be to memoize the recursion. There should also be a flag to switch 
this opt off.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (HIVE-9971) Clean up operator class

2015-03-15 Thread Gunther Hagleitner (JIRA)
Gunther Hagleitner created HIVE-9971:


 Summary: Clean up operator class
 Key: HIVE-9971
 URL: https://issues.apache.org/jira/browse/HIVE-9971
 Project: Hive
  Issue Type: Bug
Reporter: Gunther Hagleitner
Assignee: Gunther Hagleitner


This is mostly cleanup although it does enhance the pipeline in one respect. It 
introduces asyn init for operators and uses it for hash table loading where 
desired.

There's a bunch of weird code associated with the operator class:

- initialize isn't recursive, rather initializeOp is supposed to call 
initializeChildren. That has led to bugs in the past.

- setExecContext and passExecContext. Both are recursive, but passExecContext 
calls setExecContext and then recurses again. Boo.

- lots of (getChildren() != null) although that can't happen anymore

- TezCacheAccess is a hack. We should just leave init of inputs up to the 
operator that needs it.

- Need some sanity checks that make sure that operators were all initialized.





--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (HIVE-9915) Allow specifying file format for managed tables

2015-03-10 Thread Gunther Hagleitner (JIRA)
Gunther Hagleitner created HIVE-9915:


 Summary: Allow specifying file format for managed tables
 Key: HIVE-9915
 URL: https://issues.apache.org/jira/browse/HIVE-9915
 Project: Hive
  Issue Type: Bug
Reporter: Gunther Hagleitner
Assignee: Gunther Hagleitner


We already allow setting a system wide default format. In some cases it's 
useful though to specify this only for managed tables, or distinguish external 
and managed via two variables. You might want to set a more efficient (than 
text) format for managed tables, but leave external to text (as they often are 
log files etc.)



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (HIVE-9891) LLAP: disable plan caching

2015-03-06 Thread Gunther Hagleitner (JIRA)
Gunther Hagleitner created HIVE-9891:


 Summary: LLAP: disable plan caching
 Key: HIVE-9891
 URL: https://issues.apache.org/jira/browse/HIVE-9891
 Project: Hive
  Issue Type: Sub-task
Reporter: Gunther Hagleitner
Assignee: Gunther Hagleitner


Can't share the same plan objects in LLAP as they are used concurrently.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (HIVE-9850) documentation for llap

2015-03-03 Thread Gunther Hagleitner (JIRA)
Gunther Hagleitner created HIVE-9850:


 Summary: documentation for llap
 Key: HIVE-9850
 URL: https://issues.apache.org/jira/browse/HIVE-9850
 Project: Hive
  Issue Type: Sub-task
Reporter: Gunther Hagleitner
Assignee: Gunther Hagleitner


Live long and write documentation. This is the jira for all doc related things 
in llap.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (HIVE-9849) LLAP: add multi threaded object registry

2015-03-03 Thread Gunther Hagleitner (JIRA)
Gunther Hagleitner created HIVE-9849:


 Summary: LLAP: add multi threaded object registry
 Key: HIVE-9849
 URL: https://issues.apache.org/jira/browse/HIVE-9849
 Project: Hive
  Issue Type: Sub-task
Reporter: Gunther Hagleitner
Assignee: Gunther Hagleitner






--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (HIVE-9837) LLAP: Decision to use llap or uber is being lost in some reducers

2015-03-02 Thread Gunther Hagleitner (JIRA)
Gunther Hagleitner created HIVE-9837:


 Summary: LLAP: Decision to use llap or uber is being lost in some 
reducers
 Key: HIVE-9837
 URL: https://issues.apache.org/jira/browse/HIVE-9837
 Project: Hive
  Issue Type: Sub-task
Affects Versions: llap
Reporter: Gunther Hagleitner
Assignee: Gunther Hagleitner






--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (HIVE-9810) LLAP: prep object registry for multi threading

2015-02-27 Thread Gunther Hagleitner (JIRA)
Gunther Hagleitner created HIVE-9810:


 Summary: LLAP: prep object registry for multi threading
 Key: HIVE-9810
 URL: https://issues.apache.org/jira/browse/HIVE-9810
 Project: Hive
  Issue Type: Sub-task
Reporter: Gunther Hagleitner
Assignee: Gunther Hagleitner


The object registry relies on the fact that only one thread at a time is active 
in a container. With llap that's not the case. There's multiple threads that 
will try to generate the same cache object at the time, etc.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (HIVE-9801) LLAP: need counter for cache hit ratio

2015-02-26 Thread Gunther Hagleitner (JIRA)
Gunther Hagleitner created HIVE-9801:


 Summary: LLAP: need counter for cache hit ratio
 Key: HIVE-9801
 URL: https://issues.apache.org/jira/browse/HIVE-9801
 Project: Hive
  Issue Type: Sub-task
Reporter: Gunther Hagleitner






--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (HIVE-9776) LLAP: add simple way to determine wether you're running in daemon

2015-02-24 Thread Gunther Hagleitner (JIRA)
Gunther Hagleitner created HIVE-9776:


 Summary: LLAP: add simple way to determine wether you're running 
in daemon
 Key: HIVE-9776
 URL: https://issues.apache.org/jira/browse/HIVE-9776
 Project: Hive
  Issue Type: Sub-task
Reporter: Gunther Hagleitner
Assignee: Gunther Hagleitner






--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (HIVE-9778) LLAP: add config var for uber mode

2015-02-24 Thread Gunther Hagleitner (JIRA)
Gunther Hagleitner created HIVE-9778:


 Summary: LLAP: add config var for uber mode
 Key: HIVE-9778
 URL: https://issues.apache.org/jira/browse/HIVE-9778
 Project: Hive
  Issue Type: Sub-task
Reporter: Gunther Hagleitner
Assignee: Gunther Hagleitner






--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (HIVE-9756) LLAP: use log4j 2 for llap

2015-02-23 Thread Gunther Hagleitner (JIRA)
Gunther Hagleitner created HIVE-9756:


 Summary: LLAP: use log4j 2 for llap
 Key: HIVE-9756
 URL: https://issues.apache.org/jira/browse/HIVE-9756
 Project: Hive
  Issue Type: Sub-task
Reporter: Gunther Hagleitner
Assignee: Gopal V


For the INFO logging, we'll need to use the log4j-jcl 2.x upgrade-path to get 
throughput friendly logging.
http://logging.apache.org/log4j/2.0/manual/async.html#Performance



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (HIVE-9750) LLAP: avoid log locks in operators

2015-02-23 Thread Gunther Hagleitner (JIRA)
Gunther Hagleitner created HIVE-9750:


 Summary: LLAP: avoid log locks in operators
 Key: HIVE-9750
 URL: https://issues.apache.org/jira/browse/HIVE-9750
 Project: Hive
  Issue Type: Sub-task
Reporter: Gunther Hagleitner
Assignee: Gunther Hagleitner


Basically wrap all LOG.xx calls in isLogXXXEnabled to avoid unnecessary locks 
on these calls.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (HIVE-9762) LLAP: hoop up decider + dag utils

2015-02-23 Thread Gunther Hagleitner (JIRA)
Gunther Hagleitner created HIVE-9762:


 Summary: LLAP: hoop up decider + dag utils
 Key: HIVE-9762
 URL: https://issues.apache.org/jira/browse/HIVE-9762
 Project: Hive
  Issue Type: Sub-task
Reporter: Gunther Hagleitner
Assignee: Gunther Hagleitner






--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (HIVE-9765) LLAP: uber mode where applicable

2015-02-23 Thread Gunther Hagleitner (JIRA)
Gunther Hagleitner created HIVE-9765:


 Summary: LLAP: uber mode where applicable
 Key: HIVE-9765
 URL: https://issues.apache.org/jira/browse/HIVE-9765
 Project: Hive
  Issue Type: Sub-task
Reporter: Gunther Hagleitner
Assignee: Gunther Hagleitner


when task is small enough and it's the last reduce stage we can just run it in 
the AM.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (HIVE-9694) LLAP: add check for udfs/udafs to llapdecider

2015-02-14 Thread Gunther Hagleitner (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-9694?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Gunther Hagleitner updated HIVE-9694:
-
Attachment: HIVE-9694.1.patch

 LLAP: add check for udfs/udafs to llapdecider
 -

 Key: HIVE-9694
 URL: https://issues.apache.org/jira/browse/HIVE-9694
 Project: Hive
  Issue Type: Sub-task
Reporter: Gunther Hagleitner
Assignee: Gunther Hagleitner
 Attachments: HIVE-9694.1.patch






--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Resolved] (HIVE-9694) LLAP: add check for udfs/udafs to llapdecider

2015-02-14 Thread Gunther Hagleitner (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-9694?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Gunther Hagleitner resolved HIVE-9694.
--
Resolution: Fixed

Committed to branch.

 LLAP: add check for udfs/udafs to llapdecider
 -

 Key: HIVE-9694
 URL: https://issues.apache.org/jira/browse/HIVE-9694
 Project: Hive
  Issue Type: Sub-task
Reporter: Gunther Hagleitner
Assignee: Gunther Hagleitner
 Attachments: HIVE-9694.1.patch






--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (HIVE-9694) add check for udfs/udafs to llapdecider

2015-02-14 Thread Gunther Hagleitner (JIRA)
Gunther Hagleitner created HIVE-9694:


 Summary: add check for udfs/udafs to llapdecider
 Key: HIVE-9694
 URL: https://issues.apache.org/jira/browse/HIVE-9694
 Project: Hive
  Issue Type: Sub-task
Reporter: Gunther Hagleitner
Assignee: Gunther Hagleitner






--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (HIVE-9694) LLAP: add check for udfs/udafs to llapdecider

2015-02-14 Thread Gunther Hagleitner (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-9694?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Gunther Hagleitner updated HIVE-9694:
-
Summary: LLAP: add check for udfs/udafs to llapdecider  (was: add check for 
udfs/udafs to llapdecider)

 LLAP: add check for udfs/udafs to llapdecider
 -

 Key: HIVE-9694
 URL: https://issues.apache.org/jira/browse/HIVE-9694
 Project: Hive
  Issue Type: Sub-task
Reporter: Gunther Hagleitner
Assignee: Gunther Hagleitner





--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Resolved] (HIVE-9635) LLAP: I'm the decider

2015-02-13 Thread Gunther Hagleitner (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-9635?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Gunther Hagleitner resolved HIVE-9635.
--
Resolution: Fixed

Committed to branch.

 LLAP: I'm the decider
 -

 Key: HIVE-9635
 URL: https://issues.apache.org/jira/browse/HIVE-9635
 Project: Hive
  Issue Type: Sub-task
Affects Versions: llap
Reporter: Gunther Hagleitner
Assignee: Gunther Hagleitner
 Attachments: HIVE-9635.1.patch, HIVE-9635.2.patch


 https://www.youtube.com/watch?v=r8VbzrZ9yHQ
 Physical optimizer to choose what to run inside/outside llap. Tests first 
 whether user code has to be shipped then if the specific query fragment is 
 suitable to run.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (HIVE-9673) Set operationhandle in ATS entities for lookups

2015-02-13 Thread Gunther Hagleitner (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-9673?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Gunther Hagleitner updated HIVE-9673:
-
Issue Type: Improvement  (was: Bug)

 Set operationhandle in ATS entities for lookups
 ---

 Key: HIVE-9673
 URL: https://issues.apache.org/jira/browse/HIVE-9673
 Project: Hive
  Issue Type: Improvement
Reporter: Thejas M Nair
Assignee: Thejas M Nair
 Fix For: 1.2.0

 Attachments: HIVE-9673.1.patch, HIVE-9673.2.patch


 Yarn App Timeline Server (ATS) users can find their query using hive query-id.
 However, query id is available only through the logs at the moment.
 Thrift api users such as Hue have another unique id for queries, which the 
 operation handle contains 
 (TExecuteStatementResp.TOperationHandle.THandleIdentifier.guid) . Adding the 
 operationhandle guid to ATS will enable such thrift users to get information 
 from ATS for the queries that they have spawned.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (HIVE-9673) Set operationhandle in ATS entities for lookups

2015-02-13 Thread Gunther Hagleitner (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-9673?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Gunther Hagleitner updated HIVE-9673:
-
Fix Version/s: 1.2.0

 Set operationhandle in ATS entities for lookups
 ---

 Key: HIVE-9673
 URL: https://issues.apache.org/jira/browse/HIVE-9673
 Project: Hive
  Issue Type: Bug
Reporter: Thejas M Nair
Assignee: Thejas M Nair
 Fix For: 1.2.0

 Attachments: HIVE-9673.1.patch, HIVE-9673.2.patch


 Yarn App Timeline Server (ATS) users can find their query using hive query-id.
 However, query id is available only through the logs at the moment.
 Thrift api users such as Hue have another unique id for queries, which the 
 operation handle contains 
 (TExecuteStatementResp.TOperationHandle.THandleIdentifier.guid) . Adding the 
 operationhandle guid to ATS will enable such thrift users to get information 
 from ATS for the queries that they have spawned.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (HIVE-9673) Set operationhandle in ATS entities for lookups

2015-02-13 Thread Gunther Hagleitner (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-9673?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Gunther Hagleitner updated HIVE-9673:
-
Resolution: Fixed
Status: Resolved  (was: Patch Available)

committed to trunk. thanks [~thejas]!

 Set operationhandle in ATS entities for lookups
 ---

 Key: HIVE-9673
 URL: https://issues.apache.org/jira/browse/HIVE-9673
 Project: Hive
  Issue Type: Bug
Reporter: Thejas M Nair
Assignee: Thejas M Nair
 Attachments: HIVE-9673.1.patch, HIVE-9673.2.patch


 Yarn App Timeline Server (ATS) users can find their query using hive query-id.
 However, query id is available only through the logs at the moment.
 Thrift api users such as Hue have another unique id for queries, which the 
 operation handle contains 
 (TExecuteStatementResp.TOperationHandle.THandleIdentifier.guid) . Adding the 
 operationhandle guid to ATS will enable such thrift users to get information 
 from ATS for the queries that they have spawned.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (HIVE-9683) Hive metastore thrift client connections hang indefinitely

2015-02-13 Thread Gunther Hagleitner (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-9683?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14320486#comment-14320486
 ] 

Gunther Hagleitner commented on HIVE-9683:
--

[~vikram.dixit] ok for 1.0 branch?

 Hive metastore thrift client connections hang indefinitely
 --

 Key: HIVE-9683
 URL: https://issues.apache.org/jira/browse/HIVE-9683
 Project: Hive
  Issue Type: Bug
  Components: Metastore
Affects Versions: 1.0.0, 1.0.1
Reporter: Gopal V
Assignee: Gopal V
Priority: Minor
 Fix For: 1.0.1

 Attachments: HIVE-9683.1.patch


 THRIFT-2788 fixed network-partition problems that affect Thrift client 
 connections.
 Since hive-1.0 is on thrift-0.9.0 which is affected by the bug, a workaround 
 can be applied to prevent indefinite connection hangs during net-splits.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (HIVE-9635) LLAP: I'm the decider

2015-02-12 Thread Gunther Hagleitner (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-9635?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14318807#comment-14318807
 ] 

Gunther Hagleitner commented on HIVE-9635:
--

If you like, not necessary though. I do need to finish it though.

 LLAP: I'm the decider
 -

 Key: HIVE-9635
 URL: https://issues.apache.org/jira/browse/HIVE-9635
 Project: Hive
  Issue Type: Sub-task
Affects Versions: llap
Reporter: Gunther Hagleitner
Assignee: Gunther Hagleitner
 Attachments: HIVE-9635.1.patch


 https://www.youtube.com/watch?v=r8VbzrZ9yHQ
 Physical optimizer to choose what to run inside/outside llap. Tests first 
 whether user code has to be shipped then if the specific query fragment is 
 suitable to run.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (HIVE-9665) Parallel move task optimization causes race condition

2015-02-12 Thread Gunther Hagleitner (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-9665?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Gunther Hagleitner updated HIVE-9665:
-
Status: Patch Available  (was: Open)

 Parallel move task optimization causes race condition
 -

 Key: HIVE-9665
 URL: https://issues.apache.org/jira/browse/HIVE-9665
 Project: Hive
  Issue Type: Bug
Reporter: Gunther Hagleitner
Assignee: Gunther Hagleitner
Priority: Critical
 Attachments: HIVE-9665.1.patch


 The change in HIVE-8042 doesn't actually work. Running it at scale produces 
 race conditions which lead to broken thrift messages and OOMs. E.g.:
 {noformat}
 java.lang.OutOfMemoryError: Java heap space
   at 
 org.apache.thrift.protocol.TBinaryProtocol.readStringBody(TBinaryProtocol.java:353)
   at 
 org.apache.thrift.protocol.TBinaryProtocol.readMessageBegin(TBinaryProtocol.java:215)
   at org.apache.thrift.TServiceClient.receiveBase(TServiceClient.java:69)
   at 
 org.apache.hadoop.hive.metastore.api.ThriftHiveMetastore$Client.recv_get_table(ThriftHiveMetastore.java:1122)
   at 
 org.apache.hadoop.hive.metastore.api.ThriftHiveMetastore$Client.get_table(ThriftHiveMetastore.java:1108)
   at 
 org.apache.hadoop.hive.metastore.HiveMetaStoreClient.getTable(HiveMetaStoreClient.java:1091)
   at 
 org.apache.hadoop.hive.ql.metadata.SessionHiveMetaStoreClient.getTable(SessionHiveMetaStoreClient.java:131)
   at sun.reflect.GeneratedMethodAccessor19.invoke(Unknown Source)
   at 
 sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
   at java.lang.reflect.Method.invoke(Method.java:606)
   at 
 org.apache.hadoop.hive.metastore.RetryingMetaStoreClient.invoke(RetryingMetaStoreClient.java:90)
   at com.sun.proxy.$Proxy9.getTable(Unknown Source)
   at org.apache.hadoop.hive.ql.metadata.Hive.getTable(Hive.java:1064)
   at org.apache.hadoop.hive.ql.metadata.Hive.getTable(Hive.java:1019)
   at org.apache.hadoop.hive.ql.metadata.Hive.getTable(Hive.java:1006)
   at org.apache.hadoop.hive.ql.exec.MoveTask.execute(MoveTask.java:250)
   at org.apache.hadoop.hive.ql.exec.Task.executeTask(Task.java:160)
   at 
 org.apache.hadoop.hive.ql.exec.TaskRunner.runSequential(TaskRunner.java:85)
   at org.apache.hadoop.hive.ql.exec.TaskRunner.run(TaskRunner.java:72)
 java.lang.OutOfMemoryError: Java heap space
   at 
 org.apache.thrift.protocol.TBinaryProtocol.readStringBody(TBinaryProtocol.java:353)
   at 
 org.apache.thrift.protocol.TBinaryProtocol.readMessageBegin(TBinaryProtocol.java:215)
   at org.apache.thrift.TServiceClient.receiveBase(TServiceClient.java:69)
   at 
 org.apache.hadoop.hive.metastore.api.ThriftHiveMetastore$Client.recv_get_table(ThriftHiveMetastore.java:1122)
   at 
 org.apache.hadoop.hive.metastore.api.ThriftHiveMetastore$Client.get_table(ThriftHiveMetastore.java:1108)
   at 
 org.apache.hadoop.hive.metastore.HiveMetaStoreClient.getTable(HiveMetaStoreClient.java:1091)
   at 
 org.apache.hadoop.hive.ql.metadata.SessionHiveMetaStoreClient.getTable(SessionHiveMetaStoreClient.java:131)
   at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
   at 
 sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:57)
   at 
 sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
   at java.lang.reflect.Method.invoke(Method.java:606)
   at 
 org.apache.hadoop.hive.metastore.RetryingMetaStoreClient.invoke(RetryingMetaStoreClient.java:90)
   at com.sun.proxy.$Proxy9.getTable(Unknown Source)
   at org.apache.hadoop.hive.ql.metadata.Hive.getTable(Hive.java:1064)
   at org.apache.hadoop.hive.ql.metadata.Hive.getTable(Hive.java:1019)
   at org.apache.hadoop.hive.ql.metadata.Hive.getTable(Hive.java:1006)
   at org.apache.hadoop.hive.ql.exec.MoveTask.execute(MoveTask.java:250)
   at org.apache.hadoop.hive.ql.exec.Task.executeTask(Task.java:160)
   at 
 org.apache.hadoop.hive.ql.exec.TaskRunner.runSequential(TaskRunner.java:85)
   at org.apache.hadoop.hive.ql.exec.TaskRunner.run(TaskRunner.java:72)
 java.lang.OutOfMemoryError: Java heap space
   at 
 org.apache.thrift.protocol.TBinaryProtocol.readStringBody(TBinaryProtocol.java:353)
   at 
 org.apache.thrift.protocol.TBinaryProtocol.readMessageBegin(TBinaryProtocol.java:215)
   at org.apache.thrift.TServiceClient.receiveBase(TServiceClient.java:69)
   at 
 org.apache.hadoop.hive.metastore.api.ThriftHiveMetastore$Client.recv_get_table(ThriftHiveMetastore.java:1122)
   at 
 org.apache.hadoop.hive.metastore.api.ThriftHiveMetastore$Client.get_table(ThriftHiveMetastore.java:1108)
   at 
 

[jira] [Commented] (HIVE-9673) Set operationhandle in ATS entities for lookups

2015-02-12 Thread Gunther Hagleitner (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-9673?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14319360#comment-14319360
 ] 

Gunther Hagleitner commented on HIVE-9673:
--

Looks good - but how do you know that none of the following objs are going to 
be null (handle/id/...)?

{noformat}
+  String guid64 = 
Base64.encodeBase64URLSafeString(getHandle().getHandleIdentifier()
+  .toTHandleIdentifier().getGuid()).trim();
{noformat}

 Set operationhandle in ATS entities for lookups
 ---

 Key: HIVE-9673
 URL: https://issues.apache.org/jira/browse/HIVE-9673
 Project: Hive
  Issue Type: Bug
Reporter: Thejas M Nair
Assignee: Thejas M Nair
 Attachments: HIVE-9673.1.patch, HIVE-9673.2.patch


 Yarn App Timeline Server (ATS) users can find their query using hive query-id.
 However, query id is available only through the logs at the moment.
 Thrift api users such as Hue have another unique id for queries, which the 
 operation handle contains 
 (TExecuteStatementResp.TOperationHandle.THandleIdentifier.guid) . Adding the 
 operationhandle guid to ATS will enable such thrift users to get information 
 from ATS for the queries that they have spawned.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (HIVE-9683) Hive metastore thrift client connections hang indefinitely

2015-02-12 Thread Gunther Hagleitner (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-9683?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14319584#comment-14319584
 ] 

Gunther Hagleitner commented on HIVE-9683:
--

+1

 Hive metastore thrift client connections hang indefinitely
 --

 Key: HIVE-9683
 URL: https://issues.apache.org/jira/browse/HIVE-9683
 Project: Hive
  Issue Type: Bug
  Components: Metastore
Affects Versions: 1.0.0, 1.0.1
Reporter: Gopal V
Assignee: Gopal V
Priority: Minor
 Fix For: 1.0.1

 Attachments: HIVE-9683.1.patch


 THRIFT-2788 fixed network-partition problems that affect Thrift client 
 connections.
 Since hive-1.0 is on thrift-0.9.0 which is affected by the bug, a workaround 
 can be applied to prevent indefinite connection hangs during net-splits.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (HIVE-9665) Parallel move task optimization causes race condition

2015-02-12 Thread Gunther Hagleitner (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-9665?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Gunther Hagleitner updated HIVE-9665:
-
   Resolution: Fixed
Fix Version/s: 1.2.0
   Status: Resolved  (was: Patch Available)

 Parallel move task optimization causes race condition
 -

 Key: HIVE-9665
 URL: https://issues.apache.org/jira/browse/HIVE-9665
 Project: Hive
  Issue Type: Bug
Reporter: Gunther Hagleitner
Assignee: Gunther Hagleitner
Priority: Critical
 Fix For: 1.2.0

 Attachments: HIVE-9665.1.patch


 The change in HIVE-8042 doesn't actually work. Running it at scale produces 
 race conditions which lead to broken thrift messages and OOMs. E.g.:
 {noformat}
 java.lang.OutOfMemoryError: Java heap space
   at 
 org.apache.thrift.protocol.TBinaryProtocol.readStringBody(TBinaryProtocol.java:353)
   at 
 org.apache.thrift.protocol.TBinaryProtocol.readMessageBegin(TBinaryProtocol.java:215)
   at org.apache.thrift.TServiceClient.receiveBase(TServiceClient.java:69)
   at 
 org.apache.hadoop.hive.metastore.api.ThriftHiveMetastore$Client.recv_get_table(ThriftHiveMetastore.java:1122)
   at 
 org.apache.hadoop.hive.metastore.api.ThriftHiveMetastore$Client.get_table(ThriftHiveMetastore.java:1108)
   at 
 org.apache.hadoop.hive.metastore.HiveMetaStoreClient.getTable(HiveMetaStoreClient.java:1091)
   at 
 org.apache.hadoop.hive.ql.metadata.SessionHiveMetaStoreClient.getTable(SessionHiveMetaStoreClient.java:131)
   at sun.reflect.GeneratedMethodAccessor19.invoke(Unknown Source)
   at 
 sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
   at java.lang.reflect.Method.invoke(Method.java:606)
   at 
 org.apache.hadoop.hive.metastore.RetryingMetaStoreClient.invoke(RetryingMetaStoreClient.java:90)
   at com.sun.proxy.$Proxy9.getTable(Unknown Source)
   at org.apache.hadoop.hive.ql.metadata.Hive.getTable(Hive.java:1064)
   at org.apache.hadoop.hive.ql.metadata.Hive.getTable(Hive.java:1019)
   at org.apache.hadoop.hive.ql.metadata.Hive.getTable(Hive.java:1006)
   at org.apache.hadoop.hive.ql.exec.MoveTask.execute(MoveTask.java:250)
   at org.apache.hadoop.hive.ql.exec.Task.executeTask(Task.java:160)
   at 
 org.apache.hadoop.hive.ql.exec.TaskRunner.runSequential(TaskRunner.java:85)
   at org.apache.hadoop.hive.ql.exec.TaskRunner.run(TaskRunner.java:72)
 java.lang.OutOfMemoryError: Java heap space
   at 
 org.apache.thrift.protocol.TBinaryProtocol.readStringBody(TBinaryProtocol.java:353)
   at 
 org.apache.thrift.protocol.TBinaryProtocol.readMessageBegin(TBinaryProtocol.java:215)
   at org.apache.thrift.TServiceClient.receiveBase(TServiceClient.java:69)
   at 
 org.apache.hadoop.hive.metastore.api.ThriftHiveMetastore$Client.recv_get_table(ThriftHiveMetastore.java:1122)
   at 
 org.apache.hadoop.hive.metastore.api.ThriftHiveMetastore$Client.get_table(ThriftHiveMetastore.java:1108)
   at 
 org.apache.hadoop.hive.metastore.HiveMetaStoreClient.getTable(HiveMetaStoreClient.java:1091)
   at 
 org.apache.hadoop.hive.ql.metadata.SessionHiveMetaStoreClient.getTable(SessionHiveMetaStoreClient.java:131)
   at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
   at 
 sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:57)
   at 
 sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
   at java.lang.reflect.Method.invoke(Method.java:606)
   at 
 org.apache.hadoop.hive.metastore.RetryingMetaStoreClient.invoke(RetryingMetaStoreClient.java:90)
   at com.sun.proxy.$Proxy9.getTable(Unknown Source)
   at org.apache.hadoop.hive.ql.metadata.Hive.getTable(Hive.java:1064)
   at org.apache.hadoop.hive.ql.metadata.Hive.getTable(Hive.java:1019)
   at org.apache.hadoop.hive.ql.metadata.Hive.getTable(Hive.java:1006)
   at org.apache.hadoop.hive.ql.exec.MoveTask.execute(MoveTask.java:250)
   at org.apache.hadoop.hive.ql.exec.Task.executeTask(Task.java:160)
   at 
 org.apache.hadoop.hive.ql.exec.TaskRunner.runSequential(TaskRunner.java:85)
   at org.apache.hadoop.hive.ql.exec.TaskRunner.run(TaskRunner.java:72)
 java.lang.OutOfMemoryError: Java heap space
   at 
 org.apache.thrift.protocol.TBinaryProtocol.readStringBody(TBinaryProtocol.java:353)
   at 
 org.apache.thrift.protocol.TBinaryProtocol.readMessageBegin(TBinaryProtocol.java:215)
   at org.apache.thrift.TServiceClient.receiveBase(TServiceClient.java:69)
   at 
 org.apache.hadoop.hive.metastore.api.ThriftHiveMetastore$Client.recv_get_table(ThriftHiveMetastore.java:1122)
   at 
 org.apache.hadoop.hive.metastore.api.ThriftHiveMetastore$Client.get_table(ThriftHiveMetastore.java:1108)
   

[jira] [Commented] (HIVE-9665) Parallel move task optimization causes race condition

2015-02-12 Thread Gunther Hagleitner (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-9665?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14319473#comment-14319473
 ] 

Gunther Hagleitner commented on HIVE-9665:
--

Committed to trunk. Thanks [~thejas].

 Parallel move task optimization causes race condition
 -

 Key: HIVE-9665
 URL: https://issues.apache.org/jira/browse/HIVE-9665
 Project: Hive
  Issue Type: Bug
Reporter: Gunther Hagleitner
Assignee: Gunther Hagleitner
Priority: Critical
 Fix For: 1.2.0

 Attachments: HIVE-9665.1.patch


 The change in HIVE-8042 doesn't actually work. Running it at scale produces 
 race conditions which lead to broken thrift messages and OOMs. E.g.:
 {noformat}
 java.lang.OutOfMemoryError: Java heap space
   at 
 org.apache.thrift.protocol.TBinaryProtocol.readStringBody(TBinaryProtocol.java:353)
   at 
 org.apache.thrift.protocol.TBinaryProtocol.readMessageBegin(TBinaryProtocol.java:215)
   at org.apache.thrift.TServiceClient.receiveBase(TServiceClient.java:69)
   at 
 org.apache.hadoop.hive.metastore.api.ThriftHiveMetastore$Client.recv_get_table(ThriftHiveMetastore.java:1122)
   at 
 org.apache.hadoop.hive.metastore.api.ThriftHiveMetastore$Client.get_table(ThriftHiveMetastore.java:1108)
   at 
 org.apache.hadoop.hive.metastore.HiveMetaStoreClient.getTable(HiveMetaStoreClient.java:1091)
   at 
 org.apache.hadoop.hive.ql.metadata.SessionHiveMetaStoreClient.getTable(SessionHiveMetaStoreClient.java:131)
   at sun.reflect.GeneratedMethodAccessor19.invoke(Unknown Source)
   at 
 sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
   at java.lang.reflect.Method.invoke(Method.java:606)
   at 
 org.apache.hadoop.hive.metastore.RetryingMetaStoreClient.invoke(RetryingMetaStoreClient.java:90)
   at com.sun.proxy.$Proxy9.getTable(Unknown Source)
   at org.apache.hadoop.hive.ql.metadata.Hive.getTable(Hive.java:1064)
   at org.apache.hadoop.hive.ql.metadata.Hive.getTable(Hive.java:1019)
   at org.apache.hadoop.hive.ql.metadata.Hive.getTable(Hive.java:1006)
   at org.apache.hadoop.hive.ql.exec.MoveTask.execute(MoveTask.java:250)
   at org.apache.hadoop.hive.ql.exec.Task.executeTask(Task.java:160)
   at 
 org.apache.hadoop.hive.ql.exec.TaskRunner.runSequential(TaskRunner.java:85)
   at org.apache.hadoop.hive.ql.exec.TaskRunner.run(TaskRunner.java:72)
 java.lang.OutOfMemoryError: Java heap space
   at 
 org.apache.thrift.protocol.TBinaryProtocol.readStringBody(TBinaryProtocol.java:353)
   at 
 org.apache.thrift.protocol.TBinaryProtocol.readMessageBegin(TBinaryProtocol.java:215)
   at org.apache.thrift.TServiceClient.receiveBase(TServiceClient.java:69)
   at 
 org.apache.hadoop.hive.metastore.api.ThriftHiveMetastore$Client.recv_get_table(ThriftHiveMetastore.java:1122)
   at 
 org.apache.hadoop.hive.metastore.api.ThriftHiveMetastore$Client.get_table(ThriftHiveMetastore.java:1108)
   at 
 org.apache.hadoop.hive.metastore.HiveMetaStoreClient.getTable(HiveMetaStoreClient.java:1091)
   at 
 org.apache.hadoop.hive.ql.metadata.SessionHiveMetaStoreClient.getTable(SessionHiveMetaStoreClient.java:131)
   at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
   at 
 sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:57)
   at 
 sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
   at java.lang.reflect.Method.invoke(Method.java:606)
   at 
 org.apache.hadoop.hive.metastore.RetryingMetaStoreClient.invoke(RetryingMetaStoreClient.java:90)
   at com.sun.proxy.$Proxy9.getTable(Unknown Source)
   at org.apache.hadoop.hive.ql.metadata.Hive.getTable(Hive.java:1064)
   at org.apache.hadoop.hive.ql.metadata.Hive.getTable(Hive.java:1019)
   at org.apache.hadoop.hive.ql.metadata.Hive.getTable(Hive.java:1006)
   at org.apache.hadoop.hive.ql.exec.MoveTask.execute(MoveTask.java:250)
   at org.apache.hadoop.hive.ql.exec.Task.executeTask(Task.java:160)
   at 
 org.apache.hadoop.hive.ql.exec.TaskRunner.runSequential(TaskRunner.java:85)
   at org.apache.hadoop.hive.ql.exec.TaskRunner.run(TaskRunner.java:72)
 java.lang.OutOfMemoryError: Java heap space
   at 
 org.apache.thrift.protocol.TBinaryProtocol.readStringBody(TBinaryProtocol.java:353)
   at 
 org.apache.thrift.protocol.TBinaryProtocol.readMessageBegin(TBinaryProtocol.java:215)
   at org.apache.thrift.TServiceClient.receiveBase(TServiceClient.java:69)
   at 
 org.apache.hadoop.hive.metastore.api.ThriftHiveMetastore$Client.recv_get_table(ThriftHiveMetastore.java:1122)
   at 
 org.apache.hadoop.hive.metastore.api.ThriftHiveMetastore$Client.get_table(ThriftHiveMetastore.java:1108)
   at 
 

[jira] [Commented] (HIVE-9673) Set operationhandle in ATS entities for lookups

2015-02-12 Thread Gunther Hagleitner (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-9673?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14319507#comment-14319507
 ] 

Gunther Hagleitner commented on HIVE-9673:
--

+1

 Set operationhandle in ATS entities for lookups
 ---

 Key: HIVE-9673
 URL: https://issues.apache.org/jira/browse/HIVE-9673
 Project: Hive
  Issue Type: Bug
Reporter: Thejas M Nair
Assignee: Thejas M Nair
 Attachments: HIVE-9673.1.patch, HIVE-9673.2.patch


 Yarn App Timeline Server (ATS) users can find their query using hive query-id.
 However, query id is available only through the logs at the moment.
 Thrift api users such as Hue have another unique id for queries, which the 
 operation handle contains 
 (TExecuteStatementResp.TOperationHandle.THandleIdentifier.guid) . Adding the 
 operationhandle guid to ATS will enable such thrift users to get information 
 from ATS for the queries that they have spawned.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (HIVE-9635) LLAP: I'm the decider

2015-02-12 Thread Gunther Hagleitner (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-9635?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Gunther Hagleitner updated HIVE-9635:
-
Attachment: HIVE-9635.2.patch

 LLAP: I'm the decider
 -

 Key: HIVE-9635
 URL: https://issues.apache.org/jira/browse/HIVE-9635
 Project: Hive
  Issue Type: Sub-task
Affects Versions: llap
Reporter: Gunther Hagleitner
Assignee: Gunther Hagleitner
 Attachments: HIVE-9635.1.patch, HIVE-9635.2.patch


 https://www.youtube.com/watch?v=r8VbzrZ9yHQ
 Physical optimizer to choose what to run inside/outside llap. Tests first 
 whether user code has to be shipped then if the specific query fragment is 
 suitable to run.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (HIVE-9665) Parallel move task optimization causes race condition

2015-02-11 Thread Gunther Hagleitner (JIRA)
Gunther Hagleitner created HIVE-9665:


 Summary: Parallel move task optimization causes race condition
 Key: HIVE-9665
 URL: https://issues.apache.org/jira/browse/HIVE-9665
 Project: Hive
  Issue Type: Bug
Reporter: Gunther Hagleitner
Assignee: Gunther Hagleitner
Priority: Critical


The change in HIVE-8042 doesn't actually work. Running it at scale produces 
race conditions which lead to broken thrift messages and OOMs. E.g.:

{noformat}
java.lang.OutOfMemoryError: Java heap space
at 
org.apache.thrift.protocol.TBinaryProtocol.readStringBody(TBinaryProtocol.java:353)
at 
org.apache.thrift.protocol.TBinaryProtocol.readMessageBegin(TBinaryProtocol.java:215)
at org.apache.thrift.TServiceClient.receiveBase(TServiceClient.java:69)
at 
org.apache.hadoop.hive.metastore.api.ThriftHiveMetastore$Client.recv_get_table(ThriftHiveMetastore.java:1122)
at 
org.apache.hadoop.hive.metastore.api.ThriftHiveMetastore$Client.get_table(ThriftHiveMetastore.java:1108)
at 
org.apache.hadoop.hive.metastore.HiveMetaStoreClient.getTable(HiveMetaStoreClient.java:1091)
at 
org.apache.hadoop.hive.ql.metadata.SessionHiveMetaStoreClient.getTable(SessionHiveMetaStoreClient.java:131)
at sun.reflect.GeneratedMethodAccessor19.invoke(Unknown Source)
at 
sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
at java.lang.reflect.Method.invoke(Method.java:606)
at 
org.apache.hadoop.hive.metastore.RetryingMetaStoreClient.invoke(RetryingMetaStoreClient.java:90)
at com.sun.proxy.$Proxy9.getTable(Unknown Source)
at org.apache.hadoop.hive.ql.metadata.Hive.getTable(Hive.java:1064)
at org.apache.hadoop.hive.ql.metadata.Hive.getTable(Hive.java:1019)
at org.apache.hadoop.hive.ql.metadata.Hive.getTable(Hive.java:1006)
at org.apache.hadoop.hive.ql.exec.MoveTask.execute(MoveTask.java:250)
at org.apache.hadoop.hive.ql.exec.Task.executeTask(Task.java:160)
at 
org.apache.hadoop.hive.ql.exec.TaskRunner.runSequential(TaskRunner.java:85)
at org.apache.hadoop.hive.ql.exec.TaskRunner.run(TaskRunner.java:72)
java.lang.OutOfMemoryError: Java heap space
at 
org.apache.thrift.protocol.TBinaryProtocol.readStringBody(TBinaryProtocol.java:353)
at 
org.apache.thrift.protocol.TBinaryProtocol.readMessageBegin(TBinaryProtocol.java:215)
at org.apache.thrift.TServiceClient.receiveBase(TServiceClient.java:69)
at 
org.apache.hadoop.hive.metastore.api.ThriftHiveMetastore$Client.recv_get_table(ThriftHiveMetastore.java:1122)
at 
org.apache.hadoop.hive.metastore.api.ThriftHiveMetastore$Client.get_table(ThriftHiveMetastore.java:1108)
at 
org.apache.hadoop.hive.metastore.HiveMetaStoreClient.getTable(HiveMetaStoreClient.java:1091)
at 
org.apache.hadoop.hive.ql.metadata.SessionHiveMetaStoreClient.getTable(SessionHiveMetaStoreClient.java:131)
at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
at 
sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:57)
at 
sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
at java.lang.reflect.Method.invoke(Method.java:606)
at 
org.apache.hadoop.hive.metastore.RetryingMetaStoreClient.invoke(RetryingMetaStoreClient.java:90)
at com.sun.proxy.$Proxy9.getTable(Unknown Source)
at org.apache.hadoop.hive.ql.metadata.Hive.getTable(Hive.java:1064)
at org.apache.hadoop.hive.ql.metadata.Hive.getTable(Hive.java:1019)
at org.apache.hadoop.hive.ql.metadata.Hive.getTable(Hive.java:1006)
at org.apache.hadoop.hive.ql.exec.MoveTask.execute(MoveTask.java:250)
at org.apache.hadoop.hive.ql.exec.Task.executeTask(Task.java:160)
at 
org.apache.hadoop.hive.ql.exec.TaskRunner.runSequential(TaskRunner.java:85)
at org.apache.hadoop.hive.ql.exec.TaskRunner.run(TaskRunner.java:72)
java.lang.OutOfMemoryError: Java heap space
at 
org.apache.thrift.protocol.TBinaryProtocol.readStringBody(TBinaryProtocol.java:353)
at 
org.apache.thrift.protocol.TBinaryProtocol.readMessageBegin(TBinaryProtocol.java:215)
at org.apache.thrift.TServiceClient.receiveBase(TServiceClient.java:69)
at 
org.apache.hadoop.hive.metastore.api.ThriftHiveMetastore$Client.recv_get_table(ThriftHiveMetastore.java:1122)
at 
org.apache.hadoop.hive.metastore.api.ThriftHiveMetastore$Client.get_table(ThriftHiveMetastore.java:1108)
at 
org.apache.hadoop.hive.metastore.HiveMetaStoreClient.getTable(HiveMetaStoreClient.java:1091)
at 
org.apache.hadoop.hive.ql.metadata.SessionHiveMetaStoreClient.getTable(SessionHiveMetaStoreClient.java:131)
at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
at 

[jira] [Updated] (HIVE-9665) Parallel move task optimization causes race condition

2015-02-11 Thread Gunther Hagleitner (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-9665?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Gunther Hagleitner updated HIVE-9665:
-
Attachment: HIVE-9665.1.patch

 Parallel move task optimization causes race condition
 -

 Key: HIVE-9665
 URL: https://issues.apache.org/jira/browse/HIVE-9665
 Project: Hive
  Issue Type: Bug
Reporter: Gunther Hagleitner
Assignee: Gunther Hagleitner
Priority: Critical

 The change in HIVE-8042 doesn't actually work. Running it at scale produces 
 race conditions which lead to broken thrift messages and OOMs. E.g.:
 {noformat}
 java.lang.OutOfMemoryError: Java heap space
   at 
 org.apache.thrift.protocol.TBinaryProtocol.readStringBody(TBinaryProtocol.java:353)
   at 
 org.apache.thrift.protocol.TBinaryProtocol.readMessageBegin(TBinaryProtocol.java:215)
   at org.apache.thrift.TServiceClient.receiveBase(TServiceClient.java:69)
   at 
 org.apache.hadoop.hive.metastore.api.ThriftHiveMetastore$Client.recv_get_table(ThriftHiveMetastore.java:1122)
   at 
 org.apache.hadoop.hive.metastore.api.ThriftHiveMetastore$Client.get_table(ThriftHiveMetastore.java:1108)
   at 
 org.apache.hadoop.hive.metastore.HiveMetaStoreClient.getTable(HiveMetaStoreClient.java:1091)
   at 
 org.apache.hadoop.hive.ql.metadata.SessionHiveMetaStoreClient.getTable(SessionHiveMetaStoreClient.java:131)
   at sun.reflect.GeneratedMethodAccessor19.invoke(Unknown Source)
   at 
 sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
   at java.lang.reflect.Method.invoke(Method.java:606)
   at 
 org.apache.hadoop.hive.metastore.RetryingMetaStoreClient.invoke(RetryingMetaStoreClient.java:90)
   at com.sun.proxy.$Proxy9.getTable(Unknown Source)
   at org.apache.hadoop.hive.ql.metadata.Hive.getTable(Hive.java:1064)
   at org.apache.hadoop.hive.ql.metadata.Hive.getTable(Hive.java:1019)
   at org.apache.hadoop.hive.ql.metadata.Hive.getTable(Hive.java:1006)
   at org.apache.hadoop.hive.ql.exec.MoveTask.execute(MoveTask.java:250)
   at org.apache.hadoop.hive.ql.exec.Task.executeTask(Task.java:160)
   at 
 org.apache.hadoop.hive.ql.exec.TaskRunner.runSequential(TaskRunner.java:85)
   at org.apache.hadoop.hive.ql.exec.TaskRunner.run(TaskRunner.java:72)
 java.lang.OutOfMemoryError: Java heap space
   at 
 org.apache.thrift.protocol.TBinaryProtocol.readStringBody(TBinaryProtocol.java:353)
   at 
 org.apache.thrift.protocol.TBinaryProtocol.readMessageBegin(TBinaryProtocol.java:215)
   at org.apache.thrift.TServiceClient.receiveBase(TServiceClient.java:69)
   at 
 org.apache.hadoop.hive.metastore.api.ThriftHiveMetastore$Client.recv_get_table(ThriftHiveMetastore.java:1122)
   at 
 org.apache.hadoop.hive.metastore.api.ThriftHiveMetastore$Client.get_table(ThriftHiveMetastore.java:1108)
   at 
 org.apache.hadoop.hive.metastore.HiveMetaStoreClient.getTable(HiveMetaStoreClient.java:1091)
   at 
 org.apache.hadoop.hive.ql.metadata.SessionHiveMetaStoreClient.getTable(SessionHiveMetaStoreClient.java:131)
   at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
   at 
 sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:57)
   at 
 sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
   at java.lang.reflect.Method.invoke(Method.java:606)
   at 
 org.apache.hadoop.hive.metastore.RetryingMetaStoreClient.invoke(RetryingMetaStoreClient.java:90)
   at com.sun.proxy.$Proxy9.getTable(Unknown Source)
   at org.apache.hadoop.hive.ql.metadata.Hive.getTable(Hive.java:1064)
   at org.apache.hadoop.hive.ql.metadata.Hive.getTable(Hive.java:1019)
   at org.apache.hadoop.hive.ql.metadata.Hive.getTable(Hive.java:1006)
   at org.apache.hadoop.hive.ql.exec.MoveTask.execute(MoveTask.java:250)
   at org.apache.hadoop.hive.ql.exec.Task.executeTask(Task.java:160)
   at 
 org.apache.hadoop.hive.ql.exec.TaskRunner.runSequential(TaskRunner.java:85)
   at org.apache.hadoop.hive.ql.exec.TaskRunner.run(TaskRunner.java:72)
 java.lang.OutOfMemoryError: Java heap space
   at 
 org.apache.thrift.protocol.TBinaryProtocol.readStringBody(TBinaryProtocol.java:353)
   at 
 org.apache.thrift.protocol.TBinaryProtocol.readMessageBegin(TBinaryProtocol.java:215)
   at org.apache.thrift.TServiceClient.receiveBase(TServiceClient.java:69)
   at 
 org.apache.hadoop.hive.metastore.api.ThriftHiveMetastore$Client.recv_get_table(ThriftHiveMetastore.java:1122)
   at 
 org.apache.hadoop.hive.metastore.api.ThriftHiveMetastore$Client.get_table(ThriftHiveMetastore.java:1108)
   at 
 org.apache.hadoop.hive.metastore.HiveMetaStoreClient.getTable(HiveMetaStoreClient.java:1091)
   at 
 

[jira] [Updated] (HIVE-9665) Parallel move task optimization causes race condition

2015-02-11 Thread Gunther Hagleitner (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-9665?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Gunther Hagleitner updated HIVE-9665:
-
Attachment: (was: HIVE-9665.1.patch)

 Parallel move task optimization causes race condition
 -

 Key: HIVE-9665
 URL: https://issues.apache.org/jira/browse/HIVE-9665
 Project: Hive
  Issue Type: Bug
Reporter: Gunther Hagleitner
Assignee: Gunther Hagleitner
Priority: Critical

 The change in HIVE-8042 doesn't actually work. Running it at scale produces 
 race conditions which lead to broken thrift messages and OOMs. E.g.:
 {noformat}
 java.lang.OutOfMemoryError: Java heap space
   at 
 org.apache.thrift.protocol.TBinaryProtocol.readStringBody(TBinaryProtocol.java:353)
   at 
 org.apache.thrift.protocol.TBinaryProtocol.readMessageBegin(TBinaryProtocol.java:215)
   at org.apache.thrift.TServiceClient.receiveBase(TServiceClient.java:69)
   at 
 org.apache.hadoop.hive.metastore.api.ThriftHiveMetastore$Client.recv_get_table(ThriftHiveMetastore.java:1122)
   at 
 org.apache.hadoop.hive.metastore.api.ThriftHiveMetastore$Client.get_table(ThriftHiveMetastore.java:1108)
   at 
 org.apache.hadoop.hive.metastore.HiveMetaStoreClient.getTable(HiveMetaStoreClient.java:1091)
   at 
 org.apache.hadoop.hive.ql.metadata.SessionHiveMetaStoreClient.getTable(SessionHiveMetaStoreClient.java:131)
   at sun.reflect.GeneratedMethodAccessor19.invoke(Unknown Source)
   at 
 sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
   at java.lang.reflect.Method.invoke(Method.java:606)
   at 
 org.apache.hadoop.hive.metastore.RetryingMetaStoreClient.invoke(RetryingMetaStoreClient.java:90)
   at com.sun.proxy.$Proxy9.getTable(Unknown Source)
   at org.apache.hadoop.hive.ql.metadata.Hive.getTable(Hive.java:1064)
   at org.apache.hadoop.hive.ql.metadata.Hive.getTable(Hive.java:1019)
   at org.apache.hadoop.hive.ql.metadata.Hive.getTable(Hive.java:1006)
   at org.apache.hadoop.hive.ql.exec.MoveTask.execute(MoveTask.java:250)
   at org.apache.hadoop.hive.ql.exec.Task.executeTask(Task.java:160)
   at 
 org.apache.hadoop.hive.ql.exec.TaskRunner.runSequential(TaskRunner.java:85)
   at org.apache.hadoop.hive.ql.exec.TaskRunner.run(TaskRunner.java:72)
 java.lang.OutOfMemoryError: Java heap space
   at 
 org.apache.thrift.protocol.TBinaryProtocol.readStringBody(TBinaryProtocol.java:353)
   at 
 org.apache.thrift.protocol.TBinaryProtocol.readMessageBegin(TBinaryProtocol.java:215)
   at org.apache.thrift.TServiceClient.receiveBase(TServiceClient.java:69)
   at 
 org.apache.hadoop.hive.metastore.api.ThriftHiveMetastore$Client.recv_get_table(ThriftHiveMetastore.java:1122)
   at 
 org.apache.hadoop.hive.metastore.api.ThriftHiveMetastore$Client.get_table(ThriftHiveMetastore.java:1108)
   at 
 org.apache.hadoop.hive.metastore.HiveMetaStoreClient.getTable(HiveMetaStoreClient.java:1091)
   at 
 org.apache.hadoop.hive.ql.metadata.SessionHiveMetaStoreClient.getTable(SessionHiveMetaStoreClient.java:131)
   at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
   at 
 sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:57)
   at 
 sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
   at java.lang.reflect.Method.invoke(Method.java:606)
   at 
 org.apache.hadoop.hive.metastore.RetryingMetaStoreClient.invoke(RetryingMetaStoreClient.java:90)
   at com.sun.proxy.$Proxy9.getTable(Unknown Source)
   at org.apache.hadoop.hive.ql.metadata.Hive.getTable(Hive.java:1064)
   at org.apache.hadoop.hive.ql.metadata.Hive.getTable(Hive.java:1019)
   at org.apache.hadoop.hive.ql.metadata.Hive.getTable(Hive.java:1006)
   at org.apache.hadoop.hive.ql.exec.MoveTask.execute(MoveTask.java:250)
   at org.apache.hadoop.hive.ql.exec.Task.executeTask(Task.java:160)
   at 
 org.apache.hadoop.hive.ql.exec.TaskRunner.runSequential(TaskRunner.java:85)
   at org.apache.hadoop.hive.ql.exec.TaskRunner.run(TaskRunner.java:72)
 java.lang.OutOfMemoryError: Java heap space
   at 
 org.apache.thrift.protocol.TBinaryProtocol.readStringBody(TBinaryProtocol.java:353)
   at 
 org.apache.thrift.protocol.TBinaryProtocol.readMessageBegin(TBinaryProtocol.java:215)
   at org.apache.thrift.TServiceClient.receiveBase(TServiceClient.java:69)
   at 
 org.apache.hadoop.hive.metastore.api.ThriftHiveMetastore$Client.recv_get_table(ThriftHiveMetastore.java:1122)
   at 
 org.apache.hadoop.hive.metastore.api.ThriftHiveMetastore$Client.get_table(ThriftHiveMetastore.java:1108)
   at 
 org.apache.hadoop.hive.metastore.HiveMetaStoreClient.getTable(HiveMetaStoreClient.java:1091)
   at 
 

[jira] [Updated] (HIVE-9665) Parallel move task optimization causes race condition

2015-02-11 Thread Gunther Hagleitner (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-9665?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Gunther Hagleitner updated HIVE-9665:
-
Attachment: HIVE-9665.1.patch

 Parallel move task optimization causes race condition
 -

 Key: HIVE-9665
 URL: https://issues.apache.org/jira/browse/HIVE-9665
 Project: Hive
  Issue Type: Bug
Reporter: Gunther Hagleitner
Assignee: Gunther Hagleitner
Priority: Critical
 Attachments: HIVE-9665.1.patch


 The change in HIVE-8042 doesn't actually work. Running it at scale produces 
 race conditions which lead to broken thrift messages and OOMs. E.g.:
 {noformat}
 java.lang.OutOfMemoryError: Java heap space
   at 
 org.apache.thrift.protocol.TBinaryProtocol.readStringBody(TBinaryProtocol.java:353)
   at 
 org.apache.thrift.protocol.TBinaryProtocol.readMessageBegin(TBinaryProtocol.java:215)
   at org.apache.thrift.TServiceClient.receiveBase(TServiceClient.java:69)
   at 
 org.apache.hadoop.hive.metastore.api.ThriftHiveMetastore$Client.recv_get_table(ThriftHiveMetastore.java:1122)
   at 
 org.apache.hadoop.hive.metastore.api.ThriftHiveMetastore$Client.get_table(ThriftHiveMetastore.java:1108)
   at 
 org.apache.hadoop.hive.metastore.HiveMetaStoreClient.getTable(HiveMetaStoreClient.java:1091)
   at 
 org.apache.hadoop.hive.ql.metadata.SessionHiveMetaStoreClient.getTable(SessionHiveMetaStoreClient.java:131)
   at sun.reflect.GeneratedMethodAccessor19.invoke(Unknown Source)
   at 
 sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
   at java.lang.reflect.Method.invoke(Method.java:606)
   at 
 org.apache.hadoop.hive.metastore.RetryingMetaStoreClient.invoke(RetryingMetaStoreClient.java:90)
   at com.sun.proxy.$Proxy9.getTable(Unknown Source)
   at org.apache.hadoop.hive.ql.metadata.Hive.getTable(Hive.java:1064)
   at org.apache.hadoop.hive.ql.metadata.Hive.getTable(Hive.java:1019)
   at org.apache.hadoop.hive.ql.metadata.Hive.getTable(Hive.java:1006)
   at org.apache.hadoop.hive.ql.exec.MoveTask.execute(MoveTask.java:250)
   at org.apache.hadoop.hive.ql.exec.Task.executeTask(Task.java:160)
   at 
 org.apache.hadoop.hive.ql.exec.TaskRunner.runSequential(TaskRunner.java:85)
   at org.apache.hadoop.hive.ql.exec.TaskRunner.run(TaskRunner.java:72)
 java.lang.OutOfMemoryError: Java heap space
   at 
 org.apache.thrift.protocol.TBinaryProtocol.readStringBody(TBinaryProtocol.java:353)
   at 
 org.apache.thrift.protocol.TBinaryProtocol.readMessageBegin(TBinaryProtocol.java:215)
   at org.apache.thrift.TServiceClient.receiveBase(TServiceClient.java:69)
   at 
 org.apache.hadoop.hive.metastore.api.ThriftHiveMetastore$Client.recv_get_table(ThriftHiveMetastore.java:1122)
   at 
 org.apache.hadoop.hive.metastore.api.ThriftHiveMetastore$Client.get_table(ThriftHiveMetastore.java:1108)
   at 
 org.apache.hadoop.hive.metastore.HiveMetaStoreClient.getTable(HiveMetaStoreClient.java:1091)
   at 
 org.apache.hadoop.hive.ql.metadata.SessionHiveMetaStoreClient.getTable(SessionHiveMetaStoreClient.java:131)
   at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
   at 
 sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:57)
   at 
 sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
   at java.lang.reflect.Method.invoke(Method.java:606)
   at 
 org.apache.hadoop.hive.metastore.RetryingMetaStoreClient.invoke(RetryingMetaStoreClient.java:90)
   at com.sun.proxy.$Proxy9.getTable(Unknown Source)
   at org.apache.hadoop.hive.ql.metadata.Hive.getTable(Hive.java:1064)
   at org.apache.hadoop.hive.ql.metadata.Hive.getTable(Hive.java:1019)
   at org.apache.hadoop.hive.ql.metadata.Hive.getTable(Hive.java:1006)
   at org.apache.hadoop.hive.ql.exec.MoveTask.execute(MoveTask.java:250)
   at org.apache.hadoop.hive.ql.exec.Task.executeTask(Task.java:160)
   at 
 org.apache.hadoop.hive.ql.exec.TaskRunner.runSequential(TaskRunner.java:85)
   at org.apache.hadoop.hive.ql.exec.TaskRunner.run(TaskRunner.java:72)
 java.lang.OutOfMemoryError: Java heap space
   at 
 org.apache.thrift.protocol.TBinaryProtocol.readStringBody(TBinaryProtocol.java:353)
   at 
 org.apache.thrift.protocol.TBinaryProtocol.readMessageBegin(TBinaryProtocol.java:215)
   at org.apache.thrift.TServiceClient.receiveBase(TServiceClient.java:69)
   at 
 org.apache.hadoop.hive.metastore.api.ThriftHiveMetastore$Client.recv_get_table(ThriftHiveMetastore.java:1122)
   at 
 org.apache.hadoop.hive.metastore.api.ThriftHiveMetastore$Client.get_table(ThriftHiveMetastore.java:1108)
   at 
 

[jira] [Updated] (HIVE-9635) LLAP: I'm the decider

2015-02-10 Thread Gunther Hagleitner (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-9635?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Gunther Hagleitner updated HIVE-9635:
-
Attachment: HIVE-9635.1.patch

first draft - doesn't handle udfs yet.

 LLAP: I'm the decider
 -

 Key: HIVE-9635
 URL: https://issues.apache.org/jira/browse/HIVE-9635
 Project: Hive
  Issue Type: Sub-task
Affects Versions: llap
Reporter: Gunther Hagleitner
Assignee: Gunther Hagleitner
 Attachments: HIVE-9635.1.patch


 https://www.youtube.com/watch?v=r8VbzrZ9yHQ
 Physical optimizer to choose what to run inside/outside llap. Tests first 
 whether user code has to be shipped then if the specific query fragment is 
 suitable to run.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (HIVE-9635) LLAP: I'm the decider

2015-02-10 Thread Gunther Hagleitner (JIRA)
Gunther Hagleitner created HIVE-9635:


 Summary: LLAP: I'm the decider
 Key: HIVE-9635
 URL: https://issues.apache.org/jira/browse/HIVE-9635
 Project: Hive
  Issue Type: Sub-task
Affects Versions: llap
Reporter: Gunther Hagleitner
Assignee: Gunther Hagleitner


https://www.youtube.com/watch?v=r8VbzrZ9yHQ

Physical optimizer to choose what to run inside/outside llap. Tests first 
whether user code has to be shipped then if the specific query fragment is 
suitable to run.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (HIVE-9614) Encrypt mapjoin tables

2015-02-10 Thread Gunther Hagleitner (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-9614?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14314621#comment-14314621
 ] 

Gunther Hagleitner commented on HIVE-9614:
--

3 months sprints? i believe that's called a marathon.

 Encrypt mapjoin tables
 --

 Key: HIVE-9614
 URL: https://issues.apache.org/jira/browse/HIVE-9614
 Project: Hive
  Issue Type: Sub-task
Reporter: Brock Noland

 When performing a MapJoin we store data which is put in the distributed cache 
 which is stored on local disk. Ideally we would encrypt these tables to the 
 same degree that the they are encrypted in HDFS or find some other way to 
 ensure they are encrypted.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (HIVE-9529) alter table .. concatenate under Tez mode should create TezTask

2015-02-02 Thread Gunther Hagleitner (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-9529?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14302091#comment-14302091
 ] 

Gunther Hagleitner commented on HIVE-9529:
--

+1

 alter table .. concatenate under Tez mode should create TezTask
 -

 Key: HIVE-9529
 URL: https://issues.apache.org/jira/browse/HIVE-9529
 Project: Hive
  Issue Type: Bug
Affects Versions: 1.0.0, 1.2.0, 1.1.0
Reporter: Prasanth Jayachandran
Assignee: Prasanth Jayachandran
 Attachments: HIVE-9529.1.patch, HIVE-9529.2.patch


 alter table .. concatenate DDL command creates MR task by default. When 
 hive cli is launched with execution engine as tez, the scheduling of the MR 
 task for file merging could be delayed until tez session expiration. This 
 happens because YARN will not have capacity to launch another AppMaster for 
 MR task. We should create tez task to overcome this. When the execution 
 engine is tez TezTask will be created else MRTask will be created.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Resolved] (HIVE-9506) LLAP: Add an execution daemon

2015-01-29 Thread Gunther Hagleitner (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-9506?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Gunther Hagleitner resolved HIVE-9506.
--
Resolution: Fixed

Committed to branch. Thanks [~sseth]!

 LLAP: Add an execution daemon
 -

 Key: HIVE-9506
 URL: https://issues.apache.org/jira/browse/HIVE-9506
 Project: Hive
  Issue Type: Sub-task
Affects Versions: llap
Reporter: Siddharth Seth
Assignee: Siddharth Seth
 Fix For: llap

 Attachments: HIVE-9506.1.patch






--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (HIVE-9188) BloomFilter in ORC row group index

2015-01-29 Thread Gunther Hagleitner (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-9188?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14298021#comment-14298021
 ] 

Gunther Hagleitner commented on HIVE-9188:
--

should look at test failures (probably unrelated). Otherwise: +1

 BloomFilter in ORC row group index
 --

 Key: HIVE-9188
 URL: https://issues.apache.org/jira/browse/HIVE-9188
 Project: Hive
  Issue Type: New Feature
  Components: File Formats
Affects Versions: 0.15.0
Reporter: Prasanth Jayachandran
Assignee: Prasanth Jayachandran
  Labels: orcfile
 Attachments: HIVE-9188.1.patch, HIVE-9188.2.patch, HIVE-9188.3.patch, 
 HIVE-9188.4.patch, HIVE-9188.5.patch


 BloomFilters are well known probabilistic data structure for set membership 
 checking. We can use bloom filters in ORC index for better row group pruning. 
 Currently, ORC row group index uses min/max statistics to eliminate row 
 groups (stripes as well) that do not satisfy predicate condition specified in 
 the query. But in some cases, the efficiency of min/max based elimination is 
 not optimal (unsorted columns with wide range of entries). Bloom filters can 
 be an effective and efficient alternative for row group/split elimination for 
 point queries or queries with IN clause.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (HIVE-8307) null character in columns.comments schema property breaks jobconf.xml

2015-01-28 Thread Gunther Hagleitner (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-8307?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14296469#comment-14296469
 ] 

Gunther Hagleitner commented on HIVE-8307:
--

Oh yeah, I remember. But presumably you could have a non-printable or null byte 
in a column comment too (parser allows it, i think).  So escaping might be 
safer yet. Also, since we use a Properties object it seems forgivable to 
assume that the object is serializable. Might be less of a gotcha if we change 
it, no?

 null character in columns.comments schema property breaks jobconf.xml
 -

 Key: HIVE-8307
 URL: https://issues.apache.org/jira/browse/HIVE-8307
 Project: Hive
  Issue Type: Bug
  Components: Metastore
Affects Versions: 0.13.0, 0.13.1
Reporter: Carl Laird

 It would appear that the fix for 
 https://issues.apache.org/jira/browse/HIVE-6681 is causing the null character 
 to show up in job config xml files:
 I get the following when trying to insert into an elasticsearch backed table:
 [Fatal Error] :336:51: Character reference #
 14/06/17 14:40:11 FATAL conf.Configuration: error parsing conf file: 
 org.xml.sax.SAXParseException; lineNumber: 336; columnNumber: 51; Character 
 reference #
 Exception in thread main java.lang.RuntimeException: 
 org.xml.sax.SAXParseException; lineNumber: 336; columnNumber: 51; Character 
 reference #
 at org.apache.hadoop.conf.Configuration.loadResource(Configuration.java:1263)
 at org.apache.hadoop.conf.Configuration.loadResources(Configuration.java:1129)
 at org.apache.hadoop.conf.Configuration.getProps(Configuration.java:1063)
 at org.apache.hadoop.conf.Configuration.get(Configuration.java:416)
 at org.apache.hadoop.conf.Configuration.getBoolean(Configuration.java:604)
 at org.apache.hadoop.hive.conf.HiveConf.getBoolVar(HiveConf.java:1273)
 at org.apache.hadoop.hive.ql.exec.mr.ExecDriver.main(ExecDriver.java:667)
 at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
 at 
 sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:57)
 at 
 sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
 at java.lang.reflect.Method.invoke(Method.java:606)
 at org.apache.hadoop.util.RunJar.main(RunJar.java:156)
 Caused by: org.xml.sax.SAXParseException; lineNumber: 336; columnNumber: 51; 
 Character reference #
 at 
 com.sun.org.apache.xerces.internal.parsers.DOMParser.parse(DOMParser.java:251)
 at 
 com.sun.org.apache.xerces.internal.jaxp.DocumentBuilderImpl.parse(DocumentBuilderImpl.java:300)
 at javax.xml.parsers.DocumentBuilder.parse(DocumentBuilder.java:121)
 at org.apache.hadoop.conf.Configuration.loadResource(Configuration.java:1181)
 ... 11 more
 Execution failed with exit status: 1
 Line 336 of jobconf.xml:
 propertynamecolumns.comments/namevalue#0;#0;#0;#0;#0;#0;#0;#0;#0;#0;#0;#0;/value/property
 See 
 https://groups.google.com/forum/#!msg/mongodb-user/lKbha0SzMP8/jvE8ZrJom4AJ 
 for more discussion.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (HIVE-8307) null character in columns.comments schema property breaks jobconf.xml

2015-01-28 Thread Gunther Hagleitner (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-8307?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14295634#comment-14295634
 ] 

Gunther Hagleitner commented on HIVE-8307:
--

Meant to say [~ashutoshc]

 null character in columns.comments schema property breaks jobconf.xml
 -

 Key: HIVE-8307
 URL: https://issues.apache.org/jira/browse/HIVE-8307
 Project: Hive
  Issue Type: Bug
  Components: Metastore
Affects Versions: 0.13.0, 0.13.1
Reporter: Carl Laird

 It would appear that the fix for 
 https://issues.apache.org/jira/browse/HIVE-6681 is causing the null character 
 to show up in job config xml files:
 I get the following when trying to insert into an elasticsearch backed table:
 [Fatal Error] :336:51: Character reference #
 14/06/17 14:40:11 FATAL conf.Configuration: error parsing conf file: 
 org.xml.sax.SAXParseException; lineNumber: 336; columnNumber: 51; Character 
 reference #
 Exception in thread main java.lang.RuntimeException: 
 org.xml.sax.SAXParseException; lineNumber: 336; columnNumber: 51; Character 
 reference #
 at org.apache.hadoop.conf.Configuration.loadResource(Configuration.java:1263)
 at org.apache.hadoop.conf.Configuration.loadResources(Configuration.java:1129)
 at org.apache.hadoop.conf.Configuration.getProps(Configuration.java:1063)
 at org.apache.hadoop.conf.Configuration.get(Configuration.java:416)
 at org.apache.hadoop.conf.Configuration.getBoolean(Configuration.java:604)
 at org.apache.hadoop.hive.conf.HiveConf.getBoolVar(HiveConf.java:1273)
 at org.apache.hadoop.hive.ql.exec.mr.ExecDriver.main(ExecDriver.java:667)
 at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
 at 
 sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:57)
 at 
 sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
 at java.lang.reflect.Method.invoke(Method.java:606)
 at org.apache.hadoop.util.RunJar.main(RunJar.java:156)
 Caused by: org.xml.sax.SAXParseException; lineNumber: 336; columnNumber: 51; 
 Character reference #
 at 
 com.sun.org.apache.xerces.internal.parsers.DOMParser.parse(DOMParser.java:251)
 at 
 com.sun.org.apache.xerces.internal.jaxp.DocumentBuilderImpl.parse(DocumentBuilderImpl.java:300)
 at javax.xml.parsers.DocumentBuilder.parse(DocumentBuilder.java:121)
 at org.apache.hadoop.conf.Configuration.loadResource(Configuration.java:1181)
 ... 11 more
 Execution failed with exit status: 1
 Line 336 of jobconf.xml:
 propertynamecolumns.comments/namevalue#0;#0;#0;#0;#0;#0;#0;#0;#0;#0;#0;#0;/value/property
 See 
 https://groups.google.com/forum/#!msg/mongodb-user/lKbha0SzMP8/jvE8ZrJom4AJ 
 for more discussion.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (HIVE-8307) null character in columns.comments schema property breaks jobconf.xml

2015-01-28 Thread Gunther Hagleitner (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-8307?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14295633#comment-14295633
 ] 

Gunther Hagleitner commented on HIVE-8307:
--

It's true that it's fixable on their end, but the null byte in column.comments 
is ugly too. @Ashutosh - I think you put that in, what was that needed for?  

 null character in columns.comments schema property breaks jobconf.xml
 -

 Key: HIVE-8307
 URL: https://issues.apache.org/jira/browse/HIVE-8307
 Project: Hive
  Issue Type: Bug
  Components: Metastore
Affects Versions: 0.13.0, 0.13.1
Reporter: Carl Laird

 It would appear that the fix for 
 https://issues.apache.org/jira/browse/HIVE-6681 is causing the null character 
 to show up in job config xml files:
 I get the following when trying to insert into an elasticsearch backed table:
 [Fatal Error] :336:51: Character reference #
 14/06/17 14:40:11 FATAL conf.Configuration: error parsing conf file: 
 org.xml.sax.SAXParseException; lineNumber: 336; columnNumber: 51; Character 
 reference #
 Exception in thread main java.lang.RuntimeException: 
 org.xml.sax.SAXParseException; lineNumber: 336; columnNumber: 51; Character 
 reference #
 at org.apache.hadoop.conf.Configuration.loadResource(Configuration.java:1263)
 at org.apache.hadoop.conf.Configuration.loadResources(Configuration.java:1129)
 at org.apache.hadoop.conf.Configuration.getProps(Configuration.java:1063)
 at org.apache.hadoop.conf.Configuration.get(Configuration.java:416)
 at org.apache.hadoop.conf.Configuration.getBoolean(Configuration.java:604)
 at org.apache.hadoop.hive.conf.HiveConf.getBoolVar(HiveConf.java:1273)
 at org.apache.hadoop.hive.ql.exec.mr.ExecDriver.main(ExecDriver.java:667)
 at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
 at 
 sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:57)
 at 
 sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
 at java.lang.reflect.Method.invoke(Method.java:606)
 at org.apache.hadoop.util.RunJar.main(RunJar.java:156)
 Caused by: org.xml.sax.SAXParseException; lineNumber: 336; columnNumber: 51; 
 Character reference #
 at 
 com.sun.org.apache.xerces.internal.parsers.DOMParser.parse(DOMParser.java:251)
 at 
 com.sun.org.apache.xerces.internal.jaxp.DocumentBuilderImpl.parse(DocumentBuilderImpl.java:300)
 at javax.xml.parsers.DocumentBuilder.parse(DocumentBuilder.java:121)
 at org.apache.hadoop.conf.Configuration.loadResource(Configuration.java:1181)
 ... 11 more
 Execution failed with exit status: 1
 Line 336 of jobconf.xml:
 propertynamecolumns.comments/namevalue#0;#0;#0;#0;#0;#0;#0;#0;#0;#0;#0;#0;/value/property
 See 
 https://groups.google.com/forum/#!msg/mongodb-user/lKbha0SzMP8/jvE8ZrJom4AJ 
 for more discussion.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Resolved] (HIVE-9461) LLAP: Enable local mode tests on tez to facilitate llap testing

2015-01-27 Thread Gunther Hagleitner (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-9461?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Gunther Hagleitner resolved HIVE-9461.
--
Resolution: Fixed

 LLAP: Enable local mode tests on tez to facilitate llap testing
 ---

 Key: HIVE-9461
 URL: https://issues.apache.org/jira/browse/HIVE-9461
 Project: Hive
  Issue Type: Sub-task
Affects Versions: llap
Reporter: Gunther Hagleitner
Assignee: Gunther Hagleitner
 Attachments: HIVE-9461.1.patch


 local mode tez tests will help the testing of multiple fragments running at 
 the same time.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Resolved] (HIVE-9460) LLAP: Fix some static vars in the operator pipeline

2015-01-27 Thread Gunther Hagleitner (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-9460?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Gunther Hagleitner resolved HIVE-9460.
--
Resolution: Fixed

 LLAP: Fix some static vars in the operator pipeline
 ---

 Key: HIVE-9460
 URL: https://issues.apache.org/jira/browse/HIVE-9460
 Project: Hive
  Issue Type: Sub-task
Affects Versions: llap
Reporter: Gunther Hagleitner
Assignee: Gunther Hagleitner
 Attachments: HIVE-9460.1.patch


 There are a few static vars left in the operator pipeline. Can't have those 
 with multi-threaded execution...



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (HIVE-9461) LLAP: Enable local mode tests on tez to facilitate llap testing

2015-01-25 Thread Gunther Hagleitner (JIRA)
Gunther Hagleitner created HIVE-9461:


 Summary: LLAP: Enable local mode tests on tez to facilitate llap 
testing
 Key: HIVE-9461
 URL: https://issues.apache.org/jira/browse/HIVE-9461
 Project: Hive
  Issue Type: Sub-task
Reporter: Gunther Hagleitner
Assignee: Gunther Hagleitner


local mode tez tests will help the testing of multiple fragments running at the 
same time.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (HIVE-9461) LLAP: Enable local mode tests on tez to facilitate llap testing

2015-01-25 Thread Gunther Hagleitner (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-9461?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Gunther Hagleitner updated HIVE-9461:
-
Attachment: (was: HIVE-9460.1.patch)

 LLAP: Enable local mode tests on tez to facilitate llap testing
 ---

 Key: HIVE-9461
 URL: https://issues.apache.org/jira/browse/HIVE-9461
 Project: Hive
  Issue Type: Sub-task
Affects Versions: llap
Reporter: Gunther Hagleitner
Assignee: Gunther Hagleitner
 Attachments: HIVE-9461.1.patch


 local mode tez tests will help the testing of multiple fragments running at 
 the same time.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (HIVE-9461) LLAP: Enable local mode tests on tez to facilitate llap testing

2015-01-25 Thread Gunther Hagleitner (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-9461?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Gunther Hagleitner updated HIVE-9461:
-
Attachment: HIVE-9461.1.patch

 LLAP: Enable local mode tests on tez to facilitate llap testing
 ---

 Key: HIVE-9461
 URL: https://issues.apache.org/jira/browse/HIVE-9461
 Project: Hive
  Issue Type: Sub-task
Affects Versions: llap
Reporter: Gunther Hagleitner
Assignee: Gunther Hagleitner
 Attachments: HIVE-9461.1.patch


 local mode tez tests will help the testing of multiple fragments running at 
 the same time.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (HIVE-9461) LLAP: Enable local mode tests on tez to facilitate llap testing

2015-01-25 Thread Gunther Hagleitner (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-9461?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Gunther Hagleitner updated HIVE-9461:
-
Attachment: HIVE-9460.1.patch

 LLAP: Enable local mode tests on tez to facilitate llap testing
 ---

 Key: HIVE-9461
 URL: https://issues.apache.org/jira/browse/HIVE-9461
 Project: Hive
  Issue Type: Sub-task
Affects Versions: llap
Reporter: Gunther Hagleitner
Assignee: Gunther Hagleitner
 Attachments: HIVE-9460.1.patch


 local mode tez tests will help the testing of multiple fragments running at 
 the same time.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (HIVE-9460) LLAP: Fix some static vars in the operator pipeline

2015-01-25 Thread Gunther Hagleitner (JIRA)
Gunther Hagleitner created HIVE-9460:


 Summary: LLAP: Fix some static vars in the operator pipeline
 Key: HIVE-9460
 URL: https://issues.apache.org/jira/browse/HIVE-9460
 Project: Hive
  Issue Type: Sub-task
Reporter: Gunther Hagleitner
Assignee: Gunther Hagleitner


There are a few static vars left in the operator pipeline. Can't have those 
with multi-threaded execution...



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (HIVE-9460) LLAP: Fix some static vars in the operator pipeline

2015-01-25 Thread Gunther Hagleitner (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-9460?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Gunther Hagleitner updated HIVE-9460:
-
Attachment: HIVE-9460.1.patch

 LLAP: Fix some static vars in the operator pipeline
 ---

 Key: HIVE-9460
 URL: https://issues.apache.org/jira/browse/HIVE-9460
 Project: Hive
  Issue Type: Sub-task
Reporter: Gunther Hagleitner
Assignee: Gunther Hagleitner
 Attachments: HIVE-9460.1.patch


 There are a few static vars left in the operator pipeline. Can't have those 
 with multi-threaded execution...



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (HIVE-9401) Backport: Fastpath for limited fetches from unpartitioned tables

2015-01-19 Thread Gunther Hagleitner (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-9401?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14283212#comment-14283212
 ] 

Gunther Hagleitner commented on HIVE-9401:
--

+1

 Backport: Fastpath for limited fetches from unpartitioned tables
 

 Key: HIVE-9401
 URL: https://issues.apache.org/jira/browse/HIVE-9401
 Project: Hive
  Issue Type: Bug
  Components: Physical Optimizer
Affects Versions: 0.14.0, 0.15.0
Reporter: Gopal V
Assignee: Gopal V
Priority: Minor
  Labels: Backport
 Fix For: 0.14.1

 Attachments: HIVE-9401.1-branch-0.14.patch


 Currently, for flat tables, the threshold check is applicable for a query 
 like {{select * from lineitem limit 1;}}.
 This is not necessary as without a filter clause, this can be executed 
 entirely via FetchTask. Running a cluster task is redundant for this case.
 This fastpath is applicable for partitioned tables already.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Assigned] (HIVE-9406) Some TestMiniTezCliDriver tests not returning results after two hours

2015-01-19 Thread Gunther Hagleitner (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-9406?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Gunther Hagleitner reassigned HIVE-9406:


Assignee: Gunther Hagleitner

 Some TestMiniTezCliDriver tests not returning results after two hours
 -

 Key: HIVE-9406
 URL: https://issues.apache.org/jira/browse/HIVE-9406
 Project: Hive
  Issue Type: Bug
  Components: Tests, Tez
Reporter: Brock Noland
Assignee: Gunther Hagleitner
Priority: Blocker

 See here:
 https://issues.apache.org/jira/browse/HIVE-9402?focusedCommentId=14281690page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#comment-14281690
 and here:
 https://issues.apache.org/jira/browse/HIVE-9264?focusedCommentId=14281319page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#comment-14281319



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (HIVE-9406) Some TestMiniTezCliDriver tests not returning results after two hours

2015-01-19 Thread Gunther Hagleitner (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-9406?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14282881#comment-14282881
 ] 

Gunther Hagleitner commented on HIVE-9406:
--

Yeah, I think monitoring makes sense. I don't know if anything has changed but 
since build 2015 everything seems fine. Before that we've had the 2+ hours 
build, presumably because of hanging tests. Please let me know if you see it 
again. 

 Some TestMiniTezCliDriver tests not returning results after two hours
 -

 Key: HIVE-9406
 URL: https://issues.apache.org/jira/browse/HIVE-9406
 Project: Hive
  Issue Type: Bug
  Components: Tests, Tez
Reporter: Brock Noland
Priority: Blocker

 See here:
 https://issues.apache.org/jira/browse/HIVE-9402?focusedCommentId=14281690page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#comment-14281690
 and here:
 https://issues.apache.org/jira/browse/HIVE-9264?focusedCommentId=14281319page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#comment-14281319



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (HIVE-7313) Allow in-memory/ssd session-level temp-tables

2015-01-13 Thread Gunther Hagleitner (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-7313?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14276157#comment-14276157
 ] 

Gunther Hagleitner commented on HIVE-7313:
--

Looks good to me. +1. (pending tests).

I think eventually it'd be good to expose this flag on the create tmp table 
sql. So you can through sql choose the storage medium. Maybe a follow up?

 Allow in-memory/ssd session-level temp-tables
 -

 Key: HIVE-7313
 URL: https://issues.apache.org/jira/browse/HIVE-7313
 Project: Hive
  Issue Type: Improvement
  Components: Tez
Affects Versions: 0.14.0
Reporter: Gopal V
Assignee: Gopal V
  Labels: InMemory, Performance
 Attachments: HIVE-7313.1.patch, HIVE-7313.2.patch


 With HDFS storage policies implementation, temporary tables can be written 
 with different storage/reliability policies. 
 In-session temporary tables can be targetted at both SSD and memory storage 
 policies, with fallbacks onto the disk and the associated reliability 
 trade-offs.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (HIVE-9038) Join tests fail on Tez

2014-12-30 Thread Gunther Hagleitner (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-9038?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Gunther Hagleitner updated HIVE-9038:
-
Assignee: Vikram Dixit K

 Join tests fail on Tez
 --

 Key: HIVE-9038
 URL: https://issues.apache.org/jira/browse/HIVE-9038
 Project: Hive
  Issue Type: Bug
  Components: Tests, Tez
Reporter: Ashutosh Chauhan
Assignee: Vikram Dixit K

 Tez doesn't run all tests. But, if you run them, following tests fail with 
 runt time exception pointing to bugs. 
 {{auto_join21.q,auto_join29.q,auto_join30.q
 ,auto_join_filters.q,auto_join_nulls.q}} 



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (HIVE-9205) Change default tez install directory to use /tmp instead of /user and create the directory if it does not exist

2014-12-30 Thread Gunther Hagleitner (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-9205?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14261486#comment-14261486
 ] 

Gunther Hagleitner commented on HIVE-9205:
--

+1

 Change default tez install directory to use /tmp instead of /user and create 
 the directory if it does not exist
 ---

 Key: HIVE-9205
 URL: https://issues.apache.org/jira/browse/HIVE-9205
 Project: Hive
  Issue Type: Bug
  Components: Tez
Affects Versions: 0.14.0, 0.15.0
Reporter: Vikram Dixit K
Assignee: Vikram Dixit K
 Fix For: 0.15.0, 0.14.1

 Attachments: HIVE-9205.1.patch, HIVE-9205.2.patch


 The common deployment scenario is to install the packages and start services. 
 Creating the /user/user directory is currently an extra step during manual 
 installation. In case the user tries to bring up the hive shell with tez 
 enabled, this would result in an exception. The solution is to change the 
 default install directory to /tmp (so that we have the permissions to create 
 the directory /tmp/user) and create the /tmp/user directory if it did not 
 exist earlier.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (HIVE-9155) HIVE_LOCKS uses int instead of bigint hive-txn-schema-0.14.0.mssql.sql

2014-12-22 Thread Gunther Hagleitner (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-9155?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14256572#comment-14256572
 ] 

Gunther Hagleitner commented on HIVE-9155:
--

+1 for 0.14.1

 HIVE_LOCKS uses int instead of bigint hive-txn-schema-0.14.0.mssql.sql
 --

 Key: HIVE-9155
 URL: https://issues.apache.org/jira/browse/HIVE-9155
 Project: Hive
  Issue Type: Bug
  Components: Metastore
Affects Versions: 0.14.0
Reporter: Eugene Koifman
Assignee: Eugene Koifman
 Fix For: 0.15.0

 Attachments: HIVE-9155.2.patch, HIVE-9155.3.patch, HIVE-9155.patch


 HL_LAST_HEARTBEAT and HL_ACQUIRED_AT should use bigint which is 8 bytes and 
 big enough for timestamp.hive-schema-0.15.0.mssql.sql is already correct.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (HIVE-8888) Mapjoin with LateralViewJoin generates wrong plan in Tez

2014-11-28 Thread Gunther Hagleitner (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14228138#comment-14228138
 ] 

Gunther Hagleitner commented on HIVE-:
--

failures are unrelated.

 Mapjoin with LateralViewJoin generates wrong plan in Tez
 

 Key: HIVE-
 URL: https://issues.apache.org/jira/browse/HIVE-
 Project: Hive
  Issue Type: Bug
Affects Versions: 0.13.0, 0.14.0, 0.13.1, 0.15.0
Reporter: Prasanth J
Assignee: Prasanth J
 Fix For: 0.14.1

 Attachments: HIVE-.1.patch, HIVE-.2.patch, HIVE-.3.patch, 
 HIVE-.4.patch, HIVE-.5.patch


 Queries like these 
 {code}
 with sub1 as
 (select aid, avalue from expod1 lateral view explode(av) avs as avalue ),
 sub2 as
 (select bid, bvalue from expod2 lateral view explode(bv) bvs as bvalue)
 select sub1.aid, sub1.avalue, sub2.bvalue
 from sub1,sub2
 where sub1.aid=sub2.bid;
 {code}
 generates twice the number of rows in Tez when compared to MR.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (HIVE-8964) Some TestMiniTezCliDriver tests taking two hours

2014-11-28 Thread Gunther Hagleitner (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-8964?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14228139#comment-14228139
 ] 

Gunther Hagleitner commented on HIVE-8964:
--

Alright with the Tez guys' help figured out that the planner was producing a 
cyclic graph. I've re-opened HIVE- and added a new patch. [~brocknoland] 
when the new patch in HIVE-, can you re-enable the test on the build 
machine? (or did you disable in the source somewhere?)

 Some TestMiniTezCliDriver tests taking two hours
 

 Key: HIVE-8964
 URL: https://issues.apache.org/jira/browse/HIVE-8964
 Project: Hive
  Issue Type: Bug
Reporter: Brock Noland
Assignee: Gunther Hagleitner
Priority: Blocker

 The test {{TestMiniTezCliDriver}} with the following query files:
 vectorization_16.q,mapjoin_mapjoin.q,groupby2.q,lvj_mapjoin.q,vectorization_5.q,vectorization_pushdown.q,orc_merge_incompat1.q,cbo_gby.q,vectorization_4.q,auto_join0.q,cross_product_check_1.q,vectorization_not.q,update_where_no_match.q,ctas.q,cbo_udf_udaf.q
 is timing out after two hours severely delaying the Hive precommits
 http://ec2-174-129-184-35.compute-1.amazonaws.com/logs/PreCommit-HIVE-TRUNK-Build-1898/failed/TestMiniTezCliDriver-vectorization_16.q-mapjoin_mapjoin.q-groupby2.q-and-12-more/



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (HIVE-8990) mapjoin_mapjoin.q is failing on Tez (missed golden file update)

2014-11-28 Thread Gunther Hagleitner (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-8990?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Gunther Hagleitner updated HIVE-8990:
-
Attachment: HIVE-8990.1.patch

 mapjoin_mapjoin.q is failing on Tez (missed golden file update)
 ---

 Key: HIVE-8990
 URL: https://issues.apache.org/jira/browse/HIVE-8990
 Project: Hive
  Issue Type: Bug
Reporter: Gunther Hagleitner
Assignee: Gunther Hagleitner
 Attachments: HIVE-8990.1.patch


 mapjoin_mapjoin.q was updated (SORT_BEFORE_DIFF). However, since the tez test 
 were stuck the accompanying update to the golden file was missed.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (HIVE-8990) mapjoin_mapjoin.q is failing on Tez (missed golden file update)

2014-11-28 Thread Gunther Hagleitner (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-8990?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Gunther Hagleitner updated HIVE-8990:
-
Status: Patch Available  (was: Open)

 mapjoin_mapjoin.q is failing on Tez (missed golden file update)
 ---

 Key: HIVE-8990
 URL: https://issues.apache.org/jira/browse/HIVE-8990
 Project: Hive
  Issue Type: Bug
Reporter: Gunther Hagleitner
Assignee: Gunther Hagleitner
 Attachments: HIVE-8990.1.patch


 mapjoin_mapjoin.q was updated (SORT_BEFORE_DIFF). However, since the tez test 
 were stuck the accompanying update to the golden file was missed.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (HIVE-8990) mapjoin_mapjoin.q is failing on Tez (missed golden file update)

2014-11-28 Thread Gunther Hagleitner (JIRA)
Gunther Hagleitner created HIVE-8990:


 Summary: mapjoin_mapjoin.q is failing on Tez (missed golden file 
update)
 Key: HIVE-8990
 URL: https://issues.apache.org/jira/browse/HIVE-8990
 Project: Hive
  Issue Type: Bug
Reporter: Gunther Hagleitner
Assignee: Gunther Hagleitner


mapjoin_mapjoin.q was updated (SORT_BEFORE_DIFF). However, since the tez test 
were stuck the accompanying update to the golden file was missed.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (HIVE-8888) Mapjoin with LateralViewJoin generates wrong plan in Tez

2014-11-28 Thread Gunther Hagleitner (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14228485#comment-14228485
 ] 

Gunther Hagleitner commented on HIVE-:
--

[~brocknoland] I have verified that lvj_mapjoin works locally. Thanks.

 Mapjoin with LateralViewJoin generates wrong plan in Tez
 

 Key: HIVE-
 URL: https://issues.apache.org/jira/browse/HIVE-
 Project: Hive
  Issue Type: Bug
Affects Versions: 0.13.0, 0.14.0, 0.13.1, 0.15.0
Reporter: Prasanth J
Assignee: Prasanth J
 Fix For: 0.14.1

 Attachments: HIVE-.1.patch, HIVE-.2.patch, HIVE-.3.patch, 
 HIVE-.4.patch, HIVE-.5.patch


 Queries like these 
 {code}
 with sub1 as
 (select aid, avalue from expod1 lateral view explode(av) avs as avalue ),
 sub2 as
 (select bid, bvalue from expod2 lateral view explode(bv) bvs as bvalue)
 select sub1.aid, sub1.avalue, sub2.bvalue
 from sub1,sub2
 where sub1.aid=sub2.bid;
 {code}
 generates twice the number of rows in Tez when compared to MR.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (HIVE-8964) Some TestMiniTezCliDriver tests taking two hours

2014-11-27 Thread Gunther Hagleitner (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-8964?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14228019#comment-14228019
 ] 

Gunther Hagleitner commented on HIVE-8964:
--

Yes, it's definitely lvj_mapjoin.q. I can repro the hang - no idea why yet.

 Some TestMiniTezCliDriver tests taking two hours
 

 Key: HIVE-8964
 URL: https://issues.apache.org/jira/browse/HIVE-8964
 Project: Hive
  Issue Type: Bug
Reporter: Brock Noland
Assignee: Gunther Hagleitner
Priority: Blocker

 The test {{TestMiniTezCliDriver}} with the following query files:
 vectorization_16.q,mapjoin_mapjoin.q,groupby2.q,lvj_mapjoin.q,vectorization_5.q,vectorization_pushdown.q,orc_merge_incompat1.q,cbo_gby.q,vectorization_4.q,auto_join0.q,cross_product_check_1.q,vectorization_not.q,update_where_no_match.q,ctas.q,cbo_udf_udaf.q
 is timing out after two hours severely delaying the Hive precommits
 http://ec2-174-129-184-35.compute-1.amazonaws.com/logs/PreCommit-HIVE-TRUNK-Build-1898/failed/TestMiniTezCliDriver-vectorization_16.q-mapjoin_mapjoin.q-groupby2.q-and-12-more/



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (HIVE-8964) Some TestMiniTezCliDriver tests taking two hours

2014-11-27 Thread Gunther Hagleitner (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-8964?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14228026#comment-14228026
 ] 

Gunther Hagleitner commented on HIVE-8964:
--

Happens with tez 0.6.0 also. Same as [~brocknoland] only Exception I see is:

{noformat}
org.apache.hadoop.ipc.RemoteException(org.apache.hadoop.hdfs.server.namenode.LeaseExpiredException):
 No lease on 
/Users/ghagleitner/Projects/hive-trunk2/itests/qtest/target/tmp/scratchdir/ghagleitner/_tez_session_dir/dc4fca20-4a39-4452-9\
75a-467bda4947ca/.tez/application_1417137410462_0001/recovery/1/summary (inode 
16430): File does not exist. Holder DFSClient_NONMAPREDUCE_1900574341_1 does 
not have any open files.
  
  at 
org.apache.hadoop.hdfs.server.namenode.FSNamesystem.checkLease(FSNamesystem.java:3083)

   
  at 
org.apache.hadoop.hdfs.server.namenode.FSNamesystem.completeFileInternal(FSNamesystem.java:3170)

 
  at 
org.apache.hadoop.hdfs.server.namenode.FSNamesystem.completeFile(FSNamesystem.java:3140)

 
  at 
org.apache.hadoop.hdfs.server.namenode.NameNodeRpcServer.complete(NameNodeRpcServer.java:665)


  at 
org.apache.hadoop.hdfs.protocolPB.ClientNamenodeProtocolServerSideTranslatorPB.complete(ClientNamenodeProtocolServerSideTranslatorPB.java:499)

   
  at 
org.apache.hadoop.hdfs.protocol.proto.ClientNamenodeProtocolProtos$ClientNamenodeProtocol$2.callBlockingMethod(ClientNamenodeProtocolProtos.java)


  at 
org.apache.hadoop.ipc.ProtobufRpcEngine$Server$ProtoBufRpcInvoker.call(ProtobufRpcEngine.java:585)

   
  at org.apache.hadoop.ipc.RPC$Server.call(RPC.java:928)

  
  at org.apache.hadoop.ipc.Server$Handler$1.run(Server.java:2013)   

  
  at org.apache.hadoop.ipc.Server$Handler$1.run(Server.java:2009)   

  
  at java.security.AccessController.doPrivileged(Native Method) 

  
  at javax.security.auth.Subject.doAs(Subject.java:394) 

  
  at 
org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1614)

 
  at org.apache.hadoop.ipc.Server$Handler.run(Server.java:2007) 

  


  
  at org.apache.hadoop.ipc.Client.call(Client.java:1411)

  
  at org.apache.hadoop.ipc.Client.call(Client.java:1364)

  
  at 

[jira] [Reopened] (HIVE-8888) Mapjoin with LateralViewJoin generates wrong plan in Tez

2014-11-27 Thread Gunther Hagleitner (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Gunther Hagleitner reopened HIVE-:
--

The original patch doesn't actually work. The problem is that we still collect 
the mj during visiting the nodes. We need to clear that before bailing out of 
work generation.

 Mapjoin with LateralViewJoin generates wrong plan in Tez
 

 Key: HIVE-
 URL: https://issues.apache.org/jira/browse/HIVE-
 Project: Hive
  Issue Type: Bug
Affects Versions: 0.13.0, 0.14.0, 0.13.1, 0.15.0
Reporter: Prasanth J
Assignee: Prasanth J
 Fix For: 0.14.1

 Attachments: HIVE-.1.patch, HIVE-.2.patch, HIVE-.3.patch, 
 HIVE-.4.patch


 Queries like these 
 {code}
 with sub1 as
 (select aid, avalue from expod1 lateral view explode(av) avs as avalue ),
 sub2 as
 (select bid, bvalue from expod2 lateral view explode(bv) bvs as bvalue)
 select sub1.aid, sub1.avalue, sub2.bvalue
 from sub1,sub2
 where sub1.aid=sub2.bid;
 {code}
 generates twice the number of rows in Tez when compared to MR.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (HIVE-8888) Mapjoin with LateralViewJoin generates wrong plan in Tez

2014-11-27 Thread Gunther Hagleitner (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Gunther Hagleitner updated HIVE-:
-
Attachment: HIVE-.5.patch

 Mapjoin with LateralViewJoin generates wrong plan in Tez
 

 Key: HIVE-
 URL: https://issues.apache.org/jira/browse/HIVE-
 Project: Hive
  Issue Type: Bug
Affects Versions: 0.13.0, 0.14.0, 0.13.1, 0.15.0
Reporter: Prasanth J
Assignee: Prasanth J
 Fix For: 0.14.1

 Attachments: HIVE-.1.patch, HIVE-.2.patch, HIVE-.3.patch, 
 HIVE-.4.patch, HIVE-.5.patch


 Queries like these 
 {code}
 with sub1 as
 (select aid, avalue from expod1 lateral view explode(av) avs as avalue ),
 sub2 as
 (select bid, bvalue from expod2 lateral view explode(bv) bvs as bvalue)
 select sub1.aid, sub1.avalue, sub2.bvalue
 from sub1,sub2
 where sub1.aid=sub2.bid;
 {code}
 generates twice the number of rows in Tez when compared to MR.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (HIVE-8888) Mapjoin with LateralViewJoin generates wrong plan in Tez

2014-11-27 Thread Gunther Hagleitner (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Gunther Hagleitner updated HIVE-:
-
Status: Patch Available  (was: Reopened)

 Mapjoin with LateralViewJoin generates wrong plan in Tez
 

 Key: HIVE-
 URL: https://issues.apache.org/jira/browse/HIVE-
 Project: Hive
  Issue Type: Bug
Affects Versions: 0.13.1, 0.13.0, 0.14.0, 0.15.0
Reporter: Prasanth J
Assignee: Prasanth J
 Fix For: 0.14.1

 Attachments: HIVE-.1.patch, HIVE-.2.patch, HIVE-.3.patch, 
 HIVE-.4.patch, HIVE-.5.patch


 Queries like these 
 {code}
 with sub1 as
 (select aid, avalue from expod1 lateral view explode(av) avs as avalue ),
 sub2 as
 (select bid, bvalue from expod2 lateral view explode(bv) bvs as bvalue)
 select sub1.aid, sub1.avalue, sub2.bvalue
 from sub1,sub2
 where sub1.aid=sub2.bid;
 {code}
 generates twice the number of rows in Tez when compared to MR.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (HIVE-8966) Delta files created by hive hcatalog streaming cannot be compacted

2014-11-26 Thread Gunther Hagleitner (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-8966?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14227045#comment-14227045
 ] 

Gunther Hagleitner commented on HIVE-8966:
--

+1 for 0.14.1

 Delta files created by hive hcatalog streaming cannot be compacted
 --

 Key: HIVE-8966
 URL: https://issues.apache.org/jira/browse/HIVE-8966
 Project: Hive
  Issue Type: Bug
  Components: HCatalog
Affects Versions: 0.14.0
 Environment: hive
Reporter: Jihong Liu
Assignee: Alan Gates
Priority: Critical

 hive hcatalog streaming will also create a file like bucket_n_flush_length in 
 each delta directory. Where n is the bucket number. But the 
 compactor.CompactorMR think this file also needs to compact. However this 
 file of course cannot be compacted, so compactor.CompactorMR will not 
 continue to do the compaction. 
 Did a test, after removed the bucket_n_flush_length file, then the alter 
 table partition compact finished successfully. If don't delete that file, 
 nothing will be compacted. 
 This is probably a very severity bug. Both 0.13 and 0.14 have this issue



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (HIVE-8880) non-synchronized access to split list in OrcInputFormat

2014-11-26 Thread Gunther Hagleitner (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-8880?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14227048#comment-14227048
 ] 

Gunther Hagleitner commented on HIVE-8880:
--

+1, also for 0.14.1

 non-synchronized access to split list in OrcInputFormat
 ---

 Key: HIVE-8880
 URL: https://issues.apache.org/jira/browse/HIVE-8880
 Project: Hive
  Issue Type: Bug
Affects Versions: 0.14.0
Reporter: Alan Gates
Assignee: Alan Gates
 Fix For: 0.14.1

 Attachments: HIVE-8880.patch


 When adding delta files to the list of orc splits access to the list is not 
 synchronized though it is shared across threads.  All other additions to the 
 list are synchronized.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


  1   2   3   4   5   6   7   8   9   10   >