[jira] [Created] (HIVE-11305) LLAP: Hybrid Map-join cache returns invalid data
Gopal V created HIVE-11305: -- Summary: LLAP: Hybrid Map-join cache returns invalid data Key: HIVE-11305 URL: https://issues.apache.org/jira/browse/HIVE-11305 Project: Hive Issue Type: Sub-task Affects Versions: llap Environment: TPC-DS 200 scale data Reporter: Gopal V Assignee: Sergey Shelukhin Priority: Critical Fix For: llap Start a 1-node LLAP cluster with 16 executors and run attached test-case on the single node instance. {code} Caused by: java.lang.ClassCastException: org.apache.hadoop.hive.ql.exec.persistence.HybridHashTableContainer cannot be cast to org.apache.hadoop.hive.ql.exec.vector.mapjoin.hashtable.VectorMapJoinTableContainer at org.apache.hadoop.hive.ql.exec.vector.mapjoin.VectorMapJoinCommonOperator.loadHashTable(VectorMapJoinCommonOperator.java:648) at org.apache.hadoop.hive.ql.exec.MapJoinOperator.cleanUpInputFileChangedOp(MapJoinOperator.java:314) at org.apache.hadoop.hive.ql.exec.Operator.cleanUpInputFileChanged(Operator.java:1104) at org.apache.hadoop.hive.ql.exec.Operator.cleanUpInputFileChanged(Operator.java:1108) at org.apache.hadoop.hive.ql.exec.Operator.cleanUpInputFileChanged(Operator.java:1108) at org.apache.hadoop.hive.ql.exec.Operator.cleanUpInputFileChanged(Operator.java:1108) at org.apache.hadoop.hive.ql.exec.Operator.cleanUpInputFileChanged(Operator.java:1108) at org.apache.hadoop.hive.ql.exec.vector.VectorMapOperator.process(VectorMapOperator.java:37) at org.apache.hadoop.hive.ql.exec.tez.MapRecordSource.processRow(MapRecordSource.java:86) ... 17 more {code} -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Created] (HIVE-11306) Add a bloom-1 filter for Hybrid MapJoin spills
Gopal V created HIVE-11306: -- Summary: Add a bloom-1 filter for Hybrid MapJoin spills Key: HIVE-11306 URL: https://issues.apache.org/jira/browse/HIVE-11306 Project: Hive Issue Type: Improvement Components: Hive Affects Versions: 1.3.0, 2.0.0 Reporter: Gopal V Assignee: Gopal V HIVE-9277 implemented Spillable joins for Tez, which suffers from a corner-case performance issue when joining wide small tables against a narrow big table (like a user info table join events stream). The fact that the wide table is spilled causes extra IO, even though the nDV of the join key might be in the thousands. A cheap bloom-1 filter would add a massive performance gain for such queries, massively cutting down on the spill IO costs for the big-table spills. -- This message was sent by Atlassian JIRA (v6.3.4#6332)
Hive unit tests with JDBC connection (Submit patch)
I added a unit test for Hive HPL/SQL tool that uses a JDBC connection to access src table. When I submit my patch, and use URL jdbc:hive2://localhost:1 I got: java.net.ConnectException: Connection refused When I use URL jdbc:hive2:// I got java.lang.NoClassDefFoundError: org/apache/hadoop/mapreduce/TaskAttemptContext In .pom I use dependency groupIdorg.apache.hive/groupId artifactIdhive-jdbc/artifactId version1.2.1/version scopetest/scope /dependency Can someone please suggest what is the proper way to access src table using JDBC in Hive unit tests when submitting a patch? Thank you, Dmitry
[jira] [Created] (HIVE-11308) Implement NullAppender using Log4j2 API
Prasanth Jayachandran created HIVE-11308: Summary: Implement NullAppender using Log4j2 API Key: HIVE-11308 URL: https://issues.apache.org/jira/browse/HIVE-11308 Project: Hive Issue Type: Sub-task Reporter: Prasanth Jayachandran Assignee: Prasanth Jayachandran NullAppender pulls in log4j 1.x jars and uses AppenderSkeleton API. Reimplement NullAppender to make use of Log4j2 API. -- This message was sent by Atlassian JIRA (v6.3.4#6332)
Hive-0.14 - Build # 1016 - Still Failing
Changes for Build #996 Changes for Build #997 Changes for Build #998 Changes for Build #999 Changes for Build #1000 Changes for Build #1001 Changes for Build #1002 Changes for Build #1003 Changes for Build #1004 Changes for Build #1005 Changes for Build #1006 Changes for Build #1007 Changes for Build #1008 Changes for Build #1009 Changes for Build #1010 Changes for Build #1011 Changes for Build #1012 Changes for Build #1013 Changes for Build #1014 Changes for Build #1015 Changes for Build #1016 No tests ran. The Apache Jenkins build system has built Hive-0.14 (build #1016) Status: Still Failing Check console output at https://builds.apache.org/job/Hive-0.14/1016/ to view the results.
[jira] [Created] (HIVE-11307) Remove getWritableObject from ColumnVectorBatch
Owen O'Malley created HIVE-11307: Summary: Remove getWritableObject from ColumnVectorBatch Key: HIVE-11307 URL: https://issues.apache.org/jira/browse/HIVE-11307 Project: Hive Issue Type: Sub-task Components: Vectorization Reporter: Owen O'Malley Assignee: Owen O'Malley Fix For: 2.0.0 ColumnVectorBatch.getWritableObject is only used in a few tests and is really problematic when adding the complex types to vectorization. -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Created] (HIVE-11309) Replace PidDailyRollingFileAppender with equivalent log4j2 implementation
Prasanth Jayachandran created HIVE-11309: Summary: Replace PidDailyRollingFileAppender with equivalent log4j2 implementation Key: HIVE-11309 URL: https://issues.apache.org/jira/browse/HIVE-11309 Project: Hive Issue Type: Sub-task Affects Versions: 2.0.0 Reporter: Prasanth Jayachandran Assignee: Prasanth Jayachandran PidDailyRollingFileAppender appends pid@hostname information to file name output. Similar thing can be achieved by adding a custom file pattern converter in log4j2. -- This message was sent by Atlassian JIRA (v6.3.4#6332)