[jira] [Created] (HIVE-25032) Optimise PartitionManagementTask

2021-04-19 Thread Rajesh Balamohan (Jira)
Rajesh Balamohan created HIVE-25032:
---

 Summary: Optimise PartitionManagementTask
 Key: HIVE-25032
 URL: https://issues.apache.org/jira/browse/HIVE-25032
 Project: Hive
  Issue Type: Improvement
Reporter: Rajesh Balamohan


When large number of tables and dbs are present, it takes lot of time to 
iterate over all tables in PartitionManagementTask.

If table parameters are populated in TableMeta, it will cut down lots of 
interactions with HMS in PartitionManagementTask.

 

Code Ref:

[https://github.com/apache/hive/blob/master/standalone-metastore/metastore-server/src/main/java/org/apache/hadoop/hive/metastore/PartitionManagementTask.java#L126]

 

[https://github.com/apache/hive/blob/master/standalone-metastore/metastore-server/src/main/java/org/apache/hadoop/hive/metastore/PartitionManagementTask.java#L132]

 



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (HIVE-25031) Flaky test : Testing / split-01 / PostProcess / testCliDriver[orc_ppd_basic] – org.apache.hadoop.hive.cli.split1.TestMiniLlapCliDriver

2021-04-19 Thread Kishen Das (Jira)
Kishen Das created HIVE-25031:
-

 Summary: Flaky test : Testing / split-01 / PostProcess / 
testCliDriver[orc_ppd_basic] – 
org.apache.hadoop.hive.cli.split1.TestMiniLlapCliDriver
 Key: HIVE-25031
 URL: https://issues.apache.org/jira/browse/HIVE-25031
 Project: Hive
  Issue Type: Bug
  Components: HiveServer2
Reporter: Kishen Das


java.lang.AssertionError: 
Client Execution succeeded but contained differences (error code = 1) after 
executing orc_ppd_basic.q 
1795c1795
< HDFS_BYTES_READ: 0
---
> HDFS_BYTES_READ: 1759
1797c1797
< HDFS_READ_OPS: 2
---
> HDFS_READ_OPS: 3
 at org.junit.Assert.fail(Assert.java:89)
 at org.apache.hadoop.hive.ql.QTestUtil.failedDiff(QTestUtil.java:1045)
 at 
org.apache.hadoop.hive.cli.control.CoreCliDriver.runTest(CoreCliDriver.java:183)
 at org.apache.hadoop.hive.cli.control.CliAdapter.runTest(CliAdapter.java:157)
 at 
org.apache.hadoop.hive.cli.split1.TestMiniLlapCliDriver.testCliDriver(TestMiniLlapCliDriver.java:62)
 at sun.reflect.GeneratedMethodAccessor307.invoke(Unknown Source)
 at 
sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
 at java.lang.reflect.Method.invoke(Method.java:498)
 at 
org.junit.runners.model.FrameworkMethod$1.runReflectiveCall(FrameworkMethod.java:59)
 at 
org.junit.internal.runners.model.ReflectiveCallable.run(ReflectiveCallable.java:12)
 at 
org.junit.runners.model.FrameworkMethod.invokeExplosively(FrameworkMethod.java:56)
 at 
org.junit.internal.runners.statements.InvokeMethod.evaluate(InvokeMethod.java:17)
 at 
org.apache.hadoop.hive.cli.control.CliAdapter$2$1.evaluate(CliAdapter.java:135)
 at org.junit.runners.ParentRunner$3.evaluate(ParentRunner.java:306)
 at 
org.junit.runners.BlockJUnit4ClassRunner$1.evaluate(BlockJUnit4ClassRunner.java:100)
 at org.junit.runners.ParentRunner.runLeaf(ParentRunner.java:366)
 at 
org.junit.runners.BlockJUnit4ClassRunner.runChild(BlockJUnit4ClassRunner.java:103)
 at 
org.junit.runners.BlockJUnit4ClassRunner.runChild(BlockJUnit4ClassRunner.java:63)
 at org.junit.runners.ParentRunner$4.run(ParentRunner.java:331)
 at org.junit.runners.ParentRunner$1.schedule(ParentRunner.java:79)
 at org.junit.runners.ParentRunner.runChildren(ParentRunner.java:329)
 at org.junit.runners.ParentRunner.access$100(ParentRunner.java:66)
 at org.junit.runners.ParentRunner$2.evaluate(ParentRunner.java:293)
 at org.junit.runners.ParentRunner.run(ParentRunner.java:413)
 at org.junit.runners.Suite.runChild(Suite.java:128)
 at org.junit.runners.Suite.runChild(Suite.java:27)
 at org.junit.runners.ParentRunner$4.run(ParentRunner.java:331)
 at org.junit.runners.ParentRunner$1.schedule(ParentRunner.java:79)
 at org.junit.runners.ParentRunner.runChildren(ParentRunner.java:329)
 at org.junit.runners.ParentRunner.access$100(ParentRunner.java:66)
 at org.junit.runners.ParentRunner$2.evaluate(ParentRunner.java:293)
 at 
org.apache.hadoop.hive.cli.control.CliAdapter$1$1.evaluate(CliAdapter.java:95)
 at org.junit.rules.RunRules.evaluate(RunRules.java:20)
 at org.junit.runners.ParentRunner$3.evaluate(ParentRunner.java:306)
 at org.junit.runners.ParentRunner.run(ParentRunner.java:413)
 at 
org.apache.maven.surefire.junit4.JUnit4Provider.execute(JUnit4Provider.java:365)
 at 
org.apache.maven.surefire.junit4.JUnit4Provider.executeWithRerun(JUnit4Provider.java:273)
 at 
org.apache.maven.surefire.junit4.JUnit4Provider.executeTestSet(JUnit4Provider.java:238)
 at 
org.apache.maven.surefire.junit4.JUnit4Provider.invoke(JUnit4Provider.java:159)
 at 
org.apache.maven.surefire.booter.ForkedBooter.runSuitesInProcess(ForkedBooter.java:377)
 at org.apache.maven.surefire.booter.ForkedBooter.execute(ForkedBooter.java:138)
 at org.apache.maven.surefire.booter.ForkedBooter.run(ForkedBooter.java:465)
 at org.apache.maven.surefire.booter.ForkedBooter.main(ForkedBooter.java:451)
 
021-04-12T23:31:19,937 ERROR [main] QTestUtil: Client Execution succeeded but 
contained differences (error code = 1) after executing orc_ppd_basic.q 
1795c1795
< HDFS_BYTES_READ: 0
---
> HDFS_BYTES_READ: 1759
1797c1797
< HDFS_READ_OPS: 2
---
> HDFS_READ_OPS: 3
 
2021-04-12T23:31:19,938 INFO [main] control.CoreCliDriver: Done query 
orc_ppd_basic.q. succeeded=false, skipped=false. ElapsedTime(ms)=45693
Done query orc_ppd_basic.q. succeeded=false, skipped=false. 
ElapsedTime(ms)=45693
2021-04-12T23:31:20,044 INFO [main] zookeeper.ZooKeeper: Session: 
0x100054ba2210061 closed
2021-04-12T23:31:20,049 INFO [main] zookeeper.ZooKeeper: Initiatin



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (HIVE-25030) Flaky test : Testing / split-16 / PostProcess / testWmPoolMetricsAfterKillTrigger – org.apache.hive.jdbc.TestWMMetricsWithTrigger

2021-04-19 Thread Kishen Das (Jira)
Kishen Das created HIVE-25030:
-

 Summary: Flaky test : Testing / split-16 / PostProcess / 
testWmPoolMetricsAfterKillTrigger – 
org.apache.hive.jdbc.TestWMMetricsWithTrigger
 Key: HIVE-25030
 URL: https://issues.apache.org/jira/browse/HIVE-25030
 Project: Hive
  Issue Type: Bug
  Components: HiveServer2
Reporter: Kishen Das


h4. Stacktrace
java.lang.AssertionError: Exception while executing statement expected null, 
but was:
 at org.junit.Assert.fail(Assert.java:89)
 at org.junit.Assert.failNotNull(Assert.java:756)
 at org.junit.Assert.assertNull(Assert.java:738)
 at 
org.apache.hive.jdbc.TestWMMetricsWithTrigger.testWmPoolMetricsAfterKillTrigger(TestWMMetricsWithTrigger.java:207)
 at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
 at 
sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:62)
 at 
sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
 at java.lang.reflect.Method.invoke(Method.java:498)
 at 
org.junit.runners.model.FrameworkMethod$1.runReflectiveCall(FrameworkMethod.java:59)
 at 
org.junit.internal.runners.model.ReflectiveCallable.run(ReflectiveCallable.java:12)
 at 
org.junit.runners.model.FrameworkMethod.invokeExplosively(FrameworkMethod.java:56)
 at 
org.junit.internal.runners.statements.InvokeMethod.evaluate(InvokeMethod.java:17)
 at 
org.junit.internal.runners.statements.FailOnTimeout$CallableStatement.call(FailOnTimeout.java:288)
 at 
org.junit.internal.runners.statements.FailOnTimeout$CallableStatement.call(FailOnTimeout.java:282)
 at java.util.concurrent.FutureTask.run(FutureTask.java:266)
 at java.lang.Thread.run(Thread.java:748)
 
 
 
ound-Pool: Thread-1120] reexec.ReExecuteLostAMQueryPlugin: Got exception 
message: Vertex failed, vertexName=Reducer 2, 
vertexId=vertex_1618310614458_0001_1_02, diagnostics=[Task failed, 
taskId=task_1618310614458_0001_1_02_00, diagnostics=[TaskAttempt 0 failed, 
info=[Error: Error while running task ( failure ) : 
attempt_1618310614458_0001_1_02_00_0:java.lang.RuntimeException: 
java.lang.SecurityException: UDF 
org.apache.hive.jdbc.TestKillQueryWithAuthorizationDisabled.SleepMsUDF is not 
allowed
 at 
org.apache.hadoop.hive.ql.exec.tez.TezProcessor.initializeAndRunProcessor(TezProcessor.java:365)
 at org.apache.hadoop.hive.ql.exec.tez.TezProcessor.run(TezProcessor.java:277)
 at 
org.apache.tez.runtime.LogicalIOProcessorRuntimeTask.run(LogicalIOProcessorRuntimeTask.java:381)
 at 
org.apache.tez.runtime.task.TaskRunner2Callable$1.run(TaskRunner2Callable.java:75)
 at 
org.apache.tez.runtime.task.TaskRunner2Callable$1.run(TaskRunner2Callable.java:62)
 at java.security.AccessController.doPrivileged(Native Method)
 at javax.security.auth.Subject.doAs(Subject.java:422)
 at 
org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1682)
 at 
org.apache.tez.runtime.task.TaskRunner2Callable.callInternal(TaskRunner2Callable.java:62)
 at 
org.apache.tez.runtime.task.TaskRunner2Callable.callInternal(TaskRunner2Callable.java:38)
 at org.apache.tez.common.CallableWithNdc.call(CallableWithNdc.java:36)
 at 
org.apache.hadoop.hive.llap.daemon.impl.StatsRecordingThreadPool$WrappedCallable.call(StatsRecordingThreadPool.java:118)
 at java.util.concurrent.FutureTask.run(FutureTask.java:266)
 at 
java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149)
 at 
java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624)
 at java.lang.Thread.run(Thread.java:748)
Caused by: java.lang.SecurityException: UDF 
org.apache.hive.jdbc.TestKillQueryWithAuthorizationDisabled.SleepMsUDF is not 
allowed
 at 
org.apache.hadoop.hive.ql.udf.generic.GenericUDFBridge.getUdfClassInternal(GenericUDFBridge.java:146)
 at 
org.apache.hadoop.hive.ql.udf.generic.GenericUDFBridge.getUdfClass(GenericUDFBridge.java:133)
 at 
org.apache.hadoop.hive.ql.exec.FunctionRegistry.isStateful(FunctionRegistry.java:1271)
 at 
org.apache.hadoop.hive.ql.exec.FunctionRegistry.isDeterministic(FunctionRegistry.java:1234)
 at 
org.apache.hadoop.hive.ql.exec.ExprNodeGenericFuncEvaluator.isDeterministic(ExprNodeGenericFuncEvaluator.java:192)
 at 
org.apache.hadoop.hive.ql.exec.ExprNodeEvaluator.isConsistentWithinQuery(ExprNodeEvaluator.java:117)
 at 
org.apache.hadoop.hive.ql.exec.ExprNodeEvaluatorFactory.iterate(ExprNodeEvaluatorFactory.java:102)
 at 
org.apache.hadoop.hive.ql.exec.ExprNodeEvaluatorFactory.toCachedEvals(ExprNodeEvaluatorFactory.java:76)
 at 
org.apache.hadoop.hive.ql.exec.SelectOperator.initializeOp(SelectOperator.java:70)
 at org.apache.hadoop.hive.ql.exec.Operator.initialize(Operator.java:360)
 at org.apache.hadoop.hive.ql.exec.Operator.initialize(Operator.java:549)
 at 
org.apache.hadoop.hive.ql.exec.Operator.initializeChildren(Operator.java:503)
 at org.apache.hadoop.hive.ql.exec.Operator.initialize(Operator.java:369)
 at 
org.apache.hadoop.hive.

[jira] [Created] (HIVE-25029) Remove travis builds

2021-04-19 Thread Zoltan Haindrich (Jira)
Zoltan Haindrich created HIVE-25029:
---

 Summary: Remove travis builds
 Key: HIVE-25029
 URL: https://issues.apache.org/jira/browse/HIVE-25029
 Project: Hive
  Issue Type: Improvement
Reporter: Zoltan Haindrich
Assignee: Zoltan Haindrich


travis only compiles the project - we already do much more than that during 
precommit testing.
(and it it sometimes delays build because travis cant allocate executors/etc)



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (HIVE-25028) Hive: Select query with IS operator producing unexpected result

2021-04-19 Thread Soumyakanti Das (Jira)
Soumyakanti Das created HIVE-25028:
--

 Summary: Hive: Select query with IS operator producing unexpected 
result
 Key: HIVE-25028
 URL: https://issues.apache.org/jira/browse/HIVE-25028
 Project: Hive
  Issue Type: Bug
  Components: Parser
Reporter: Soumyakanti Das
Assignee: Soumyakanti Das


Hive: Select query with IS operator is producing unexpected result.

The following was executed on postgres:
{code:java}
sqlancer=# create table if not exists emp(name text, age int);
CREATE TABLE
sqlancer=# insert into emp values ('a', 5), ('b', 15), ('c', 12);
INSERT 0 3
sqlancer=# select emp.age from emp where emp.age > 10;
 age
-
  15
  12
(2 rows)sqlancer=# select emp.age > 10 is true from emp;
 ?column?
--
 f
 t
 t
(3 rows){code}

This is happening because IS operator has higher precedence than comparison 
operators in Hive. In most other databases, comparison operator has higher 
precedence. The grammar needs to be changed to fix the precedence.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (HIVE-25027) Hide Iceberg module behind a profile

2021-04-19 Thread Peter Vary (Jira)
Peter Vary created HIVE-25027:
-

 Summary: Hide Iceberg module behind a profile
 Key: HIVE-25027
 URL: https://issues.apache.org/jira/browse/HIVE-25027
 Project: Hive
  Issue Type: Task
Reporter: Peter Vary
Assignee: Peter Vary


After creating {{patched-iceberg-core}} and {{patched-iceberg-api}} modules the 
maven build works fine, but IntelliJ needs manual classpath setup for the build 
in the IntelliJ to succeed.

Most of the community does not use Iceberg and eventually the "patched" modules 
will be removed as the Hive-Iceberg integration stabilizes and the Iceberg 
project releases the changes we need. In the meantime we just hide the whole 
{{Iceberg}} module behind a profile which is only used on the CI and if the 
developer specifically sets it. 

It could be used like"
{code:java}
 mvn clean install -DskipTests -Piceberg{code}



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (HIVE-25026) hive sql result is duplicate data cause of same task resubmission

2021-04-19 Thread hezhang (Jira)
hezhang created HIVE-25026:
--

 Summary: hive sql result is duplicate data cause of same task 
resubmission
 Key: HIVE-25026
 URL: https://issues.apache.org/jira/browse/HIVE-25026
 Project: Hive
  Issue Type: Bug
  Components: Hive
Affects Versions: 3.1.1
Reporter: hezhang
Assignee: hezhang






--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (HIVE-25025) Distcp In MoveTask may cause stats info lost

2021-04-19 Thread WangHualei (Jira)
WangHualei created HIVE-25025:
-

 Summary: Distcp In MoveTask may cause stats info lost
 Key: HIVE-25025
 URL: https://issues.apache.org/jira/browse/HIVE-25025
 Project: Hive
  Issue Type: Bug
  Components: Hive
 Environment: example:

set hive.exec.copyfile.maxsize=0;
set hive.exec.copyfile.maxnumfiles=0;

insert overwrite table abd_new select * from abc;

select count(*) from abd_new ;

select * from abd_new ;

then the  count(*) result will be 0, but select * will display real data, 
because stats info lost.
Reporter: WangHualei
Assignee: WangHualei


after set  _Run_ _as_ _end_ _user_ _instead_ _of_ _Hive_ _user_ , when execute 
insert overwrite , In MoveTask ,if source byte > HIVE_EXEC_COPYFILE_MAXSIZE  
and source file count> HIVE_EXEC_COPYFILE_MAXNUMFILES , HIve will use distcp 
method, it may cause tmp stats file lost. 



--
This message was sent by Atlassian Jira
(v8.3.4#803005)