[jira] [Work logged] (HIVE-24742) Support router path or view fs path in Hive table location

2021-11-27 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-24742?focusedWorklogId=687077=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-687077
 ]

ASF GitHub Bot logged work on HIVE-24742:
-

Author: ASF GitHub Bot
Created on: 28/Nov/21 00:13
Start Date: 28/Nov/21 00:13
Worklog Time Spent: 10m 
  Work Description: github-actions[bot] closed pull request #2574:
URL: https://github.com/apache/hive/pull/2574


   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: gitbox-unsubscr...@hive.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 687077)
Time Spent: 1h 50m  (was: 1h 40m)

> Support router path or view fs path in Hive table location
> --
>
> Key: HIVE-24742
> URL: https://issues.apache.org/jira/browse/HIVE-24742
> Project: Hive
>  Issue Type: Improvement
>  Components: Hive
>Affects Versions: 3.1.2
>Reporter: Aihua Xu
>Assignee: Aihua Xu
>Priority: Major
>  Labels: pull-request-available
> Attachments: HIVE-24742.patch
>
>  Time Spent: 1h 50m
>  Remaining Estimate: 0h
>
> In 
> [FileUtils.java|https://github.com/apache/hive/blob/master/common/src/java/org/apache/hadoop/hive/common/FileUtils.java#L747],
>  equalsFileSystem function checks the base URL to determine if source and 
> destination are on the same cluster and decides copy or move the data. That 
> will not work for viewfs or router base file system since 
> viewfs://ns-default/a and viewfs://ns-default/b may be on different physical 
> clusters.
> FileSystem in HDFS supports resolvePath() function to resolve to the physical 
> path. We can support viewfs and router through such function.



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Work logged] (HIVE-21614) Derby does not support CLOB comparisons

2021-11-27 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-21614?focusedWorklogId=687076=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-687076
 ]

ASF GitHub Bot logged work on HIVE-21614:
-

Author: ASF GitHub Bot
Created on: 28/Nov/21 00:13
Start Date: 28/Nov/21 00:13
Worklog Time Spent: 10m 
  Work Description: github-actions[bot] closed pull request #2484:
URL: https://github.com/apache/hive/pull/2484


   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: gitbox-unsubscr...@hive.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 687076)
Time Spent: 1h 40m  (was: 1.5h)

> Derby does not support CLOB comparisons
> ---
>
> Key: HIVE-21614
> URL: https://issues.apache.org/jira/browse/HIVE-21614
> Project: Hive
>  Issue Type: Bug
>  Components: Metastore
>Affects Versions: 2.3.4, 3.0.0
>Reporter: Vlad Rozov
>Assignee: Hank Fanchiu
>Priority: Major
>  Labels: pull-request-available
>  Time Spent: 1h 40m
>  Remaining Estimate: 0h
>
> HiveMetaStoreClient.listTableNamesByFilter() with non empty filter causes 
> exception with Derby DB:
> {noformat}
> Caused by: ERROR 42818: Comparisons between 'CLOB (UCS_BASIC)' and 'CLOB 
> (UCS_BASIC)' are not supported. Types must be comparable. String types must 
> also have matching collation. If collation does not match, a possible 
> solution is to cast operands to force them to the default collation (e.g. 
> SELECT tablename FROM sys.systables WHERE CAST(tablename AS VARCHAR(128)) = 
> 'T1')
>   at org.apache.derby.iapi.error.StandardException.newException(Unknown 
> Source)
>   at org.apache.derby.iapi.error.StandardException.newException(Unknown 
> Source)
>   at 
> org.apache.derby.impl.sql.compile.BinaryComparisonOperatorNode.bindComparisonOperator(Unknown
>  Source)
>   at 
> org.apache.derby.impl.sql.compile.BinaryComparisonOperatorNode.bindExpression(Unknown
>  Source)
>   at 
> org.apache.derby.impl.sql.compile.BinaryOperatorNode.bindExpression(Unknown 
> Source)
>   at 
> org.apache.derby.impl.sql.compile.BinaryLogicalOperatorNode.bindExpression(Unknown
>  Source)
>   at org.apache.derby.impl.sql.compile.AndNode.bindExpression(Unknown 
> Source)
>   at org.apache.derby.impl.sql.compile.SelectNode.bindExpressions(Unknown 
> Source)
>   at 
> org.apache.derby.impl.sql.compile.DMLStatementNode.bindExpressions(Unknown 
> Source)
>   at org.apache.derby.impl.sql.compile.DMLStatementNode.bind(Unknown 
> Source)
>   at org.apache.derby.impl.sql.compile.CursorNode.bindStatement(Unknown 
> Source)
>   at org.apache.derby.impl.sql.GenericStatement.prepMinion(Unknown Source)
>   at org.apache.derby.impl.sql.GenericStatement.prepare(Unknown Source)
>   at 
> org.apache.derby.impl.sql.conn.GenericLanguageConnectionContext.prepareInternalStatement(Unknown
>  Source)
>   ... 42 more
> {noformat}



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Work logged] (HIVE-25396) Improve Exception handling in Hive Server 2

2021-11-27 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-25396?focusedWorklogId=687073=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-687073
 ]

ASF GitHub Bot logged work on HIVE-25396:
-

Author: ASF GitHub Bot
Created on: 28/Nov/21 00:13
Start Date: 28/Nov/21 00:13
Worklog Time Spent: 10m 
  Work Description: github-actions[bot] closed pull request #2572:
URL: https://github.com/apache/hive/pull/2572


   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: gitbox-unsubscr...@hive.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 687073)
Time Spent: 2h 50m  (was: 2h 40m)

> Improve Exception handling in Hive Server 2
> ---
>
> Key: HIVE-25396
> URL: https://issues.apache.org/jira/browse/HIVE-25396
> Project: Hive
>  Issue Type: Bug
>Reporter: Matt McCline
>Assignee: Matt McCline
>Priority: Major
>  Labels: pull-request-available
> Attachments: hive-improve-exception-handling-04-master.patch
>
>  Time Spent: 2h 50m
>  Remaining Estimate: 0h
>
> Hive's org.apache.hive.service.thrift.ThriftHttpServlet.doPost method does 
> not handle all Exception kinds. This leaves uncaught Exception handling 
> choices to the Jetty HTTP library. We fix that.
> Also, a Thread.UncaughtExceptionHandler is added to Hive Server 2 so uncaught 
> Exception are handled uniformly, including making them logged and not just 
> printed to stderr.



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Work logged] (HIVE-25411) Don't parse zk path created by PrivilegeSynchronizer to get HiveServer2 URI

2021-11-27 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-25411?focusedWorklogId=687074=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-687074
 ]

ASF GitHub Bot logged work on HIVE-25411:
-

Author: ASF GitHub Bot
Created on: 28/Nov/21 00:13
Start Date: 28/Nov/21 00:13
Worklog Time Spent: 10m 
  Work Description: github-actions[bot] closed pull request #2553:
URL: https://github.com/apache/hive/pull/2553


   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: gitbox-unsubscr...@hive.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 687074)
Time Spent: 40m  (was: 0.5h)

> Don't parse zk path created by PrivilegeSynchronizer to get HiveServer2 URI 
> 
>
> Key: HIVE-25411
> URL: https://issues.apache.org/jira/browse/HIVE-25411
> Project: Hive
>  Issue Type: Bug
>Reporter: Chia-Ping Tsai
>Assignee: Chia-Ping Tsai
>Priority: Major
>  Labels: pull-request-available
>  Time Spent: 40m
>  Remaining Estimate: 0h
>
> The zk path "/{hive.server2.zookeeper.namespace}/leader" is used by 
> PrivilegeSynchronizer so we should remove it from server hosts when parsing 
> HiveServer2 URI.



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Work logged] (HIVE-25340) Refactor UDFToByte to GenericUDFToByte

2021-11-27 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-25340?focusedWorklogId=687071=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-687071
 ]

ASF GitHub Bot logged work on HIVE-25340:
-

Author: ASF GitHub Bot
Created on: 28/Nov/21 00:13
Start Date: 28/Nov/21 00:13
Worklog Time Spent: 10m 
  Work Description: github-actions[bot] closed pull request #2487:
URL: https://github.com/apache/hive/pull/2487


   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: gitbox-unsubscr...@hive.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 687071)
Time Spent: 40m  (was: 0.5h)

> Refactor UDFToByte to GenericUDFToByte
> --
>
> Key: HIVE-25340
> URL: https://issues.apache.org/jira/browse/HIVE-25340
> Project: Hive
>  Issue Type: Sub-task
>Reporter: Ashish Sharma
>Assignee: Ashish Sharma
>Priority: Major
>  Labels: pull-request-available
>  Time Spent: 40m
>  Remaining Estimate: 0h
>
> Description
> Move implementation of UDFToByte from old UDF.class to GenericUDFToByte with 
> GenericUDF.class
>  



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Work logged] (HIVE-25321) [HMS] Advance write Id during AlterTableDropPartition

2021-11-27 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-25321?focusedWorklogId=687075=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-687075
 ]

ASF GitHub Bot logged work on HIVE-25321:
-

Author: ASF GitHub Bot
Created on: 28/Nov/21 00:13
Start Date: 28/Nov/21 00:13
Worklog Time Spent: 10m 
  Work Description: github-actions[bot] closed pull request #2465:
URL: https://github.com/apache/hive/pull/2465


   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: gitbox-unsubscr...@hive.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 687075)
Time Spent: 1h 50m  (was: 1h 40m)

> [HMS] Advance write Id during AlterTableDropPartition
> -
>
> Key: HIVE-25321
> URL: https://issues.apache.org/jira/browse/HIVE-25321
> Project: Hive
>  Issue Type: Sub-task
>Reporter: Kishen Das
>Assignee: Kishen Das
>Priority: Major
>  Labels: pull-request-available
> Fix For: 4.0.0
>
>  Time Spent: 1h 50m
>  Remaining Estimate: 0h
>
> All DDLs should advance the write ID, so that we can provide consistent data 
> from the cache, based on the validWriteIds. 
>  



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Work logged] (HIVE-24471) Add support for combiner in hash mode group aggregation

2021-11-27 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-24471?focusedWorklogId=687072=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-687072
 ]

ASF GitHub Bot logged work on HIVE-24471:
-

Author: ASF GitHub Bot
Created on: 28/Nov/21 00:13
Start Date: 28/Nov/21 00:13
Worklog Time Spent: 10m 
  Work Description: github-actions[bot] closed pull request #2611:
URL: https://github.com/apache/hive/pull/2611


   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: gitbox-unsubscr...@hive.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 687072)
Time Spent: 3h  (was: 2h 50m)

> Add support for combiner in hash mode group aggregation 
> 
>
> Key: HIVE-24471
> URL: https://issues.apache.org/jira/browse/HIVE-24471
> Project: Hive
>  Issue Type: Bug
>  Components: Hive
>Reporter: mahesh kumar behera
>Assignee: mahesh kumar behera
>Priority: Major
>  Labels: pull-request-available
>  Time Spent: 3h
>  Remaining Estimate: 0h
>
> In map side group aggregation, partial grouped aggregation is calculated to 
> reduce the data written to disk by map task. In case of hash aggregation, 
> where the input data is not sorted, hash table is used (with sorting also 
> being performed before flushing). If the hash table size increases beyond 
> configurable limit, data is flushed to disk and new hash table is generated. 
> If the reduction by hash table is less than min hash aggregation reduction 
> calculated during compile time, the map side aggregation is converted to 
> streaming mode. So if the first few batch of records does not result into 
> significant reduction, then the mode is switched to streaming mode. This may 
> have impact on performance, if the subsequent batch of records have less 
> number of distinct values. 
> To improve performance both in Hash and Streaming mode, a combiner can be 
> added to the map task after the keys are sorted. This will make sure that the 
> aggregation is done if possible and reduce the data written to disk.



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Work logged] (HIVE-25251) Reduce overhead of adding partitions during batch loading of partitions.

2021-11-27 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-25251?focusedWorklogId=687069=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-687069
 ]

ASF GitHub Bot logged work on HIVE-25251:
-

Author: ASF GitHub Bot
Created on: 28/Nov/21 00:12
Start Date: 28/Nov/21 00:12
Worklog Time Spent: 10m 
  Work Description: github-actions[bot] closed pull request #2399:
URL: https://github.com/apache/hive/pull/2399


   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: gitbox-unsubscr...@hive.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 687069)
Time Spent: 0.5h  (was: 20m)

> Reduce overhead of adding partitions during batch loading of partitions.
> 
>
> Key: HIVE-25251
> URL: https://issues.apache.org/jira/browse/HIVE-25251
> Project: Hive
>  Issue Type: Sub-task
>Reporter: mahesh kumar behera
>Assignee: mahesh kumar behera
>Priority: Major
>  Labels: pull-request-available
>  Time Spent: 0.5h
>  Remaining Estimate: 0h
>
> The add partitions call done to HMS does a serial execution of data nucleus 
> calls to add the partitions to backend DB. This can be further optimised by 
> batching those sql statements.



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Work logged] (HIVE-25174) HiveMetastoreAuthorizer didn't check URI permission for AlterTableEvent

2021-11-27 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-25174?focusedWorklogId=687070=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-687070
 ]

ASF GitHub Bot logged work on HIVE-25174:
-

Author: ASF GitHub Bot
Created on: 28/Nov/21 00:12
Start Date: 28/Nov/21 00:12
Worklog Time Spent: 10m 
  Work Description: github-actions[bot] closed pull request #2327:
URL: https://github.com/apache/hive/pull/2327


   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: gitbox-unsubscr...@hive.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 687070)
Time Spent: 1h 10m  (was: 1h)

> HiveMetastoreAuthorizer didn't check URI permission for AlterTableEvent
> ---
>
> Key: HIVE-25174
> URL: https://issues.apache.org/jira/browse/HIVE-25174
> Project: Hive
>  Issue Type: Improvement
>Reporter: Janus Chow
>Priority: Major
>  Labels: pull-request-available
>  Time Spent: 1h 10m
>  Remaining Estimate: 0h
>
> When Using Ranger on Hive MetaStore, we met an issue that users without 
> permission to table's HDFS path succeeded in running "msck repair table 
> TABLENAME".
> This command is not authorized when we use `StorageBasedAuthorizer`, after 
> checking the code, we found `StorageBasedAuthorizer` would check the 
> permission of table's HDFS path, while `HiveMetastoreAuthorizer` used by 
> Ranger won't when dealing with the event of `AlterTableEvent`.
> This ticket is to add the URI permission check on AlterTableEvent for 
> `HiveMetastoreAuthorizer`.



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Work logged] (HIVE-25398) Converted external tables should be able to configure purge behaviour

2021-11-27 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-25398?focusedWorklogId=687068=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-687068
 ]

ASF GitHub Bot logged work on HIVE-25398:
-

Author: ASF GitHub Bot
Created on: 28/Nov/21 00:12
Start Date: 28/Nov/21 00:12
Worklog Time Spent: 10m 
  Work Description: github-actions[bot] closed pull request #2541:
URL: https://github.com/apache/hive/pull/2541


   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: gitbox-unsubscr...@hive.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 687068)
Time Spent: 40m  (was: 0.5h)

> Converted external tables should be able to configure purge behaviour
> -
>
> Key: HIVE-25398
> URL: https://issues.apache.org/jira/browse/HIVE-25398
> Project: Hive
>  Issue Type: Bug
>  Components: Standalone Metastore
>Reporter: Panagiotis Garefalakis
>Assignee: Panagiotis Garefalakis
>Priority: Major
>  Labels: pull-request-available
>  Time Spent: 40m
>  Remaining Estimate: 0h
>
> Creating non-ACID MANAGED tables is not allowed on Hive, which is instead 
> converting these tables to External: 
> https://issues.apache.org/jira/browse/HIVE-22158
> During table translation  both TRANSLATED_TO_EXTERNAL and 
> 'external.table.purge' are set to True. However, there could be the case that 
> the second parameter is already set in the table properties by the User. This 
> is ticket is adding an extra check to maintain that property if set.
> PS: A cleaner solution would be to create these Tables as External directly 
> but there could be the case the User is taking advantage of the translation 
> and is expecting the data NOT to be purged!
> Example:
> {code:java}
> -- Non-ACID table will be translated to EXTERNAL
> create table c(c int) LOCATION 'etp_1' 
> TBLPROPERTIES('transactional'='false','external.table.purge'='false');
> insert into c values(1);
> -- Maintain the purge=false property set above
> desc formatted c;
> select count(*) from c;
> drop table c;
> -- Create table in same location, data should still be there
> create table c(c int) LOCATION 'etp_1' 
> TBLPROPERTIES('transactional'='false','external.table.purge'='false');
> desc formatted c;
> select count(*) from c;
> {code}



--
This message was sent by Atlassian Jira
(v8.20.1#820001)