[jira] [Created] (HIVE-23955) Classification of Error Codes in Replication

2020-07-30 Thread Aasha Medhi (Jira)
Aasha Medhi created HIVE-23955:
--

 Summary: Classification of Error Codes in Replication
 Key: HIVE-23955
 URL: https://issues.apache.org/jira/browse/HIVE-23955
 Project: Hive
  Issue Type: Task
Reporter: Aasha Medhi
Assignee: Aasha Medhi






--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (HIVE-23956) Delete delta directory file information should be pushed to execution side

2020-07-30 Thread Peter Varga (Jira)
Peter Varga created HIVE-23956:
--

 Summary: Delete delta directory file information should be pushed 
to execution side
 Key: HIVE-23956
 URL: https://issues.apache.org/jira/browse/HIVE-23956
 Project: Hive
  Issue Type: Improvement
Reporter: Peter Varga
Assignee: Peter Varga


Since HIVE-23840 LLAP cache is used to retrieve the tail of the ORC bucket 
files in the delete deltas, but to use the cache the fileId must be determined, 
so one more FileSystem call is issued for each bucket.

This fileId is already available during compilation in the AcidState 
calculation, we should serialise this to the OrcSplit, and remove the 
unnecessary FS calls.

Furthermore instead of sending the SyntheticFileId directly, we should pass the 
attemptId instead of the standard path hash, this way the path and the 
SyntheticFileId. can be calculated, and it will work even, if the move free 
delete operations will be introduced.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (HIVE-23957) Limit followed by TopNKey improvement

2020-07-30 Thread Attila Magyar (Jira)
Attila Magyar created HIVE-23957:


 Summary: Limit followed by TopNKey improvement
 Key: HIVE-23957
 URL: https://issues.apache.org/jira/browse/HIVE-23957
 Project: Hive
  Issue Type: Improvement
Reporter: Attila Magyar
Assignee: Attila Magyar


The Limit + topnkey pushdown might result a limit operator followed by a TNK in 
the physical plan. This likely makes the TNK unnecessary in cases like this. 
Need to investigate if/when we can remove the TNK.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (HIVE-23958) HiveServer2 should support additional keystore/truststores types besides JKS

2020-07-30 Thread Kevin Risden (Jira)
Kevin Risden created HIVE-23958:
---

 Summary: HiveServer2 should support additional 
keystore/truststores types besides JKS
 Key: HIVE-23958
 URL: https://issues.apache.org/jira/browse/HIVE-23958
 Project: Hive
  Issue Type: Improvement
  Components: HiveServer2
Reporter: Kevin Risden
Assignee: Kevin Risden


Currently HiveServer2 (through Jetty and Thrift) only supports JKS (and PCKS12 
based on JDK fallback) keystore/truststore types. There are additional 
keystore/truststore types used for different applications like for FIPS crypto 
algorithms. HS2 should support the default keystore type specified for the JDK 
and not always use JKS.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (HIVE-23959) Provide an option to wipe out column stats for partitioned tables in case of column removal

2020-07-30 Thread Zoltan Haindrich (Jira)
Zoltan Haindrich created HIVE-23959:
---

 Summary: Provide an option to wipe out column stats for 
partitioned tables in case of column removal
 Key: HIVE-23959
 URL: https://issues.apache.org/jira/browse/HIVE-23959
 Project: Hive
  Issue Type: Improvement
Reporter: Zoltan Haindrich
Assignee: Zoltan Haindrich


in case of column removal / replacement - an update for each partition is 
neccessary; which could take a while.

goal here is to provide an option to switch to the bulk removal of column 
statistics instead of working hard to retain as much as possible from the old 
stats.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (HIVE-23960) Partition with no column statistics leads to unbalanced calls to openTransaction/commitTransaction error during get_partitions_by_names

2020-07-30 Thread Pravin Sinha (Jira)
Pravin Sinha created HIVE-23960:
---

 Summary: Partition with no column statistics leads to unbalanced 
calls to openTransaction/commitTransaction error during get_partitions_by_names
 Key: HIVE-23960
 URL: https://issues.apache.org/jira/browse/HIVE-23960
 Project: Hive
  Issue Type: Task
Reporter: Pravin Sinha
Assignee: Pravin Sinha


{color:#172b4d}Creating a partition with data and adding another partition is 
leading to unbalanced calls to open/commit transaction during 
get_partitions_by_names call.
{color}

{color:#172b4d}Issue was discovered during REPL DUMP operation which uses  this 
HMS call to get the metadata of partition. This error occurs when there is a 
partition with no column statistics.{color}

{color:#172b4d}To reproduce:{color}
{code:java}
CREATE TABLE student_part_acid(name string, age int, gpa double) PARTITIONED BY 
(ds string) STORED AS orc;
LOAD DATA INPATH ‘/user/hive/partDir/student_part_acid/ds=20110924’ INTO TABLE 
student_part_acid partition(ds=20110924);
ALTER TABLE student_part_acid ADD PARTITION (ds=20110925);

Now we try to preform REPL DUMP it fails with this the error "Unbalanced calls 
to open/commit transaction" on the HS2 side. 
{code}



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (HIVE-23961) Enable external table replication by default

2020-07-30 Thread Aasha Medhi (Jira)
Aasha Medhi created HIVE-23961:
--

 Summary: Enable external table replication by default
 Key: HIVE-23961
 URL: https://issues.apache.org/jira/browse/HIVE-23961
 Project: Hive
  Issue Type: Task
Reporter: Aasha Medhi
Assignee: Aasha Medhi






--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (HIVE-23962) Make bin/hive pick user defined jdbc url

2020-07-30 Thread Xiaomeng Zhang (Jira)
Xiaomeng Zhang created HIVE-23962:
-

 Summary: Make bin/hive pick user defined jdbc url 
 Key: HIVE-23962
 URL: https://issues.apache.org/jira/browse/HIVE-23962
 Project: Hive
  Issue Type: Bug
  Components: HiveServer2
Affects Versions: 4.0.0
Reporter: Xiaomeng Zhang


Currently hive command will trigger bin/hive which run "beeline" by default.

We want to pass a env variable so that user can define which url beeline use.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)