[jira] [Created] (HIVE-25954) LOAD DATA INPATH can't load data from empty Sequence File (.seq)

2022-02-11 Thread Vladislav (Jira)
Vladislav created HIVE-25954:


 Summary: LOAD DATA INPATH can't load data from empty Sequence File 
(.seq)
 Key: HIVE-25954
 URL: https://issues.apache.org/jira/browse/HIVE-25954
 Project: Hive
  Issue Type: Bug
  Components: API
Reporter: Vladislav


Whether using pyhive or impala, I'v got the same error when loading data to 
external table via 

LOAD DATA INPATH 'path to empty .seq file' into table external_table_name 
partition (part='value')

OperationalError: Error while processing statement: FAILED: Execution Error, 
return code 1 from org.apache.hadoop.hive.ql.exec.MoveTask

P.S. In the case of loading .seq file of the size > 0M or loading that emty 
file via Spark I haven't any problem.

impala v0.17.0

pyhive '0.6.4'

Spark  2.3.0.2.6.5.0-292

 

Maybe, there is any setting in pyhive to fix this problem via?

More detailed response from beeline 

ERROR : Failed with exception Wrong file format. Please check the file's format.
org.apache.hadoop.hive.ql.metadata.HiveException: Wrong file format. Please 
check the file's format.
        at org.apache.hadoop.hive.ql.exec.MoveTask.execute(MoveTask.java:360)
        at org.apache.hadoop.hive.ql.exec.Task.executeTask(Task.java:162)
        at 
org.apache.hadoop.hive.ql.exec.TaskRunner.runSequential(TaskRunner.java:89)
        at org.apache.hadoop.hive.ql.Driver.launchTask(Driver.java:1765)
        at org.apache.hadoop.hive.ql.Driver.execute(Driver.java:1506)
        at org.apache.hadoop.hive.ql.Driver.runInternal(Driver.java:1303)
        at org.apache.hadoop.hive.ql.Driver.run(Driver.java:1170)
        at org.apache.hadoop.hive.ql.Driver.run(Driver.java:1165)        at 
org.apache.hive.service.cli.operation.SQLOperation.runQuery(SQLOperation.java:197)
        at 
org.apache.hive.service.cli.operation.SQLOperation.access$300(SQLOperation.java:76)
        at 
org.apache.hive.service.cli.operation.SQLOperation$2$1.run(SQLOperation.java:255)
        at java.security.AccessController.doPrivileged(Native Method)
        at javax.security.auth.Subject.doAs(Subject.java:422)
        at 
org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1869)
        at 
org.apache.hive.service.cli.operation.SQLOperation$2.run(SQLOperation.java:266)
        at 
java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:511)
        at java.util.concurrent.FutureTask.run(FutureTask.java:266)
        at 
java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:511)
        at java.util.concurrent.FutureTask.run(FutureTask.java:266)
        at 
java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149)
        at 
java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624)
        at java.lang.Thread.run(Thread.java:748)

Error: Error while processing statement: FAILED: Execution Error, return code 1 
from org.apache.hadoop.hive.ql.exec.MoveTas
k (state=08S01,code=1)



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Created] (HIVE-25953) Unify getPredicates() for Join between HiveRelMdPredicates and RelMdPredicates

2022-02-11 Thread Alessandro Solimando (Jira)
Alessandro Solimando created HIVE-25953:
---

 Summary: Unify getPredicates() for Join between 
HiveRelMdPredicates and RelMdPredicates
 Key: HIVE-25953
 URL: https://issues.apache.org/jira/browse/HIVE-25953
 Project: Hive
  Issue Type: Sub-task
  Components: CBO
Affects Versions: 4.0.0
Reporter: Alessandro Solimando
Assignee: Alessandro Solimando


The goal of the ticket is to unify the two implementations and remove the 
override in HiveRelMdPredicates. 

At the moment, the main blocker is that the Hive variant still relies in 
RexNode's comparison via its String digest, while Calcite does not need that 
anymore.



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Created] (HIVE-25952) Unify getPredicates() for Projection between HiveRelMdPredicates and RelMdPredicates

2022-02-11 Thread Alessandro Solimando (Jira)
Alessandro Solimando created HIVE-25952:
---

 Summary: Unify getPredicates() for Projection between 
HiveRelMdPredicates and RelMdPredicates
 Key: HIVE-25952
 URL: https://issues.apache.org/jira/browse/HIVE-25952
 Project: Hive
  Issue Type: Sub-task
  Components: CBO
Affects Versions: 4.0.0
Reporter: Alessandro Solimando
Assignee: Alessandro Solimando


There are some differences on this method between Hive and Calcite, the idea of 
this ticket is to unify the two methods, and then drop the override in 
HiveRelMdPredicates in favour of the method of RelMdPredicates.



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Created] (HIVE-25951) Re-use methods from RelMdPredicates in HiveRelMdPredicates

2022-02-11 Thread Alessandro Solimando (Jira)
Alessandro Solimando created HIVE-25951:
---

 Summary: Re-use methods from RelMdPredicates in HiveRelMdPredicates
 Key: HIVE-25951
 URL: https://issues.apache.org/jira/browse/HIVE-25951
 Project: Hive
  Issue Type: Sub-task
  Components: CBO
Affects Versions: 4.0.0
Reporter: Alessandro Solimando
Assignee: Alessandro Solimando


This ticket makes HiveRelMdPredicates extend RelMdPredicates, and remove the 
duplicate methods which share the same behaviour, while overriding those for 
which there is a difference.



--
This message was sent by Atlassian Jira
(v8.20.1#820001)