Vladislav created HIVE-25954:
Summary: LOAD DATA INPATH can't load data from empty Sequence File
(.seq)
Key: HIVE-25954
URL: https://issues.apache.org/jira/browse/HIVE-25954
Project: Hive
Issue Type: Bug
Components: API
Reporter: Vladislav
Whether using pyhive or impala, I'v got the same error when loading data to
external table via
LOAD DATA INPATH 'path to empty .seq file' into table external_table_name
partition (part='value')
OperationalError: Error while processing statement: FAILED: Execution Error,
return code 1 from org.apache.hadoop.hive.ql.exec.MoveTask
P.S. In the case of loading .seq file of the size > 0M or loading that emty
file via Spark I haven't any problem.
impala v0.17.0
pyhive '0.6.4'
Spark 2.3.0.2.6.5.0-292
Maybe, there is any setting in pyhive to fix this problem via?
More detailed response from beeline
ERROR : Failed with exception Wrong file format. Please check the file's format.
org.apache.hadoop.hive.ql.metadata.HiveException: Wrong file format. Please
check the file's format.
at org.apache.hadoop.hive.ql.exec.MoveTask.execute(MoveTask.java:360)
at org.apache.hadoop.hive.ql.exec.Task.executeTask(Task.java:162)
at
org.apache.hadoop.hive.ql.exec.TaskRunner.runSequential(TaskRunner.java:89)
at org.apache.hadoop.hive.ql.Driver.launchTask(Driver.java:1765)
at org.apache.hadoop.hive.ql.Driver.execute(Driver.java:1506)
at org.apache.hadoop.hive.ql.Driver.runInternal(Driver.java:1303)
at org.apache.hadoop.hive.ql.Driver.run(Driver.java:1170)
at org.apache.hadoop.hive.ql.Driver.run(Driver.java:1165) at
org.apache.hive.service.cli.operation.SQLOperation.runQuery(SQLOperation.java:197)
at
org.apache.hive.service.cli.operation.SQLOperation.access$300(SQLOperation.java:76)
at
org.apache.hive.service.cli.operation.SQLOperation$2$1.run(SQLOperation.java:255)
at java.security.AccessController.doPrivileged(Native Method)
at javax.security.auth.Subject.doAs(Subject.java:422)
at
org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1869)
at
org.apache.hive.service.cli.operation.SQLOperation$2.run(SQLOperation.java:266)
at
java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:511)
at java.util.concurrent.FutureTask.run(FutureTask.java:266)
at
java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:511)
at java.util.concurrent.FutureTask.run(FutureTask.java:266)
at
java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149)
at
java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624)
at java.lang.Thread.run(Thread.java:748)
Error: Error while processing statement: FAILED: Execution Error, return code 1
from org.apache.hadoop.hive.ql.exec.MoveTas
k (state=08S01,code=1)
--
This message was sent by Atlassian Jira
(v8.20.1#820001)