Eugene Koifman created ORC-195: ---------------------------------- Summary: FileFormatException should include file name in the message Key: ORC-195 URL: https://issues.apache.org/jira/browse/ORC-195 Project: ORC Issue Type: Bug Affects Versions: 1.3.3 Reporter: Eugene Koifman
Here is 1 example: {noformat} ReaderImpl.extractFileTail(FileSystem fs, Path path, long maxFileLength) throws IOException has if (size <= OrcFile.MAGIC.length()) { throw new FileFormatException("Not a valid ORC file"); } {noformat} which in the logs looks like {noformat} 2017-05-18T12:08:23,572 WARN [Thread-360] mapred.LocalJobRunner: job_local150767050_0007 java.lang.Exception: org.apache.orc.FileFormatException: Not a valid ORC file at org.apache.hadoop.mapred.LocalJobRunner$Job.runTasks(LocalJobRunner.java:489) ~[hadoop-mapreduce-client-common-2.8.0.jar:?] at org.apache.hadoop.mapred.LocalJobRunner$Job.run(LocalJobRunner.java:549) [hadoop-mapreduce-client-common-2.8.0.jar:?] Caused by: org.apache.orc.FileFormatException: Not a valid ORC file at org.apache.orc.impl.ReaderImpl.extractFileTail(ReaderImpl.java:511) ~[orc-core-1.3.3.jar:1.3.3] at org.apache.orc.impl.ReaderImpl.<init>(ReaderImpl.java:378) ~[orc-core-1.3.3.jar:1.3.3] at org.apache.hadoop.hive.ql.io.orc.ReaderImpl.<init>(ReaderImpl.java:63) ~[classes/:?] at org.apache.hadoop.hive.ql.io.orc.OrcFile.createReader(OrcFile.java:90) ~[classes/:?] at org.apache.hadoop.hive.ql.io.orc.OrcInputFormat.getRawReader(OrcInputFormat.java:2279) ~[classes/:?] at org.apache.hadoop.hive.ql.txn.compactor.CompactorMR$CompactorMap.map(CompactorMR.java:665) ~[classes/:?] at org.apache.hadoop.hive.ql.txn.compactor.CompactorMR$CompactorMap.map(CompactorMR.java:642) ~[classes/:?] at org.apache.hadoop.mapred.MapRunner.run(MapRunner.java:54) ~[hadoop-mapreduce-client-core-2.8.0.jar:?] at org.apache.hadoop.mapred.MapTask.runOldMapper(MapTask.java:453) ~[hadoop-mapreduce-client-core-2.8.0.jar:?] at org.apache.hadoop.mapred.MapTask.run(MapTask.java:343) ~[hadoop-mapreduce-client-core-2.8.0.jar:?] at org.apache.hadoop.mapred.LocalJobRunner$Job$MapTaskRunnable.run(LocalJobRunner.java:270) ~[hadoop-mapreduce-client-common-2.8.0.jar:?] at java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:511) ~[?:1.8.0_25] at java.util.concurrent.FutureTask.run(FutureTask.java:266) ~[?:1.8.0_25] at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142) ~[?:1.8.0_25] at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617) ~[?:1.8.0_25] at java.lang.Thread.run(Thread.java:745) ~[?:1.8.0_25] {noformat} -- This message was sent by Atlassian JIRA (v6.3.15#6346)