It looks like the same error: Caused by: org.apache.hadoop.hive.ql.metadata.HiveException: org.apache.hadoop.ipc.RemoteException(java.io.IOException): File /test-warehouse/tpcds.store_sales/.hive-staging_hive_2017-07-31_23-55-05_306_8385818677737494274-760/_task_tmp.-ext-10000/ss_sold_date_sk=2450988/_tmp.000000_0 could only be replicated to 0 nodes instead of minReplication (=1). There are 3 datanode(s) running and no node(s) are excluded in this operation. at org.apache.hadoop.hdfs.server.blockmanagement.BlockManager.chooseTarget4NewBlock(BlockManager.java:1724) at org.apache.hadoop.hdfs.server.namenode.FSNamesystem.getAdditionalBlock(FSNamesystem.java:3385) at org.apache.hadoop.hdfs.server.namenode.NameNodeRpcServer.addBlock(NameNodeRpcServer.java:683) at org.apache.hadoop.hdfs.server.namenode.AuthorizationProviderProxyClientProtocol.addBlock(AuthorizationProviderProxyClientProtocol.java:214) at org.apache.hadoop.hdfs.protocolPB.ClientNamenodeProtocolServerSideTranslatorPB.addBlock(ClientNamenodeProtocolServerSideTranslatorPB.java:495) at org.apache.hadoop.hdfs.protocol.proto.ClientNamenodeProtocolProtos$ClientNamenodeProtocol$2.callBlockingMethod(ClientNamenodeProtocolProtos.java) at org.apache.hadoop.ipc.ProtobufRpcEngine$Server$ProtoBufRpcInvoker.call(ProtobufRpcEngine.java:617) at org.apache.hadoop.ipc.RPC$Server.call(RPC.java:1073) at org.apache.hadoop.ipc.Server$Handler$1.run(Server.java:2217) at org.apache.hadoop.ipc.Server$Handler$1.run(Server.java:2213) at java.security.AccessController.doPrivileged(Native Method) at javax.security.auth.Subject.doAs(Subject.java:415) at org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1917) at org.apache.hadoop.ipc.Server$Handler.run(Server.java:2211)
at org.apache.hadoop.hive.ql.exec.FileSinkOperator.processOp(FileSinkOperator.java:751) at org.apache.hadoop.hive.ql.exec.Operator.forward(Operator.java:815) at org.apache.hadoop.hive.ql.exec.SelectOperator.processOp(SelectOperator.java:84) at org.apache.hadoop.hive.ql.exec.mr.ExecReducer.reduce(ExecReducer.java:244) ... 8 more Caused by: org.apache.hadoop.ipc.RemoteException(java.io.IOException): File /test-warehouse/tpcds.store_sales/.hive-staging_hive_2017-07-31_23-55-05_306_8385818677737494274-760/_task_tmp.-ext-10000/ss_sold_date_sk=2450988/_tmp.000000_0 could only be replicated to 0 nodes instead of minReplication (=1). There are 3 datanode(s) running and no node(s) are excluded in this operation. at org.apache.hadoop.hdfs.server.blockmanagement.BlockManager.chooseTarget4NewBlock(BlockManager.java:1724) at org.apache.hadoop.hdfs.server.namenode.FSNamesystem.getAdditionalBlock(FSNamesystem.java:3385) at org.apache.hadoop.hdfs.server.namenode.NameNodeRpcServer.addBlock(NameNodeRpcServer.java:683) at org.apache.hadoop.hdfs.server.namenode.AuthorizationProviderProxyClientProtocol.addBlock(AuthorizationProviderProxyClientProtocol.java:214) at org.apache.hadoop.hdfs.protocolPB.ClientNamenodeProtocolServerSideTranslatorPB.addBlock(ClientNamenodeProtocolServerSideTranslatorPB.java:495) at org.apache.hadoop.hdfs.protocol.proto.ClientNamenodeProtocolProtos$ClientNamenodeProtocol$2.callBlockingMethod(ClientNamenodeProtocolProtos.java) at org.apache.hadoop.ipc.ProtobufRpcEngine$Server$ProtoBufRpcInvoker.call(ProtobufRpcEngine.java:617) at org.apache.hadoop.ipc.RPC$Server.call(RPC.java:1073) at org.apache.hadoop.ipc.Server$Handler$1.run(Server.java:2217) at org.apache.hadoop.ipc.Server$Handler$1.run(Server.java:2213) at java.security.AccessController.doPrivileged(Native Method) at javax.security.auth.Subject.doAs(Subject.java:415) at org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1917) at org.apache.hadoop.ipc.Server$Handler.run(Server.java:2211) at org.apache.hadoop.ipc.Client.call(Client.java:1502) at org.apache.hadoop.ipc.Client.call(Client.java:1439) at org.apache.hadoop.ipc.ProtobufRpcEngine$Invoker.invoke(ProtobufRpcEngine.java:230) at com.sun.proxy.$Proxy12.addBlock(Unknown Source) at org.apache.hadoop.hdfs.protocolPB.ClientNamenodeProtocolTranslatorPB.addBlock(ClientNamenodeProtocolTranslatorPB.java:413) at sun.reflect.GeneratedMethodAccessor68.invoke(Unknown Source) at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43) at java.lang.reflect.Method.invoke(Method.java:606) at org.apache.hadoop.io.retry.RetryInvocationHandler.invokeMethod(RetryInvocationHandler.java:260) at org.apache.hadoop.io.retry.RetryInvocationHandler.invoke(RetryInvocationHandler.java:104) at com.sun.proxy.$Proxy13.addBlock(Unknown Source) at org.apache.hadoop.hdfs.DFSOutputStream$DataStreamer.locateFollowingBlock(DFSOutputStream.java:1814) at org.apache.hadoop.hdfs.DFSOutputStream$DataStreamer.nextBlockOutputStream(DFSOutputStream.java:1610) at org.apache.hadoop.hdfs.DFSOutputStream$DataStreamer.run(DFSOutputStream.java:773) 2017-07-31 23:55:38,630 ERROR exec.Task (SessionState.java:printError(1103)) - Ended Job = job_local1252085428_0826 with errors 2017-07-31 23:55:38,631 ERROR exec.Task (SessionState.java:printError(1103)) - Error during job, obtaining debugging information... 2017-07-31 23:55:38,641 ERROR ql.Driver (SessionState.java:printError(1103)) - FAILED: Execution Error, return code 2 from org.apache.hadoop.hive.ql.exec.mr.MapRedTask 2017-07-31 23:55:38,641 INFO log.PerfLogger (PerfLogger.java:PerfLogEnd(168)) - </PERFLOG method=Driver.execute start=1501545305365 end=1501545338641 duration=33276 from=org.apache.hadoop.hive.ql.Driver> On Mon, Jul 31, 2017 at 8:03 PM, Tim Armstrong <tarmstr...@cloudera.com> wrote: > I saw this on GVO: https://jenkins.impala.io/job/ubuntu-14.04-from- > scratch/1807/ > > I haven't pulled out the error from hive.log yet - for some reason that > log is almost 500mb. > > On Thu, Jul 13, 2017 at 3:52 PM, Tim Armstrong <tarmstr...@cloudera.com> > wrote: > >> I'm not sure exactly what is going on, but I can confirm that I was able >> to load data on Ubuntu 16.04 with OpenJDK 8 a while back. >> >> On Thu, Jul 13, 2017 at 2:58 PM, Jim Apple <jbap...@cloudera.com> wrote: >> >>> I also see this with the Oracle JDK. I have also now checked I am not >>> running out of memory. >>> >>> Oracle JDK7 is harder to get one's hands on, and OpenJDK7 isn't packaged >>> by >>> canonical for Ubuntu 16.04. >>> >>> On Wed, Jul 12, 2017 at 11:20 PM, Jim Apple <jbap...@cloudera.com> >>> wrote: >>> >>> > I'm getting data loading errors on Ubuntu 16.04 in TPC-DS. The terminal >>> > shows: >>> > >>> > ERROR : FAILED: Execution Error, return code 2 from >>> > org.apache.hadoop.hive.ql.exec.mr.MapRedTask >>> > >>> > logs/cluster/hive/hive.log shows the error below, which previous bugs >>> have >>> > called an issue with the disk being out of space, but my disk has at >>> least >>> > 45GB left on it >>> > >>> > IMPALA-3246, IMPALA-2856, IMPALA-2617 >>> > >>> > I see this with openJDK8. I haven't tried Oracle's JDK yet. >>> > >>> > Has anyone else seen this and been able to diagnose it as something >>> that >>> > doesn't mean a full disk? >>> > >>> > >>> > FATAL ExecReducer (ExecReducer.java:reduce(264)) - >>> > org.apache.hadoop.hive.ql.metadata.HiveException: Hive Runtime Error >>> > while processing row (tag=0) {"key":{},"value":{"_col0": >>> > 48147,"_col1":17805,"_col2":27944,"_col3":606992,"_col4": >>> > 3193,"_col5":16641,"_col6":10,"_col7":209,"_col8":44757,"_ >>> > col9":20,"_col10":5.51,"_col11":9.36,"_col12":9.17,"_ >>> > col13":0,"_col14":183.4,"_col15":110.2,"_col16":187.2,"_ >>> > col17":3.66,"_col18":0,"_col19":183.4,"_col20":187.06," >>> > _col21":73.2,"_col22":2452013}} >>> > at org.apache.hadoop.hive.ql.exec.mr.ExecReducer.reduce( >>> > ExecReducer.java:253) >>> > at org.apache.hadoop.mapred.ReduceTask.runOldReducer( >>> > ReduceTask.java:444) >>> > at org.apache.hadoop.mapred.Reduc >>> eTask.run(ReduceTask.java:392) >>> > at org.apache.hadoop.mapred.LocalJobRunner$Job$ >>> > ReduceTaskRunnable.run(LocalJobRunner.java:346) >>> > at java.util.concurrent.Executors$RunnableAdapter. >>> > call(Executors.java:511) >>> > at java.util.concurrent.FutureTask.run(FutureTask.java:266) >>> > at java.util.concurrent.ThreadPoolExecutor.runWorker( >>> > ThreadPoolExecutor.java:1142) >>> > at java.util.concurrent.ThreadPoolExecutor$Worker.run( >>> > ThreadPoolExecutor.java:617) >>> > at java.lang.Thread.run(Thread.java:748) >>> > Caused by: org.apache.hadoop.hive.ql.metadata.HiveException: >>> > org.apache.hadoop.ipc.RemoteException(java.io.IOException): File >>> > /test-warehouse/tpcds.store_sales/.hive-staging_hive_2017- >>> > 07-12_22-51-18_139_3687815919405186455-760/_task_ >>> > tmp.-ext-10000/ss_sold_date_sk=2452013/_tmp.000001_0 could only be >>> > replicated to 0 nodes instead of minReplication (=1). There are 3 >>> > datanode(s) running and no node(s) are excluded in this operation. >>> > at org.apache.hadoop.hdfs.server.blockmanagement.BlockManager. >>> > chooseTarget4NewBlock(BlockManager.java:1724) >>> > at org.apache.hadoop.hdfs.server.namenode.FSNamesystem. >>> > getAdditionalBlock(FSNamesystem.java:3385) >>> > at org.apache.hadoop.hdfs.server.namenode.NameNodeRpcServer. >>> > addBlock(NameNodeRpcServer.java:683) >>> > at org.apache.hadoop.hdfs.server.namenode. >>> > AuthorizationProviderProxyClientProtocol.addBlock( >>> > AuthorizationProviderProxyClientProtocol.java:214) >>> > at org.apache.hadoop.hdfs.protocolPB. >>> > ClientNamenodeProtocolServerSideTranslatorPB.addBlock( >>> > ClientNamenodeProtocolServerSideTranslatorPB.java:495) >>> > at org.apache.hadoop.hdfs.protocol.proto. >>> > ClientNamenodeProtocolProtos$ClientNamenodeProtocol$2.callBl >>> ockingMethod( >>> > ClientNamenodeProtocolProtos.java) >>> > at org.apache.hadoop.ipc.ProtobufRpcEngine$Server$ >>> > ProtoBufRpcInvoker.call(ProtobufRpcEngine.java:617) >>> > at org.apache.hadoop.ipc.RPC$Server.call(RPC.java:1073) >>> > at org.apache.hadoop.ipc.Server$H >>> andler$1.run(Server.java:2217) >>> > at org.apache.hadoop.ipc.Server$H >>> andler$1.run(Server.java:2213) >>> > at java.security.AccessController.doPrivileged(Native Method) >>> > at javax.security.auth.Subject.doAs(Subject.java:422) >>> > at org.apache.hadoop.security.UserGroupInformation.doAs( >>> > UserGroupInformation.java:1917) >>> > at org.apache.hadoop.ipc.Server$Handler.run(Server.java:2211) >>> > >>> > at org.apache.hadoop.hive.ql.exec.FileSinkOperator. >>> > processOp(FileSinkOperator.java:751) >>> > at org.apache.hadoop.hive.ql.exec.Operator.forward( >>> > Operator.java:815) >>> > at org.apache.hadoop.hive.ql.exec.SelectOperator.processOp( >>> > SelectOperator.java:84) >>> > at org.apache.hadoop.hive.ql.exec.mr.ExecReducer.reduce( >>> > ExecReducer.java:244) >>> > >>> >> >> >