[ 
https://issues.apache.org/jira/browse/TRAFODION-2109?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15376315#comment-15376315
 ] 

Selvaganesan Govindarajan commented on TRAFODION-2109:
------------------------------------------------------

Analysis of the issue reveals the following:

One of the ESPs returned the internal error below while writing the error log 
record in hdfs. 
 
data_ = 0x7f8a38756d70 "\njava.nio.channels.ClosedChannelException\n
org.apache.hadoop.hdfs.DFSOutputStream.checkClosed(DFSOutputStream.java:1635)\n
org.apache.hadoop.fs.FSOutputSummer.write(FSOutputSummer.java:104)\n
org.apache.hadoop.fs.FSDataOutputStream$PositionCache.write(FSDataOutputStream.java:58)\n
java.io.DataOutputStream.write(DataOutputStream.java:107)\n
java.io.FilterOutputStream.write(FilterOutputStream.java:97)\n
org.trafodion.sql.HiveClient.hdfsWrite(HiveClient.java:278)\n"
 
Hdfs scan returned an error -8413.  This error row along with the error 
information is written to error log file in hdfs. But the writing to the error 
log file returned an error as shown in the above stack trace. This ESP is 
supposed to core with the asserting message shown below:
 
ex_assert((retcode == HBASE_ACCESS_SUCCESS), "Error while writing the log 
file");
 
I have seen earlier ex_assert is ignored in release mode. Tried with 
ABORT_ON_ERROR=2034 in  $MY_SQROOT/etc/ms.env..  This setting dumped the core 
for the ESP process or error -8413 though ABORT_ON_ERROR is set to 2034 because 
the error is thrown due to ex_assert.
 
I think though it didn’t dump core earlier,  it triggered error 2034. But, the 
actual error message was lost.


> Load with log error rows returns the following error at times
> -------------------------------------------------------------
>
>                 Key: TRAFODION-2109
>                 URL: https://issues.apache.org/jira/browse/TRAFODION-2109
>             Project: Apache Trafodion
>          Issue Type: Bug
>          Components: sql-exe
>    Affects Versions: 1.3-incubating
>            Reporter: Selvaganesan Govindarajan
>            Assignee: Selvaganesan Govindarajan
>             Fix For: 2.1-incubating
>
>
> load with log error rows into <trafodion_table>
> select * from hive.hive.<hive_table>
> *** ERROR[2034] $Z0014JQ:5192: Operating system error 201 while communicating 
> with server process $Z021FS0:425.
>  
> *** ERROR[2034] $Z0014JQ:5192: Operating system error 201 while communicating 
> with server process $Z021FS0:425.
>  
> *** ERROR[2034] $Z0014JQ:5192: Operating system error 201 while communicating 
> with server process $Z021FS0:425.
>  
> --- 0 row(s) loaded.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to