[ 
https://issues.apache.org/jira/browse/TEZ-2314?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14512246#comment-14512246
 ] 

Siddharth Seth commented on TEZ-2314:
-------------------------------------

Heartbeating and sending counters/stats for IOs which have initialized should 
be absolutely fine. Initialization can changes these values - for which updates 
will be available. That's one of the reasons counters are serialized - to make 
sure data reaches correctly as these can change at any point.
Given that this window is small however (during initialization), this approach 
is probably fine as well. Will look more later.
Will this information be exposed correctly to plugins after a task starts ?, 
but before any updates have been received for it ?

> Tez task attempt failures due to bad event serialization
> --------------------------------------------------------
>
>                 Key: TEZ-2314
>                 URL: https://issues.apache.org/jira/browse/TEZ-2314
>             Project: Apache Tez
>          Issue Type: Bug
>    Affects Versions: 0.7.0
>            Reporter: Rohini Palaniswamy
>            Assignee: Bikas Saha
>            Priority: Blocker
>         Attachments: TEZ-2314.1.patch, TEZ-2314.log.patch
>
>
> {code}
> 2015-04-13 19:21:48,516 WARN [Socket Reader #3 for port 53530] ipc.Server: 
> Unable to read call parameters for client 10.216.13.112on connection protocol 
> org.apache.tez.common.TezTaskUmbilicalProtocol for rpcKind RPC_WRITABLE
> java.lang.ArrayIndexOutOfBoundsException: 1935896432
>         at 
> org.apache.tez.runtime.api.impl.EventMetaData.readFields(EventMetaData.java:120)
>         at 
> org.apache.tez.runtime.api.impl.TezEvent.readFields(TezEvent.java:271)
>         at 
> org.apache.tez.runtime.api.impl.TezHeartbeatRequest.readFields(TezHeartbeatRequest.java:110)
>         at 
> org.apache.hadoop.io.ObjectWritable.readObject(ObjectWritable.java:285)
>         at 
> org.apache.hadoop.ipc.WritableRpcEngine$Invocation.readFields(WritableRpcEngine.java:160)
>         at 
> org.apache.hadoop.ipc.Server$Connection.processRpcRequest(Server.java:1884)
>         at 
> org.apache.hadoop.ipc.Server$Connection.processOneRpc(Server.java:1816)
>         at 
> org.apache.hadoop.ipc.Server$Connection.readAndProcess(Server.java:1574)
>         at org.apache.hadoop.ipc.Server$Listener.doRead(Server.java:806)
>         at 
> org.apache.hadoop.ipc.Server$Listener$Reader.doRunLoop(Server.java:673)
>         at org.apache.hadoop.ipc.Server$Listener$Reader.run(Server.java:644)
> {code}
> cc/ [~hitesh] and [~bikassaha]



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to