Rohini Palaniswamy created TEZ-1257:
---------------------------------------

             Summary: Error on empty partition when using 
OnFileUnorderedKVOutput and ShuffledMergedInput
                 Key: TEZ-1257
                 URL: https://issues.apache.org/jira/browse/TEZ-1257
             Project: Apache Tez
          Issue Type: Bug
            Reporter: Rohini Palaniswamy


Encountering exception

{code}
org.apache.tez.dag.api.TezUncheckedException: Path component must start with: 
attempt InputAttemptIdentifier [inputIdentifier=InputIdentifier [inputIndex=0], 
attemptNumber=0, pathComponent=]
        at 
org.apache.tez.runtime.library.common.InputAttemptIdentifier.<init>(InputAttemptIdentifier.java:45)
        at 
org.apache.tez.runtime.library.common.InputAttemptIdentifier.<init>(InputAttemptIdentifier.java:51)
        at 
org.apache.tez.runtime.library.common.shuffle.impl.ShuffleInputEventHandler.processDataMovementEvent(ShuffleInputEventHandler.java:81)
        at 
org.apache.tez.runtime.library.common.shuffle.impl.ShuffleInputEventHandler.handleEvent(ShuffleInputEventHandler.java:66)
        at 
org.apache.tez.runtime.library.common.shuffle.impl.ShuffleInputEventHandler.handleEvents(ShuffleInputEventHandler.java:59)
{code}

This is because the pathComponent is not set by UnorderedPartitionedKVWriter 
for empty partition

{code}
if (emptyPartitions.cardinality() != numPartitions) {
      // Populate payload only if at least 1 partition has data
      payloadBuidler.setHost(host);
      payloadBuidler.setPort(shufflePort);
      payloadBuidler.setPathComponent(outputContext.getUniqueIdentifier());
    }
{code}

The combination of OnFileUnorderedKVOutput and ShuffledMergedInput works fine 
otherwise if there are no empty partitions.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

Reply via email to