[ 
https://issues.apache.org/jira/browse/PIG-5179?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15902681#comment-15902681
 ] 

liyunzhang_intel commented on PIG-5179:
---------------------------------------

[~nkollar]: the exception is because the invalid format of task attemptId, 
maybe we need to change the code about TASK_ID in LoadConvert#apply
{code}
** Construct a TaskAttemptID object from given string 
   * @return constructed TaskAttemptID object or null if the given String is 
null
   * @throws IllegalArgumentException if the given string is malformed
   */
  public static TaskAttemptID forName(String str
                                      ) throws IllegalArgumentException {
    if(str == null)
      return null;
    String exceptionMsg = null;
    try {
      String[] parts = str.split(Character.toString(SEPARATOR));
      if(parts.length == 6) {
        if(parts[0].equals(ATTEMPT)) {
          String type = parts[3];
          TaskType t = TaskID.getTaskType(type.charAt(0));
          if(t != null) {
            return new org.apache.hadoop.mapred.TaskAttemptID
            (parts[1],
             Integer.parseInt(parts[2]),
             t, Integer.parseInt(parts[4]), 
             Integer.parseInt(parts[5]));  
          } else
            exceptionMsg = "Bad TaskType identifier. TaskAttemptId string : "
                + str + " is not properly formed.";
        }
      }
    } catch (Exception ex) {
      //fall below
    }
    if (exceptionMsg == null) {
      exceptionMsg = "TaskAttemptId string : " + str
          + " is not properly formed";
    }
    throw new IllegalArgumentException(exceptionMsg);
  }

}

{code}

> BigData_Streaming_1 is failing
> ------------------------------
>
>                 Key: PIG-5179
>                 URL: https://issues.apache.org/jira/browse/PIG-5179
>             Project: Pig
>          Issue Type: Sub-task
>          Components: spark
>            Reporter: Nandor Kollar
>            Assignee: Nandor Kollar
>             Fix For: spark-branch
>
>         Attachments: PIG-5179_1.patch
>
>
> BigData_Streaming_1 is failing, because Spark uses HadoopExecutableManager, 
> and looks like spark task id is not a valid task attempt id.
> {code}
> Caused by: org.apache.pig.backend.executionengine.ExecException: ERROR 2083: 
> Error while trying to get next result in POStream.
>         at 
> org.apache.pig.backend.hadoop.executionengine.physicalLayer.relationalOperators.POStream.getNextHelper(POStream.java:309)
>         at 
> org.apache.pig.backend.hadoop.executionengine.physicalLayer.relationalOperators.POStream.getNextTuple(POStream.java:171)
>         ... 19 more
> Caused by: java.lang.IllegalArgumentException: TaskAttemptId string : 1 is 
> not properly formed
>         at 
> org.apache.hadoop.mapreduce.TaskAttemptID.forName(TaskAttemptID.java:201)
>         at 
> org.apache.pig.backend.hadoop.streaming.HadoopExecutableManager.writeErrorToHDFS(HadoopExecutableManager.java:177)
>         at 
> org.apache.pig.backend.hadoop.streaming.HadoopExecutableManager.exec(HadoopExecutableManager.java:107)
>         at 
> org.apache.pig.impl.streaming.ExecutableManager.run(ExecutableManager.java:259)
>         at 
> org.apache.pig.backend.hadoop.executionengine.physicalLayer.relationalOperators.POStream.getNextHelper(POStream.java:281)
> {code}



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

Reply via email to