Guillem LEFAIT created PIG-4063: ----------------------------------- Summary: BinSedesTuple cannot be cast to org.apache.pig.pen.util.ExampleTuple Key: PIG-4063 URL: https://issues.apache.org/jira/browse/PIG-4063 Project: Pig Issue Type: Bug Affects Versions: 0.12.0 Environment: local environment Reporter: Guillem LEFAIT
I got the following (simplified) input : {code:title=input|borderStyle=solid} 0 1135429845 -7727023712262303168 2004267823 -7727023712262303168 2004267823 {code} If I use the following script: {code:title=script.pig|borderStyle=solid} data = LOAD 'input' USING PigStorage('\t') AS (uid:chararray, hash:chararray); dataUidNotNull = FILTER data BY (uid != '0') AND (uid != '') AND (uid is not null); dataUidNotNullDistinct = DISTINCT dataUidNotNull; dataUidGroup = GROUP dataUidNotNullDistinct BY uid; ILLUSTRATE dataUidGroup; {code} I got this error : {quote} Pig Stack Trace --------------- ERROR 2997: Encountered IOException. Exception java.io.IOException: Exception at org.apache.pig.PigServer.getExamples(PigServer.java:1246) at org.apache.pig.tools.grunt.GruntParser.processIllustrate(GruntParser.java:831) at org.apache.pig.tools.pigscript.parser.PigScriptParser.Illustrate(PigScriptParser.java:802) at org.apache.pig.tools.pigscript.parser.PigScriptParser.parse(PigScriptParser.java:381) at org.apache.pig.tools.grunt.GruntParser.parseStopOnError(GruntParser.java:198) at org.apache.pig.tools.grunt.GruntParser.parseStopOnError(GruntParser.java:173) at org.apache.pig.tools.grunt.Grunt.run(Grunt.java:69) at org.apache.pig.Main.run(Main.java:541) at org.apache.pig.Main.main(Main.java:156) Caused by: java.lang.ClassCastException: org.apache.pig.data.BinSedesTuple cannot be cast to org.apache.pig.pen.util.ExampleTuple at org.apache.pig.pen.AugmentBaseDataVisitor.visit(AugmentBaseDataVisitor.java:444) at org.apache.pig.newplan.logical.relational.LODistinct.accept(LODistinct.java:51) at org.apache.pig.pen.util.PreOrderDepthFirstWalker.depthFirst(PreOrderDepthFirstWalker.java:82) at org.apache.pig.pen.util.PreOrderDepthFirstWalker.depthFirst(PreOrderDepthFirstWalker.java:84) at org.apache.pig.pen.util.PreOrderDepthFirstWalker.walk(PreOrderDepthFirstWalker.java:66) at org.apache.pig.newplan.PlanVisitor.visit(PlanVisitor.java:52) at org.apache.pig.pen.ExampleGenerator.getExamples(ExampleGenerator.java:180) at org.apache.pig.PigServer.getExamples(PigServer.java:1240) ... 8 more {quote} I also get the following debug message : {quote} DEBUG mapReduceLayer.PigMapReduce$Reduce: Reduce Plan empty! {quote} However, if I save dataUidNotNullDistinct and load it again, I got no more error, i.e. that script below is working: {code:title=scriptOK.pig|borderStyle=solid} data = LOAD 'inputNoBag' USING PigStorage('\t') AS (uid:chararray, hash:chararray); dataUid = FOREACH data GENERATE uid as uid, hash as hash; dataUidNotNull = FILTER dataUid BY (uid != '0') AND (uid != '') AND (uid is not null); dataUidNotNullDistinct = DISTINCT dataUidNotNull; STORE dataUidNotNullDistinct INTO 'output'; dataUidNotNullDistinct2 = LOAD 'output' USING PigStorage('\t') AS (uid:chararray, hash:chararray); dataUidGroup = GROUP dataUidNotNullDistinct2 BY uid; ILLUSTRATE dataUidGroup; {code} -- This message was sent by Atlassian JIRA (v6.2#6252)