[ https://issues.apache.org/jira/browse/PIG-3630?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13852133#comment-13852133 ]
Dmitriy V. Ryaboy commented on PIG-3630: ---------------------------------------- Is this a AvroStorage or data issue? grunt> import '/Users/dmitriy/tmp/tf_idf.macro'; grunt> register build/ivy/lib/Pig/avro-1.7.4.jar grunt> register build/ivy/lib/Pig/json-simple-1.1.jar grunt> register contrib/piggybank/java/piggybank.jar grunt> define AvroStorage org.apache.pig.piggybank.storage.avro.AvroStorage(); grunt> emails = load '/Users/dmitriy/Downloads/enron.avro'; grunt> describe emails Schema for emails unknown. (this is the same in both pig 0.11 and pig 0.12). Can you provide a simple reproducible use case that doesn't involve Avro, etc? Can you share what debugging you've done so far? > Macros that work in Pig 0.11 fail in Pig 0.12 :( > ------------------------------------------------ > > Key: PIG-3630 > URL: https://issues.apache.org/jira/browse/PIG-3630 > Project: Pig > Issue Type: Bug > Components: parser > Affects Versions: 0.12.0 > Reporter: Russell Jurney > > http://my.safaribooksonline.com/book/databases/9781449326890/7dot-exploring-data-with-reports/i_sect13_id196600_html > The ntf-idf macro listed there works under 0.11. Under 0.12, it results in > this: > 13/12/16 22:09:19 INFO pig.ExecTypeProvider: Trying ExecType : LOCAL > 2013-12-16 22:09:19,159 [main] INFO org.apache.pig.Main - Apache Pig version > 0.13.0-SNAPSHOT (rUnversioned directory) compiled Dec 09 2013, 14:37:29 > 2013-12-16 22:09:19,159 [main] INFO org.apache.pig.Main - Logging error > messages to: /private/tmp/pig_1387260559120.log > 2013-12-16 22:09:19.268 java[38060:1903] Unable to load realm info from > SCDynamicStore > 2013-12-16 22:09:19,528 [main] INFO > org.apache.pig.backend.hadoop.executionengine.HExecutionEngine - Connecting > to hadoop file system at: file:/// > 2013-12-16 22:09:20,189 [main] ERROR org.apache.pig.tools.grunt.Grunt - ERROR > 1025: > at expanding macro 'tf_idf' (per_business.pig:9) > <file per_business.pig, line 35, column 17> Invalid field projection. > Projected field [tf_idf] does not exist in schema: > business_id:chararray,token:chararray,term_freq:double,num_docs_with_token:long. > 2013-12-16 22:09:20,189 [main] ERROR org.apache.pig.tools.grunt.Grunt - > org.apache.pig.impl.plan.PlanValidationException: ERROR 1025: > at expanding macro 'tf_idf' (per_business.pig:9) > <file per_business.pig, line 35, column 17> Invalid field projection. > Projected field [tf_idf] does not exist in schema: > business_id:chararray,token:chararray,term_freq:double,num_docs_with_token:long. > at > org.apache.pig.newplan.logical.expression.ProjectExpression.findColNum(ProjectExpression.java:191) > at > org.apache.pig.newplan.logical.expression.ProjectExpression.setColumnNumberFromAlias(ProjectExpression.java:174) > at > org.apache.pig.newplan.logical.visitor.ColumnAliasConversionVisitor$1.visit(ColumnAliasConversionVisitor.java:53) > at > org.apache.pig.newplan.logical.expression.ProjectExpression.accept(ProjectExpression.java:215) > at > org.apache.pig.newplan.DependencyOrderWalker.walk(DependencyOrderWalker.java:75) > at org.apache.pig.newplan.PlanVisitor.visit(PlanVisitor.java:52) > at > org.apache.pig.newplan.logical.optimizer.AllExpressionVisitor.visit(AllExpressionVisitor.java:142) > at > org.apache.pig.newplan.logical.relational.LOInnerLoad.accept(LOInnerLoad.java:128) > at > org.apache.pig.newplan.DependencyOrderWalker.walk(DependencyOrderWalker.java:75) > at > org.apache.pig.newplan.logical.optimizer.AllExpressionVisitor.visit(AllExpressionVisitor.java:124) > at > org.apache.pig.newplan.logical.relational.LOForEach.accept(LOForEach.java:76) > at > org.apache.pig.newplan.DependencyOrderWalker.walk(DependencyOrderWalker.java:75) > at org.apache.pig.newplan.PlanVisitor.visit(PlanVisitor.java:52) > at org.apache.pig.PigServer$Graph.compile(PigServer.java:1694) > at org.apache.pig.PigServer$Graph.compile(PigServer.java:1686) > at org.apache.pig.PigServer$Graph.access$200(PigServer.java:1387) > at org.apache.pig.PigServer.execute(PigServer.java:1302) > at org.apache.pig.PigServer.executeBatch(PigServer.java:391) > at org.apache.pig.PigServer.executeBatch(PigServer.java:369) > at > org.apache.pig.tools.grunt.GruntParser.executeBatch(GruntParser.java:133) > at > org.apache.pig.tools.grunt.GruntParser.parseStopOnError(GruntParser.java:195) > at > org.apache.pig.tools.grunt.GruntParser.parseStopOnError(GruntParser.java:166) > at org.apache.pig.tools.grunt.Grunt.exec(Grunt.java:84) > at org.apache.pig.Main.run(Main.java:600) > at org.apache.pig.Main.main(Main.java:156) > at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method) > at > sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:57) > at > sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43) > at java.lang.reflect.Method.invoke(Method.java:606) > at org.apache.hadoop.util.RunJar.main(RunJar.java:156) -- This message was sent by Atlassian JIRA (v6.1.4#6159)