[ https://issues.apache.org/jira/browse/PIG-592?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Daniel Dai updated PIG-592: --------------------------- Status: Patch Available (was: Open) > schema inferred incorrectly > --------------------------- > > Key: PIG-592 > URL: https://issues.apache.org/jira/browse/PIG-592 > Project: Pig > Issue Type: Bug > Affects Versions: 0.4.0 > Reporter: Christopher Olston > Fix For: 0.6.0 > > Attachments: PIG-592-1.patch, PIG-592-2.patch, PIG-592-3.patch > > > A simple pig script, that never introduces any schema information: > A = load 'foo'; > B = foreach (group A by $8) generate group, COUNT($1); > C = load 'bar'; // ('bar' has two columns) > D = join B by $0, C by $0; > E = foreach D generate $0, $1, $3; > Fails, complaining that $3 does not exist: > java.io.IOException: Out of bound access. Trying to access non-existent > column: 3. Schema {B::group: bytearray,long,bytearray} has 3 column(s). > Apparently Pig gets confused, and thinks it knows the schema for C (a single > bytearray column). -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.