[ 
https://issues.apache.org/jira/browse/PIG-2023?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Olga Natkovich updated PIG-2023:
--------------------------------

    Fix Version/s:     (was: 0.10)
    
> lineage tracking  for casting should compare LoadCaster returned from 
> LoadFunc instead of comparing the FuncSpec
> ----------------------------------------------------------------------------------------------------------------
>
>                 Key: PIG-2023
>                 URL: https://issues.apache.org/jira/browse/PIG-2023
>             Project: Pig
>          Issue Type: Improvement
>            Reporter: Thejas M Nair
>
> When lineage of a column is tracked for the purpose of finding the LoadCaster 
> associated with a column, and it finds that a column has two possible 
> sources, it associates a LoadCaster (through a LoadFunc) only if the funcspec 
> for LoadFunc in both cases are the same. But it is possible that the two 
> LoadFunc with different func spec actually use the same LoadCaster (for 
> example the default of Utf8StorageConverter). If the LoadFunc funcspec don't 
> match, the LoadCaster returned by the LoadFunc should also be compred. If 
> they are equal, this LoadCaster should be associated with the column . The 
> LoadCaster implementation would need to override equals().
> For example, in this case the columns in relation u use the same LoadCaster -
> {code}
> l1 = load 'x' using PigStorage(',') as (a,b);
> l2 = load 'y' using PigStorage(':') as (a,b);
> u = union l1,l2;
> {code}

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Reply via email to