[ https://issues.apache.org/jira/browse/PIG-2541?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Daniel Dai resolved PIG-2541. ----------------------------- Resolution: Fixed Fix Version/s: 0.11 Release Note: We add a new option -tagsource to PigStorage. With this flag, we can get the INPUT_FILE_NAME as the first column of the output data. eg: a = load '1.txt' using PigStorage('\t', '-tagsource'); Hadoop Flags: Reviewed Unit tests pass. test-patch: [exec] -1 overall. [exec] [exec] +1 @author. The patch does not contain any @author tags. [exec] [exec] +1 tests included. The patch appears to include 3 new or modified tests. [exec] [exec] -1 javadoc. The javadoc tool appears to have generated 1 warning messages. [exec] [exec] +1 javac. The applied patch does not increase the total number of javac compiler warnings. [exec] [exec] +1 findbugs. The patch does not introduce any new Findbugs warnings. [exec] [exec] -1 release audit. The applied patch generated 533 release audit warnings (more than the trunk's current 530 warnings). javac and release audit warning is unrelated. Patch committed to trunk, thanks Prashant! > Automatic record provenance (source tagging) for PigStorage > ----------------------------------------------------------- > > Key: PIG-2541 > URL: https://issues.apache.org/jira/browse/PIG-2541 > Project: Pig > Issue Type: Improvement > Components: impl > Affects Versions: 0.9.1 > Reporter: Richard Ding > Assignee: Prashant Kommireddi > Fix For: 0.11 > > Attachments: PIG-2541.patch, PIG-2541_2.patch, PIG-2541_3.patch > > > There are a lot of interests in knowing where the data comes from when > loading from a directory (or a set of directories). One can do it manually > (see https://cwiki.apache.org/confluence/display/PIG/FAQ). But it will be > more convenient for users if we implement this in the PigStorage with a > command line option (e.g., pig.source.tagging=true/false) to turn it on/off. > By default it will be off. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira