[
https://issues.apache.org/jira/browse/PIG-4355?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Daniel Dai updated PIG-4355:
----------------------------
Attachment: 14.diff
Pull the patch and attach to Jira since Pig need to leave a self-contained
trace for the patch.
Patch committed to trunk cleanly. Thanks John!
> Piggybank: XPath cant handle namespace in xpath, nor can it return more than
> one match
> ---------------------------------------------------------------------------------------
>
> Key: PIG-4355
> URL: https://issues.apache.org/jira/browse/PIG-4355
> Project: Pig
> Issue Type: Bug
> Components: piggybank
> Affects Versions: 0.14.0
> Reporter: John Cavanaugh
> Assignee: John Cavanaugh
> Attachments: 14.diff
>
>
> If you pass an xpath that contains a namespace the XPath UDF will always fail
> to match.
> It would be better to either silently remove the namespace or provide a
> parameter that will remove it.
> The reason it is desirable to ignore xpath's with namespaces is that many xml
> tools when selecting an xpath provide the namespace. It makes cutting &
> pasting into a pig script painful if you need to manually remove it.
> Additionally XPath only returns the *first* match. It is often desirable to
> return all matches and allow for a flattening to process multiple records.
> An XPathAll would be useful to have.
> A patch is available as a git pullrequest at
> https://github.com/apache/pig/pull/14
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)