[ https://issues.apache.org/jira/browse/PIG-3373?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13888113#comment-13888113 ]
Daniel Dai commented on PIG-3373: --------------------------------- I am fine to checkin a test data file. However, you will need to put it into a proper directory (eg contrib/piggybank/java/src/test/java/org/apache/pig/piggybank/test/evaluation/xml/data) and make your test work with it. > XMLLoader returns non-matching nodes when a tag name spans through the block > boundary > ------------------------------------------------------------------------------------- > > Key: PIG-3373 > URL: https://issues.apache.org/jira/browse/PIG-3373 > Project: Pig > Issue Type: Bug > Components: piggybank > Affects Versions: site > Reporter: Ahmed Eldawy > Assignee: Ahmed Eldawy > Labels: patch > Attachments: PIG3373.patch, PIG3373_1.patch, PIG3373_2.patch, > bad-file.xml.bz2 > > > When node start tag spans two blocks this tag is returned even if it is not > of the type. > Example: For the following input file > <event id="3423"> > <ev > -------- BLOCK BOUNDARY > entually id="dfasd"> > XMLoader with tag type 'event' should return only the first one but it > actually returns both of them -- This message was sent by Atlassian JIRA (v6.1.5#6160)