Hi list,

I would like to parse the following XML-File using Pig:

<page>
  <id>1</id>
<revision>
    <id>1</id>
    <username>muehlburger</username>
</revision>
<revision>
    <id>2</id>
    <username>muehlburger</username>
</revision>
<revision>
    <id>3</id>
    <username>user1</username>
</revision>
...
<revision>
    <id>34334398</id>
    <username>muehlburger</username>
</revision>
</page>
<page>
  <id>2</id>
<revision>
    <id>343434</id>
    <username>muehlburger</username>
</revision>
<revision>
    <id>25343232</id>
    <username>muehlburger</username>
</revision>
<revision>
    <id>43434333</id>
    <username>user2</username>
</revision>
...
<revision>
    <id>5409589854</id>
    <username>user5</username>
</revision>
</page>
...

I would like to produce the following kind of csv output:

page_id revision_id username
1 1 muehlburger
1 2 muehlburger
1 3 user1
1 34334398 muehlburger
2 343434 muehlburger
2 25343232 muehlburger
2 43434333 user2
2 5409589854 user5

How can I acomplish this using PIG?

Thank you very much for your help!

Kind regards,
Herbert
--
=================================================================
Herbert Muehlburger  Software Development and Business Management
                                    Graz University of Technology
www.muehlburger.at                   www.twitter.com/hmuehlburger
=================================================================

Reply via email to