Re: Propose UDF

2013-09-05 Thread Alan del Rio
Hi Alan, Here are my answers: 1) Why did you try to use RANK? I needed to create a second attribute in order to generate a one to one relationship between the two data bags, names and ids. RANK allowed me to generate a unique sequence per data bag item according to the tuple position within the

Re: Propose UDF

2013-09-04 Thread Alan Gates
A few questions: 1) Why did you try to use RANK? I don't see how rank is part of this. 2) The semantics here aren't clear to me. record_id appears to be crossed with name and id but name and id appear to be chosen in order. If this is join semantics I'd have expected two more entries in B,

Propose UDF

2013-08-31 Thread Alan del Rio Mendez
Hi Dev Team, I developed a UDF to handle the following situation on pig 10.0 and want to see if I could contribute with it to the project. Let us consider a BAG A with the following data: A:{record_id:{1),names:{(ALAN),(SARAI)}},ids:{(7),(8)}} and an expected bag B