Hi Alan,
Here are my answers:
1) Why did you try to use RANK?
I needed to create a second attribute in order to generate a one to one
relationship between the two data bags, names and ids. RANK allowed me to
generate a unique sequence per data bag item according to the tuple
position within the
A few questions:
1) Why did you try to use RANK? I don't see how rank is part of this.
2) The semantics here aren't clear to me. record_id appears to be crossed with
name and id but name and id appear to be chosen in order. If this is join
semantics I'd have expected two more entries in B,
Hi Dev Team,
I developed a UDF to handle the following situation on pig 10.0 and want to
see if I could contribute with it to the project.
Let us consider a BAG A with the following data:
A:{record_id:{1),names:{(ALAN),(SARAI)}},ids:{(7),(8)}}
and an expected bag B