Re: Query | Join Internals

2015-07-29 Thread Gagan Juneja
Any help? Regards, Gagan On Tuesday 14 July 2015, Gagan Juneja wrote: > Hi Team, > > We are using Pig intensively in our various projects. We are doing > optimizations for that we wanted to know how join works. Though we have > moved to Skewed joins for some of our use cases. > > At many places

Re: Query | Join Internals

2015-07-29 Thread Gagan Juneja
Any help? Regards, Gagan On Tuesday 14 July 2015, Gagan Juneja wrote: > Hi Team, > > We are using Pig intensively in our various projects. We are doing > optimizations for that we wanted to know how join works. Though we have > moved to Skewed joins for some of our use cases. > > At many places

New PigIpTools UDFs

2015-07-29 Thread Christian F. Gonzalez Di Antonio
Hi everyone, I would like to share with you my new Pig UDFs library!, This is a wrapper function for JAva InetAddress and MaxMind GeoIP Here you have the link of my repository: https://github.com/christiangda/pigiptools

Re: Need help on writing a UDF

2015-07-29 Thread Aaron Zimmerman
I don't entirely follow what your problem is, but it sounds a bit like you might do better to load all of the data from HBase into its own relation and then join it? What is the overall objective of the pig script you are writing? On Wed, Jul 29, 2015 at 8:51 AM, Sandeep Mellacheruvu < sandeep.me

Need help on writing a UDF

2015-07-29 Thread Sandeep Mellacheruvu
Hi, I need to write a pig UDF which takes string and personId as an input tuple. The personId is a key to query hbase within this UDF. I have created a connection to hbase when the UDF class loads. The problem here is PigStorage actually treats each row as a tuple and I have to query each per