I think it will be very useful to put in Hive (or rather a contrib in hive) but I don't think GPL is compatible with Apache. So it will be a no go putting the maxmind code/data in Hive.
Prasad ________________________________ From: Edward Capriolo <[email protected]> Reply-To: <[email protected]> Date: Tue, 28 Apr 2009 07:42:48 -0700 To: <[email protected]> Subject: GEO-IP as User Defined Function Hey all, You may all be familiar with geo-ip from maxmind. http://www.maxmind.com/app/api. GNU General Public License (GPL) I am running a process where I have to geo locate IP addresses. I think this would be a good UDF. Right now I am using an external map reduce process to be inserted back into hive. GEO-CITY(columnname) GEO-STATE(columnname) The only drawback I can see is that GEO IP requires database files to be on the local file system. However the functions could return NULL if the local files do not exist. Does anyone think these would be useful in hive? Edward
