Check the Datafu library from Linkedin. It should have all the statistical
function you are expecting to use in PIG.
https://github.com/linkedin/datafu
Thank You,
Manish.
-Original Message-
From: Cheolsoo Park [mailto:cheol...@cloudera.com]
Sent: Wednesday, September 26, 2012 8:11 AM
T
Yogesh,
I think to get desired result you need to group by both cid and date , time. If
you don't want to do the group by both then break the original dataset into two
and keep the date, time separate from the dataset where you are running the
group by and later point join them back bases on ci
Thanks Siddharth this is really new stuff for me.
Sent from my BlackBerry, pls excuse typo
-Original Message-
From: Siddharth Tiwari
Date: Thu, 6 Sep 2012 03:38:58
To: ;
Reply-To: user@pig.apache.org
Subject: Re: Machine Learning + Pig?
HI Anurag,
Yes its offcourse possible.
Infact t
Anurag,
Pig can provide you data flow capability where you can do conventional data
analysis and run aggregations. As far as ML is concerned use Mahout where you
have most of the algorithm implementation like clustering, k-means, LR etc. Let
me know if you need specific documentation on this.
Check the link cwiki.apache.org/PIG
Thank You,
Manish
Sent from my BlackBerry, pls excuse typo
-Original Message-
From: prabhu k
Date: Sat, 11 Aug 2012 08:42:49
To:
Reply-To: user@pig.apache.org
Subject: pig - Installation
Hi Users,
Im new to pig, Can anyone provide pig installation &
I think 2.6.
From: Jagat
To: Manish Bhoge ; user@pig.apache.org
Sent: Sunday, May 20, 2012 8:56 AM
Subject: Re: Jython Interpretor error
Hi
Are you using jython 2.5 as pig works with that.
---
Sent from Mobile , short and crisp.
On 20-May-2012 8:49
Hi All,
I have written some UDF in Python script (test.py). Now i want to register
this python script with PIG-GRUNT. But when i registering this script i am
getting below errors.
Case 1: When i place the python script in local directory like
/home/manish/python/src/test/py then
It gives me be
Pig does its interpretation of datatype. I do suspect whether Chararray is the
right choice for storing and counting an integer value.
Thank You,
Sent from my BlackBerry, pls excuse typo
-Original Message-
From: Dmitriy Ryaboy
Date: Thu, 8 Mar 2012 09:07:21
To: user@pig.apache.org
Reply
Hi,
There are 2 important configuration that PIG look for: namenode path and job
tracker path. Make sure you have set up PiG_CLASSPATH for the same.
By looking at error detail it doesn't look like configuration error. It talks
about the utf-8 error which is related to the data which your reading