Re: Pig script to load C++ library

2015-12-13 Thread inelu nagamallikarjuna
Hi Shashikant. Pig supports streaming with help of *stream *operator. This allows invoking any external executables i.e., perl, python, c++, php etc... inside your pig scripts. Thanks Naga On Mon, Dec 14, 2015 at 12:18 PM, Shashikant K < shashikant.kulkarn...@gmail.com> wrote: > Hi All, > > Her

Re: run pig script through eclipse without hadoop

2015-07-21 Thread inelu nagamallikarjuna
Hi, This is because of some jars missing in the classpath. Please add missing jars like log4j, commons-logging-1.1.1.jar and others if necessary. Thanks Nagamallikarjuna On Tue, Jul 21, 2015 at 2:44 PM, Divya Gehlot wrote: > Hi, > Sorry for such a basic question but I am breaking my head to so

Re: create a pipeline

2015-04-15 Thread inelu nagamallikarjuna
Hi, use work flow manager Oozie, to create the work flow (DAG of jobs i.e pig scripts). Thanks Nagamallikarjuna On Wed, Apr 15, 2015 at 1:46 PM, pth001 wrote: > Hi, > > How can I create a pipeline (containing a sequence of pig scripts)? > > BR, > Patcharee > -- Thanks and Regards Nagamall

Re: ClassNotFoundException while running pig in local mode

2014-12-26 Thread inelu nagamallikarjuna
Hi, Add all the required jars to the PIG CLASSPATH variable, It will resolve the issue. Thanks Naga On Fri, Dec 26, 2014 at 3:06 PM, Venkat Ramakrishnan < venkat.archit...@gmail.com> wrote: > Thanks Praveen. I am running pig-14 on Windows 7. > > Can anyone confirm if Hadoop is really required f

Re: ToDate and GetMonth function help

2014-08-18 Thread inelu nagamallikarjuna
Hai, Write UDF in java to extract the month and any other values from your input string Thanks Naga On Aug 18, 2014 8:49 PM, "murali krishna p" wrote: > > > Trying to read a table column defined as datetime in my pig script as > follows > > load ‘/tmp.psv’ using PIgStore() (open_dte : chara

Re: Query on Pig

2014-07-10 Thread inelu nagamallikarjuna
Hi, We are calling external map reduce program inside our pig script to perform a specific task. Lets take the example crawling process. -- Load the all seed urls into the relation crawldata. *crawldata = load 'baseurls' using PigStorage( pageid: chararray, pageurl:chararray)* normalizedata = fo

Re: Adding days to Pig

2013-12-14 Thread inelu nagamallikarjuna
Hi Write a UDF, it takes date and no of days to add and returns the date Thanks Naga On Dec 14, 2013 6:19 AM, "Krishnan Narayanan" wrote: > Hi All , > > I am trying to do something like (get_date +46 days) , how to achieve this > in pig. > > I am using pig 0.10 > help much appreciated. > >

Re: Simple word count in pig..

2013-11-20 Thread inelu nagamallikarjuna
Hai, Please go through the following code, Input Data: --- DocNameTokens -- cricketsachin,sehwag,dravid,dhoni movieamir,salman,hruthik,ranveer cricketsachin,ganguly,rohit,dhoni cricketsehwag,sachin,dravid,kohli moviesalman,amir,sharukh ===

Re: Converting xml to csv

2013-09-11 Thread inelu nagamallikarjuna
Hai, Load those two files as two relations and extract or parse XML files by using extract_regex_all and finally store them into CSV files. Thanks Naga On Sep 12, 2013 5:44 AM, "jamal sasha" wrote: > Hi, > So I have different xml data sources...For example: > > src1.txt > > > 1 > > > 2

Re: Delete Output Folder in Pig Script

2013-09-11 Thread inelu nagamallikarjuna
Hai, Just include fs -rmr directory path as first line of the pig script. Thanks Naga On Sep 11, 2013 7:39 PM, "Dip Kharod" wrote: > Hi, > My Pig script stores data in an HDFS folder and I want to delete > those folders (like in MapReduce with File object) in the script, every > time I run it -

Re: Error during parsing

2013-03-05 Thread inelu nagamallikarjuna
gher version of script. > > > On Tue, Mar 5, 2013 at 3:32 PM, inelu nagamallikarjuna > wrote: > > > Hi, > > > > > > The function STRSPLIT is not there in the list of in built fuction of > > hive-0.7.0. Please use any version from 0.8.0 on wo

Re: Error during parsing

2013-03-05 Thread inelu nagamallikarjuna
Hi, The function STRSPLIT is not there in the list of in built fuction of hive-0.7.0. Please use any version from 0.8.0 on words. There are lots of improvements from 0.7.0 to 0.10.0. Thanks Nagamallikarjuna On Wed, Mar 6, 2013 at 4:58 AM, inelu nagamallikarjuna wrote: > Hi, > > Th

Re: Error during parsing

2013-03-05 Thread inelu nagamallikarjuna
Hi, This is the command *pig -version* in Linux shell. Thanks Nagamallikarjuna On Wed, Mar 6, 2013 at 4:56 AM, Mix Nin wrote: > I checked by removing REGISTER command, but still I get the error. How do I > check the PIG version? > > > On Tue, Mar 5, 2013 at 3:22 PM, inelu n

Re: Error during parsing

2013-03-05 Thread inelu nagamallikarjuna
Hi, strspit is a builtin function, so the register command is not required. use same script by removing the first line. I already tested the script against pig-0.10.0 version it is working fine. Thanks Nagamallikarjuna On Wed, Mar 6, 2013 at 4:46 AM, Mix Nin wrote: > Below is my script > > > R

Re: Error during parsing

2013-03-05 Thread inelu nagamallikarjuna
.pig.builtin., > org.apache.pig.impl.builtin. > > > On Tue, Mar 5, 2013 at 3:07 PM, inelu nagamallikarjuna > wrote: > > > Hi, > > > > There is a small mistake in your script. You used relation name called > data > > in second line use X instead of data. >

Re: Error during parsing

2013-03-05 Thread inelu nagamallikarjuna
Hi, There is a small mistake in your script. You used relation name called data in second line use X instead of data. *Sample script: X= LOAD '/streamming/read' AS (line : chararray); Y = foreach X generate STRSPLIT(line,' '); dump Y;* Thanks Nagamallikarjuna On Wed, Mar 6, 2013 at 4:19 AM, Mi

Re: UDF to calculate Average of whole dataset

2013-03-05 Thread inelu nagamallikarjuna
* Thanks Nagamallikarjuna On Wed, Mar 6, 2013 at 3:42 AM, inelu nagamallikarjuna wrote: > Hi, > > Use the fully qualified class name like org.apache.udf.myudf.udfName in > the pig script while using udf. > Otherwise use only udf name in the script and while running use like

Re: UDF to calculate Average of whole dataset

2013-03-05 Thread inelu nagamallikarjuna
Hi, Use the fully qualified class name like org.apache.udf.myudf.udfName in the pig script while using udf. Otherwise use only udf name in the script and while running use like pig - Dudf.import.list=org.apache.udf.myudf.evaluation.string scriptname.pig Thanks Nagamallikarjuna On Wed, Mar 6, 20

Re: Is there a way to limit the number of maps produced by HBaseStorage ?

2013-01-21 Thread inelu nagamallikarjuna
Hi Vincent, You can restrict the number of concurrent maps by setting this parameter *mapred.tasktracker.map.tasks.maximum = 1 or 2*. *Thanks Nagamallikarjuna* On Mon, Jan 21, 2013 at 7:13 PM, Mohammad Tariq wrote: > Hello Vincent, > > The number of map tasks for a job is primarily

Re: [ANNOUNCE] Welcome new Apache Pig Committers Rohini Palaniswamy

2012-11-01 Thread inelu nagamallikarjuna
Congrats Rohini.. On Thu, Nov 1, 2012 at 10:13 AM, Aniket Mokashi wrote: > Congrats Rohini... > > > On Mon, Oct 29, 2012 at 11:31 AM, Julien Le Dem > wrote: > > > Congrats Rohini ! > > > > > > On Sun, Oct 28, 2012 at 9:42 AM, Bill Graham > wrote: > > > Congrats Rohini! Great news indeed. > > >