Hi Shashikant.
Pig supports streaming with help of *stream *operator. This allows invoking
any external executables i.e., perl, python, c++, php etc... inside your
pig scripts.
Thanks
Naga
On Mon, Dec 14, 2015 at 12:18 PM, Shashikant K <
shashikant.kulkarn...@gmail.com> wrote:
> Hi All,
>
> Her
Hi,
This is because of some jars missing in the classpath. Please add missing
jars like log4j, commons-logging-1.1.1.jar and others if necessary.
Thanks
Nagamallikarjuna
On Tue, Jul 21, 2015 at 2:44 PM, Divya Gehlot
wrote:
> Hi,
> Sorry for such a basic question but I am breaking my head to so
Hi,
use work flow manager Oozie, to create the work flow (DAG of jobs i.e pig
scripts).
Thanks
Nagamallikarjuna
On Wed, Apr 15, 2015 at 1:46 PM, pth001 wrote:
> Hi,
>
> How can I create a pipeline (containing a sequence of pig scripts)?
>
> BR,
> Patcharee
>
--
Thanks and Regards
Nagamall
Hi,
Add all the required jars to the PIG CLASSPATH variable, It will resolve
the issue.
Thanks
Naga
On Fri, Dec 26, 2014 at 3:06 PM, Venkat Ramakrishnan <
venkat.archit...@gmail.com> wrote:
> Thanks Praveen. I am running pig-14 on Windows 7.
>
> Can anyone confirm if Hadoop is really required f
Hai,
Write UDF in java to extract the month and any other values from your input
string
Thanks
Naga
On Aug 18, 2014 8:49 PM, "murali krishna p"
wrote:
>
>
> Trying to read a table column defined as datetime in my pig script as
> follows
>
> load ‘/tmp.psv’ using PIgStore() (open_dte : chara
Hi,
We are calling external map reduce program inside our pig script to perform
a specific task. Lets take the example crawling process.
-- Load the all seed urls into the relation crawldata.
*crawldata = load 'baseurls' using PigStorage( pageid: chararray,
pageurl:chararray)*
normalizedata = fo
Hi
Write a UDF, it takes date and no of days to add and returns the date
Thanks
Naga
On Dec 14, 2013 6:19 AM, "Krishnan Narayanan"
wrote:
> Hi All ,
>
> I am trying to do something like (get_date +46 days) , how to achieve this
> in pig.
>
> I am using pig 0.10
> help much appreciated.
>
>
Hai,
Please go through the following code,
Input Data:
---
DocNameTokens
--
cricketsachin,sehwag,dravid,dhoni
movieamir,salman,hruthik,ranveer
cricketsachin,ganguly,rohit,dhoni
cricketsehwag,sachin,dravid,kohli
moviesalman,amir,sharukh
===
Hai,
Load those two files as two relations and extract or parse XML files by
using extract_regex_all and finally store them into CSV files.
Thanks
Naga
On Sep 12, 2013 5:44 AM, "jamal sasha" wrote:
> Hi,
> So I have different xml data sources...For example:
>
> src1.txt
>
>
> 1
>
>
> 2
Hai,
Just include fs -rmr directory path as first line of the pig script.
Thanks
Naga
On Sep 11, 2013 7:39 PM, "Dip Kharod" wrote:
> Hi,
> My Pig script stores data in an HDFS folder and I want to delete
> those folders (like in MapReduce with File object) in the script, every
> time I run it -
gher version of script.
>
>
> On Tue, Mar 5, 2013 at 3:32 PM, inelu nagamallikarjuna
> wrote:
>
> > Hi,
> >
> >
> > The function STRSPLIT is not there in the list of in built fuction of
> > hive-0.7.0. Please use any version from 0.8.0 on wo
Hi,
The function STRSPLIT is not there in the list of in built fuction of
hive-0.7.0. Please use any version from 0.8.0 on words. There are lots of
improvements from 0.7.0 to 0.10.0.
Thanks
Nagamallikarjuna
On Wed, Mar 6, 2013 at 4:58 AM, inelu nagamallikarjuna
wrote:
> Hi,
>
> Th
Hi,
This is the command *pig -version* in Linux shell.
Thanks
Nagamallikarjuna
On Wed, Mar 6, 2013 at 4:56 AM, Mix Nin wrote:
> I checked by removing REGISTER command, but still I get the error. How do I
> check the PIG version?
>
>
> On Tue, Mar 5, 2013 at 3:22 PM, inelu n
Hi,
strspit is a builtin function, so the register command is not required.
use same script by removing the first line. I already tested the script
against pig-0.10.0 version it is working fine.
Thanks
Nagamallikarjuna
On Wed, Mar 6, 2013 at 4:46 AM, Mix Nin wrote:
> Below is my script
>
>
> R
.pig.builtin.,
> org.apache.pig.impl.builtin.
>
>
> On Tue, Mar 5, 2013 at 3:07 PM, inelu nagamallikarjuna
> wrote:
>
> > Hi,
> >
> > There is a small mistake in your script. You used relation name called
> data
> > in second line use X instead of data.
>
Hi,
There is a small mistake in your script. You used relation name called data
in second line use X instead of data.
*Sample script:
X= LOAD '/streamming/read' AS (line : chararray);
Y = foreach X generate STRSPLIT(line,' ');
dump Y;*
Thanks
Nagamallikarjuna
On Wed, Mar 6, 2013 at 4:19 AM, Mi
*
Thanks
Nagamallikarjuna
On Wed, Mar 6, 2013 at 3:42 AM, inelu nagamallikarjuna
wrote:
> Hi,
>
> Use the fully qualified class name like org.apache.udf.myudf.udfName in
> the pig script while using udf.
> Otherwise use only udf name in the script and while running use like
Hi,
Use the fully qualified class name like org.apache.udf.myudf.udfName in the
pig script while using udf.
Otherwise use only udf name in the script and while running use like pig -
Dudf.import.list=org.apache.udf.myudf.evaluation.string scriptname.pig
Thanks
Nagamallikarjuna
On Wed, Mar 6, 20
Hi Vincent,
You can restrict the number of concurrent maps by setting this
parameter *mapred.tasktracker.map.tasks.maximum
= 1 or 2*.
*Thanks
Nagamallikarjuna*
On Mon, Jan 21, 2013 at 7:13 PM, Mohammad Tariq wrote:
> Hello Vincent,
>
> The number of map tasks for a job is primarily
Congrats Rohini..
On Thu, Nov 1, 2012 at 10:13 AM, Aniket Mokashi wrote:
> Congrats Rohini...
>
>
> On Mon, Oct 29, 2012 at 11:31 AM, Julien Le Dem
> wrote:
>
> > Congrats Rohini !
> >
> >
> > On Sun, Oct 28, 2012 at 9:42 AM, Bill Graham
> wrote:
> > > Congrats Rohini! Great news indeed.
> > >
20 matches
Mail list logo