Hi Youngwoo,
I appreciate your help. It worked!
I re-installed everything freshly, and it started working. Dunno what was going
wrong earlier. This feature is amazing.
Regards,
Deepak
From: 김영우 [mailto:warwit...@gmail.com]
Sent: Thursday, January 13, 2011 8:41 A
Soren,
Adding to the 'oozie' alternative ...
With Oozie you can can do something like:
$ oozie pig -file SCRIPT
The command line options are aligned with Pig ones (you can do a direct
passthrough of options). You'll get a JOB ID (like it would be a PIG server)
and later you can monitor the prog
Hi Deepak,
I just build the pig snapshot from my PC and then I deploy a distribution to
server. Also I drop required jars into $PIG_HOME/lib directory.
After all, Seems it works fine.
Hopes this helps.
- Youngwoo
*My env for Hadoop:*
$ env | grep HADOOP
HADOOP_HOME=/usr/lib/hadoop-0.20
*My
The functions need to be part of an expression in a relational operator, for
example -
f = foreach l generate REGEX_EXTRACT_ALL('192.168.1.5:8020', '(.*)\\:(.*)');
(the above example does not make much sense as none of the columns in input
relation are being used.)
-Thejas
On 1/12/11 5:43 P
I see there are some builtin string functions, but I don't know how to use
them. I got this error when I follow the examples:
grunt> REGEX_EXTRACT_ALL('192.168.1.5:8020', '(.*)\:(.*)');
2011-01-12 19:34:23,773 [main] ERROR org.apache.pig.tools.grunt.Grunt -
ERROR 1000: Error during parsing. Encoun
Yeah, that works great. Thanks Jonathan and Alan. I can see that all fields
in between feature will be totally useful for some cases.
On Wed, Jan 12, 2011 at 3:33 PM, Alan Gates wrote:
> Jonathan is right, you can do all fields in a tuple with *. I was thinking
> of doing all fields in between
Jonathan is right, you can do all fields in a tuple with *. I was
thinking of doing all fields in between two fields, which you can't do
yet.
Alan.
On Jan 12, 2011, at 3:18 PM, Alan Gates wrote:
There isn't a way to do that yet. See
https://issues.apache.org/jira/browse/PIG-1693
for ou
There isn't a way to do that yet. See https://issues.apache.org/jira/browse/PIG-1693
for our plans on adding it in the next release.
Alan.
On Jan 12, 2011, at 2:51 PM, Dexin Wang wrote:
Hi,
Hope there is some simple answer to this. I have bunch of rows, for
each
row, I want to add a colu
Foreach a generate function(thing), *; should do what yopu want. * just throws
on all the columns
Sent via BlackBerry
-Original Message-
From: Dexin Wang
Date: Wed, 12 Jan 2011 14:51:58
To:
Reply-To: user@pig.apache.org
Subject: wild card for all fields in a tuple
Hi,
Hope there is
Hi,
Hope there is some simple answer to this. I have bunch of rows, for each
row, I want to add a column which is derived from some existing columns. And
I have large number of columns in my input tuple so I don't want to repeat
the name using "AS" when I generate. Is there an easy way just to app
P.S. Thrift 0.2 and 0.5 are binary-compatible, so you can read messages
generated with 0.5 using files compiled with thrift 0.2, and vice versa. We
have some projects that use 0.5 and some that are still on 0.2, and all that
means is that you install both versions of the compilers on your dev box a
I am working on the pig 08 compatibility layer; it mostly works, fwiw.
Converting to Thrift 0.5 would be fairly straightforward; unfortunately the
signatures of Thrift messages changed so the code is not entirely backwards
compatible. I don't think the changes for what we do with Pig are material.
Thanks to Joe and Daniel, I was able to fix this issue.
It was a combination of ambiguity about file paths (which Joe's message
helped me confirm) and an error in my Java that wasn't causing an exception
and failing silently.
Thanks,
Geoff
On Wed, Jan 12, 2011 at 7:43 AM, Joe Crobak wrote:
> A
There's a fork of elephant-bird where pig-8 support is being worked on:
https://github.com/dvryaboy/elephant-bird/tree/pig-08
I haven't given it a shot yet.
-Original Message-
From: Evert Lammerts [mailto:evert.lamme...@sara.nl]
Sent: January 12, 2011 15:10
To: 'user@pig.apache.org'
Subje
Hello list,
I've installed the LZO codecs (https://github.com/kevinweil/hadoop-lzo) and
now I'm looking into using LZO in Pig. Elephant Bird
(https://github.com/kevinweil/elephant-bird) seems to provide some nice
prefab loaders, but it's requirements do not fit out Hadoop installation
(we're on CD
A = LOAD 'file://home/geoffeg/test.json' will try to load using a relative
path. Pig will understand file:/home/geoffeg/test.json or
file:///home/geoffeg/test.json to load the absolute path. Same goes for a
file in hdfs://
HTH,
Joe
On Sun, Jan 9, 2011 at 11:47 PM, Geoffrey Gallaway wrote:
> He
Hi Youngwoo,
Yes, I downloaded Pig Snapshot from Hudson. Is there some other Pig-0.9.0 that
comes bundled with Jython.jar? Please point me to it.
With the snapshot version, I tried your advice.
Putting jython.jar in $PIG_HOME/lib did not help. I'm getting the same error.
The Java command does
Hi Deepak,
Did you download pig distribution from Apache Hudson?
IIt seems that the snapshot build does not include jython.jar
Drop the jython.jar into $PIG_HOME/lib directory and then try it again.
Also you can specify classpath in java command line. E.g., java -cp
pig.jar:/path/jython.jar --em
Hi,
I am not able to import Pig.
The following is throwing up import errors
>>> from org.apache.pig.scripting import Pig
Traceback (most recent call last):
File "", line 1, in
ImportError: No module named apache
Any ideas? I checked my classpath, and things look alright.
-Original Messa
19 matches
Mail list logo