RE: Iterative MapReduce with PIG

2011-01-12 Thread deepak.n85
Hi Youngwoo, I appreciate your help. It worked! I re-installed everything freshly, and it started working. Dunno what was going wrong earlier. This feature is amazing. Regards, Deepak From: 김영우 [mailto:warwit...@gmail.com] Sent: Thursday, January 13, 2011 8:41 A

Re: calling pig from a web app

2011-01-12 Thread Alejandro Abdelnur
Soren, Adding to the 'oozie' alternative ... With Oozie you can can do something like: $ oozie pig -file SCRIPT The command line options are aligned with Pig ones (you can do a direct passthrough of options). You'll get a JOB ID (like it would be a PIG server) and later you can monitor the prog

Re: Iterative MapReduce with PIG

2011-01-12 Thread 김영우
Hi Deepak, I just build the pig snapshot from my PC and then I deploy a distribution to server. Also I drop required jars into $PIG_HOME/lib directory. After all, Seems it works fine. Hopes this helps. - Youngwoo *My env for Hadoop:* $ env | grep HADOOP HADOOP_HOME=/usr/lib/hadoop-0.20 *My

Re: how to use builtin String functions

2011-01-12 Thread Thejas M Nair
The functions need to be part of an expression in a relational operator, for example - f = foreach l generate REGEX_EXTRACT_ALL('192.168.1.5:8020', '(.*)\\:(.*)'); (the above example does not make much sense as none of the columns in input relation are being used.) -Thejas On 1/12/11 5:43 P

how to use builtin String functions

2011-01-12 Thread Dexin Wang
I see there are some builtin string functions, but I don't know how to use them. I got this error when I follow the examples: grunt> REGEX_EXTRACT_ALL('192.168.1.5:8020', '(.*)\:(.*)'); 2011-01-12 19:34:23,773 [main] ERROR org.apache.pig.tools.grunt.Grunt - ERROR 1000: Error during parsing. Encoun

Re: wild card for all fields in a tuple

2011-01-12 Thread Dexin Wang
Yeah, that works great. Thanks Jonathan and Alan. I can see that all fields in between feature will be totally useful for some cases. On Wed, Jan 12, 2011 at 3:33 PM, Alan Gates wrote: > Jonathan is right, you can do all fields in a tuple with *. I was thinking > of doing all fields in between

Re: wild card for all fields in a tuple

2011-01-12 Thread Alan Gates
Jonathan is right, you can do all fields in a tuple with *. I was thinking of doing all fields in between two fields, which you can't do yet. Alan. On Jan 12, 2011, at 3:18 PM, Alan Gates wrote: There isn't a way to do that yet. See https://issues.apache.org/jira/browse/PIG-1693 for ou

Re: wild card for all fields in a tuple

2011-01-12 Thread Alan Gates
There isn't a way to do that yet. See https://issues.apache.org/jira/browse/PIG-1693 for our plans on adding it in the next release. Alan. On Jan 12, 2011, at 2:51 PM, Dexin Wang wrote: Hi, Hope there is some simple answer to this. I have bunch of rows, for each row, I want to add a colu

Re: wild card for all fields in a tuple

2011-01-12 Thread Jonathan Coveney
Foreach a generate function(thing), *; should do what yopu want. * just throws on all the columns Sent via BlackBerry -Original Message- From: Dexin Wang Date: Wed, 12 Jan 2011 14:51:58 To: Reply-To: user@pig.apache.org Subject: wild card for all fields in a tuple Hi, Hope there is

wild card for all fields in a tuple

2011-01-12 Thread Dexin Wang
Hi, Hope there is some simple answer to this. I have bunch of rows, for each row, I want to add a column which is derived from some existing columns. And I have large number of columns in my input tuple so I don't want to repeat the name using "AS" when I generate. Is there an easy way just to app

Re: LZO & Pig (Elephantbird?)

2011-01-12 Thread Dmitriy Ryaboy
P.S. Thrift 0.2 and 0.5 are binary-compatible, so you can read messages generated with 0.5 using files compiled with thrift 0.2, and vice versa. We have some projects that use 0.5 and some that are still on 0.2, and all that means is that you install both versions of the compilers on your dev box a

Re: LZO & Pig (Elephantbird?)

2011-01-12 Thread Dmitriy Ryaboy
I am working on the pig 08 compatibility layer; it mostly works, fwiw. Converting to Thrift 0.5 would be fairly straightforward; unfortunately the signatures of Thrift messages changed so the code is not entirely backwards compatible. I don't think the changes for what we do with Pig are material.

Re: Pig error: Unable to create input splits

2011-01-12 Thread Geoffrey Gallaway
Thanks to Joe and Daniel, I was able to fix this issue. It was a combination of ambiguity about file paths (which Joe's message helped me confirm) and an error in my Java that wasn't causing an exception and failing silently. Thanks, Geoff On Wed, Jan 12, 2011 at 7:43 AM, Joe Crobak wrote: > A

RE: LZO & Pig (Elephantbird?)

2011-01-12 Thread Tyler Coffin
There's a fork of elephant-bird where pig-8 support is being worked on: https://github.com/dvryaboy/elephant-bird/tree/pig-08 I haven't given it a shot yet. -Original Message- From: Evert Lammerts [mailto:evert.lamme...@sara.nl] Sent: January 12, 2011 15:10 To: 'user@pig.apache.org' Subje

LZO & Pig (Elephantbird?)

2011-01-12 Thread Evert Lammerts
Hello list, I've installed the LZO codecs (https://github.com/kevinweil/hadoop-lzo) and now I'm looking into using LZO in Pig. Elephant Bird (https://github.com/kevinweil/elephant-bird) seems to provide some nice prefab loaders, but it's requirements do not fit out Hadoop installation (we're on CD

Re: Pig error: Unable to create input splits

2011-01-12 Thread Joe Crobak
A = LOAD 'file://home/geoffeg/test.json' will try to load using a relative path. Pig will understand file:/home/geoffeg/test.json or file:///home/geoffeg/test.json to load the absolute path. Same goes for a file in hdfs:// HTH, Joe On Sun, Jan 9, 2011 at 11:47 PM, Geoffrey Gallaway wrote: > He

RE: Iterative MapReduce with PIG

2011-01-12 Thread deepak.n85
Hi Youngwoo, Yes, I downloaded Pig Snapshot from Hudson. Is there some other Pig-0.9.0 that comes bundled with Jython.jar? Please point me to it. With the snapshot version, I tried your advice. Putting jython.jar in $PIG_HOME/lib did not help. I'm getting the same error. The Java command does

Re: Iterative MapReduce with PIG

2011-01-12 Thread 김영우
Hi Deepak, Did you download pig distribution from Apache Hudson? IIt seems that the snapshot build does not include jython.jar Drop the jython.jar into $PIG_HOME/lib directory and then try it again. Also you can specify classpath in java command line. E.g., java -cp pig.jar:/path/jython.jar --em

RE: Iterative MapReduce with PIG

2011-01-12 Thread deepak.n85
Hi, I am not able to import Pig. The following is throwing up import errors >>> from org.apache.pig.scripting import Pig Traceback (most recent call last): File "", line 1, in ImportError: No module named apache Any ideas? I checked my classpath, and things look alright. -Original Messa