Re: elephantbird JsonLoader doesn't like gz?

2011-05-19 Thread Eric Lubow
Eric Lubow e: eric.lu...@gmail.com w: eric.lubow.org

Re: Loader UDF with variable schema

2011-03-13 Thread Eric Lubow
in both the udf and the pig script, and generate all final tuples in the udf, and then just use a split in pig. Thanks, Marko Eric Lubow e: eric.lu...@gmail.com w: eric.lubow.org

Re: Limting output

2011-03-09 Thread Eric Lubow
a regular expression I want to get 'x' number of urls matching the regex pattern. I have written a UDF to filter out urls based on regular expression. Is there a way in Pig script to limit the number of results to 'x' ? ( 'x' is some configurable value) Thanks, Souri Eric Lubow e: eric.lu

Re: [DISCUSSION] Pig.next

2011-03-03 Thread Eric Lubow
this conversation and see what others think before deciding if it is worth while to call a vote. Olga Eric Lubow e: eric.lu...@gmail.com w: eric.lubow.org

Re: Reading Gzip Files

2011-02-22 Thread Eric Lubow
I apologize for the double mailing: grunt Y = LOAD 'hdfs:///mnt/test.log.gz' AS (line:chararray); grunt foo = LIMIT Y 5; grunt dump foo 0\Mtest.log?]?o?H??}?) It didn't work out of HDFS. -e On Tue, Feb 22, 2011 at 08:18, Eric Lubow eric.lu...@gmail.com wrote: I'm not sure what you mean

Re: Reading Gzip Files

2011-02-22 Thread Eric Lubow
@thedatachef On Tue, 2011-02-22 at 08:21 -0500, Eric Lubow wrote: I apologize for the double mailing: grunt Y = LOAD 'hdfs:///mnt/test.log.gz' AS (line:chararray); grunt foo = LIMIT Y 5; grunt dump foo 0\Mtest.log?]?o?H??}?) It didn't work out of HDFS. -e On Tue, Feb 22

JSON Loading on EMR

2011-02-17 Thread Eric Lubow
help that would push me in the right direction would be greatly appreciated. -e -- Eric Lubow e: eric.lu...@gmail.com w: eric.lubow.org