Re: pig 0.8 releases

2012-02-17 Thread Jeremy Hanna
Hmmm, okay. I'm not sure where those artifacts in the apache maven repo came from then - there's no 0.8.2 in there, but there is a 0.8.3 version in there. I just didn't know if people would get confused by that or if maven would silently pull down the "latest" 0.8 jars from apache's repo and g

Re: pig 0.8 releases

2012-02-17 Thread Dmitriy Ryaboy
Do you mean the snapshot of current 0.8 branch? Once 8.2 is released, the version in the branch is bumped up. There has been no 8.3 release. On Feb 17, 2012, at 12:06 PM, Jeremy Hanna wrote: > So the current releases of pig are 0.8.1 and 0.9.2. However, in the apache > mvn repo (and mirrored

Re: State of Pig / EMR & S3

2012-02-17 Thread Russell Jurney
Thanks, looks like my issues are Avro related. Russell Jurney twitter.com/rjurney russell.jur...@gmail.com datasyndrome.com On Feb 17, 2012, at 9:45 AM, Grig Gheorghiu wrote: > If you're talking about Elastic MapReduce, I am able to LOAD from and > STORE into S3. Something like this works for m

pig 0.8 releases

2012-02-17 Thread Jeremy Hanna
So the current releases of pig are 0.8.1 and 0.9.2. However, in the apache mvn repo (and mirrored repos) there is a pig 0.8.3. I find no release on it, no svn tag for it, and no user mailing list announcement for it. Where does 0.8.3 come from? it's in https://repository.apache.org/index.ht

Re: State of Pig / EMR & S3

2012-02-17 Thread Grig Gheorghiu
If you're talking about Elastic MapReduce, I am able to LOAD from and STORE into S3. Something like this works for me: concept = LOAD 's3://mybucket/test_files/geowordnet/concept.csv.gz' USING PigStorage(',') as ( con_id: int, name: chararray, gloss:chararray,

Re: how to issue command inside command.

2012-02-17 Thread Dmitriy Ryaboy
This is not currently possible. It's also ill-defined since 'result' is a relation, not a scalar. What you really want is to call an hbase lookup udf, to ensure you get a single record back, not to use the result of a table scan in another table scan. The way to get this done 'properly' is to

store the result with PigStorage

2012-02-17 Thread Haitao Yao
hi while using PigStorage , there will have _logs and _SUCCESS , which are useless . How can store the result without _logs and _SUCCESS? thanks.

how to issue command inside command.

2012-02-17 Thread chethan
Hi, I want to issue command inside command in PIG Script. command 1 : result = load 'hbase://sample_names' using org.apache.pig.backend.hadoop.hbase.HBaseStorage('cf:fname','-loadKey true -gt 1 -lt 3') as (id:chararray); command 2 : user_links = load 'hbase://sample_names' using org.apache.pig.ba