Fwd: First International Mapreduce Workshop 2010: Paper submission deadline July 15 , 2010

2010-06-18 Thread Alan Gates
Begin forwarded message: From: Milind A Bhandarkar Date: May 31, 2010 9:16:38 PM PDT To: "common-u...@hadoop.apache.org" , "mapreduce-u...@hadoop.apache.org" u...@hadoop.apache.org>, "gene...@hadoop.apache.org" > Subject: First International Mapreduce Workshop 2010: Paper submission deadl

Re: Pig loader 0.6 to 0.7 migration guide

2010-06-18 Thread Alan Gates
I've created https://issues.apache.org/jira/browse/PIG-1459 to capture the need for a standard serialization method. Regarding required field list, it is the last option. I believe the name was included since some loaders may think in terms of names instead of positions. I created https:/

Re: simple way to REPLACE on various columns

2010-06-18 Thread hc busy
yeah, that'd be really cool. The other way that we can say this, (to make map reduce interface available in pig), is to allow FOREACH to be nested: TRIMED_TABLE = FOREACH TABLE { stripped = FOREACH TABLE.SOME_BAG GENERATE String.Trim(value); GENERATE k1,k2,k3, stripped; } On Wed, Ju

Re: a way to mash aliases together

2010-06-18 Thread Andrey Stepachev
Another way to make tuples from other tuples or fields: use special UDF https://issues.apache.org/jira/browse/PIG-1385 It depends mostly on what you need. 2010/6/18 Corbin Hoenes > Need to have a bunch of non related aliases into a single alias (so I can > pass this alias into my UDF). Is it

Re: a way to mash aliases together

2010-06-18 Thread Dmitriy Ryaboy
Tuples can contain tuples. The arguments to a udf get wrapped in a tuple. So, MyUDF('foo', 'bar', 24) will invoke MyUDF.exec(Tuple input), and this input tuple will have 3 fields -- String foo, String bar, and int 24. If you call a udf on a list of tuples -- MyUDF(tuple1, tuple2) -- this will res

a way to mash aliases together

2010-06-18 Thread Corbin Hoenes
Need to have a bunch of non related aliases into a single alias (so I can pass this alias into my UDF). Is it possible to do this? Or is it possible to pass a number of Tuple objects into an EvalFunc? exec(Tuple input1, Tuple input2, ...)