Re: Join Multiple Relations by Different Fields

2012-12-14 Thread Jonathan Coveney
it's a little confusing, but the following is a tuple: (key1,foo,) it's just not the tuple you want. it is a tuple where the first field is "key1,foo" and the second field is null. The printing makes this ambiguous 2012/12/14 Thomas Bach > (key1,foo,)

Re: Join Multiple Relations by Different Fields

2012-12-14 Thread Thomas Bach
Hi all, I got a hint via StackOverflow[1] the problem was the missing delimiter definition On Fri, Dec 14, 2012 at 11:11:43AM +0100, Thomas Bach wrote: > grunt> data1 = load 'data1' as (key: chararray, val: chararray); > grunt> data2 = load 'data2' as (key: chararray, val: chararray); > grunt

Join Multiple Relations by Different Fields

2012-12-14 Thread Thomas Bach
Hi, Say I have three files `data1`, `data2` and `assocs`: $ cat data1 key1,foo key2,bar $ cat data2 key3,braz key4,froz $ cat assoc key1,key3 key2,key4 I load these files via $ pig -b -p debug=WARN -x local Warning: $HADOOP_HOME is deprecated. Apache Pig version 0.10.0 (r1328203) compiled Apr