Can't we just change the built-in CONCAT to accept additional fields? This would be totally backward compatible. I know it won't help now.

Alan.

On May 12, 2010, at 4:15 PM, Russell Jurney wrote:

The CONCAT in the oink project (LinkedIn's UDFs) does concatenation of
any number of string arguments:
http://github.com/criccomini/oink/blob/master/src/java/oink/udf/CONCAT.java

We're going to merge this with elephant-bird when we get a chance, and
this UDF could use a new name like MULTI_CONCAT, but it should work
for you.

Russell Jurney
russell.jur...@gmail.com
(404) 317-3620
http://twitter.com/rjurney
http://linkedin.com/in/russelljurney

On May 12, 2010, at 3:59 PM, Scott Wine <sco...@whitepages.com> wrote:

Hello,

I am trying to create a full address and full location field in Pig by combining multiple fields.

file = LOAD 'file.txt' USING PigStorage() AS
(house:chararray,
predir:chararray,
street:chararray,
streettype:chararray,
postdir:chararray
city:chararray,
state:chararray,
zip:chararray)

I need an output that is full address and full location:

full_address == house + ' ' + predir + ' ' + street + ' ' + streettype + ' ' + postdir
full_location == city + ' ' + state + ' ' + zip

I can get two to merge with CONCAT using but am not able to add more or the spaces in between.

Temp1 = FOREACH file GENERATE CONCAT (house,street)

Any ideas?

Thanks
Scott



Reply via email to