Re: Pig storage and load functions and Cache

2012-10-07 Thread Dmitriy Ryaboy
Pig has multi-query execution optimization built-in. If you compute multiple relations in your script that share parent relations, those parent relations will be computed only once. You don't have to do anything to make that happen. If you prefer to handle your own caching, you would have to handl

Re: A question of pig default load function

2012-10-07 Thread Prashant Kommireddi
Default loader is PigStorage which takes '\t' as delimiter. In your 2nd example, you need to explicitly specify comma as a delimiter (load 'foo' using PigStorage(',') as ...) Sent from my iPhone On Oct 7, 2012, at 12:00 PM, yonghu wrote: > Dear all, > > When I load the data stored in txt file i