Re: Using Distributed Cache in PIG

2012-08-13 Thread Dmitriy Ryaboy
You are talking about changing the way hadoop works; something like this would be transparent to Pig. Note that Hadoop Distributed Cache != "distributed memory cache". I suppose you could replace the value of fs.file.impl from org.apache.hadoop.fs.LocalFileSystem to something else.. might be qui

Using Distributed Cache in PIG

2012-08-13 Thread kapil bhosale
Hello Can we use Distributed Cache to store intermediate results after the Map Phase so that these can be used in Reduce phase from cache. So as to improve performance of Map-Reduce Job. I found a Paper regarding usage of Cache in Map-Reduce, http://ieeexplore.ieee.org/xpl/login.jsp?tp=&arnumber=5