Fw: new message

2015-10-06 Thread Amit Mittal
Hello! New message, please read <http://immobilien-rahlfs.de/although.php?uwq> Amit Mittal

Fw: new message

2015-10-06 Thread Amit Mittal
Hello! New message, please read <http://soundcloudblast.com/supposed.php?vq3fg> Amit Mittal

Exception in DFSOutputStream.checkClosed: when 39 mapper tasks

2014-12-15 Thread Amit Mittal
Hi All, Here is an issue (exception in DFSOutputStream.checkClosed(DFSOutputStream.java:1317)) we are seeing when running a MapReduce program with 39 input files. Can you please have a quick look and advise. Cluster has 22 data nodes and dedicated name nodes and job tracker. Hadoop 2.2, using new

Filter data set by applying many rules

2014-08-26 Thread Amit Mittal
Hi All, I have a data set in text csv files and are compressed using gzip compression. Each record is having around 100 fields. I need to filter the data by applying various checks like "1. type of field", "2. nullable?", "3. min & max length", "4. value belongs to predefined list", "5. value subs

Re: DistributedCache deprecated

2014-01-30 Thread Amit Mittal
if you understand > MRv1, I don't see any reasons why can't you just move around a bit in API > and find your relevant classes that you want to use by yourself. Again, as > I said, I don't have any valid statements of what I am saying, they are > just the results of my own exp

Re: DistributedCache deprecated

2014-01-30 Thread Amit Mittal
s still the same way.. Its just that > you use the new Job class to use distributed cache features. > > I am not sure what entries you are exactly pointing to. Am I missing > anything here ? > > > Regards > Prav > > > On Thu, Jan 30, 2014 at 6:12 AM, Amit Mittal wrote: &

Re: DistributedCache deprecated

2014-01-29 Thread Amit Mittal
Hi Mike & Prav, Although I am new to Hadoop, but would like to add my 2 cents if that helps. We are having 2 ways for distribution of shared data, one is using Job configuration and other is DistributedCache. As job configuration is read by the JT, TT and child JVMs, and each time the configuratio

Re: Does all reducer take input from all NodeManager/Tasktrackers of Map tasks

2014-01-27 Thread Amit Mittal
PM, Vinod Kumar Vavilapalli < vino...@apache.org> wrote: > > > On Jan 27, 2014, at 4:17 AM, Amit Mittal wrote: > > *Question 1:* I believe the TaskTracker and then JobTracker/AppMaster > will receive the updates through call to > Task.statusUpdate(TaskUmbilicalProt

Does all reducer take input from all NodeManager/Tasktrackers of Map tasks

2014-01-27 Thread Amit Mittal
nly connects and takes output related to the keys partitioned for that particular reducer. Thanks Amit Mittal

Processing steps of NameNode & Secondary NameNode

2014-01-27 Thread Amit Mittal
will be merged. Is this understanding correct ? *Question 3:* Sorry, I did not get "For namenodes that* write to multiple directories*, the write must be flushed and synced to *every copy* before returning successfully." ? Especially the text in bold. Thanks Amit Mittal