Re: 1 job with Input data from 2 HDFS?

2015-02-27 Thread xeon Mailinglist
Hi, I don't understand this part of your answer: read the other as a side-input directly by creating a client.. If I consider both inputs through the InputFormat, this means that a job will contain both input path in its configuration, and this is enough to work. So, what is the other? Is is the

1 job with Input data from 2 HDFS?

2015-02-27 Thread xeonmailinglist
Hi, I would like to have a mapreduce job that reads input data from 2 HDFS. Is this possible? Thanks,

Re: 1 job with Input data from 2 HDFS?

2015-02-27 Thread Vinod Kumar Vavilapalli
It is entirely possible. You should treat one of them as the primary inputs through the InputFormat/Mapper and read the other as a side-input directly by creating a client. +Vinod On Feb 27, 2015, at 7:22 AM, xeonmailinglist xeonmailingl...@gmail.com wrote: Hi, I would like to have a