Hi,
Do we have a patch to support splitting with the bzip2 format for the current
stable version 0.20.2 ? Please refer to JIRA below :
https://issues.apache.org/jira/browse/HADOOP-4012
Thanks,
Deepika
Hi Koji,
Thank you for your reply.
I'll try what you wrote and see if it works as expected.
By the way, what does the `client-side config' mean?
dfs.replication and dfs.block.size are written in conf/hdfs-site.xml.
Where should I put them into?
On Tue, May 18, 2010 at 3:01 AM, Koji Noguchi wro
I don't think that the workcount example uses FileSplit class. Only the
MultithreadedMapper class uses FileSplit and I can't find an example where
it's invoked.
Where is the setup() method?
On Tue, May 18, 2010 at 6:50 PM, Wilkes, Chris wrote:
> In your setup() look at context.getInputSplit()
In your setup() look at context.getInputSplit(), this will be a
FileSplit in your case. From there you can do a getPath() to see the
both the directory structure and the split value.
On May 18, 2010, at 10:01 AM, psdc1978 wrote:
Hi,
I'm study the MapReduce code, and I've the following qu
Hi,
I'm study the MapReduce code, and I've the following questions:
1 - I'm running the wordcount example. I've 3 txt files as input. Each txt
file is about 120Mb.
During the execution of the map tasks, a number of map tasks will read the
txt files. Each file is divided in split files. I would l
Thanks for arranging that.
James
On Tue, May 18, 2010 at 4:07 AM, Amareshwari Sri Ramadasu <
amar...@yahoo-inc.com> wrote:
> The issue is being tracked at
> https://issues.apache.org/jira/browse/MAPREDUCE-1541.
>
> Thanks
> Amareshwari
>
>
>
> On 5/18/10 7:14 AM, "Hemanth Yamijala" wrote:
>
>