If you want to take a look o.a.c.hadoop.ColumnFamilyRecordReader.getSplits() is 
the function that gets the splits. 


Cheers 
-----------------
Aaron Morton
Freelance Cassandra Developer
@aaronmorton
http://www.thelastpickle.com

On 3 Aug 2011, at 16:18, Patrik Modesto wrote:

> On Tue, Aug 2, 2011 at 23:10, Jeremiah Jordan
> <jeremiah.jor...@morningstar.com> wrote:
>> If you have RF=1, taking one node down is going to cause 25% of your
>> data to be unavailable.  If you want to tolerate a machines going down
>> you need to have at least RF=2, if you want to use quorum and have a
>> machine go down, you need at least RF=3.
> 
> I know I can have RF > 1 but I have limited resources and I don't care
> lossing 25% of the data. RF > 1 basicaly means if a node goes down I
> have the data elsewhere, but what I need is if node goes down just
> ignore its range. I can handle it in my applications using thrift, but
> the hadoop-mapreduce can't handle it. It just fails with "Exception in
> thread "main" java.io.IOException: Could not get input splits". Is
> there a way to say ignore this range to hadoop?
> 
> Regards,
> P.

Reply via email to