Re: is there a SSTAbleInput for Map/Reduce instead of ColumnFamily?

2013-09-06 Thread Jim Ancona
Unfortunately, Netflix doesn't seem to have released Aegisthus as open
source.

Jim


On Fri, Aug 30, 2013 at 1:44 PM, Jeremiah D Jordan 
jeremiah.jor...@gmail.com wrote:

 FYI:
 http://techblog.netflix.com/2012/02/aegisthus-bulk-data-pipeline-out-of.html

 -Jeremiah

 On Aug 30, 2013, at 9:21 AM, Hiller, Dean dean.hil...@nrel.gov wrote:

  is there a SSTableInput for Map/Reduce instead of ColumnFamily (which
 uses thrift)?
 
  We are not worried about repeated reads since we are idempotent but
 would rather have the direct speed (even if we had to read from a snapshot,
 it would be fine).
 
  (We would most likely run our M/R on 4 nodes of the 12 nodes we have
 since we have RF=3 right now).
 
  Thanks,
  Dean




is there a SSTAbleInput for Map/Reduce instead of ColumnFamily?

2013-08-30 Thread Hiller, Dean
is there a SSTableInput for Map/Reduce instead of ColumnFamily (which uses 
thrift)?

We are not worried about repeated reads since we are idempotent but would 
rather have the direct speed (even if we had to read from a snapshot, it would 
be fine).

(We would most likely run our M/R on 4 nodes of the 12 nodes we have since we 
have RF=3 right now).

Thanks,
Dean


Re: is there a SSTAbleInput for Map/Reduce instead of ColumnFamily?

2013-08-30 Thread Jeremiah D Jordan
FYI: 
http://techblog.netflix.com/2012/02/aegisthus-bulk-data-pipeline-out-of.html

-Jeremiah

On Aug 30, 2013, at 9:21 AM, Hiller, Dean dean.hil...@nrel.gov wrote:

 is there a SSTableInput for Map/Reduce instead of ColumnFamily (which uses 
 thrift)?
 
 We are not worried about repeated reads since we are idempotent but would 
 rather have the direct speed (even if we had to read from a snapshot, it 
 would be fine).
 
 (We would most likely run our M/R on 4 nodes of the 12 nodes we have since we 
 have RF=3 right now).
 
 Thanks,
 Dean