map/reduce on Cassandra

2010-01-25 Thread Phillip Michalak
Multiple people have expressed an interest in 'hadoop integration' and 'map/reduce functionality' within Cassandra. I'd like to get a feel for what that means to different people. As a starting point for discussion, Jeff Hodges undertook a prototype effort last summer which was the subject

'large' node configuration question

2010-01-20 Thread Phillip Michalak
Does anyone have a recommendation for configuring cassandra on a cluster with 'large' nodes? i.e. multiple nodes, each >64G RAM, 8 cores, >4 disks Would it be better to run multiple cassandra processes per node (configured on separate IP addresses, separate data directories, etc) or just a

Re: Cassandra users survey

2009-11-21 Thread Phillip Michalak
We're using Cassandra in development to store custom index information on large document sets. Also considered HBase and Voldemort. Cassandra's data model and performance tradeoffs seemed to best fit our needs. Features that we're looking forward to seeing: * map/reduce integration * built-

Re: quorum read timeout

2009-08-20 Thread Phillip Michalak
, but have left it open for someone with more experience to close it out. Cheers, Phil On Aug 19, 2009, at 5:14 PM, Phillip Michalak wrote: Sure thing. Filed as https://issues.apache.org/jira/browse/CASSANDRA-381 Thanks, Phil On Aug 19, 2009, at 4:54 PM, Jonathan Ellis wrote: Looks lik

Re: quorum read timeout

2009-08-20 Thread Phillip Michalak
Jun, That was my suspicion too. I'm trying to figure out whether anything in my configuration might be causing that. Actually, seeing two or more storage-conf.xml files from a working configuration would be useful for identifying or ruling out configuration issues. Can anyone point me to

Re: quorum read timeout

2009-08-19 Thread Phillip Michalak
Sure thing. Filed as https://issues.apache.org/jira/browse/CASSANDRA-381 Thanks, Phil On Aug 19, 2009, at 4:54 PM, Jonathan Ellis wrote: Looks like a bug in TcpConnectionManager. Can you file a ticket? thanks, -Jonathan On Wed, Aug 19, 2009 at 2:49 PM, Phillip Michalak wrote: It&#

Re: quorum read timeout

2009-08-19 Thread Phillip Michalak
It's cassandra-0.4-beta1. Thanks! Phil On Aug 19, 2009, at 4:43 PM, Jonathan Ellis wrote: Is this 0.3 or 0.4/trunk? On Wed, Aug 19, 2009 at 2:36 PM, Phillip Michalak wrote: I'm running three Cassandra nodes in virtual machines. During a 'get' operation from Cassandra-re

quorum read timeout

2009-08-19 Thread Phillip Michalak
I'm running three Cassandra nodes in virtual machines. During a 'get' operation from Cassandra-remote directed at one of these nodes, I'm receiving the following output vad...@vadmin:~/cassandra$ interface/gen-py/cassandra/Cassandra-remote -h 192.168.133.130:9160 get 'MockElementLibrary'