Re: random partitioner and key scan

2012-07-21 Thread prasenjit mukherjee
On Sat, Jul 21, 2012 at 3:37 PM, Sylvain Lebresne wrote: >> I don't know much about Cassandra internals, but from a user point of >> view, a scan for a range of tokens is not a common use-case. > > All of boostrap/move/decommission/repair rely heavily on being able to > scan efficiently a range of

Re: random partitioner and key scan

2012-07-20 Thread prasenjit mukherjee
gt; would have to be scanned to find the ones whose token was in the required > token range. > > Cheers > > - > Aaron Morton > Freelance Developer > @aaronmorton > http://www.thelastpickle.com > > On 20/07/2012, at 4:46 AM, prasenjit mukherjee wrote:

Re: random partitioner and key scan

2012-07-19 Thread prasenjit mukherjee
wrote: > Hi Prasenjit, > > I don't see the need to recalculate anything. One key has a one MD5 > hash, it doesn't change. Just use the hash to select a node, than just > the plain key. Can you elaborate on the redistribution please? > > Regards, > P. > > On

Re: random partitioner and key scan

2012-07-19 Thread prasenjit mukherjee
I agree. The probem could be while redistributing the tokens. In that case the hashes has to be recalculated on each fo the candidate node. -Thanks, Prasenjit On Thu, Jul 19, 2012 at 12:19 PM, Patrik Modesto wrote: > Hi, > > I know that RandomPartitioner does MD5 of a key and the MD5 is then >

Re: SSTable format

2012-07-15 Thread prasenjit mukherjee
Appreciate the insightful replies. Understood Sylvain's argument that having different partitioning locally and globally could create problem in data movement. Edward, for a given sstable in a node, why having lexicographically closer rows clumped together should matter ? Anyways the lookups for

Re: SSTable format

2012-07-13 Thread prasenjit mukherjee
> > It depends on what partitioner you use. You should be using the > RandomPartitioner, and if so, the rows are sorted by the hash of the row > key. there are partitioners that sort based on the raw key value but these > partitioners shouldn't be used as they have problems due to uneven > partitio

Re: How to come up with a predefined topology

2012-07-12 Thread prasenjit mukherjee
--- > Aaron Morton > Freelance Developer > @aaronmorton > http://www.thelastpickle.com > > On 12/07/2012, at 8:05 PM, prasenjit mukherjee wrote: > > Thanks. Some follow up questions : > > 1. How do the reads use strategy/snitch information ? I am assuming > the reads can go to any of t

Re: How to come up with a predefined topology

2012-07-12 Thread prasenjit mukherjee
ood reason not too > (and you probably don't). The way that replicas are chosen when multiple > racks are in play can be fairly confusing and lead to a data imbalance if > you don't catch it. > > > On Wed, Jul 11, 2012 at 10:53 PM, prasenjit mukherjee > wrote: >>

Re: How to come up with a predefined topology

2012-07-11 Thread prasenjit mukherjee
ault ordering of > same-node < same-rack < same-datacenter < different-datacenter. Each snitch > has methods to tell Cassandra which rack and DC a node is in, so it always > knows which node is closest. Used with the Bloom filters this can tell us > where the nearest

CompositeType support for keynames

2012-07-09 Thread prasenjit mukherjee
Any reason we dont have CompositeType data structure for key_validation_class ( ref: http://www.datastax.com/docs/0.8/configuration/storage_configuration#key-validation-class) ? I would like to create row_names in the form username:mmddhhmm ( e.g. joe:201206092312 ). I can still do that key_va

Re: Effect of rangequeries with RandomPartitioner

2012-07-09 Thread prasenjit mukherjee
Thanks for the response. Further questions inline.. On Mon, Jul 9, 2012 at 11:50 AM, samal wrote: >> 1. With RandomPartitioner, on a given node, are the keys sorted by >> their hash_values or original/unhashed keys ? > > hash value, 1. Based on the second answer in http://stackoverflow.com/que

Re: Effect of rangequeries with RandomPartitioner

2012-07-08 Thread prasenjit mukherjee
n Morton > Freelance Developer > @aaronmorton > http://www.thelastpickle.com > > On 7/07/2012, at 11:52 PM, prasenjit mukherjee wrote: > > Wondering how a rangequery request is handled if RP is used. Will the > receiving node do a fan-out to all the nodes in the ring or

Re: Effect of RangeQuery with RandomPartitioner

2012-07-07 Thread prasenjit mukherjee
ge is distributed across the ring, so essentially either it send has to send the request to all nodes in the ring or just do a local processing. On Sat, Jul 7, 2012 at 7:47 PM, Edward Capriolo wrote: > On Sat, Jul 7, 2012 at 9:26 AM, prasenjit mukherjee > wrote: >> Wondering how a rang

Effect of RangeQuery with RandomPartitioner

2012-07-07 Thread prasenjit mukherjee
Wondering how a rangequery request is handled if RP is used. Will the receiving node do a fan-out to all the nodes in the ring or it will just execute the rangequery on its own local partition ? -Prasenjit

Effect of rangequeries with RandomPartitioner

2012-07-07 Thread prasenjit mukherjee
Wondering how a rangequery request is handled if RP is used. Will the receiving node do a fan-out to all the nodes in the ring or it will just execute the rangequery on its own local partition ? -- Sent from my mobile device

steps to add node in cassandra 1.1.2

2012-07-05 Thread prasenjit mukherjee
I am using cassandar version 1.1.2. I got the document to add node for version 0.7 : http://www.datastax.com/docs/0.7/getting_started/configuring Is it still valid ? Is there a documentation on this topic from cassandra twiki/docs ? -Prasenjit