Re: pig_cassandra problem - Incompatible field schema error

2011-10-17 Thread Pete Warden
I've dug deeper into this, since this got my script running but still left me at sea when dealing with the actual data. It's looking like there may be a mismatch between the schema that's being reported by CassandraStorage.java, and the data that's actually returned. Here's an example: rows =

Re: Storing pre-sorted data

2011-10-17 Thread Matthias Pfau
Thanks for that hint! However, it seems like soundex is a very language specific algorithm (US English). We have to get into this topic further... Kind regards Matthias On 10/13/2011 10:43 PM, Stephen Connolly wrote: Then just use a soundex function on the first word in the text... that will

Re: pig_cassandra problem - Incompatible field schema error

2011-10-17 Thread Pete Warden
JIRA filed, with a messy patch too: https://issues.apache.org/jira/browse/CASSANDRA-3371 cheers, Pete On Mon, Oct 17, 2011 at 2:27 AM, Pete Warden p...@jetpac.com wrote: I've dug deeper into this, since this got my script running but still left me at sea when dealing with the

Re: Cassandra server log continuously the same message

2011-10-17 Thread Thibaut Détrée
Ok thank you for your answer, however is it possible to use an xml log4j configuration file instead of the properties one (deprecated) ? Thanks, Thibaut Détrée Message du 17/10/11 15:22 De : Jonathan Ellis A : user@cassandra.apache.org, Thibaut Détrée Copie à : Objet : Re:

Re: Cassandra server log continuously the same message

2011-10-17 Thread Jonathan Ellis
Right now AbstractCassandraDaemon only uses PropertyConfigurator but we'd be happy to review a patch to add xml support. 2011/10/17 Thibaut Détrée thibaut.det...@orange.fr: Ok thank you for your answer, however is it possible to use an xml log4j configuration file instead of the properties one

Re: CassandraDaemon deactivate doesn't shutdown Cassandra

2011-10-17 Thread aaron morton
What measure are you using to say Cassandra does not shut down ? Can you get a thread dump to see what's still running ? Cheers - Aaron Morton Freelance Cassandra Developer @aaronmorton http://www.thelastpickle.com On 16/10/2011, at 8:50 AM, Shimi Kiviti wrote: The problem

Re: Storing pre-sorted data

2011-10-17 Thread aaron morton
Sort order is determined by the Comparator, which is an implementation of the o.a.c.db.marshal.AbstractTypeT class. If you wish to order column (names) in a row based on an opaque (to cassandra) byte value you can create your own implementation. You would then need to decrypt and compare

Re: One CF vs several CFs

2011-10-17 Thread aaron morton
It depends on what your workload is and how you want to read the data. If you want to get all the data for an article every time, and the number of comments is not huge go with option 1. Cheers - Aaron Morton Freelance Cassandra Developer @aaronmorton

Re: Massive writes when only reading from Cassandra

2011-10-17 Thread Jeremy Hanna
Even after disabling hinted handoff and setting read_repair_chance to 0 on all our column families, we were still experiencing massive writes. Apparently the read_repair_chance is completely ignored at any CL higher than CL.ONE. So we were doing CL.QUORUM on reads and writes and seeing

Using elasticsearch on cassandra nodes

2011-10-17 Thread Anthony Ikeda
I've already posted to the elasticsearch groups and thought it prudent to also ask here. We are looking at using elastic search to index our data that we currently store to Cassandra. I was wondering if there are any concerns running elastic search on the same nodes that we use for Cassandra? We

Re: how to reduce disk read? (and bloom filter performance)

2011-10-17 Thread Mohit Anchlia
On Sun, Oct 16, 2011 at 2:20 AM, Radim Kolar h...@sendmail.cz wrote: Dne 10.10.2011 18:53, Mohit Anchlia napsal(a): Does it mean you are not updating a row or deleting them? yes. i have 350m rows and only about 100k of them are updated.  Can you look at JMX values of BloomFilter* ? i

Re: show schema fails

2011-10-17 Thread Radim Kolar
Dne 17.10.2011 22:06, aaron morton napsal(a): Hi there, If you start cassandra-cli with --debug it will output a stack trace if the error is client side. A long is exactly 8 bytes: 5 java.lang.RuntimeException: A long is exactly 8 bytes: 5 at

Re: how to reduce disk read? (and bloom filter performance)

2011-10-17 Thread Radim Kolar
Look in jconcole - org.apache.cassandra.db - ColumnFamilies bloom filter false ratio is on this server 0.0018 and 0,06% reads hits more than 1 sstable. From cassandra point of view, it looks good.