My best advice on this is, insert a bit of data into the tree, and then do a
heap dump to calculate the extra overhead. It's unfortunately more than you
would like from our testing.
On Tue, Oct 18, 2011 at 8:14 PM, Todd Nine wrote:
> **
> Hi guys,
> We've just built a K tree implementation in
Hi guys,
We've just built a K tree implementation in cassandra. We're going
for relatively "wide" nodes in our tree to minimize our tree depth and
increase our search times. Most of the links between parent/child
nodes are longs. We're ready to start tuning the size of K so that our
most acce
On Tue, Oct 18, 2011 at 4:30 PM, Jonathan Ellis wrote:
> On Tue, Oct 18, 2011 at 4:10 PM, Tyler Hobbs wrote:
> > * You'll get range ghosts
> > (http://wiki.apache.org/cassandra/FAQ#range_ghosts) with column_count=0.
> > You can avoid them if you set column_count=1.
>
> What heuristic do you us
jstack shows that all the mutation stages are blocked on a synchronization:
"MutationStage:1" prio=10 tid=0x2aaabc01b000 nid=0x27b8 waiting
for monitor entry [0x42048000]
java.lang.Thread.State: BLOCKED (on object monitor)
at org.apache.cassandra.db.Table.apply(Table.java:4
I updated to 71ba6998b504966690e099c03e04f9876dc1060e on github 1.0.0
branch HEAD,
now the new code seems to be very slow in commitlog replay, it takes
20minutes to recover about 500MB of commit logs, while
previously this takes roughly 1--2 minutes.
is the official 1.0.0 release built on
4f39d
David,
thanks for sharing these ideas. We basically implemented it by using
longs and dividing the long-namespace into multiple parts on each
insert. As you already described, the biggest problem with this approach
is, that we are not able to simply invoke get(x) because the indexes of
the lis
Hi David,
encrypting and Decrypting data in cassandra is not an option to us.
However, I read your elaboration on best practices for adding a custom
feature to cassandra with a lot of interest. Thank you!
Please see below for the answers to your questions.
Kind regards
Matthias
On 10/18/2011
On Tue, Oct 18, 2011 at 4:10 PM, Tyler Hobbs wrote:
> * You'll get range ghosts
> (http://wiki.apache.org/cassandra/FAQ#range_ghosts) with column_count=0.
> You can avoid them if you set column_count=1.
What heuristic do you use for skipping empty rows?
--
Jonathan Ellis
Project Chair, Apache
At the moment we are only prototyping so we haven't bridged the two at all.
We had planned on creating a write-through operation that allowed us to
filter the calls (AOP perhaps?) to manage the indexing as we stored it in
Cassandra.
We are still trying to work out if we go the elastic search route
A couple of notes:
* You need to use the master branch (not the most recent release) if you
want to use get_range() with column_count=0 and get any results.
* You'll get range ghosts (
http://wiki.apache.org/cassandra/FAQ#range_ghosts) with column_count=0. You
can avoid them if you set column_
There no first class support for just getting row keys, you will always want to
get a column.
You can fake it by requesting zero columns.
Cheers
-
Aaron Morton
Freelance Developer
@aaronmorton
http://www.thelastpickle.com
On 19/10/2011, at 3:53 AM, David Fischer(Gtalk) wrote:
At that scale of data, and the fact that it's a batch job, I would go with the
bulk loading tool.
Cheers
-
Aaron Morton
Freelance Developer
@aaronmorton
http://www.thelastpickle.com
On 19/10/2011, at 3:32 AM, Mike Rapuano wrote:
> We are not currently live but testing with Cas
Thanks all.
-
Aaron Morton
Freelance Developer
@aaronmorton
http://www.thelastpickle.com
On 19/10/2011, at 2:45 AM, Jonathan Ellis wrote:
> Multiple concurrent schema modifications are not supported in 1.0.
> https://issues.apache.org/jira/browse/CASSANDRA-1391 is open to add
>
Looks like the column meta for the CF specifies a column name that is not a
valid Long. I seem to remember a bug like this something in the past.
You should be able to work around this by running ALTER COLUMN FAMILY and only
specifying valid column meta data.
Cheers
-
Aaron
is it possible to do this?
I have a counter CF, and over time, some rows are no longer useful, so
I want to delete them in batches, like once a week.
(doing the delete by specifying all the row keys would probably be too slow)
I looked over the .thrift definition of Cassandra interface, it seems
Hello all.
New to cassandra and I am using pycassa to access data. I was
wondering someone knows how to just pull row keys insead of get_range?
This question may be a bit more on the pycassa but not sure. If
someone has a java snippet to do it that would be ok also
Thanks
We are not currently live but testing with Cassandra. I'm looking for
recommendations on the most efficient way to load text files over 25GBs in
size to Cassandra (version 0.8.6). Our application may require us to load
2-3 text files between 25-40GBs each a few times a week to our 3 node
cluster.
Multiple concurrent schema modifications are not supported in 1.0.
https://issues.apache.org/jira/browse/CASSANDRA-1391 is open to add
this in 1.1.
On Tue, Oct 18, 2011 at 8:27 AM, Dikang Gu wrote:
> Congrats!
> In 0.8, the schema disagreement occurs sometimes when I create
> keyspaces/column fam
The Mojo team is pleased to announce the release of Mojo's Cassandra
Maven Plugin version 1.0.0-1.
Mojo's Cassandra Plugin is used when you want to install and control a
test instance of Apache Cassandra from within your Apache Maven build.
The Cassandra Plugin has the following goals.
* cassan
Congrats!
In 0.8, the schema disagreement occurs sometimes when I create
keyspaces/column families dynamically, is this also fixed?
Regards.
On Tue, Oct 18, 2011 at 9:20 PM, Jonathan Ellis wrote:
> Short version: yes, 1.0 addresses the known repair problems.
>
> On Tue, Oct 18, 2011 at 7:29 AM
Short version: yes, 1.0 addresses the known repair problems.
On Tue, Oct 18, 2011 at 7:29 AM, Maxim Potekhin wrote:
> There was a problem in early 0.8 where the repair was taking
> forever -- am I right to assume this was fixed in 1.0?
>
> Many thanks to you guys,
>
> Maxim
>
>
> On 10/18/2011 2:
Anthony,
We've been looking at elastic search as well. Presently we have SOLR in
place, but it is cumbersome dealing with SOLR schemas when indexing
information out of Cassandra (since you can't anticipate all the columns
ahead of time).
What are you using as your bridge between Cassandra and ES
1.0 FTW!
Nice work.
-brian
On Tue, Oct 18, 2011 at 8:32 AM, Viktor Jevdokimov <
viktor.jevdoki...@adform.com> wrote:
> Congrats!!!
>
>
> Best regards/ Pagarbiai
>
> Viktor Jevdokimov
> Senior Developer
>
> Email: viktor.jevdoki...@adform.com
> Phone: +370 5 212 3063
> Fax: +370 5 261 0453
>
> J.
Congrats!!!
Best regards/ Pagarbiai
Viktor Jevdokimov
Senior Developer
Email: viktor.jevdoki...@adform.com
Phone: +370 5 212 3063
Fax: +370 5 261 0453
J. Jasinskio 16C,
LT-01112 Vilnius,
Lithuania
Disclaimer: The information contained in this message and attachments is
intended solely for
There was a problem in early 0.8 where the repair was taking
forever -- am I right to assume this was fixed in 1.0?
Many thanks to you guys,
Maxim
On 10/18/2011 2:25 PM, Thibaut Britz wrote:
Great news!
Especially the improved read performance and compactions are great!
Thanks,
Thibaut
On
Great news!
Especially the improved read performance and compactions are great!
Thanks,
Thibaut
On Tue, Oct 18, 2011 at 2:11 PM, Jonathan Ellis wrote:
> Thanks for the help, everyone! This is a great milestone for Cassandra.
>
> On Tue, Oct 18, 2011 at 7:01 AM, Sylvain Lebresne
> wrote:
>>
Thanks for the help, everyone! This is a great milestone for Cassandra.
On Tue, Oct 18, 2011 at 7:01 AM, Sylvain Lebresne wrote:
> The Cassandra team is very pleased to announce the release of Apache Cassandra
> version 1.0.0. Cassandra 1.0.0 is a new major release that build upon the
> awesomen
The Cassandra team is very pleased to announce the release of Apache Cassandra
version 1.0.0. Cassandra 1.0.0 is a new major release that build upon the
awesomeness of previous versions and adds numerous improvements[1,2], amongst
which:
- Compression of on-disk data files (SSTables), with checks
Nobody objects, so I will publish the artifacts as Cassandra 1.0.0 is being
released
On 12 October 2011 23:44, Stephen Connolly
wrote:
> Hi,
>
> I'd like to release version 1.0.0-1 of Mojo's Cassandra Maven Plugin
> to sync up with the pending 1.0.0 release of Apache Cassandra.
>
> This version n
On Tue, Oct 18, 2011 at 12:14 AM, Matthias Pfau wrote:
> we want to sort completely on the client-side (where the data is
> encrypted). But that requires an "insert at offset X" operation. We would
> always use CL QUORUM and client side synchronisation.
>
You can do "insert at offset X"... just
Aaron,
we want to sort completely on the client-side (where the data is
encrypted). But that requires an "insert at offset X" operation. We
would always use CL QUORUM and client side synchronisation.
However, it seems to be not be a good idea to add such a feature to
cassandra as everyone usi
31 matches
Mail list logo