Re: Cassandra not suitable?

2011-12-07 Thread Jake Luciani
Where do you see the timeout exceptions? in the mappers? How many mappers reducers slots are you using? What does your disk setup look like? do you have HDFS on same disk as cassandra data dir? -Jake On Tue, Dec 6, 2011 at 4:50 AM, Patrik Modesto patrik.mode...@gmail.comwrote: Hi, I'm

Cassandra as a LinkedIn Skill

2011-11-14 Thread Jake Luciani
Hi, Sorry for the intrusion. I was speaking to some of the LinkedIn engineers at ApacheCon last week about to see how to get Cassandra into the linkedin skills page [1]. They claim if more people add Cassandra as a skill in their profile then it will show up. So my request is if you use

Re: Second Cassandra users survey

2011-11-14 Thread Jake Luciani
Re Simpler elasticity: Latest opscenter will now rebalance cluster optimally http://www.datastax.com/dev/blog/whats-new-in-opscenter-1-3 /plug -Jake On Mon, Nov 14, 2011 at 7:27 PM, Chris Burroughs chris.burrou...@gmail.comwrote: - It would be super cool if all of that counter work made it

Re: Second Cassandra users survey

2011-11-09 Thread Jake Luciani
Hi Todd, Entity Groups : https://issues.apache.org/jira/browse/CASSANDRA-1684 -Jake On Wed, Nov 9, 2011 at 6:44 AM, Todd Burruss bburr...@expedia.com wrote: I believe I heard someone talk at Cassandra SF conference about creating a partitioner that was a derivation of RandomPartitioner. It

Re: Second Cassandra users survey

2011-11-09 Thread Jake Luciani
at the conference that had already implemented what I mentioned. It didn't offer any atomicity, just co-locating a family of data on the same node. From: Jake Luciani jak...@gmail.com Reply-To: user@cassandra.apache.org user@cassandra.apache.org Date: Wed, 9 Nov 2011 02:53:20 -0800 To: user

Re: Secondary index issue, unable to query for records that should be there

2011-11-08 Thread Jake Luciani
Hi Nate, Could you try running it with debug enabled on the logs? it will give more insite into what's going on. -Jake On Tue, Nov 8, 2011 at 3:45 PM, Nate Sammons nsamm...@ften.com wrote: This is against a single server, not a cluster. Replication factor for the keyspace is set to 1, CL

Re: Reminder: Cassandra Meetup, Thursday Nov. 10th in Vancouver

2011-11-07 Thread Jake Luciani
I'll be there! On Mon, Nov 7, 2011 at 5:23 PM, Eric Evans eev...@acunu.com wrote: Just a reminder; If you're planning to be at ApacheCon, or are otherwise able to be in Vancouver on the 10th, we're having a Cassandra Meetup. There is no cost to attend (you don't even need to be registered

Re: Counter Experience (Performance)?

2011-10-27 Thread Jake Luciani
What's your bottleneck? http://spyced.blogspot.com/2010/01/linux-performance-basics.html On Thu, Oct 27, 2011 at 9:37 AM, Joe Stein crypt...@gmail.com wrote: Hey folks, I am interested in what others have seen in regards to their experience in the amount of depth and width (CF, Rows Columns)

Re: 2 node cluster, 1 node down, overall failure

2011-10-27 Thread Jake Luciani
You are unable to connect? or you are getting an UnavailableException? On Thu, Oct 27, 2011 at 11:14 AM, RobinUs2 ro...@us2.nl wrote: I currently run a 2-node cluster with version cassandra 1.0 (stable). With replication factor 2 on the keyspace which I'm testing. When I shutdown node B,

Re: 2 node cluster, 1 node down, overall failure

2011-10-27 Thread Jake Luciani
What consistency level are you using? With RF=2 your only option is CL.ONE when a node is down. On Thu, Oct 27, 2011 at 11:47 AM, RobinUs2 ro...@us2.nl wrote: The error I currently see when I take down node B: Error performing get_indexed_slices on NODE A IP:9160: exception

Re: Operator on secondary indexes in 0.8.x (GTE/LTE)

2011-10-11 Thread Jake Luciani
This hasn't changed in AFAIK, In Brisk we had the same problem in CFS so we created a sentinel value that all rows shared then it works. CASSANDRA-2915 should fix it. On Tue, Oct 11, 2011 at 4:48 PM, Sasha Dolgy sdo...@gmail.com wrote: I was trying to get a range of rows based on a

Re: Solandra distributed search

2011-10-06 Thread Jake Luciani
the default setting of 4 for this property affect the distribution of data across my nodes? From: Jake Luciani jak...@gmail.com Reply-To: user@cassandra.apache.org user@cassandra.apache.org Date: Mon, 15 Aug 2011 12:03:22 -0700 To: user@cassandra.apache.org user@cassandra.apache.org Subject: Re

Re: Possibility of going OOM using get_count

2011-09-18 Thread Jake Luciani
This is fixed in 1.0 https://issues.apache.org/jira/browse/CASSANDRA-2894 On Sun, Sep 18, 2011 at 2:16 PM, Tharindu Mathew mcclou...@gmail.comwrote: Hi everyone, I noticed this line in the API docs, The method is not O(1). It takes all the columns from disk to calculate the answer. The

Re: Solandra error - spaces in search

2011-08-23 Thread Jake Luciani
Thx for the info I'll try to reproduce On Aug 23, 2011, at 9:28 PM, Ashley Martens amart...@ngmoco.com wrote: INFO [769787724@qtp-311722089-9825] 2011-08-23 22:07:53,750 SolrCore.java (line 1370) [users] webapp=/solandra path=/select

Re: Reg File upload to cassandra - connection refused error

2011-08-18 Thread Jake Luciani
What is rpc_address set to in cassandra.yaml? Try setting these to 0.0.0.0 to be sure it's listening to external traffic. On Thu, Aug 18, 2011 at 8:37 AM, Thamizh tceg...@yahoo.co.in wrote: Hi All, This is regarding help to resolve connection refused error on Cassandra client API. I have

Re: Suggested settings for number crunching

2011-08-18 Thread Jake Luciani
Are you writing lots of tiny rows or a few very large rows, are you batching mutations? is the loading disk or cpu or network bound? -Jake On Thu, Aug 18, 2011 at 7:08 AM, Paul Loy ketera...@gmail.com wrote: Hi All, I have a program that crunches through around 3 billion calculations. We

Re: Suggested settings for number crunching

2011-08-18 Thread Jake Luciani
no network traffic so I think it's disk access. Will find out for sure tomorrow after the current test runs. Thanks, Paul. On Thu, Aug 18, 2011 at 2:23 PM, Jake Luciani jak...@gmail.com wrote: Are you writing lots of tiny rows or a few very large rows, are you batching mutations

Re: Solandra multiple schemas

2011-08-15 Thread Jake Luciani
You want the solandra data stored under two keyspaces? Or you just want two different logical indexes. The former requires changing the keyspace name located in solandra.properties but you can only access one per process. The latter would involve creating two different solr cores at different

Re: Solandra distributed search

2011-08-15 Thread Jake Luciani
Solandra manages the shard parameters for you. you don't need to specify anything. On Mon, Aug 15, 2011 at 3:00 PM, Jeremiah Jordan jeremiah.jor...@morningstar.com wrote: When using Solandra, do I need to use the Solr sharding synxtax in my queries? I don't think I do because Cassandra is

Re: Solandra/Cassandra ring issue

2011-08-11 Thread Jake Luciani
seriously, If you change the cluster name in cassandra.yaml they won't join. On Thu, Aug 11, 2011 at 12:31 PM, Ashley Martens amart...@ngmoco.comwrote: No shared seeds. Downright freaky. -- http://twitter.com/tjake

Re: Setup Cassandra0.8 in Eclipse

2011-08-08 Thread Jake Luciani
you can simply run: ant generate-eclipse-files then import the project On Sun, Aug 7, 2011 at 5:39 PM, Alvin UW alvi...@gmail.com wrote: Hello, I am trying to Setup Cassandra0.8 in Eclipse following http://wiki.apache.org/cassandra/RunningCassandraInEclipse After right clicking on the

Re: Cassandra Pig with network topology and data centers.

2011-07-29 Thread Jake Luciani
Yes it's read repair you can lower the read repair chance to tune this. On Jul 29, 2011, at 6:31 PM, Aaron Griffith aaron.c.griff...@gmail.com wrote: I currently have a 9 node cassandra cluster setup as follows: DC1: Six nodes DC2: Three nodes The tokens alternate between the two

Re: Slow Reads

2011-07-27 Thread Jake Luciani
The philosophy in no-sql is to store the data as you plan to access it. that means duplicating the data many time possibly. Disk is cheap, writes are fast. On Wed, Jul 27, 2011 at 2:22 PM, Priyanka priya...@gmail.com wrote: Thank you Indra for your suggestion. But the thing is apart from

Re: Slow Reads

2011-07-26 Thread Jake Luciani
It doesn't read the entire row, but it does read a section of the row from disk... How big is each supercolumn? If you re-read the data does the query time get faster? On Tue, Jul 26, 2011 at 11:59 AM, Philippe watche...@gmail.com wrote: i believe it's because it needs to read the whole row

Re: Little problems with Solandra

2011-07-23 Thread Jake Luciani
Sounds like you forgot to start solandra after you built it. cd solandra-app; ./bin/solandra You can verify it's running with jps look for SolandraServer. On Jul 23, 2011, at 10:52 AM, Jean-Nicolas Boulay Desjardins jnbdzjn...@gmail.com wrote: Hi, I have a server on RackSpace and it

Re: solandra or pig or....?

2011-06-22 Thread Jake Luciani
be required... -sd On Tue, Jun 21, 2011 at 9:50 PM, Jake Luciani jak...@gmail.com wrote: Right, Solr will not do anything other than basic aggregations (facets) and range queries. On Tue, Jun 21, 2011 at 3:16 PM, Dan Kuebrich dan.kuebr...@gmail.com wrote: Solandra is indeed distributed search

Re: solandra or pig or....?

2011-06-21 Thread Jake Luciani
Solandra can answer the question you used as an example and it's more of a fit for low-latency ad-hoc reporting then PIG. Pig queries will take minutes not seconds. On Tue, Jun 21, 2011 at 12:12 PM, Sasha Dolgy sdo...@gmail.com wrote: Folks, Simple question ... Assuming my current use case

Re: solandra or pig or....?

2011-06-21 Thread Jake Luciani
i had a quick look at https://github.com/tjake/Solandra/wiki/Solandra-Wiki and it wasn't dead obvious to me On Tue, Jun 21, 2011 at 8:19 PM, Jake Luciani jak...@gmail.com wrote: Solandra can answer the question you used as an example and it's more of a fit for low-latency ad-hoc

Re: solandra or pig or....?

2011-06-21 Thread Jake Luciani
Right, Solr will not do anything other than basic aggregations (facets) and range queries. On Tue, Jun 21, 2011 at 3:16 PM, Dan Kuebrich dan.kuebr...@gmail.comwrote: Solandra is indeed distributed search, not distributed number-crunching. As a previous poster said, you could imagine

Re: What's the best approach to search in Cassandra

2011-06-16 Thread Jake Luciani
that if I read all there is on GitHub, I can probably start using it. * Thank you, Mark On Fri, Jun 3, 2011 at 8:07 PM, Jake Luciani jak...@gmail.com wrote: Mark, Check out Solandra. http://github.com/tjake/Solandra On Fri, Jun 3, 2011 at 7:56 PM, Mark Kerzner markkerz

Re: Is there a way from a running Cassandra node to determine whether or not itself is up?

2011-06-15 Thread Jake Luciani
No force a node down you can use nodetool disablegossip On Wed, Jun 15, 2011 at 6:42 PM, Suan Aik Yeo yeosuan...@gmail.com wrote: Thanks, Aaron, but we determined that adding Java into the equation just brings in too much complexity for something that's called out of an Nginx Perl module.

Re: Cassandra HDFS question

2011-06-09 Thread Jake Luciani
Hi JKnight, Yes. The Brisk project adds a HDFS compatible layer for Cassandra see http://github.com/riptano/brisk -Jake On Thu, Jun 9, 2011 at 11:05 PM, JKnight JKnight beukni...@gmail.comwrote: Dear all, Does Cassandra support HDFS storage? Thank a lot for support. -- Best regards,

Re: Installing Thrift with Solandra

2011-06-07 Thread Jake Luciani
to build a Thrift interface for Cassandra: ./compiler/cpp/thrift -gen php ../PATH-TO-CASSANDRA/interface/cassandra.thrift How do I do this? Where is the interface folder? Again, tjake thanks allot for your time and help. On Mon, Jun 6, 2011 at 11:13 PM, Jake Luciani jak...@gmail.com wrote

Re: Installing Thrift with Solandra

2011-06-07 Thread Jake Luciani
-gen php ../PATH-TO-CASSANDRA/interface/cassandra.thrift How do I do this? Where is the interface folder? Again, tjake thanks allot for your time and help. On Mon, Jun 6, 2011 at 11:13 PM, Jake Luciani jak...@gmail.com wrote: To access Cassandra in Solandra it's the same as regular cassandra

Re: Installing Thrift with Solandra

2011-06-06 Thread Jake Luciani
To access Cassandra in Solandra it's the same as regular cassandra. To access Solr you use one of the Php Solr libraries http://wiki.apache.org/solr/SolPHP On Mon, Jun 6, 2011 at 11:04 PM, Jean-Nicolas Boulay Desjardins jnbdzjn...@gmail.com wrote: I am trying to install Thrift with

Re: What's the best approach to search in Cassandra

2011-06-03 Thread Jake Luciani
Mark, Check out Solandra. http://github.com/tjake/Solandra On Fri, Jun 3, 2011 at 7:56 PM, Mark Kerzner markkerz...@gmail.com wrote: Hi, I need to store, say, 10M-100M documents, with each document having say 100 fields, like author, creation date, access date, etc., and then I want to

Re: Forcing Cassandra to free up some space

2011-05-26 Thread Jake Luciani
Is there a way for me to make (or even gently suggest to) Cassandra that it may be a good time to free up some space? Disregarding what's been said and until ref-counting is implemented this is a useful tool to gently suggest cleanup: https://github.com/ceocoder/jmxgc On Thu, May 26, 2011 at

Re: Migrating all rows from 0.6.13 to 0.7.5 over thrift?

2011-05-07 Thread Jake Luciani
I know thrift and python and Unicode don't mix. On May 7, 2011, at 4:21 PM, aaron morton aa...@thelastpickle.com wrote: I've been able to reproduce the fault using python on my mac book see https://github.com/amorton/cassandra-unicode-bug When we try to find the unicode key in the

Re: Combining all CFs into one big one

2011-05-01 Thread Jake Luciani
If you have N column families you need N * memtable size of RAM to support this. If that's not an option you can merge them into one as you suggest but then you will have much larger SSTables, slower compactions, etc. I don't necessarily agree with Tyler that the OS cache will be less

Re: Endless minor compactions after heavy inserts

2011-04-01 Thread Jake Luciani
nodetool compactionstats On Fri, Apr 1, 2011 at 12:14 PM, mcasandra mohitanch...@gmail.com wrote: Is there a way to monitor the compactions using nodetools? I don't see it in tpstats. -- View this message in context:

Re: Ditching Cassandra

2011-03-29 Thread Jake Luciani
Hi Gregori, What language *were* you using to interact with cassandra? were you unable to find a wrapper API that you found We have discussed adopting the best of client api's in cassandra but we decided it's better for the community to naturally develop them. I think this has also motivated

Re: 0.7.2 slow memtables flushing

2011-02-19 Thread Jake Luciani
Are you running with JNA enabled? If so could you try disabling it? On Sat, Feb 19, 2011 at 11:32 AM, Ivan Georgiev yngw...@bk.ru wrote: On 19.2.2011 г. 16:43 ч., Jonathan Ellis wrote: Flush code didn't change between 0.7.0 and 0.7.2. There must be some other variable here. Memory pressure

Re: [RELEASE] 0.7.2

2011-02-17 Thread Jake Luciani
https://issues.apache.org/jira/browse/CASSANDRA-2174 Yes, just clear the cache On Thu, Feb 17, 2011 at 1:06 PM, Damick, Jeffrey jeffrey.dam...@neustar.biz wrote: So after upgrade to 0.7.2, I see this on startup – should I just blow away these cache files? WARN [main] 2011-02-17

Re: Possible EOFException regression in 0.7.1

2011-02-15 Thread Jake Luciani
Have you made any changes to the cassandra config? 2011/2/15 Jonas Borgström jonas.borgst...@trioptima.com Hi all, While testing the new 0.7.1 release I got the following exception: ERROR [ReadStage:11] 2011-02-15 16:39:18,105 DebuggableThreadPoolExecutor.java (line 103) Error in

Re: [RELEASE] 0.7.1

2011-02-14 Thread Jake Luciani
It can take some time for the files to propagate to the mirrors. It's Eventually Consistent though :) On Mon, Feb 14, 2011 at 4:20 PM, Frank LoVecchio fr...@isidorey.com wrote: Ah, I meant quite a few of the mirror links keep showing up as links to gossip sites and whatnot. On Feb 14, 2011

Re: [0.7.1] more exceptions: Illegal mode

2011-02-07 Thread Jake Luciani
This sounds like a possible bug since the BRAF was re-written in 0.7.1. Could you open a ticket? On Mon, Feb 7, 2011 at 10:32 AM, Patrik Modesto patrik.mode...@gmail.comwrote: On Mon, Feb 7, 2011 at 15:42, Thibaut Britz thibaut.br...@trendiction.com wrote: I think this is related to a faulty

Re: Secondary Index information

2011-01-28 Thread Jake Luciani
http://www.datastax.com/blog/whats-new-cassandra-07-secondary-indexes On Fri, Jan 28, 2011 at 7:15 AM, Sasha Dolgy sasha.do...@gmail.com wrote: Hi there, Where can I find information regarding secondary indexes? Spent the past 2 days looking for some good details.

Re: reduced cached mem; resident set size growth

2011-01-28 Thread Jake Luciani
Are you using a row cache? if so what is it set too? in general it should not be a percentage. On Thu, Jan 27, 2011 at 12:23 PM, Chris Burroughs chris.burrou...@gmail.com wrote: We have a 6 node Cassandra 0.6.8 cluster running on boxes with 4 GB of RAM. Over the course of several weeks

Re: Lucandra Limitations

2011-01-27 Thread Jake Luciani
Yes, but that's also the lucene limit http://lucene.apache.org/java/3_0_1/fileformats.html#Limitations Lucene uses a Java int to refer to document numbers, and the index file format uses an Int32 On Thu, Jan 27, 2011 at 1:40 PM, David G. Boney dbon...@semanticartifacts.com wrote: I was

Re: Lucandra Limitations

2011-01-27 Thread Jake Luciani
? Lucene supports the ability to create multiple IndexSearchers and stick them in a MultiSearcher. Is this the right way to view the problem? - Sincerely, David G. Boney dbon...@semanticartifacts.com http://www.semanticartifacts.com On Jan 27, 2011, at 12:45 PM, Jake Luciani

Re: Ring up but read fails ...

2011-01-23 Thread Jake Luciani
I've seen this when you leave a socket open and idle for a long time. The connection times out. On Jan 23, 2011, at 8:42 AM, ruslan usifov ruslan.usi...@gmail.com wrote: 2011/1/23 cbert...@libero.it cbert...@libero.it ERROR UserNameCmd:38 - java.net.SocketException: Broken pipe

Re: R: Re: Ring up but read fails ...

2011-01-23 Thread Jake Luciani
Reconnect and try again? On Jan 23, 2011, at 10:47 AM, cbert...@libero.it cbert...@libero.it wrote: I've seen this when you leave a socket open and idle for a long time. The connection times out. It could be the situation ... any idea about the solution? I create the pool once at

Re: GeoIndexing in Cassandra, Open Sourced?

2011-01-21 Thread Jake Luciani
One possible open source approach would be to use the Solr 1.4 spatial plugin[1] along with Solandra[2] What kind of spatial searches are you looking for? basic bounding box/radius? [1] https://github.com/outoftime/solr-spatial-light [2] https://github.com/tjake/lucandra On Fri, Jan 21, 2011

Re: Welcome committer Jake Luciani

2011-01-13 Thread Jake Luciani
Thanks Jonathan and Cassandra PMC! Happy to help Cassandra take over the world! -Jake On Thu, Jan 13, 2011 at 1:41 PM, Jonathan Ellis jbel...@gmail.com wrote: The Cassandra PMC has voted to add Jake as a committer. (Jake is also a committer on Thrift.) Welcome, Jake, and thanks for the

Re: Bootstrapping taking long

2011-01-05 Thread Jake Luciani
...@gmail.com wrote: I haven't tried repair. Should I? On Jan 5, 2011 3:48 PM, Jake Luciani jak...@gmail.com wrote: Have you tried not bootstrapping but setting the token and manually calling repair? On Wed, Jan 5, 2011 at 7:07 AM, Ran Tavory ran...@gmail.com wrote: My conclusion is lame: I

Re: Bootstrapping taking long

2011-01-04 Thread Jake Luciani
In 0.6, locate the node doing anti-compaction and look in the streams subdirectory in the keyspace data dir to monitor the anti-compaction progress (it puts new SSTables for bootstrapping node in there) On Tue, Jan 4, 2011 at 8:01 AM, Ran Tavory ran...@gmail.com wrote: Running nodetool

Re: anyone using Cassandra as an analytics/data warehouse?

2011-01-04 Thread Jake Luciani
Some relevant information here: https://www.cloudkick.com/blog/2010/mar/02/4_months_with_cassandra/ On Tue, Jan 4, 2011 at 10:09 PM, Dave Viner davevi...@gmail.com wrote: Hi Peter, Thanks. These are great ideas. One comment tho. I'm actually not as worried about the logging into the

Re: Too many open files Exception + java.lang.ArithmeticException: / by zero

2010-12-16 Thread Jake Luciani
(SSTable.java:233) * Thanks. * * 2010/12/15 Jake Luciani jak...@gmail.com http://www.riptano.com/docs/0.6/troubleshooting/index#java-reports-an-error-saying-there-are-too-many-open-files On Wed, Dec 15, 2010 at 11:13 AM, Amin Sakka, Novapost amin.sa...@novapost.fr wrote: *Hello,* *I'm

Re: Too many open files Exception + java.lang.ArithmeticException: / by zero

2010-12-15 Thread Jake Luciani
http://www.riptano.com/docs/0.6/troubleshooting/index#java-reports-an-error-saying-there-are-too-many-open-files On Wed, Dec 15, 2010 at 11:13 AM, Amin Sakka, Novapost amin.sa...@novapost.fr wrote: *Hello,* *I'm using cassandra 0.7.0 rc1, a single node configuration, replication factor 1,

Re: Re: Re: Cassandra 0.7 beta 3 outOfMemory (OOM)

2010-12-07 Thread Jake Luciani
Max this was a bug fixed recently in 0.7 branch https://issues.apache.org/jira/browse/CASSANDRA-1801 fixed now in RC2 -Jake On Tue, Dec 7, 2010 at 8:11 AM, Max cassan...@ajowa.de wrote: As far as i can see, Lucandra already uses batch_mutations.

Re: Data Model Question

2010-12-02 Thread Jake Luciani
You can also run Solr with Cassandra as the backend: https://github.com/tjake/Lucandra/tree/solandra /shameless_plug -Jake On Thu, Dec 2, 2010 at 6:27 AM, aaron morton aa...@thelastpickle.comwrote: Have you considered using Solr / lucene for the search? It has a lot more search features,

Re: Taking down a node in a 3-node cluster, RF=2

2010-11-28 Thread Jake Luciani
are writing with CL.ANY If you never write with CL.ANY then you can turn off hinted handoff. How do I reconcile this? On Sun, Nov 28, 2010 at 7:11 PM, Jake Luciani jak...@gmail.com wrote: If you read/write data with quorum then you can safely take a node down in this scenario. Subsequent

Re: Taking down a node in a 3-node cluster, RF=2

2010-11-28 Thread Jake Luciani
Right. On Sun, Nov 28, 2010 at 1:03 PM, David Boxenhorn da...@lookin2.com wrote: OK. To sum up: RF=2 and QUORUM are incompatible (if you want to be able to take a node down). Right? On Sun, Nov 28, 2010 at 7:59 PM, Jake Luciani jak...@gmail.com wrote: I was wrong on this scenario

Re: Facebook messaging and choice of HBase over Cassandra - what can we learn?

2010-11-21 Thread Jake Luciani
+1 Ed On Nov 21, 2010, at 12:13 PM, Edward Capriolo edlinuxg...@gmail.com wrote: On Sun, Nov 21, 2010 at 12:10 PM, André Fiedler fiedler.an...@googlemail.com wrote: Facebook Messaging – HBase Comes of Age http://facility9.com/2010/11/18/facebook-messaging-hbase-comes-of-age

Re: Cassandra 0.7-beta3 - IndexOutOfBoundsException when create column family

2010-11-16 Thread Jake Luciani
This is a bug in beta3, if you checkout the cassandra-0.7 branch it should work for you. On Tue, Nov 16, 2010 at 3:38 PM, André Fiedler fiedler.an...@googlemail.com wrote: I try to perform the following action after a clean startup. And get the log below. How to fix this?

Re: Not overwriting values

2010-09-21 Thread Jake Luciani
In theory you could use timestamps that go back in time for this CF. That way the first write will persist over future writes. On Sep 21, 2010, at 6:58 AM, Christian Decker decker.christ...@gmail.com wrote: Hi all, I have a rather strange problem I'd like to address. As I understand it

Re: indexing methods

2010-09-03 Thread Jake Luciani
Hi Courtney, You can take a look at lucandra http://github.com/tjake/Lucandra which uses the lucene api to maintain a inverted index in cassandra. There are a couple articles and presentations in the readme that give more info on how this is done. -Jake On Fri, Sep 3, 2010 at 6:26 AM, Courtney

Re: Coke Products at Digg?

2010-07-07 Thread Jake Luciani
Coke sucks! Only drink it if you want to work hard for 20 minutes then crash. I started a new cola that's already way better than Coke and it will solve all your problems. I'm finalizing my results but so far I only need one drink per WEEK! On Jul 7, 2010, at 12:10 PM, Mike Malone

Re: Lucandra issues

2010-06-18 Thread Jake Luciani
Hi Maxim, Lucandra doesn't support numeric queries quite yet. A workaround would be to load your numbers and convert them to strings. I'll eventually add support for this. Please feel free to help out if you can :) Jake On Jun 17, 2010, at 1:16 PM, Maxim Kramarenko

Re: Continuously increasing RAM usage

2010-06-02 Thread Jake Luciani
I've started seeing this issue as well. Running 0.6.2. One interesting thing I happened upon, I explicitly called the GC via jconsole and the heap dropped completely fixing the issue. When you explicitly call System.gc() it does a full sweep. I'm wondering if this issue is to do with the GC

Re: Thoughts on adding complex queries to Cassandra

2010-05-27 Thread Jake Luciani
I've secretly started working on this but nothing to show yet :( I'm calling it SliceDiceReduce or SliceReduce. The plan is to use the js thrift bindings I've added for 0.3 release of thrift (out very soon?) This will allow the supplied js to access the results like any other thrift

Re: Pooling Question

2010-05-20 Thread Jake Luciani
Look in /contrib it's already there. On May 20, 2010, at 6:23 PM, Mark Robson mar...@gmail.com wrote: On 20 May 2010 23:16, Ryan Daum r...@thimbleware.com wrote: I personally would love to see Cassandra add the concept of a read- only 'proxy' node which acts like the embedded ready only

Re: BinaryMemtable and collisions

2010-05-07 Thread Jake Luciani
by the work of Jake Luciani in Lucandra. I've successfully loaded nearly a million documents over a 3-node cluster, and initial query tests look promising. The problem is that our target use case has hundreds of millions of documents (each document is very small however). Loading time

Re: BinaryMemtable and collisions

2010-05-07 Thread Jake Luciani
at 12:09 AM, Jake Luciani jak...@gmail.com wrote: Any reason why you aren't using Lucandra directly? On Fri, May 7, 2010 at 8:21 PM, Tobias Jungen tobias.jun...@gmail.comwrote: Greetings, Started getting my feet wet with Cassandra in earnest this week. I'm building a custom inverted index

Re: Lucandra or some way to query

2010-04-14 Thread Jake Luciani
Hi, What doesn't work with lucandra exactly? Feel free to msg me. -Jake On Wed, Apr 14, 2010 at 9:30 PM, Jesus Ibanez jesusiba...@gmail.com wrote: I will explore Lucandra a little more and if I can't get it to work today, I will go for Option 2. Using SQL will not be efficient in the

Re: Lucandra or some way to query

2010-04-14 Thread Jake Luciani
Lucandra spreads the data randomly by index + field combination so you do get some distribution for free. Otherwise you can use nodetool loadbalance to alter the token ring to alleviate hotspots. On Thu, Apr 15, 2010 at 2:04 AM, HubertChang hui...@gmail.com wrote: If you worked with Lucandra

<    1   2