Where do you see the timeout exceptions? in the mappers?
How many mappers reducers slots are you using? What does your disk setup
look like? do you have HDFS on same disk as cassandra data dir?
-Jake
On Tue, Dec 6, 2011 at 4:50 AM, Patrik Modesto patrik.mode...@gmail.comwrote:
Hi,
I'm
Hi,
Sorry for the intrusion.
I was speaking to some of the LinkedIn engineers at ApacheCon last week
about to see how to get
Cassandra into the linkedin skills page [1].
They claim if more people add Cassandra as a skill in their profile then it
will show up. So my request
is if you use
Re Simpler elasticity:
Latest opscenter will now rebalance cluster optimally
http://www.datastax.com/dev/blog/whats-new-in-opscenter-1-3
/plug
-Jake
On Mon, Nov 14, 2011 at 7:27 PM, Chris Burroughs
chris.burrou...@gmail.comwrote:
- It would be super cool if all of that counter work made it
Hi Todd,
Entity Groups : https://issues.apache.org/jira/browse/CASSANDRA-1684
-Jake
On Wed, Nov 9, 2011 at 6:44 AM, Todd Burruss bburr...@expedia.com wrote:
I believe I heard someone talk at Cassandra SF conference about creating a
partitioner that was a derivation of RandomPartitioner. It
at the conference that had
already implemented what I mentioned. It didn't offer any atomicity, just
co-locating a family of data on the same node.
From: Jake Luciani jak...@gmail.com
Reply-To: user@cassandra.apache.org user@cassandra.apache.org
Date: Wed, 9 Nov 2011 02:53:20 -0800
To: user
Hi Nate,
Could you try running it with debug enabled on the logs? it will give more
insite into what's going on.
-Jake
On Tue, Nov 8, 2011 at 3:45 PM, Nate Sammons nsamm...@ften.com wrote:
This is against a single server, not a cluster. Replication factor for
the keyspace is set to 1, CL
I'll be there!
On Mon, Nov 7, 2011 at 5:23 PM, Eric Evans eev...@acunu.com wrote:
Just a reminder; If you're planning to be at ApacheCon, or are
otherwise able to be in Vancouver on the 10th, we're having a
Cassandra Meetup. There is no cost to attend (you don't even need to
be registered
What's your bottleneck?
http://spyced.blogspot.com/2010/01/linux-performance-basics.html
On Thu, Oct 27, 2011 at 9:37 AM, Joe Stein crypt...@gmail.com wrote:
Hey folks, I am interested in what others have seen in regards to their
experience in the amount of depth and width (CF, Rows Columns)
You are unable to connect? or you are getting an UnavailableException?
On Thu, Oct 27, 2011 at 11:14 AM, RobinUs2 ro...@us2.nl wrote:
I currently run a 2-node cluster with version cassandra 1.0 (stable). With
replication factor 2 on the keyspace which I'm testing. When I shutdown
node
B,
What consistency level are you using? With RF=2 your only option is CL.ONE
when a node is down.
On Thu, Oct 27, 2011 at 11:47 AM, RobinUs2 ro...@us2.nl wrote:
The error I currently see when I take down node B:
Error performing get_indexed_slices on NODE A IP:9160: exception
This hasn't changed in AFAIK, In Brisk we had the same problem in CFS so we
created a sentinel value that all rows shared then it works. CASSANDRA-2915
should fix it.
On Tue, Oct 11, 2011 at 4:48 PM, Sasha Dolgy sdo...@gmail.com wrote:
I was trying to get a range of rows based on a
the default setting of 4 for this property
affect the distribution of data across my nodes?
From: Jake Luciani jak...@gmail.com
Reply-To: user@cassandra.apache.org user@cassandra.apache.org
Date: Mon, 15 Aug 2011 12:03:22 -0700
To: user@cassandra.apache.org user@cassandra.apache.org
Subject: Re
This is fixed in 1.0
https://issues.apache.org/jira/browse/CASSANDRA-2894
On Sun, Sep 18, 2011 at 2:16 PM, Tharindu Mathew mcclou...@gmail.comwrote:
Hi everyone,
I noticed this line in the API docs,
The method is not O(1). It takes all the columns from disk to calculate the
answer. The
Thx for the info I'll try to reproduce
On Aug 23, 2011, at 9:28 PM, Ashley Martens amart...@ngmoco.com wrote:
INFO [769787724@qtp-311722089-9825] 2011-08-23 22:07:53,750 SolrCore.java
(line 1370) [users] webapp=/solandra path=/select
What is rpc_address set to in cassandra.yaml?
Try setting these to 0.0.0.0 to be sure it's listening to external traffic.
On Thu, Aug 18, 2011 at 8:37 AM, Thamizh tceg...@yahoo.co.in wrote:
Hi All,
This is regarding help to resolve connection refused error on Cassandra
client API.
I have
Are you writing lots of tiny rows or a few very large rows, are you batching
mutations? is the loading disk or cpu or network bound?
-Jake
On Thu, Aug 18, 2011 at 7:08 AM, Paul Loy ketera...@gmail.com wrote:
Hi All,
I have a program that crunches through around 3 billion calculations. We
no network
traffic
so I think it's disk access. Will find out for sure tomorrow after the
current test runs.
Thanks,
Paul.
On Thu, Aug 18, 2011 at 2:23 PM, Jake Luciani jak...@gmail.com wrote:
Are you writing lots of tiny rows or a few very large rows, are you
batching mutations
You want the solandra data stored under two keyspaces? Or you just want two
different logical indexes.
The former requires changing the keyspace name located in
solandra.properties but you can only access one per process.
The latter would involve creating two different solr cores at different
Solandra manages the shard parameters for you. you don't need to specify
anything.
On Mon, Aug 15, 2011 at 3:00 PM, Jeremiah Jordan
jeremiah.jor...@morningstar.com wrote:
When using Solandra, do I need to use the Solr sharding synxtax in my
queries? I don't think I do because Cassandra is
seriously, If you change the cluster name in cassandra.yaml they won't
join.
On Thu, Aug 11, 2011 at 12:31 PM, Ashley Martens amart...@ngmoco.comwrote:
No shared seeds. Downright freaky.
--
http://twitter.com/tjake
you can simply run:
ant generate-eclipse-files
then import the project
On Sun, Aug 7, 2011 at 5:39 PM, Alvin UW alvi...@gmail.com wrote:
Hello,
I am trying to Setup Cassandra0.8 in Eclipse following
http://wiki.apache.org/cassandra/RunningCassandraInEclipse
After right clicking on the
Yes it's read repair you can lower the read repair chance to tune this.
On Jul 29, 2011, at 6:31 PM, Aaron Griffith aaron.c.griff...@gmail.com wrote:
I currently have a 9 node cassandra cluster setup as follows:
DC1: Six nodes
DC2: Three nodes
The tokens alternate between the two
The philosophy in no-sql is to store the data as you plan to access it. that
means duplicating the data many time possibly. Disk is cheap, writes are
fast.
On Wed, Jul 27, 2011 at 2:22 PM, Priyanka priya...@gmail.com wrote:
Thank you Indra for your suggestion.
But the thing is apart from
It doesn't read the entire row, but it does read a section of the row from
disk...
How big is each supercolumn? If you re-read the data does the query time
get faster?
On Tue, Jul 26, 2011 at 11:59 AM, Philippe watche...@gmail.com wrote:
i believe it's because it needs to read the whole row
Sounds like you forgot to start solandra after you built it.
cd solandra-app; ./bin/solandra
You can verify it's running with jps look for SolandraServer.
On Jul 23, 2011, at 10:52 AM, Jean-Nicolas Boulay Desjardins
jnbdzjn...@gmail.com wrote:
Hi,
I have a server on RackSpace and it
be required...
-sd
On Tue, Jun 21, 2011 at 9:50 PM, Jake Luciani jak...@gmail.com wrote:
Right, Solr will not do anything other than basic aggregations (facets) and
range queries.
On Tue, Jun 21, 2011 at 3:16 PM, Dan Kuebrich dan.kuebr...@gmail.com
wrote:
Solandra is indeed distributed search
Solandra can answer the question you used as an example and it's more of a
fit for low-latency ad-hoc reporting then PIG. Pig queries will take
minutes not seconds.
On Tue, Jun 21, 2011 at 12:12 PM, Sasha Dolgy sdo...@gmail.com wrote:
Folks,
Simple question ... Assuming my current use case
i had a quick look at
https://github.com/tjake/Solandra/wiki/Solandra-Wiki and it wasn't
dead obvious to me
On Tue, Jun 21, 2011 at 8:19 PM, Jake Luciani jak...@gmail.com wrote:
Solandra can answer the question you used as an example and it's more of
a
fit for low-latency ad-hoc
Right, Solr will not do anything other than basic aggregations (facets) and
range queries.
On Tue, Jun 21, 2011 at 3:16 PM, Dan Kuebrich dan.kuebr...@gmail.comwrote:
Solandra is indeed distributed search, not distributed number-crunching.
As a previous poster said, you could imagine
that if I read
all there is on GitHub, I can probably start using it.
*
Thank you,
Mark
On Fri, Jun 3, 2011 at 8:07 PM, Jake Luciani jak...@gmail.com wrote:
Mark,
Check out Solandra. http://github.com/tjake/Solandra
On Fri, Jun 3, 2011 at 7:56 PM, Mark Kerzner markkerz
No force a node down you can use nodetool disablegossip
On Wed, Jun 15, 2011 at 6:42 PM, Suan Aik Yeo yeosuan...@gmail.com wrote:
Thanks, Aaron, but we determined that adding Java into the equation just
brings in too much complexity for something that's called out of an Nginx
Perl module.
Hi JKnight,
Yes. The Brisk project adds a HDFS compatible layer for Cassandra see
http://github.com/riptano/brisk
-Jake
On Thu, Jun 9, 2011 at 11:05 PM, JKnight JKnight beukni...@gmail.comwrote:
Dear all,
Does Cassandra support HDFS storage?
Thank a lot for support.
--
Best regards,
to build a Thrift interface for
Cassandra:
./compiler/cpp/thrift -gen php ../PATH-TO-CASSANDRA/interface/cassandra.thrift
How do I do this?
Where is the interface folder?
Again, tjake thanks allot for your time and help.
On Mon, Jun 6, 2011 at 11:13 PM, Jake Luciani jak...@gmail.com wrote
-gen php ../PATH-TO-CASSANDRA/interface/cassandra.thrift
How do I do this?
Where is the interface folder?
Again, tjake thanks allot for your time and help.
On Mon, Jun 6, 2011 at 11:13 PM, Jake Luciani jak...@gmail.com wrote:
To access Cassandra in Solandra it's the same as regular cassandra
To access Cassandra in Solandra it's the same as regular cassandra. To
access Solr you use one of the Php Solr libraries
http://wiki.apache.org/solr/SolPHP
On Mon, Jun 6, 2011 at 11:04 PM, Jean-Nicolas Boulay Desjardins
jnbdzjn...@gmail.com wrote:
I am trying to install Thrift with
Mark,
Check out Solandra. http://github.com/tjake/Solandra
On Fri, Jun 3, 2011 at 7:56 PM, Mark Kerzner markkerz...@gmail.com wrote:
Hi,
I need to store, say, 10M-100M documents, with each document having say 100
fields, like author, creation date, access date, etc., and then I want to
Is there a way for me to make (or even gently suggest to) Cassandra that it
may be a good time to free up some space?
Disregarding what's been said and until ref-counting is implemented this is
a useful tool to gently suggest cleanup:
https://github.com/ceocoder/jmxgc
On Thu, May 26, 2011 at
I know thrift and python and Unicode don't mix.
On May 7, 2011, at 4:21 PM, aaron morton aa...@thelastpickle.com wrote:
I've been able to reproduce the fault using python on my mac book see
https://github.com/amorton/cassandra-unicode-bug
When we try to find the unicode key in the
If you have N column families you need N * memtable size of RAM to support
this. If that's not an option you can merge them into one as you suggest
but then you will have much larger SSTables, slower compactions, etc. I
don't necessarily agree with Tyler that the OS cache will be less
nodetool compactionstats
On Fri, Apr 1, 2011 at 12:14 PM, mcasandra mohitanch...@gmail.com wrote:
Is there a way to monitor the compactions using nodetools? I don't see it
in
tpstats.
--
View this message in context:
Hi Gregori,
What language *were* you using to interact with cassandra? were you unable
to find a wrapper API that you found
We have discussed adopting the best of client api's in cassandra but we
decided it's better for the community to naturally develop them. I think
this has also motivated
Are you running with JNA enabled? If so could you try disabling it?
On Sat, Feb 19, 2011 at 11:32 AM, Ivan Georgiev yngw...@bk.ru wrote:
On 19.2.2011 г. 16:43 ч., Jonathan Ellis wrote:
Flush code didn't change between 0.7.0 and 0.7.2. There must be some
other variable here. Memory pressure
https://issues.apache.org/jira/browse/CASSANDRA-2174
Yes, just clear the cache
On Thu, Feb 17, 2011 at 1:06 PM, Damick, Jeffrey jeffrey.dam...@neustar.biz
wrote:
So after upgrade to 0.7.2, I see this on startup – should I just blow
away these cache files?
WARN [main] 2011-02-17
Have you made any changes to the cassandra config?
2011/2/15 Jonas Borgström jonas.borgst...@trioptima.com
Hi all,
While testing the new 0.7.1 release I got the following exception:
ERROR [ReadStage:11] 2011-02-15 16:39:18,105
DebuggableThreadPoolExecutor.java (line 103) Error in
It can take some time for the files to propagate to the mirrors. It's
Eventually Consistent though :)
On Mon, Feb 14, 2011 at 4:20 PM, Frank LoVecchio fr...@isidorey.com wrote:
Ah, I meant quite a few of the mirror links keep showing up as links to
gossip sites and whatnot.
On Feb 14, 2011
This sounds like a possible bug since the BRAF was re-written in 0.7.1.
Could you open a ticket?
On Mon, Feb 7, 2011 at 10:32 AM, Patrik Modesto patrik.mode...@gmail.comwrote:
On Mon, Feb 7, 2011 at 15:42, Thibaut Britz
thibaut.br...@trendiction.com wrote:
I think this is related to a faulty
http://www.datastax.com/blog/whats-new-cassandra-07-secondary-indexes
On Fri, Jan 28, 2011 at 7:15 AM, Sasha Dolgy sasha.do...@gmail.com wrote:
Hi there,
Where can I find information regarding secondary indexes? Spent the
past 2 days looking for some good details.
Are you using a row cache? if so what is it set too? in general it should
not be a percentage.
On Thu, Jan 27, 2011 at 12:23 PM, Chris Burroughs chris.burrou...@gmail.com
wrote:
We have a 6 node Cassandra 0.6.8 cluster running on boxes with 4 GB of
RAM. Over the course of several weeks
Yes, but that's also the lucene limit
http://lucene.apache.org/java/3_0_1/fileformats.html#Limitations
Lucene uses a Java int to refer to document numbers, and the index file
format uses an Int32
On Thu, Jan 27, 2011 at 1:40 PM, David G. Boney
dbon...@semanticartifacts.com wrote:
I was
? Lucene supports the ability to create multiple
IndexSearchers and stick them in a MultiSearcher.
Is this the right way to view the problem?
-
Sincerely,
David G. Boney
dbon...@semanticartifacts.com
http://www.semanticartifacts.com
On Jan 27, 2011, at 12:45 PM, Jake Luciani
I've seen this when you leave a socket open and idle for a long time. The
connection times out.
On Jan 23, 2011, at 8:42 AM, ruslan usifov ruslan.usi...@gmail.com wrote:
2011/1/23 cbert...@libero.it cbert...@libero.it
ERROR UserNameCmd:38 - java.net.SocketException: Broken pipe
Reconnect and try again?
On Jan 23, 2011, at 10:47 AM, cbert...@libero.it cbert...@libero.it wrote:
I've seen this when you leave a socket open and idle for a long time. The
connection times out.
It could be the situation ... any idea about the solution?
I create the pool once at
One possible open source approach would be to use the Solr 1.4 spatial
plugin[1] along with Solandra[2]
What kind of spatial searches are you looking for? basic bounding
box/radius?
[1] https://github.com/outoftime/solr-spatial-light
[2] https://github.com/tjake/lucandra
On Fri, Jan 21, 2011
Thanks Jonathan and Cassandra PMC!
Happy to help Cassandra take over the world!
-Jake
On Thu, Jan 13, 2011 at 1:41 PM, Jonathan Ellis jbel...@gmail.com wrote:
The Cassandra PMC has voted to add Jake as a committer. (Jake is also
a committer on Thrift.)
Welcome, Jake, and thanks for the
...@gmail.com wrote:
I haven't tried repair. Should I?
On Jan 5, 2011 3:48 PM, Jake Luciani jak...@gmail.com wrote:
Have you tried not bootstrapping but setting the token and manually
calling
repair?
On Wed, Jan 5, 2011 at 7:07 AM, Ran Tavory ran...@gmail.com wrote:
My conclusion is lame: I
In 0.6, locate the node doing anti-compaction and look in the streams
subdirectory in the keyspace data dir to monitor the anti-compaction
progress (it puts new SSTables for bootstrapping node in there)
On Tue, Jan 4, 2011 at 8:01 AM, Ran Tavory ran...@gmail.com wrote:
Running nodetool
Some relevant information here:
https://www.cloudkick.com/blog/2010/mar/02/4_months_with_cassandra/
On Tue, Jan 4, 2011 at 10:09 PM, Dave Viner davevi...@gmail.com wrote:
Hi Peter,
Thanks. These are great ideas. One comment tho. I'm actually not as
worried about the logging into the
(SSTable.java:233)
*
Thanks.
*
*
2010/12/15 Jake Luciani jak...@gmail.com
http://www.riptano.com/docs/0.6/troubleshooting/index#java-reports-an-error-saying-there-are-too-many-open-files
On Wed, Dec 15, 2010 at 11:13 AM, Amin Sakka, Novapost
amin.sa...@novapost.fr wrote:
*Hello,*
*I'm
http://www.riptano.com/docs/0.6/troubleshooting/index#java-reports-an-error-saying-there-are-too-many-open-files
On Wed, Dec 15, 2010 at 11:13 AM, Amin Sakka, Novapost
amin.sa...@novapost.fr wrote:
*Hello,*
*I'm using cassandra 0.7.0 rc1, a single node configuration, replication
factor 1,
Max this was a bug fixed recently in 0.7 branch
https://issues.apache.org/jira/browse/CASSANDRA-1801
fixed now in RC2
-Jake
On Tue, Dec 7, 2010 at 8:11 AM, Max cassan...@ajowa.de wrote:
As far as i can see, Lucandra already uses batch_mutations.
You can also run Solr with Cassandra as the backend:
https://github.com/tjake/Lucandra/tree/solandra
/shameless_plug
-Jake
On Thu, Dec 2, 2010 at 6:27 AM, aaron morton aa...@thelastpickle.comwrote:
Have you considered using Solr / lucene for the search? It has a lot more
search features,
are writing with
CL.ANY
If you never write with CL.ANY then you can turn off hinted handoff.
How do I reconcile this?
On Sun, Nov 28, 2010 at 7:11 PM, Jake Luciani jak...@gmail.com wrote:
If you read/write data with quorum then you can safely take a node down in
this scenario. Subsequent
Right.
On Sun, Nov 28, 2010 at 1:03 PM, David Boxenhorn da...@lookin2.com wrote:
OK. To sum up: RF=2 and QUORUM are incompatible (if you want to be able to
take a node down).
Right?
On Sun, Nov 28, 2010 at 7:59 PM, Jake Luciani jak...@gmail.com wrote:
I was wrong on this scenario
+1 Ed
On Nov 21, 2010, at 12:13 PM, Edward Capriolo edlinuxg...@gmail.com wrote:
On Sun, Nov 21, 2010 at 12:10 PM, André Fiedler
fiedler.an...@googlemail.com wrote:
Facebook Messaging – HBase Comes of Age
http://facility9.com/2010/11/18/facebook-messaging-hbase-comes-of-age
This is a bug in beta3, if you checkout the cassandra-0.7 branch it should
work for you.
On Tue, Nov 16, 2010 at 3:38 PM, André Fiedler fiedler.an...@googlemail.com
wrote:
I try to perform the following action after a clean startup. And get the
log below. How to fix this?
In theory you could use timestamps that go back in time for this CF. That way
the first write will persist over future writes.
On Sep 21, 2010, at 6:58 AM, Christian Decker decker.christ...@gmail.com
wrote:
Hi all,
I have a rather strange problem I'd like to address. As I understand it
Hi Courtney,
You can take a look at lucandra http://github.com/tjake/Lucandra which uses
the lucene api to maintain a inverted index in cassandra. There are a couple
articles and presentations in the readme that give more info on how this is
done.
-Jake
On Fri, Sep 3, 2010 at 6:26 AM, Courtney
Coke sucks! Only drink it if you want to work hard for 20 minutes then crash.
I started a new cola that's already way better than Coke and it will solve all
your problems. I'm finalizing my results but so far I only need one drink per
WEEK!
On Jul 7, 2010, at 12:10 PM, Mike Malone
Hi Maxim,
Lucandra doesn't support numeric queries quite yet. A workaround would
be to load your numbers and convert them to strings.
I'll eventually add support for this. Please feel free to help out if
you can :)
Jake
On Jun 17, 2010, at 1:16 PM, Maxim Kramarenko
I've started seeing this issue as well. Running 0.6.2.
One interesting thing I happened upon, I explicitly called the GC via
jconsole and the heap dropped completely fixing the issue. When you
explicitly call System.gc() it does a full sweep. I'm wondering if this
issue is to do with the GC
I've secretly started working on this but nothing to show yet :( I'm
calling it SliceDiceReduce or SliceReduce.
The plan is to use the js thrift bindings I've added for 0.3 release
of thrift (out very soon?)
This will allow the supplied js to access the results like any other
thrift
Look in /contrib it's already there.
On May 20, 2010, at 6:23 PM, Mark Robson mar...@gmail.com wrote:
On 20 May 2010 23:16, Ryan Daum r...@thimbleware.com wrote:
I personally would love to see Cassandra add the concept of a read-
only 'proxy' node which acts like the embedded ready only
by the work of Jake Luciani in Lucandra. I've successfully loaded
nearly a million documents over a 3-node cluster, and initial query tests
look promising.
The problem is that our target use case has hundreds of millions of
documents (each document is very small however). Loading time
at 12:09 AM, Jake Luciani jak...@gmail.com wrote:
Any reason why you aren't using Lucandra directly?
On Fri, May 7, 2010 at 8:21 PM, Tobias Jungen tobias.jun...@gmail.comwrote:
Greetings,
Started getting my feet wet with Cassandra in earnest this week. I'm
building a custom inverted index
Hi,
What doesn't work with lucandra exactly? Feel free to msg me.
-Jake
On Wed, Apr 14, 2010 at 9:30 PM, Jesus Ibanez jesusiba...@gmail.com wrote:
I will explore Lucandra a little more and if I can't get it to work today,
I will go for Option 2.
Using SQL will not be efficient in the
Lucandra spreads the data randomly by index + field combination so you do
get some distribution for free. Otherwise you can use nodetool
loadbalance to alter the token ring to alleviate hotspots.
On Thu, Apr 15, 2010 at 2:04 AM, HubertChang hui...@gmail.com wrote:
If you worked with Lucandra
101 - 176 of 176 matches
Mail list logo