Re: Status of deploy to maven central patch

2011-03-31 Thread Stu Hood
I would be +1 on this. We need a good way to develop against
snapshots. Thanks for your work here Stephen.

Stu

On Fri, Mar 25, 2011 at 5:35 AM, Stephen Connolly 
stephen.alan.conno...@gmail.com wrote:

 Just an FYI.

 I have the required fixes for Maven ANT Tasks in place:

 http://jira.codehaus.org/browse/MANTTASKS-217
 http://jira.codehaus.org/browse/MANTTASKS-211
 http://jira.codehaus.org/browse/MANTTASKS-210

 I plan to run a release of Maven ANT Tasks early next week.

 Once the release is place I should have the patches for 0.8.x and
 0.7.x shortly after.

 Once the 0.7.x patch has landed it would be great if we could spin a
 0.7.5 just to trial the patches to verify that they do actually result
 in a valid release getting pushed all the way to central, and to
 verify that I have the wiki page for making the release in place

 -Stephen



Re: plugins/triggers/coprocessors

2011-02-11 Thread Stu Hood
Honestly, I think we should just mark 1016 a dupe and move forward with
1311: we won't be hurting anyone's feelings, and the implementation from
1016 is: 1. much, much less complete, 2. abandoned.


On Fri, Feb 11, 2011 at 9:23 AM, Jeremy Hanna jeremy.hanna1...@gmail.comwrote:

 Thanks Maxim - I'll just go ahead and BCC you and Hentschel and move the
 discussion to the dev list.

 Based on the comments on 1311 - did you have anything else to add to that -
 could we unify around 1016 or 1311 and work on getting that to a general
 state of acceptance?  Were there any that were able to do some work on
 either these days?  Or are we not at that point?

 On Feb 11, 2011, at 10:36 AM, Maxim Grinev wrote:

  Hi all,
 
  Jeremy, thanks for starting the discussion! We don't follow the dev list
 closely so it was a good idea to email it directly.
 
  It really seems to be about the same. To unify the discussions, we
 propose to list the features of each solution and compare them feature by
 feature. Here is the feature list for triggers:
• Triggers are set on a column family. Triggers are executed for
 each mutation to the column family and parametrized by the mutation.
• The mutation, which is the trigger parameter, is the new value.
 The trigger cannot see the old value.
• Triggers are executed asynchronously some time after the write
 which fired it is acknowledged to the client.
• Triggers are executed once during normal execution. We guarantee
 at least once execution in case of node failures.
  Cheers,
  Maxim
 
 
  On Thu, Feb 10, 2011 at 8:45 AM, Jeremy Hanna
  jeremy.hanna1...@gmail.com wrote:
   Hey guys,
  
   I was just wondering if it would be a good time to unify discussions on
 plugins/triggers/coprocessors?
  
   https://issues.apache.org/jira/browse/CASSANDRA-1016
   https://issues.apache.org/jira/browse/CASSANDRA-1311
  
   I was going to email the dev list but since I don't know if all of you
 follow the dev list and you guys are the ones that expressed the most
 interest, I thought I would start here.
 
  Yeah, they're all tackling basically the same problem. For which we
  should have a single solution.
 
  -ryan
 




Re: [VOTE] 0.7.1 (what are we at now, 4?)

2011-02-11 Thread Stu Hood
+1

Passes the distributed tests.

On Thu, Feb 10, 2011 at 9:52 AM, Eric Evans eev...@rackspace.com wrote:


 I propose the following for release as 0.7.1.

 SVN:
 https://svn.apache.org/repos/asf/cassandra/branches/cassandra-0.7@r1069461
 0.7.1 artifacts: http://people.apache.org/~eevans

 The vote will be open for 72 hours.

 [1]: http://goo.gl/5VAPP (CHANGES.txt)
 [2]: http://goo.gl/C9M5W (NEWS.txt)
 [3]: http://goo.gl/8dZUr

 --
 Eric Evans
 eev...@rackspace.com




Re: [VOTE] 0.7.1 (attempt #2)

2011-02-01 Thread Stu Hood
-1
Kelvin was kind enough to confirm that ALL is broken in this release and
trunk. See https://issues.apache.org/jira/browse/CASSANDRA-2094

On Sun, Jan 30, 2011 at 5:24 PM, Stu Hood stuh...@gmail.com wrote:

 -0
 Upgrading from 0.7.0 to these artifacts was fine, but the write ONE read
 ALL distributed test times out in an unexpected location, with no error
 messages on the server. The test looks valid, but is also failing in
 0.8/trunk.

 I'll try and bisect it tomorrow from CASSANDRA-1964 (which passed
 consistently) to the breakage.


 On Sun, Jan 30, 2011 at 1:14 AM, Stephen Connolly 
 stephen.alan.conno...@gmail.com wrote:

 I'm getting

 Bad Gateway

 The proxy server received an invalid response from an upstream server.

 From repository.apache.org.

 So the Maven central artifacts will probably be staged tomorrow AM (as
 my wife will kill me if I waste Sunday working on this! and she'd be
 right too!) ;-)

 -Stephen

 On 28 January 2011 20:34, Stephen Connolly
 stephen.alan.conno...@gmail.com wrote:
  I'll drop and restage the artifacts for maven central when I get a
 chance
 
  - Stephen
 
  ---
  Sent from my Android phone, so random spelling mistakes, random nonsense
  words and other nonsense are a direct result of using swype to type on
 the
  screen
 
  On 28 Jan 2011 20:30, Eric Evans eev...@rackspace.com wrote:
 
  CASSANDRA-2058[1] has landed in 0.7, so let's give this another shot. I
  propose the following for release as 0.7.1.
 
  SVN:
 
 https://svn.apache.org/repos/asf/cassandra/branches/cassandra-0.7@r1064845
  0.7.1 artifacts: http://people.apache.org/~eevans
 
  The vote will be open for 72 hours.
 
 
  [1]: https://issues.apache.org/jira/browse/CASSANDRA-2058
  [2]: http://goo.gl/5Tafg (CHANGES.txt)
  [3]: http://goo.gl/PkreZ (NEWS.txt)
 
  --
  Eric Evans
  eev...@rackspace.com
 
 
 
 





Re: [VOTE] 0.7.1 (attempt #2)

2011-01-30 Thread Stu Hood
-0
Upgrading from 0.7.0 to these artifacts was fine, but the write ONE read
ALL distributed test times out in an unexpected location, with no error
messages on the server. The test looks valid, but is also failing in
0.8/trunk.

I'll try and bisect it tomorrow from CASSANDRA-1964 (which passed
consistently) to the breakage.


On Sun, Jan 30, 2011 at 1:14 AM, Stephen Connolly 
stephen.alan.conno...@gmail.com wrote:

 I'm getting

 Bad Gateway

 The proxy server received an invalid response from an upstream server.

 From repository.apache.org.

 So the Maven central artifacts will probably be staged tomorrow AM (as
 my wife will kill me if I waste Sunday working on this! and she'd be
 right too!) ;-)

 -Stephen

 On 28 January 2011 20:34, Stephen Connolly
 stephen.alan.conno...@gmail.com wrote:
  I'll drop and restage the artifacts for maven central when I get a chance
 
  - Stephen
 
  ---
  Sent from my Android phone, so random spelling mistakes, random nonsense
  words and other nonsense are a direct result of using swype to type on
 the
  screen
 
  On 28 Jan 2011 20:30, Eric Evans eev...@rackspace.com wrote:
 
  CASSANDRA-2058[1] has landed in 0.7, so let's give this another shot. I
  propose the following for release as 0.7.1.
 
  SVN:
 
 https://svn.apache.org/repos/asf/cassandra/branches/cassandra-0.7@r1064845
  0.7.1 artifacts: http://people.apache.org/~eevans
 
  The vote will be open for 72 hours.
 
 
  [1]: https://issues.apache.org/jira/browse/CASSANDRA-2058
  [2]: http://goo.gl/5Tafg (CHANGES.txt)
  [3]: http://goo.gl/PkreZ (NEWS.txt)
 
  --
  Eric Evans
  eev...@rackspace.com
 
 
 
 



Re: Time for 1.0

2011-01-13 Thread Stu Hood
 In that environment, I think the production grade validation is important.
A bump in version number does not give you production grade validation: in
fact, it is the other way around.

I'm -1 on going to 1.0 for the next release.

On Thu, Jan 13, 2011 at 9:06 AM, Eric Evans eev...@rackspace.com wrote:

 On Thu, 2011-01-13 at 16:34 +, Nick Telford wrote:
  ...or the Ubuntu route and call it Apache Cassandra 11.08 (or whatever
  month the release occurs in). The number itself is relatively
  unimportant.

 And while we're at it, how about a codename in adjective-animal form?
 Some suggestions:

 * Funky Monkey
 * Flatulent Platypus
 * Leaky Walrus
 * Ballsy Beaver
 * Salty Porcupine

 I could do this all day. :)

 --
 Eric Evans
 eev...@rackspace.com




Re: [VOTE RESULTS] was: [VOTE] 7.0

2011-01-09 Thread Stu Hood
Congratulations everyone!

On Sun, Jan 9, 2011 at 4:21 PM, Eric Evans eev...@rackspace.com wrote:

 On Thu, 2011-01-06 at 11:17 -0600, Eric Evans wrote:
  SVN:
 
 https://svn.apache.org/repos/asf/cassandra/branches/cassandra-0@r1055934
  0.7.0 artifacts: http://people.apache.org/~eevans
 
  The vote will be open for at least 72 hours.

 I count 4 binding +1s, 5 others, and no -1s.  The vote passes.  An ASF
 press release is scheduled for tomorrow so I'll get the artifacts up
 tonight, but I'm going to hold off on the announcement until tomorrow.

 --
 Eric Evans
 eev...@rackspace.com




Re: [VOTE] 0.7.0 rc3

2010-12-22 Thread Stu Hood
Nick and I discovered
https://issues.apache.org/jira/browse/CASSANDRA-1895last night. If
someone can reproduce it, it is likely worth delaying this RC
for.

On Tue, Dec 21, 2010 at 5:22 PM, Eric Evans eev...@rackspace.com wrote:


 We've had lots of changes[1] since RC2, too many to move to release
 without wider testing.  I propose the following artifacts as 0.7.0 RC3.

 SVN:
 https://svn.apache.org/repos/asf/cassandra/branches/cassandra-...@r1051698
 0.7.0-rc3 artifacts: http://people.apache.org/~eevans

 The vote will be open for 72 hours.

 [1]: http://goo.gl/VIhDZ (CHANGES.txt)
 [2]: http://goo.gl/MrxXI (NEWS.txt)
 [3]: http://goo.gl/HLKlW

 --
 Eric Evans
 eev...@rackspace.com




Re: CASSANDRA-1472 (bitmap indexes)

2010-11-09 Thread Stu Hood
Hmm, 500 sstables is definitely a degenerate case: did you disable compaction? 
By default, Cassandra strives to keep the sstable count below ~32, since 
accesses to separate sstables require seeks.

In this case, the query will seek 500 times to check the secondary index for 
each sstable: if it finds matches it will need to seek to find them in the 
primary index, and seek again for the data file.

-Original Message-
From: dragos cernahoschi dragos.cernahos...@gmail.com
Sent: Tuesday, November 9, 2010 5:33am
To: dev@cassandra.apache.org
Subject: Re: CASSANDRA-1472 (bitmap indexes)

There are about 500 SSTables (12GB of data including index data,
statistics...) The source data file had about 3GB/26 million rows.

I only test with EQ expressions for now.

Increasing the file limit resolved the problem, but now I'm getting
TimedOutException(s) from thrift when querying even with slice size of 1.
Is my machine too small (core 2 duo 2.93 2GB RAM Ubuntu 10.04) for such a
test?

I really have some interesting sets of data to test indexes with and I want
to make a comparison between ordinary indexes and bitmap indexes.

Thank you,
Dragos

On Mon, Nov 8, 2010 at 6:42 PM, Stu Hood stu.h...@rackspace.com wrote:

 Dragos,

 How many SSTables did you have on disk, and were any of your index
 expressions GT(E)/LT(E)?

 I expect that you are bumping into a limitation of the current
 implementation: it opens up to 128 file-handles per SSTable in the worst
 case for a GT/LT query (one per index bucket).

 A future version might remove that requirement, but for now, you should
 probably bump the file handle limit on your machine to at least 2^16.

 Thanks,
 Stu


 -Original Message-
 From: dragos cernahoschi dragos.cernahos...@gmail.com
 Sent: Monday, November 8, 2010 10:05am
 To: dev@cassandra.apache.org
 Subject: CASSANDRA-1472 (bitmap indexes)

 Hi,

 I've got an exception during the following test:

 test machine: core 2 duo 2.93 2GB RAM Ubuntu 10.04

 test scenario:
 - 1 column family
 - about 15 columns
 - 7 indexed columns (bitmap)
 - 26 million rows (insert operation went fine)
 - thrift query on 3 of the indexed columns with get_indexed_slices
 (count:
 100)
 - got the following exception:

 10/11/08 17:52:40 ERROR service.AbstractCassandraDaemon: Fatal exception in
 thread Thread[ReadStage:3,5,main]
 java.io.IOError: java.io.FileNotFoundException:
 /home/dragos/cassandra/data/keyspace/visit-e-814-4-Bitidx.db (Too many open
 files)
at

 org.apache.cassandra.io.sstable.bitidx.SegmentIterator.open(SegmentIterator.java:78)
at

 org.apache.cassandra.io.sstable.bitidx.BitmapIndexReader.openBin(BitmapIndexReader.java:226)
at

 org.apache.cassandra.io.sstable.bitidx.BitmapIndexReader.iterator(BitmapIndexReader.java:214)
at
 org.apache.cassandra.io.sstable.SSTableReader.scan(SSTableReader.java:523)
at

 org.apache.cassandra.db.secindex.KeysBitmapIndex.iterator(KeysBitmapIndex.java:103)
at
 org.apache.cassandra.db.ColumnFamilyStore.scan(ColumnFamilyStore.java:1371)
at

 org.apache.cassandra.service.IndexScanVerbHandler.doVerb(IndexScanVerbHandler.java:41)
at

 org.apache.cassandra.net.MessageDeliveryTask.run(MessageDeliveryTask.java:51)
at

 java.util.concurrent.ThreadPoolExecutor$Worker.runTask(ThreadPoolExecutor.java:886)
at

 java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:908)
at java.lang.Thread.run(Thread.java:662)
 Caused by: java.io.FileNotFoundException:
 /home/dragos/cassandra/data/keyspace/visit-e-814-4-Bitidx.db (Too many open
 files)
at java.io.FileInputStream.open(Native Method)
at java.io.FileInputStream.init(FileInputStream.java:106)
at
 org.apache.avro.file.SeekableFileInput.init(SeekableFileInput.java:29)
at org.apache.avro.file.DataFileReader.init(DataFileReader.java:38)
at

 org.apache.cassandra.io.sstable.bitidx.SegmentIterator.open(SegmentIterator.java:72)
... 10 more
 10/11/08 17:52:40 ERROR service.AbstractCassandraDaemon: Fatal exception in
 thread Thread[ReadStage:2,5,main]
 java.io.IOError: java.io.FileNotFoundException:
 /home/dragos/cassandra/data/keyspace/visit-e-1018-Index.db (Too many open
 files)
at

 org.apache.cassandra.io.util.BufferedSegmentedFile.getSegment(BufferedSegmentedFile.java:68)
at

 org.apache.cassandra.io.util.SegmentedFile$SegmentIterator.next(SegmentedFile.java:129)
at

 org.apache.cassandra.io.util.SegmentedFile$SegmentIterator.next(SegmentedFile.java:1)
at

 org.apache.cassandra.io.sstable.SSTableReader.getPosition(SSTableReader.java:455)
at

 org.apache.cassandra.io.sstable.SSTableReader.getFileDataInput(SSTableReader.java:572)
at

 org.apache.cassandra.db.columniterator.SSTableSliceIterator.init(SSTableSliceIterator.java:49)
at

 org.apache.cassandra.db.filter.SliceQueryFilter.getSSTableColumnIterator(SliceQueryFilter.java:72)
at

 org.apache.cassandra.db.filter.QueryFilter.getSSTableColumnIterator(QueryFilter.java:84

Re: CASSANDRA-1472 (bitmap indexes)

2010-11-09 Thread Stu Hood
Can you tell me a little bit about your key distribution? How many unique 
values are indexed (the cardinality)?

Until the OrBiC projection I mention on 1472 is implemented, the bitmap 
secondary indexes will perform terribly for high cardinality datasets.

Thanks!


-Original Message-
From: dragos cernahoschi dragos.cernahos...@gmail.com
Sent: Tuesday, November 9, 2010 10:14am
To: dev@cassandra.apache.org
Subject: Re: CASSANDRA-1472 (bitmap indexes)

Meantime the number of SSTable(s) reduced to just 7. Initially the
compaction thread suffered the same problem of too many open files and
couldn't do any compaction.

But I'm still not able to run my tests: TimedOutException :(

On Tue, Nov 9, 2010 at 5:51 PM, Stu Hood stu.h...@rackspace.com wrote:

 Hmm, 500 sstables is definitely a degenerate case: did you disable
 compaction? By default, Cassandra strives to keep the sstable count below
 ~32, since accesses to separate sstables require seeks.

 In this case, the query will seek 500 times to check the secondary index
 for each sstable: if it finds matches it will need to seek to find them in
 the primary index, and seek again for the data file.

 -Original Message-
 From: dragos cernahoschi dragos.cernahos...@gmail.com
 Sent: Tuesday, November 9, 2010 5:33am
 To: dev@cassandra.apache.org
 Subject: Re: CASSANDRA-1472 (bitmap indexes)

 There are about 500 SSTables (12GB of data including index data,
 statistics...) The source data file had about 3GB/26 million rows.

 I only test with EQ expressions for now.

 Increasing the file limit resolved the problem, but now I'm getting
 TimedOutException(s) from thrift when querying even with slice size of 1.
 Is my machine too small (core 2 duo 2.93 2GB RAM Ubuntu 10.04) for such a
 test?

 I really have some interesting sets of data to test indexes with and I want
 to make a comparison between ordinary indexes and bitmap indexes.

 Thank you,
 Dragos

 On Mon, Nov 8, 2010 at 6:42 PM, Stu Hood stu.h...@rackspace.com wrote:

  Dragos,
 
  How many SSTables did you have on disk, and were any of your index
  expressions GT(E)/LT(E)?
 
  I expect that you are bumping into a limitation of the current
  implementation: it opens up to 128 file-handles per SSTable in the worst
  case for a GT/LT query (one per index bucket).
 
  A future version might remove that requirement, but for now, you should
  probably bump the file handle limit on your machine to at least 2^16.
 
  Thanks,
  Stu
 
 
  -Original Message-
  From: dragos cernahoschi dragos.cernahos...@gmail.com
  Sent: Monday, November 8, 2010 10:05am
  To: dev@cassandra.apache.org
  Subject: CASSANDRA-1472 (bitmap indexes)
 
  Hi,
 
  I've got an exception during the following test:
 
  test machine: core 2 duo 2.93 2GB RAM Ubuntu 10.04
 
  test scenario:
  - 1 column family
  - about 15 columns
  - 7 indexed columns (bitmap)
  - 26 million rows (insert operation went fine)
  - thrift query on 3 of the indexed columns with get_indexed_slices
  (count:
  100)
  - got the following exception:
 
  10/11/08 17:52:40 ERROR service.AbstractCassandraDaemon: Fatal exception
 in
  thread Thread[ReadStage:3,5,main]
  java.io.IOError: java.io.FileNotFoundException:
  /home/dragos/cassandra/data/keyspace/visit-e-814-4-Bitidx.db (Too many
 open
  files)
 at
 
 
 org.apache.cassandra.io.sstable.bitidx.SegmentIterator.open(SegmentIterator.java:78)
 at
 
 
 org.apache.cassandra.io.sstable.bitidx.BitmapIndexReader.openBin(BitmapIndexReader.java:226)
 at
 
 
 org.apache.cassandra.io.sstable.bitidx.BitmapIndexReader.iterator(BitmapIndexReader.java:214)
 at
 
 org.apache.cassandra.io.sstable.SSTableReader.scan(SSTableReader.java:523)
 at
 
 
 org.apache.cassandra.db.secindex.KeysBitmapIndex.iterator(KeysBitmapIndex.java:103)
 at
 
 org.apache.cassandra.db.ColumnFamilyStore.scan(ColumnFamilyStore.java:1371)
 at
 
 
 org.apache.cassandra.service.IndexScanVerbHandler.doVerb(IndexScanVerbHandler.java:41)
 at
 
 
 org.apache.cassandra.net.MessageDeliveryTask.run(MessageDeliveryTask.java:51)
 at
 
 
 java.util.concurrent.ThreadPoolExecutor$Worker.runTask(ThreadPoolExecutor.java:886)
 at
 
 
 java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:908)
 at java.lang.Thread.run(Thread.java:662)
  Caused by: java.io.FileNotFoundException:
  /home/dragos/cassandra/data/keyspace/visit-e-814-4-Bitidx.db (Too many
 open
  files)
 at java.io.FileInputStream.open(Native Method)
 at java.io.FileInputStream.init(FileInputStream.java:106)
 at
  org.apache.avro.file.SeekableFileInput.init(SeekableFileInput.java:29)
 at org.apache.avro.file.DataFileReader.init(DataFileReader.java:38)
 at
 
 
 org.apache.cassandra.io.sstable.bitidx.SegmentIterator.open(SegmentIterator.java:72)
 ... 10 more
  10/11/08 17:52:40 ERROR service.AbstractCassandraDaemon: Fatal exception
 in
  thread Thread[ReadStage:2,5,main]
  java.io.IOError

RE: CASSANDRA-1472 (bitmap indexes)

2010-11-08 Thread Stu Hood
Dragos,

How many SSTables did you have on disk, and were any of your index expressions 
GT(E)/LT(E)?

I expect that you are bumping into a limitation of the current implementation: 
it opens up to 128 file-handles per SSTable in the worst case for a GT/LT query 
(one per index bucket).

A future version might remove that requirement, but for now, you should 
probably bump the file handle limit on your machine to at least 2^16.

Thanks,
Stu


-Original Message-
From: dragos cernahoschi dragos.cernahos...@gmail.com
Sent: Monday, November 8, 2010 10:05am
To: dev@cassandra.apache.org
Subject: CASSANDRA-1472 (bitmap indexes)

Hi,

I've got an exception during the following test:

test machine: core 2 duo 2.93 2GB RAM Ubuntu 10.04

test scenario:
- 1 column family
- about 15 columns
- 7 indexed columns (bitmap)
- 26 million rows (insert operation went fine)
- thrift query on 3 of the indexed columns with get_indexed_slices (count:
100)
- got the following exception:

10/11/08 17:52:40 ERROR service.AbstractCassandraDaemon: Fatal exception in
thread Thread[ReadStage:3,5,main]
java.io.IOError: java.io.FileNotFoundException:
/home/dragos/cassandra/data/keyspace/visit-e-814-4-Bitidx.db (Too many open
files)
at
org.apache.cassandra.io.sstable.bitidx.SegmentIterator.open(SegmentIterator.java:78)
at
org.apache.cassandra.io.sstable.bitidx.BitmapIndexReader.openBin(BitmapIndexReader.java:226)
at
org.apache.cassandra.io.sstable.bitidx.BitmapIndexReader.iterator(BitmapIndexReader.java:214)
at
org.apache.cassandra.io.sstable.SSTableReader.scan(SSTableReader.java:523)
at
org.apache.cassandra.db.secindex.KeysBitmapIndex.iterator(KeysBitmapIndex.java:103)
at
org.apache.cassandra.db.ColumnFamilyStore.scan(ColumnFamilyStore.java:1371)
at
org.apache.cassandra.service.IndexScanVerbHandler.doVerb(IndexScanVerbHandler.java:41)
at
org.apache.cassandra.net.MessageDeliveryTask.run(MessageDeliveryTask.java:51)
at
java.util.concurrent.ThreadPoolExecutor$Worker.runTask(ThreadPoolExecutor.java:886)
at
java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:908)
at java.lang.Thread.run(Thread.java:662)
Caused by: java.io.FileNotFoundException:
/home/dragos/cassandra/data/keyspace/visit-e-814-4-Bitidx.db (Too many open
files)
at java.io.FileInputStream.open(Native Method)
at java.io.FileInputStream.init(FileInputStream.java:106)
at
org.apache.avro.file.SeekableFileInput.init(SeekableFileInput.java:29)
at org.apache.avro.file.DataFileReader.init(DataFileReader.java:38)
at
org.apache.cassandra.io.sstable.bitidx.SegmentIterator.open(SegmentIterator.java:72)
... 10 more
10/11/08 17:52:40 ERROR service.AbstractCassandraDaemon: Fatal exception in
thread Thread[ReadStage:2,5,main]
java.io.IOError: java.io.FileNotFoundException:
/home/dragos/cassandra/data/keyspace/visit-e-1018-Index.db (Too many open
files)
at
org.apache.cassandra.io.util.BufferedSegmentedFile.getSegment(BufferedSegmentedFile.java:68)
at
org.apache.cassandra.io.util.SegmentedFile$SegmentIterator.next(SegmentedFile.java:129)
at
org.apache.cassandra.io.util.SegmentedFile$SegmentIterator.next(SegmentedFile.java:1)
at
org.apache.cassandra.io.sstable.SSTableReader.getPosition(SSTableReader.java:455)
at
org.apache.cassandra.io.sstable.SSTableReader.getFileDataInput(SSTableReader.java:572)
at
org.apache.cassandra.db.columniterator.SSTableSliceIterator.init(SSTableSliceIterator.java:49)
at
org.apache.cassandra.db.filter.SliceQueryFilter.getSSTableColumnIterator(SliceQueryFilter.java:72)
at
org.apache.cassandra.db.filter.QueryFilter.getSSTableColumnIterator(QueryFilter.java:84)
at
org.apache.cassandra.db.ColumnFamilyStore.getTopLevelColumns(ColumnFamilyStore.java:1190)
at
org.apache.cassandra.db.ColumnFamilyStore.getColumnFamily(ColumnFamilyStore.java:1082)
at
org.apache.cassandra.db.ColumnFamilyStore.getColumnFamily(ColumnFamilyStore.java:1052)
at
org.apache.cassandra.db.ColumnFamilyStore.scan(ColumnFamilyStore.java:1378)
at
org.apache.cassandra.service.IndexScanVerbHandler.doVerb(IndexScanVerbHandler.java:41)
at
org.apache.cassandra.net.MessageDeliveryTask.run(MessageDeliveryTask.java:51)
at
java.util.concurrent.ThreadPoolExecutor$Worker.runTask(ThreadPoolExecutor.java:886)
at
java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:908)
at java.lang.Thread.run(Thread.java:662)
Caused by: java.io.FileNotFoundException:
/home/dragos/cassandra/data/keyspace/visit-e-1018-Index.db (Too many open
files)
at java.io.RandomAccessFile.open(Native Method)
at java.io.RandomAccessFile.init(RandomAccessFile.java:212)
at java.io.RandomAccessFile.init(RandomAccessFile.java:98)
at
org.apache.cassandra.io.util.BufferedRandomAccessFile.init(BufferedRandomAccessFile.java:142)
at
org.apache.cassandra.io.util.BufferedSegmentedFile.getSegment(BufferedSegmentedFile.java:62)
... 16 more

The same test worked 

Re: Calling all library maintainers

2010-11-05 Thread Stu Hood
 Java you serialize a type to a byte[] whereas with the query
 language you'd serialize to a string term
The serializing to a byte[] part is what the RPC libraries exist for. With a 
string serialization format, you are setting all of your clients up to become 
string concatenation engines with an ad-hoc format defined by your spec: 
essentially, duplicating Avro and Thrift.

 TIMEUUID(timestamp)
Note that this same approach is possible in Avro by adding a union type: it is 
not dependent on String serialization.

 to serialize that to a string like
 10L, than it would be to pack a binary string in network-order
I don't think you are giving client library devs enough credit: this only needs 
to be implemented once, and I'm sure they're capable.


-Original Message-
From: Eric Evans eev...@rackspace.com
Sent: Thursday, November 4, 2010 2:59pm
To: client-dev@cassandra.apache.org
Subject: Re: Calling all library maintainers

On Thu, 2010-11-04 at 21:28 +0200, Ran Tavory wrote:
 A QL can shield clients from a class of changes, but OTOH will make
 clients have to compose the query strings, where with type safe
 libraries this job is somewhat easier. IMO in the near term
 introducing a query language will make client dev somewhat harder b/c
 of the (somewhat negligible) work of composing query strings and
 mostly b/c I don't expect the QL to be stable at v1 so still a moving
 target, but easier in the the long term mainly due to the hope that
 the QL will stabilize.

I think you could argue that it makes all of this easier.  Right now
from Java you serialize a type to a byte[] whereas with the query
language you'd serialize to a string term.  That's a bit more effort out
of the gate for primitives like long for example, but consider the
venerable TimeUUID that causes so much frustration.  I think it would be
much easier to take a timestamp and construct a term like
TIMEUUID(timestamp) (or whatever), especially since that would work
identically across all clients.

And it's also worth pointing out that not all languages in use are
statically typed, so even in the case of an int, or a long, it'd be
easier (or as easy at least), to serialize that to a string like
10L, than it would be to pack a binary string in network-order.

As for not being stable, well, yeah it's going to need to bake a bit
before being suitable for widespread use, but I raise it here not to
encourage everyone to transition now, but so that you can help shape the
outcome (if you're interested, of course).

 One other benefit of query languages is that they make tooling a
 little easier, one does not have to come up with a specific CLI
 interpreter or a web interface with a set of input fields, you just
 have to type your QL into a text box or a terminal like you do with
 sql.
 Long term I think I'm in for a QL (although I have to think about the
 syntax you suggested) but I don't expect it to benefit client devs in
 the near term even if it was ready today as an alternative to thrift.
 
 One small question, does this language tunnel through avro or thrift
 calls? (Is  conn.execute() an avro or thrift call) 

It's avro for the simple reason that that's still sort of an
experimental code path and seemed a less controverial sandbox.  When the
spec and implementation are complete, and if it gains suitable traction,
I'd actually like to explore a customized transport and serialization.

-- 
Eric Evans
eev...@rackspace.com





Re: more info CASSANDRA-1472

2010-11-01 Thread Stu Hood
trunk @ r1029546

-Original Message-
From: dragos cernahoschi dragos.cernahos...@gmail.com
Sent: Monday, November 1, 2010 12:20pm
To: dev@cassandra.apache.org
Subject: Re: more info CASSANDRA-1472

I can't apply the patches on the latest trunk. Which trunk version
(checksum) did you rebased?

Thank you!

On Mon, Nov 1, 2010 at 6:50 AM, Stu Hood stu.h...@rackspace.com wrote:

 I can't reproduce this, but I've posted a rebased version on #1472 that you
 can try out. Thanks for trying it out!


 -Original Message-
 From: dragos cernahoschi dragos.cernahos...@gmail.com
 Sent: Friday, October 29, 2010 10:38am
 To: dev@cassandra.apache.org
 Subject: more info CASSANDRA-1472

 The stress.py command:

 1. python contrib/py_stress/stress.py -C 32 -x keys_bitmap

 2. more info from stack trace:

 10/10/29 18:15:28 INFO db.Memtable: Writing
 memtable-standa...@23048841(17806140
 bytes, 349140 operations)
 10/10/29 18:15:29 INFO service.GCInspector: GC for PS MarkSweep: 789 ms,
 7178512 reclaimed leaving 320182640 used; max is 1383202816
 10/10/29 18:15:31 ERROR service.AbstractCassandraDaemon: Fatal exception in
 thread Thread[FlushWriter:1,5,main]
 java.lang.RuntimeException: java.lang.ArrayIndexOutOfBoundsException: 1
at
 org.apache.cassandra.utils.WrappedRunnable.run(WrappedRunnable.java:34)
at
 java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:441)
at java.util.concurrent.FutureTask$Sync.innerRun(FutureTask.java:303)
at java.util.concurrent.FutureTask.run(FutureTask.java:138)
at

 java.util.concurrent.ThreadPoolExecutor$Worker.runTask(ThreadPoolExecutor.java:886)
at

 java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:908)
at java.lang.Thread.run(Thread.java:662)
 Caused by: java.lang.ArrayIndexOutOfBoundsException: 1
at
 org.apache.avro.io.ResolvingDecoder.readEnum(ResolvingDecoder.java:177)
at

 org.apache.avro.generic.GenericDatumReader.readEnum(GenericDatumReader.java:172)
at

 org.apache.avro.generic.GenericDatumReader.read(GenericDatumReader.java:115)
at

 org.apache.avro.generic.GenericDatumReader.read(GenericDatumReader.java:118)
at

 org.apache.avro.generic.GenericDatumReader.readRecord(GenericDatumReader.java:142)
at

 org.apache.avro.generic.GenericDatumReader.read(GenericDatumReader.java:114)
at

 org.apache.avro.generic.GenericDatumReader.readRecord(GenericDatumReader.java:142)
at

 org.apache.avro.generic.GenericDatumReader.read(GenericDatumReader.java:114)
at

 org.apache.avro.generic.GenericDatumReader.read(GenericDatumReader.java:105)
at

 org.apache.cassandra.io.SerDeUtils.deserializeWithSchema(SerDeUtils.java:112)
at

 org.apache.cassandra.io.sstable.bitidx.BitmapIndexReader.open(BitmapIndexReader.java:87)
at

 org.apache.cassandra.io.sstable.SSTableWriter.closeAndOpenReader(SSTableWriter.java:196)
at

 org.apache.cassandra.io.sstable.SSTableWriter.closeAndOpenReader(SSTableWriter.java:178)
at
 org.apache.cassandra.db.Memtable.writeSortedContents(Memtable.java:160)
at org.apache.cassandra.db.Memtable.access$1(Memtable.java:152)
at org.apache.cassandra.db.Memtable$1.runMayThrow(Memtable.java:172)
at
 org.apache.cassandra.utils.WrappedRunnable.run(WrappedRunnable.java:30)
... 6 more
 10/10/29 18:15:38 INFO service.GCInspector: GC for PS MarkSweep: 896 ms,
 72939816 reclaimed leaving 341851792 used; max is 1359740928
 10/10/29 18:15:38 INFO service.GCInspector: GC for PS Scavenge: 312 ms,
 188201560 reclaimed leaving 414791608 used; max is 1359740928
 10/10/29 18:15:40 INFO db.ColumnFamilyStore: switching in a fresh Memtable
 for Standard1 at

 CommitLogContext(file='/home/dragos/cassandra/commitlog/CommitLog-1288365280350.log',
 position=111458032)
 10/10/29 18:15:40 INFO db.ColumnFamilyStore: Enqueuing flush of
 memtable-standa...@25762909(17806140 bytes, 349140 operations)
 10/10/29 18:15:40 INFO db.Memtable: Writing
 memtable-standa...@25762909(17806140
 bytes, 349140 operations)
 10/10/29 18:15:43 INFO service.GCInspector: GC for PS MarkSweep: 1309 ms,
 293768 reclaimed leaving 456751520 used; max is 1349451776
 10/10/29 18:15:43 INFO service.GCInspector: Pool Name
 Active   Pending
 10/10/29 18:15:43 INFO service.GCInspector:
 ResponseStage 0 0
 10/10/29 18:15:43 INFO service.GCInspector:
 ReadStage 0 0
 10/10/29 18:15:43 INFO service.GCInspector:
 ReadRepair0 0
 10/10/29 18:15:43 INFO service.GCInspector: MutationStage
 3249
 10/10/29 18:15:43 INFO service.GCInspector:
 GossipStage   0 0
 10/10/29 18:15:43 INFO service.GCInspector:
 AntientropyStage  0 0
 10/10/29 18:15:43 INFO service.GCInspector:
 MigrationStage0 0
 10/10/29 18:15:43 INFO service.GCInspector:
 StreamStage   0 0
 10/10/29 18:15:43 INFO service.GCInspector

RE: [VOTE] 0.7.0 beta3

2010-10-31 Thread Stu Hood
+1

-Original Message-
From: Eric Evans eev...@rackspace.com
Sent: Thursday, October 28, 2010 10:55am
To: dev@cassandra.apache.org
Subject: [VOTE] 0.7.0 beta3


The RC1 vote[1] was vetoed due to, (among other things), the desire to
see more testing after the somewhat disruptive changes made in
CASSANDRA-1367[2].  Thus, I propose the follow artifacts for release as
beta3.

SVN:
https://svn.apache.org/repos/asf/cassandra/branches/cassandra-...@r1028349
0.7.0-beta3 artifacts: http://people.apache.org/~eevans

The vote will be open for the usual 72 hours.

[1]: http://article.gmane.org/gmane.comp.db.cassandra.devel/2346
[2]: https://issues.apache.org/jira/browse/CASSANDRA-1367
[3]: http://goo.gl/o0lQ (CHANGES.txt)
[4]: http://goo.gl/wFax (NEWS.txt)

-- 
Eric Evans
eev...@rackspace.com







Re: NoSQL, YesCQL?

2010-10-29 Thread Stu Hood
Let me preface this by saying: the fact that it looks like SQL doesn't bother 
me. But: I think this would be a terrible thing to _focus_ on. Something for 
contrib? Sure.

Most reasonable languages these days have a way to define what looks like a 
DSL: giving people a text DSL which is subject to injection attacks and can't 
be type checked without support from a client driver anyway is brain dead. 
There is a reason that 99% of clients that interact with a SQL server do so 
programmatically via a library: because that is an easier way to interact than 
via string concatenation. At least that way the library is performing the 
concatenation for you.

Regarding performance: assuming optimized RPC libraries (which we do not yet 
have in Avro, and which Thrift is getting better at), serializing to a string 
and back will never be as performant as using a pre-parsed representation of 
the statement on both sides. Oh but we can add prepared statements! Poppycock.



 Truth is we've seen quite a bit of proliferation, and very little critical 
 mass.
 forced to move in lock-step with Cassandra releases, that are never backward 
 compatible
Expecting stable libraries or API pre-1.0 is not reasonable: neither the devs 
nor the users expect stability, and yet we are still able to guarantee 
compatibility within major versions.

The stated problem is that backwards compatibility is hard to provide: if that 
is the core complaint, then changing to a text based serialization format with 
a sexy name in order to add backwards compatibility is a severe overreaction to 
the problem. Instead, I would propose evolving the API in a manner that 
simplifies it.

Thanks,
Stu


-Original Message-
From: Eric Evans eev...@rackspace.com
Sent: Thursday, October 28, 2010 4:56pm
To: dev@cassandra.apache.org
Subject: Re: NoSQL, YesCQL?

On Thu, 2010-10-28 at 14:46 -0700, Chip Salzenberg wrote:
 Short answer: YES Please, but we will still want a side channel for
 minimum overhead.

Ok.  Though I'm not sure I agree with the minimum overhead argument.

I've only done preliminary tests so far, but this seems on par (if not a
little faster) than Thrift is, and no effort to optimize has been made
(using a socket transport instead of HTTP for example would make a big
difference).

 Long answer: Query languages only work reliably when you have data
 binding assistance (insert Bobby Tables xkcd here).  However, they
 do
 have the wonderful property of evolving aggressively without requiring
 upgrades of the driver plumbing.  This is, of course, emphatically
 *not*
 true of anything like the current Thrift and Avro interfaces.  So
 that's
 why I say Yes.  On the other hand, a very simple interface for very
 simple queries has a lot of value, too; see, for example,
 http://yoshinorimatsunobu.blogspot.com/2010/10/using-mysql-as-nosql-story-for.html
   
 So that's why I think we will still want to bypass the full language
 for minimum latency in some circumstances. 

I'm also not sure I follow here either.  Do you mean simple for the
application developer?  CQL-using code should be considerably simpler
than the equivalent Thrift/Avro.

Thanks for the feedback!

-- 
Eric Evans
eev...@rackspace.com





RE: [VOTE] 0.7.0 RC1

2010-10-23 Thread Stu Hood
-1

bin/config-converter failed: see 
https://issues.apache.org/jira/browse/CASSANDRA-1655

-Original Message-
From: Eric Evans eev...@rackspace.com
Sent: Friday, October 22, 2010 10:28pm
To: dev@cassandra.apache.org
Subject: [VOTE] 0.7.0 RC1


We're closing in on a final; it shouldn't be long now.  I propose the
following for 0.7.0 Release Candidate 1.

SVN:
https://svn.apache.org/repos/asf/cassandra/branches/cassandra-...@r1026517
0.7.0-rc1 artifacts: http://people.apache.org/~eevans

The vote will be open for the usual 72 hours.


[1]: http://goo.gl/zLY0 (CHANGES.txt)
[2]: http://goo.gl/U5Fo (NEWS.txt)

-- 
Eric Evans
eev...@rackspace.com





RE: [VOTE] 0.7.0 RC1

2010-10-23 Thread Stu Hood
I was able to reproduce it with:
 bin/cassandra -f
 bin/cassandra-cli --host localhost --port 9160

Client prints:
 Exception retrieving information about the cassandra node, check you
 have connected to the thrift port.

Server prints:
 TProtocolException: Missing version in readMessageBegin, old client?

-Original Message-
From: Eric Evans eev...@rackspace.com
Sent: Saturday, October 23, 2010 11:49am
To: dev@cassandra.apache.org
Subject: RE: [VOTE] 0.7.0 RC1

On Sat, 2010-10-23 at 11:44 -0500, Stu Hood wrote:
 Also, client connections fail immediately with a version mismatch:
 something is not kosher with the Thrift build. Poor cats.

Not seeing this, the system tests pass here.

-- 
Eric Evans
eev...@rackspace.com





Re: [VOTE] 0.7.0 RC1

2010-10-23 Thread Stu Hood
And as the last variable, the src artifact builds and does not exhibit the 
problem, so this is very likely a problem with the classes in the bin artifact.

-Original Message-
From: Stu Hood stu.h...@rackspace.com
Sent: Saturday, October 23, 2010 1:59pm
To: dev@cassandra.apache.org
Subject: Re: [VOTE] 0.7.0 RC1

I'm only seeing the problem with the artifacts.

-Original Message-
From: Jonathan Ellis jbel...@gmail.com
Sent: Saturday, October 23, 2010 1:51pm
To: dev dev@cassandra.apache.org
Subject: Re: [VOTE] 0.7.0 RC1

Are you seeing this just on the artifacts or on trunk?

Trunk gives me

$ bin/cassandra-cli --host localhost
Connected to: Test Cluster on localhost/9160
[defa...@unknown] create keyspace Keyspace1
80379cab-ded6-11df-b235-e700f669bcfc


On Sat, Oct 23, 2010 at 11:44 AM, Stu Hood stu.h...@rackspace.com wrote:
 Also, client connections fail immediately with a version mismatch: something 
 is not kosher with the Thrift build. Poor cats.

 -Original Message-
 From: Stu Hood stu.h...@rackspace.com
 Sent: Saturday, October 23, 2010 11:28am
 To: dev@cassandra.apache.org
 Subject: RE: [VOTE] 0.7.0 RC1

 -1

 bin/config-converter failed: see 
 https://issues.apache.org/jira/browse/CASSANDRA-1655

 -Original Message-
 From: Eric Evans eev...@rackspace.com
 Sent: Friday, October 22, 2010 10:28pm
 To: dev@cassandra.apache.org
 Subject: [VOTE] 0.7.0 RC1


 We're closing in on a final; it shouldn't be long now.  I propose the
 following for 0.7.0 Release Candidate 1.

 SVN:
 https://svn.apache.org/repos/asf/cassandra/branches/cassandra-...@r1026517
 0.7.0-rc1 artifacts: http://people.apache.org/~eevans

 The vote will be open for the usual 72 hours.


 [1]: http://goo.gl/zLY0 (CHANGES.txt)
 [2]: http://goo.gl/U5Fo (NEWS.txt)

 --
 Eric Evans
 eev...@rackspace.com









-- 
Jonathan Ellis
Project Chair, Apache Cassandra
co-founder of Riptano, the source for professional Cassandra support
http://riptano.com






Re: [VOTE] 0.7.0 RC1

2010-10-23 Thread Stu Hood
Ack. This was a framed transport problem... the config converter will leave 
people in a state where the cli won't connect by default. I guess it's a known 
issue.

Sorry! 

-Original Message-
From: Stu Hood stu.h...@rackspace.com
Sent: Saturday, October 23, 2010 2:03pm
To: dev@cassandra.apache.org
Subject: Re: [VOTE] 0.7.0 RC1

And as the last variable, the src artifact builds and does not exhibit the 
problem, so this is very likely a problem with the classes in the bin artifact.

-Original Message-
From: Stu Hood stu.h...@rackspace.com
Sent: Saturday, October 23, 2010 1:59pm
To: dev@cassandra.apache.org
Subject: Re: [VOTE] 0.7.0 RC1

I'm only seeing the problem with the artifacts.

-Original Message-
From: Jonathan Ellis jbel...@gmail.com
Sent: Saturday, October 23, 2010 1:51pm
To: dev dev@cassandra.apache.org
Subject: Re: [VOTE] 0.7.0 RC1

Are you seeing this just on the artifacts or on trunk?

Trunk gives me

$ bin/cassandra-cli --host localhost
Connected to: Test Cluster on localhost/9160
[defa...@unknown] create keyspace Keyspace1
80379cab-ded6-11df-b235-e700f669bcfc


On Sat, Oct 23, 2010 at 11:44 AM, Stu Hood stu.h...@rackspace.com wrote:
 Also, client connections fail immediately with a version mismatch: something 
 is not kosher with the Thrift build. Poor cats.

 -Original Message-
 From: Stu Hood stu.h...@rackspace.com
 Sent: Saturday, October 23, 2010 11:28am
 To: dev@cassandra.apache.org
 Subject: RE: [VOTE] 0.7.0 RC1

 -1

 bin/config-converter failed: see 
 https://issues.apache.org/jira/browse/CASSANDRA-1655

 -Original Message-
 From: Eric Evans eev...@rackspace.com
 Sent: Friday, October 22, 2010 10:28pm
 To: dev@cassandra.apache.org
 Subject: [VOTE] 0.7.0 RC1


 We're closing in on a final; it shouldn't be long now.  I propose the
 following for 0.7.0 Release Candidate 1.

 SVN:
 https://svn.apache.org/repos/asf/cassandra/branches/cassandra-...@r1026517
 0.7.0-rc1 artifacts: http://people.apache.org/~eevans

 The vote will be open for the usual 72 hours.


 [1]: http://goo.gl/zLY0 (CHANGES.txt)
 [2]: http://goo.gl/U5Fo (NEWS.txt)

 --
 Eric Evans
 eev...@rackspace.com









-- 
Jonathan Ellis
Project Chair, Apache Cassandra
co-founder of Riptano, the source for professional Cassandra support
http://riptano.com








Re: [VOTE] 0.7.0 RC1

2010-10-23 Thread Stu Hood
Sounds reasonable: https://issues.apache.org/jira/browse/CASSANDRA-1659

-Original Message-
From: Jonathan Ellis jbel...@gmail.com
Sent: Saturday, October 23, 2010 5:08pm
To: dev dev@cassandra.apache.org
Subject: Re: [VOTE] 0.7.0 RC1

Maybe we should set the new value to unframed always and include a
warning if that's not what their 0.6 value was.

On Sat, Oct 23, 2010 at 2:08 PM, Stu Hood stu.h...@rackspace.com wrote:
 Ack. This was a framed transport problem... the config converter will leave 
 people in a state where the cli won't connect by default. I guess it's a 
 known issue.

 Sorry!

 -Original Message-
 From: Stu Hood stu.h...@rackspace.com
 Sent: Saturday, October 23, 2010 2:03pm
 To: dev@cassandra.apache.org
 Subject: Re: [VOTE] 0.7.0 RC1

 And as the last variable, the src artifact builds and does not exhibit the 
 problem, so this is very likely a problem with the classes in the bin 
 artifact.

 -Original Message-
 From: Stu Hood stu.h...@rackspace.com
 Sent: Saturday, October 23, 2010 1:59pm
 To: dev@cassandra.apache.org
 Subject: Re: [VOTE] 0.7.0 RC1

 I'm only seeing the problem with the artifacts.

 -Original Message-
 From: Jonathan Ellis jbel...@gmail.com
 Sent: Saturday, October 23, 2010 1:51pm
 To: dev dev@cassandra.apache.org
 Subject: Re: [VOTE] 0.7.0 RC1

 Are you seeing this just on the artifacts or on trunk?

 Trunk gives me

 $ bin/cassandra-cli --host localhost
 Connected to: Test Cluster on localhost/9160
 [defa...@unknown] create keyspace Keyspace1
 80379cab-ded6-11df-b235-e700f669bcfc


 On Sat, Oct 23, 2010 at 11:44 AM, Stu Hood stu.h...@rackspace.com wrote:
 Also, client connections fail immediately with a version mismatch: something 
 is not kosher with the Thrift build. Poor cats.

 -Original Message-
 From: Stu Hood stu.h...@rackspace.com
 Sent: Saturday, October 23, 2010 11:28am
 To: dev@cassandra.apache.org
 Subject: RE: [VOTE] 0.7.0 RC1

 -1

 bin/config-converter failed: see 
 https://issues.apache.org/jira/browse/CASSANDRA-1655

 -Original Message-
 From: Eric Evans eev...@rackspace.com
 Sent: Friday, October 22, 2010 10:28pm
 To: dev@cassandra.apache.org
 Subject: [VOTE] 0.7.0 RC1


 We're closing in on a final; it shouldn't be long now.  I propose the
 following for 0.7.0 Release Candidate 1.

 SVN:
 https://svn.apache.org/repos/asf/cassandra/branches/cassandra-...@r1026517
 0.7.0-rc1 artifacts: http://people.apache.org/~eevans

 The vote will be open for the usual 72 hours.


 [1]: http://goo.gl/zLY0 (CHANGES.txt)
 [2]: http://goo.gl/U5Fo (NEWS.txt)

 --
 Eric Evans
 eev...@rackspace.com









 --
 Jonathan Ellis
 Project Chair, Apache Cassandra
 co-founder of Riptano, the source for professional Cassandra support
 http://riptano.com










-- 
Jonathan Ellis
Project Chair, Apache Cassandra
co-founder of Riptano, the source for professional Cassandra support
http://riptano.com




RE: [VOTE] 0.7.0 beta2 (attempt #3)

2010-09-28 Thread Stu Hood
+1
Tested upgrades from 0.6.5 and 0.7-beta1.

-Original Message-
From: Eric Evans eev...@rackspace.com
Sent: Tuesday, September 28, 2010 12:37pm
To: dev@cassandra.apache.org
Subject: [VOTE] 0.7.0 beta2 (attempt #3)

 It feels like 0.7.0-beta1 is becoming too distant a spec in
 the rear-view, and the delta[1] is becoming quite large.  I
 propose we vote to release a new beta.

 The previous vote was waved off, so here is a new one.

3rd times the charm?  We shall see.

SVN: https://svn.apache.org/repos/asf/cassandra/tr...@r1002257
0.7.0-beta2 artifacts: http://people.apache.org/~eevans

The vote will be open for 72 hours (less if there is another
veto).

[1]: https://svn.apache.org/repos/asf/cassandra/trunk/CHANGES.txt

-- 
Eric Evans
eev...@rackspace.com







Re: launch failure with current git (svn?)

2010-08-26 Thread Stu Hood
Hey Chip,

Sorry for the trouble! Jon has a patch on 
https://issues.apache.org/jira/browse/CASSANDRA-891 that fixes this issue, and 
should make it in tomorrow.

Thanks,
Stu

-Original Message-
From: Jonathan Ellis jbel...@gmail.com
Sent: Wednesday, August 25, 2010 9:33pm
To: dev@cassandra.apache.org
Subject: Re: launch failure with current git (svn?)

maybe adding the new default validator is breaking it?
https://issues.apache.org/jira/browse/CASSANDRA-891 in which case
blowing away system keyspace and re-initializing your schema should
fix it

adding new optional fields isn't supposed to break avro serialization tho :(

On Wed, Aug 25, 2010 at 8:39 PM, Chip Salzenberg c...@pobox.com wrote:
  I was using svn cassandra from about a week ago, and it worked for me; but
 then I updated to today's and got the below error after rebuild.
 Help, please: Should I apply some known config file change, or just revert,
 or ?
 The version failing is
 https://svn.apache.org/repos/asf/cassandra/tr...@989392
 PS: I'm working on Perl client code, so this really is a dev@ question.  :-)

 + exec /usr/bin/java -ea -XX:+UseThreadPriorities
 -XX:ThreadPriorityPolicy=42 -Xms2G -Xmx2G -XX:+HeapDumpOnOutOfMemoryError
 -Xss128k -XX:+UseParNewGC -XX:+UseConcMarkSweepGC
 -XX:+CMSParallelRemarkEnabled -XX:SurvivorRatio=8 -XX:MaxTenuringThreshold=1
 -XX:CMSInitiatingOccupancyFraction=80
 -Dcom.sun.management.jmxremote.port=8089
 -Dcom.sun.management.jmxremote.ssl=false
 -Dcom.sun.management.jmxremote.authenticate=false
 -Dlog4j.configuration=log4j-server.properties -Dcassandra-foreground=yes -cp
 /home/chip/cassandra/conf:bin/../build/classes:bin/../lib/antlr-3.1.3.jar:bin/../lib/avro-1.4.0-sources~r986959.jar:bin/../lib/avro-1.4.0~r986959.jar:bin/../lib/clhm-production.jar:bin/../lib/commons-cli-1.1.jar:bin/../lib/commons-codec-1.2.jar:bin/../lib/commons-collections-3.2.1.jar:bin/../lib/commons-lang-2.4.jar:bin/../lib/guava-r05.jar:bin/../lib/hadoop-core-0.20.1.jar:bin/../lib/high-scale-lib.jar:bin/../lib/jackson-core-asl-1.4.0.jar:bin/../lib/jackson-mapper-asl-1.4.0.jar:bin/../lib/jetty-6.1.21.jar:bin/../lib/jetty-util-6.1.21.jar:bin/../lib/jline-0.9.94.jar:bin/../lib/json-simple-1.1.jar:bin/../lib/jug-2.0.0.jar:bin/../lib/libthrift-r959516.jar:bin/../lib/log4j-1.2.16.jar:bin/../lib/servlet-api-2.5-20081211.jar:bin/../lib/slf4j-api-1.5.8.jar:bin/../lib/slf4j-log4j12-1.5.8.jar:bin/../lib/snakeyaml-1.6.jar
 org.apache.cassandra.thrift.CassandraDaemon
  INFO 18:35:49,194 JNA not found. Native methods will be disabled.
  INFO 18:35:49,299 DiskAccessMode 'auto' determined to be mmap,
 indexAccessMode is mmap
  INFO 18:35:49,403 Sampling index for
 /var/lib/cassandra/precious_roy/data/system/Statistics-e-1-
  INFO 18:35:49,417 Sampling index for
 /var/lib/cassandra/precious_roy/data/system/Statistics-e-2-
  INFO 18:35:49,432 Sampling index for
 /var/lib/cassandra/precious_roy/data/system/Schema-e-5-
  INFO 18:35:49,437 Sampling index for
 /var/lib/cassandra/precious_roy/data/system/Migrations-e-5-
  INFO 18:35:49,441 Sampling index for
 /var/lib/cassandra/precious_roy/data/system/LocationInfo-e-1-
  INFO 18:35:49,442 Sampling index for
 /var/lib/cassandra/precious_roy/data/system/LocationInfo-e-2-
  INFO 18:35:49,470 Loading schema version
 29946d4a-afec-11df-ba37-e700f669bcfc
 ERROR 18:35:49,636 Exception encountered during startup.
 org.apache.avro.AvroTypeException: Found
 {type:record,name:CfDef,namespace:org.apache.cassandra.config.avro,fields:[{name:keyspace,type:string},{name:name,type:string},{name:column_type,type:[string,null]},{name:clock_type,type:[string,null]},{name:comparator_type,type:[string,null]},{name:subcomparator_type,type:[string,null]},{name:reconciler,type:[string,null]},{name:comment,type:[string,null]},{name:row_cache_size,type:[double,null]},{name:preload_row_cache,type:[boolean,null]},{name:key_cache_size,type:[double,null]},{name:read_repair_chance,type:[double,null]},{name:gc_grace_seconds,type:[int,null]},{name:column_metadata,type:[{type:array,items:{type:record,name:ColumnDef,fields:[{name:name,type:bytes},{name:validation_class,type:string},{name:index_type,type:[{type:enum,name:IndexType,symbols:[KEYS]},null]},{name:index_name,type:[string,null]}]}},null]},{name:id,type:[int,null]}]},
 expecting
 

RE: network compatibility from 0.6 to 0.7

2010-07-22 Thread Stu Hood
 So I think we should probably try to preserve compatibility in 0.7.
We could probably limit the scope of our network compatibility to 
steady-state messages, aka, nothing having to do with bootstrap, repair, etc.

-Original Message-
From: Stu Hood stu.h...@rackspace.com
Sent: Thursday, July 22, 2010 12:35pm
To: dev@cassandra.apache.org
Subject: RE: network compatibility from 0.6 to 0.7

I feel like the next time we break network compatibility should be the last 
time, aka, the release when we introduce a backwards compatible RPC layer 
(Avro?), and implement support for dropping messages that a node can't handle.

So I think we should probably try to preserve compatibility in 0.7.


-Original Message-
From: Jonathan Ellis jbel...@gmail.com
Sent: Thursday, July 22, 2010 11:49am
To: cassandra-...@incubator.apache.org
Subject: network compatibility from 0.6 to 0.7

How useful is this to insist on, given that 0.7 thrift api is fairly
incompatible with 0.6's?  (timestamp - Clock change being the biggest
problem there)

-- 
Jonathan Ellis
Project Chair, Apache Cassandra
co-founder of Riptano, the source for professional Cassandra support
http://riptano.com






Re: Is SuperColumn necessary?

2010-05-05 Thread Stu Hood
Hey Ed,

I've been working on a similar approach for arbitarily nested/compound column 
names in #998. See: 
http://github.com/stuhood/cassandra/blob/998/src/java/org/apache/cassandra/db/ColumnKey.java

The goal is to provide native support and potentially (in the very long term), 
API support for nested/compound names. The difference between our approaches 
boils down to needing to define a comparator for every level in #998, versus 
having dynamic types per name in your approach.

Thanks,
Stu


-Original Message-
From: Ed Anuff e...@anuff.com
Sent: Wednesday, May 5, 2010 1:31pm
To: u...@cassandra.apache.org
Subject: Re: Is SuperColumn necessary?

Follow-up from last weeks discussion, I've been playing around with a simple
column comparator for composite column names that I put up on github.  I'd
be interested to hear what people think of this approach.

http://github.com/edanuff/CassandraCompositeType

Ed

On Wed, Apr 28, 2010 at 12:52 PM, Ed Anuff e...@anuff.com wrote:

 It might make sense to create a CompositeType subclass of AbstractType for
 the purpose of constructing and comparing these types of composite column
 names so that if you could more easily do that sort of thing rather than
 having to concatenate into one big string.


 On Wed, Apr 28, 2010 at 10:25 AM, Mike Malone m...@simplegeo.com wrote:

 The only thing SuperColumns appear to buy you (as someone pointed out to
 me at the Cassandra meetup - I think it was Eric Florenzano) is that you can
 use different comparator types for the Super/SubColumns, I guess..? But you
 should be able to do the same thing by creating your own Column comparator.
 I guess my point is that SuperColumns are mostly a convenience mechanism, as
 far as I can tell.

 Mike







Re: Bootstrap source code

2010-05-02 Thread Stu Hood
Nodes declare that they will be joining the ring at a particular position, 
which makes them a member of the 'pending ranges' set. Nodes with pending 
ranges are supposed to receive writes for those ranges, despite not officially 
owning them yet.


-Original Message-
From: Roger Schildmeijer schildmei...@gmail.com
Sent: Sunday, May 2, 2010 11:50am
To: dev@cassandra.apache.org
Subject: Re: Bootstrap source code

s/hinted handoff/read repair

 
 
 
 (Moved to developers mailing list)
 
 Without delve to deep into to that part of the code my educated(?) guess is 
 that this will (eventually) be solved/repaired thanks to hinted handoff and 
 anti entropy service.
 
 
 On 2 maj 2010, at 18.29em, Bill Hastings wrote:
 
 Hi
 
 I have looking at the bootstrap source and seem to understand it for the 
 most part. This is what I do not follow:
 
 (1) New node joins and doesn't advertise its token.
 (2) Requests nodes to send it data. The nodes that need to send it data 
 first flush memtables and then transfer SSTables.
 
 Once the streaming is over the new node advertises its token and starts 
 handling reads and writes correct? But what happens to keys that are being 
 written to the old nodes after the memtables have been dumped. Looks like 
 there is a window where keys would be going to the old nodes and not making 
 it to the new nodes. Am I missing something here.
 
 -- 
 Cheers
 Bill
 





RE: Grep for what?

2010-04-05 Thread Stu Hood
The actual process is `java`, which should have 'cassandra' as an argument. 
Make sure to use a variant of `ps` that shows the full command.


-Original Message-
From: JEFFERY SCHMITZ jefferyschm...@mac.com
Sent: Monday, April 5, 2010 12:20pm
To: dev@cassandra.apache.org
Subject: Grep for what?

Warning this is a newbie question - 

On startup I get

[r...@marduk bin]# ./cassandra -f
Listening for transport dt_socket at address: 
INFO - Sampling index for /var/lib/cassandra/data/system/LocationInfo-1-Data.db
INFO - Replaying /var/lib/cassandra/commitlog/CommitLog-1270047913578.log
INFO - LocationInfo has reached its threshold; switching in a fresh Memtable
INFO - Enqueuing flush of Memtable(LocationInfo)@2132679615
INFO - Sorting Memtable(LocationInfo)@2132679615
INFO - Writing Memtable(LocationInfo)@2132679615
INFO - Completed flushing /var/lib/cassandra/data/system/LocationInfo-2-Data.db
INFO - Log replay complete
INFO - Saved Token found: 75598148438183751486026363636316999593
INFO - Starting up server gossip
INFO - Cassandra starting up...

Okay so its running I can't figure out what the PID is?? - grepping for 
cassandra or apache turns up zip - 

Thanks - 

Jeffery