date:20120425

Sylvain Lebresne created CASSANDRA-4184:
---

 Summary: Make identifier and value grammar for CQL3 stricter
 Key: CASSANDRA-4184
 URL: https://issues.apache.org/jira/browse/CASSANDRA-4184
 Project: Cassandra
  Issue Type: Improvement
  Components: API
Affects Versions: 1.1.0
Reporter: Sylvain Lebresne
Assignee: Sylvain Lebresne
 Fix For: 1.1.1


The current grammar for CQL3 allows:
# uuid and integer constants as identifiers
# identifier as value (aka term in the grammar)

I think both of those should be removed.

For 1, mostly because this feels useless and slightly complicates the grammar 
which is annoying for the documentation of CQL3 for instance (note that this 
doesn't mean forbidding integer or uuid as identifier, but means they have to 
be double-quoted when used as such).
For 2, I think that allowing identifier as value is actually misleading, 
typically if you write things like {{SELECT foo WHERE foo=foo}}. It suggests we 
support JOIN when we do not.

Also, if both are done, then one will always be able to distinguish between 
identifier and value even without any context, which is a nice property.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Updated] (CASSANDRA-4184) Make identifier and value grammar for CQL3 stricter


 [ 
https://issues.apache.org/jira/browse/CASSANDRA-4184?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Sylvain Lebresne updated CASSANDRA-4184:


Attachment: 0002-Disallow-identifier-as-value.txt
0001-Disallow-integer-and-uuid-constants-as-identifier.txt

 Make identifier and value grammar for CQL3 stricter
 ---

 Key: CASSANDRA-4184
 URL: https://issues.apache.org/jira/browse/CASSANDRA-4184
 Project: Cassandra
  Issue Type: Improvement
  Components: API
Affects Versions: 1.1.0
Reporter: Sylvain Lebresne
Assignee: Sylvain Lebresne
  Labels: cql3
 Fix For: 1.1.1

 Attachments: 
 0001-Disallow-integer-and-uuid-constants-as-identifier.txt, 
 0002-Disallow-identifier-as-value.txt


 The current grammar for CQL3 allows:
 # uuid and integer constants as identifiers
 # identifier as value (aka term in the grammar)
 I think both of those should be removed.
 For 1, mostly because this feels useless and slightly complicates the grammar 
 which is annoying for the documentation of CQL3 for instance (note that this 
 doesn't mean forbidding integer or uuid as identifier, but means they have to 
 be double-quoted when used as such).
 For 2, I think that allowing identifier as value is actually misleading, 
 typically if you write things like {{SELECT foo WHERE foo=foo}}. It suggests 
 we support JOIN when we do not.
 Also, if both are done, then one will always be able to distinguish between 
 identifier and value even without any context, which is a nice property.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Created] (CASSANDRA-4185) Minor CQL3 fixes

Sylvain Lebresne created CASSANDRA-4185:
---

 Summary: Minor CQL3 fixes
 Key: CASSANDRA-4185
 URL: https://issues.apache.org/jira/browse/CASSANDRA-4185
 Project: Cassandra
  Issue Type: Bug
  Components: API
Affects Versions: 1.1.0
Reporter: Sylvain Lebresne
Assignee: Sylvain Lebresne
Priority: Minor
 Fix For: 1.1.1


The goal of this ticket is to be the home for a number of minor 
fixes/improvements in CQL3 that I didn't felt warranted a ticket each. It 
includes 4 patches:
* The first one fixes the grammar for float constants, so as to not recognize 
3.-3, but to actually allow 3. (i.e, with radix point but with the fractional 
part left blank)
* The second one correctly detect the (invalid) case where a table is created 
with COMPACT STORAGE but without any 'clustering keys'.
* The third one fixes COUNT, first by making sure both COUNT(*) and COUNT(1) 
are correctly recognized and also by processing the internal row before 
counting, are there isn't a 1-to-1 correspondence between internal rows and CQL 
rows in CQL3. The grammar change in this patch actually rely on CASSANDRA-4184
* The fourth and last patch disallows the counter type for keys (i.e. any 
column part of the PRIMARY KEY) as it is completely non-sensical and will only 
led to confusion.


--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Updated] (CASSANDRA-4185) Minor CQL3 fixes


 [ 
https://issues.apache.org/jira/browse/CASSANDRA-4185?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Sylvain Lebresne updated CASSANDRA-4185:


Attachment: 0004-Disallow-counters-for-PRIMARY-KEY-part.txt
0003-Fix-COUNT-in-select.txt
0002-Fix-compact-storage-validation.txt
0001-Fix-float-parsing.txt

 Minor CQL3 fixes
 

 Key: CASSANDRA-4185
 URL: https://issues.apache.org/jira/browse/CASSANDRA-4185
 Project: Cassandra
  Issue Type: Bug
  Components: API
Affects Versions: 1.1.0
Reporter: Sylvain Lebresne
Assignee: Sylvain Lebresne
Priority: Minor
  Labels: cql3
 Fix For: 1.1.1

 Attachments: 0001-Fix-float-parsing.txt, 
 0002-Fix-compact-storage-validation.txt, 0003-Fix-COUNT-in-select.txt, 
 0004-Disallow-counters-for-PRIMARY-KEY-part.txt


 The goal of this ticket is to be the home for a number of minor 
 fixes/improvements in CQL3 that I didn't felt warranted a ticket each. It 
 includes 4 patches:
 * The first one fixes the grammar for float constants, so as to not recognize 
 3.-3, but to actually allow 3. (i.e, with radix point but with the fractional 
 part left blank)
 * The second one correctly detect the (invalid) case where a table is created 
 with COMPACT STORAGE but without any 'clustering keys'.
 * The third one fixes COUNT, first by making sure both COUNT(*) and COUNT(1) 
 are correctly recognized and also by processing the internal row before 
 counting, are there isn't a 1-to-1 correspondence between internal rows and 
 CQL rows in CQL3. The grammar change in this patch actually rely on 
 CASSANDRA-4184
 * The fourth and last patch disallows the counter type for keys (i.e. any 
 column part of the PRIMARY KEY) as it is completely non-sensical and will 
 only led to confusion.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Updated] (CASSANDRA-4185) Minor CQL3 fixes


 [ 
https://issues.apache.org/jira/browse/CASSANDRA-4185?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Sylvain Lebresne updated CASSANDRA-4185:


Attachment: (was: 0004-Disallow-counters-for-PRIMARY-KEY-part.txt)

 Minor CQL3 fixes
 

 Key: CASSANDRA-4185
 URL: https://issues.apache.org/jira/browse/CASSANDRA-4185
 Project: Cassandra
  Issue Type: Bug
  Components: API
Affects Versions: 1.1.0
Reporter: Sylvain Lebresne
Assignee: Sylvain Lebresne
Priority: Minor
  Labels: cql3
 Fix For: 1.1.1

 Attachments: 0001-Fix-float-parsing.txt, 
 0002-Fix-compact-storage-validation.txt, 0003-Fix-COUNT-in-select.txt, 
 0004-Disallow-counters-for-PRIMARY-KEY-part-v2.txt


 The goal of this ticket is to be the home for a number of minor 
 fixes/improvements in CQL3 that I didn't felt warranted a ticket each. It 
 includes 4 patches:
 * The first one fixes the grammar for float constants, so as to not recognize 
 3.-3, but to actually allow 3. (i.e, with radix point but with the fractional 
 part left blank)
 * The second one correctly detect the (invalid) case where a table is created 
 with COMPACT STORAGE but without any 'clustering keys'.
 * The third one fixes COUNT, first by making sure both COUNT(*) and COUNT(1) 
 are correctly recognized and also by processing the internal row before 
 counting, are there isn't a 1-to-1 correspondence between internal rows and 
 CQL rows in CQL3. The grammar change in this patch actually rely on 
 CASSANDRA-4184
 * The fourth and last patch disallows the counter type for keys (i.e. any 
 column part of the PRIMARY KEY) as it is completely non-sensical and will 
 only led to confusion.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Updated] (CASSANDRA-4185) Minor CQL3 fixes


 [ 
https://issues.apache.org/jira/browse/CASSANDRA-4185?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Sylvain Lebresne updated CASSANDRA-4185:


Attachment: 0004-Disallow-counters-for-PRIMARY-KEY-part-v2.txt

Realized the last patch was missing some parts. v2 attached with those missing 
parts.

 Minor CQL3 fixes
 

 Key: CASSANDRA-4185
 URL: https://issues.apache.org/jira/browse/CASSANDRA-4185
 Project: Cassandra
  Issue Type: Bug
  Components: API
Affects Versions: 1.1.0
Reporter: Sylvain Lebresne
Assignee: Sylvain Lebresne
Priority: Minor
  Labels: cql3
 Fix For: 1.1.1

 Attachments: 0001-Fix-float-parsing.txt, 
 0002-Fix-compact-storage-validation.txt, 0003-Fix-COUNT-in-select.txt, 
 0004-Disallow-counters-for-PRIMARY-KEY-part-v2.txt


 The goal of this ticket is to be the home for a number of minor 
 fixes/improvements in CQL3 that I didn't felt warranted a ticket each. It 
 includes 4 patches:
 * The first one fixes the grammar for float constants, so as to not recognize 
 3.-3, but to actually allow 3. (i.e, with radix point but with the fractional 
 part left blank)
 * The second one correctly detect the (invalid) case where a table is created 
 with COMPACT STORAGE but without any 'clustering keys'.
 * The third one fixes COUNT, first by making sure both COUNT(*) and COUNT(1) 
 are correctly recognized and also by processing the internal row before 
 counting, are there isn't a 1-to-1 correspondence between internal rows and 
 CQL rows in CQL3. The grammar change in this patch actually rely on 
 CASSANDRA-4184
 * The fourth and last patch disallows the counter type for keys (i.e. any 
 column part of the PRIMARY KEY) as it is completely non-sensical and will 
 only led to confusion.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Commented] (CASSANDRA-4184) Make identifier and value grammar for CQL3 stricter

2012-04-25 Thread Eric Evans (JIRA)


[ 
https://issues.apache.org/jira/browse/CASSANDRA-4184?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13261682#comment-13261682
 ] 

Eric Evans commented on CASSANDRA-4184:
---

For what it's worth, allowing IDENT here was done to optimize for user 
experience in interactive sessions; To avoid having to quote something if it 
could be unambiguously parsed.  I believe it was Jonathan that championed this.

I wasn't in favor of it initially (like you, I was trying to simplify the 
grammar), but it's grown on me.



 Make identifier and value grammar for CQL3 stricter
 ---

 Key: CASSANDRA-4184
 URL: https://issues.apache.org/jira/browse/CASSANDRA-4184
 Project: Cassandra
  Issue Type: Improvement
  Components: API
Affects Versions: 1.1.0
Reporter: Sylvain Lebresne
Assignee: Sylvain Lebresne
  Labels: cql3
 Fix For: 1.1.1

 Attachments: 
 0001-Disallow-integer-and-uuid-constants-as-identifier.txt, 
 0002-Disallow-identifier-as-value.txt


 The current grammar for CQL3 allows:
 # uuid and integer constants as identifiers
 # identifier as value (aka term in the grammar)
 I think both of those should be removed.
 For 1, mostly because this feels useless and slightly complicates the grammar 
 which is annoying for the documentation of CQL3 for instance (note that this 
 doesn't mean forbidding integer or uuid as identifier, but means they have to 
 be double-quoted when used as such).
 For 2, I think that allowing identifier as value is actually misleading, 
 typically if you write things like {{SELECT foo WHERE foo=foo}}. It suggests 
 we support JOIN when we do not.
 Also, if both are done, then one will always be able to distinguish between 
 identifier and value even without any context, which is a nice property.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

[1/7] git commit: Merge branch 'cassandra-1.1' into trunk

Updated Branches:
  refs/heads/cassandra-1.1 7daa6625a - 58dceaa6a
  refs/heads/trunk edf9e0e16 - ea117154f


Merge branch 'cassandra-1.1' into trunk


Project: http://git-wip-us.apache.org/repos/asf/cassandra/repo
Commit: http://git-wip-us.apache.org/repos/asf/cassandra/commit/ea117154
Tree: http://git-wip-us.apache.org/repos/asf/cassandra/tree/ea117154
Diff: http://git-wip-us.apache.org/repos/asf/cassandra/diff/ea117154

Branch: refs/heads/trunk
Commit: ea117154f15c423e5ee9de2c920b9f69444ac164
Parents: edf9e0e 58dceaa
Author: Jonathan Ellis jbel...@apache.org
Authored: Wed Apr 25 10:29:31 2012 -0500
Committer: Jonathan Ellis jbel...@apache.org
Committed: Wed Apr 25 10:29:31 2012 -0500

--
 CHANGES.txt |1 +
 NEWS.txt|9 +
 2 files changed, 10 insertions(+), 0 deletions(-)
--


http://git-wip-us.apache.org/repos/asf/cassandra/blob/ea117154/CHANGES.txt
--
diff --cc CHANGES.txt
index a316a49,6a7a67b..00aa405
--- a/CHANGES.txt
+++ b/CHANGES.txt
@@@ -1,14 -1,5 +1,15 @@@
 +1.2-dev
 + * Track tombstone expiration and compact when tombstone content is
 +   higher than a configurable threshold, default 20% (CASSANDRA-3442)
 + * update MurmurHash to version 3 (CASSANDRA-2975)
 + * (CLI) track elapsed time for `delete' operation (CASSANDRA-4060)
 + * (CLI) jline version is bumped to 1.0 to properly  support
 +   'delete' key function (CASSANDRA-4132)
 + * Save IndexSummary into new SSTable 'Summary' component (CASSANDRA-2392)
 +
 +
  1.1.1-dev
+  * streaming commitlog backup + pitr (CASSANDRA-3690)
   * avoid generating redundant compaction tasks during streaming
 (CASSANDRA-4174)
   * add -cf option to nodetool snapshot, and takeColumnFamilySnapshot to

[3/7] git commit: update CHANGES and NEWS for 3690

update CHANGES and NEWS for 3690


Project: http://git-wip-us.apache.org/repos/asf/cassandra/repo
Commit: http://git-wip-us.apache.org/repos/asf/cassandra/commit/58dceaa6
Tree: http://git-wip-us.apache.org/repos/asf/cassandra/tree/58dceaa6
Diff: http://git-wip-us.apache.org/repos/asf/cassandra/diff/58dceaa6

Branch: refs/heads/trunk
Commit: 58dceaa6a8ef31db5fc7e7c7afd83891c9cf8d14
Parents: 7daa662
Author: Jonathan Ellis jbel...@apache.org
Authored: Wed Apr 25 10:29:20 2012 -0500
Committer: Jonathan Ellis jbel...@apache.org
Committed: Wed Apr 25 10:29:20 2012 -0500

--
 CHANGES.txt |1 +
 NEWS.txt|9 +
 2 files changed, 10 insertions(+), 0 deletions(-)
--


http://git-wip-us.apache.org/repos/asf/cassandra/blob/58dceaa6/CHANGES.txt
--
diff --git a/CHANGES.txt b/CHANGES.txt
index 47f95ce..6a7a67b 100644
--- a/CHANGES.txt
+++ b/CHANGES.txt
@@ -1,4 +1,5 @@
 1.1.1-dev
+ * streaming commitlog backup + pitr (CASSANDRA-3690)
  * avoid generating redundant compaction tasks during streaming
(CASSANDRA-4174)
  * add -cf option to nodetool snapshot, and takeColumnFamilySnapshot to

http://git-wip-us.apache.org/repos/asf/cassandra/blob/58dceaa6/NEWS.txt
--
diff --git a/NEWS.txt b/NEWS.txt
index 1c8344d..89af1ab 100644
--- a/NEWS.txt
+++ b/NEWS.txt
@@ -9,6 +9,15 @@ upgrade, just in case you need to roll back to the previous 
version.
 by version X, but the inverse is not necessarily the case.)
 
 
+1.1.1
+=
+
+Features
+
+- Continuous commitlog archiving and point-in-time recovery.
+  See conf/commitlog_archiving.properties
+
+
 1.1
 ===

[6/7] git commit: Remove other obsolete NEW entry

Remove other obsolete NEW entry


Project: http://git-wip-us.apache.org/repos/asf/cassandra/repo
Commit: http://git-wip-us.apache.org/repos/asf/cassandra/commit/5276e837
Tree: http://git-wip-us.apache.org/repos/asf/cassandra/tree/5276e837
Diff: http://git-wip-us.apache.org/repos/asf/cassandra/diff/5276e837

Branch: refs/heads/trunk
Commit: 5276e8375f602980bda6232085f89c0131ea985b
Parents: 706edf9
Author: Sylvain Lebresne sylv...@datastax.com
Authored: Fri Apr 20 17:32:13 2012 +0200
Committer: Sylvain Lebresne sylv...@datastax.com
Committed: Tue Apr 24 20:21:24 2012 +0200

--
 NEWS.txt |3 ---
 1 files changed, 0 insertions(+), 3 deletions(-)
--


http://git-wip-us.apache.org/repos/asf/cassandra/blob/5276e837/NEWS.txt
--
diff --git a/NEWS.txt b/NEWS.txt
index 2c5f7b4..4522e03 100644
--- a/NEWS.txt
+++ b/NEWS.txt
@@ -16,9 +16,6 @@ Upgrading
 -
 - Compression is enabled by default on newly created ColumnFamilies
   (and unchanged for ColumnFamilies created prior to upgrading).
-- The KsDef.replication_factor field (deprecated since 0.8) has
-  been removed.  Older clients will need to be updated to be able
-  to continue to created and update keyspaces.
 - If you are running a multi datacenter setup, you should upgrade to
   the latest 1.0.x (or 0.8.x) release before upgrading.  Versions
   0.8.8 and 1.0.3-1.0.5 generate cross-dc forwarding that is incompatible

[4/7] git commit: Fix wide row counter deserialization. Patch by Marco Cova, reviewed by brandonwilliams for CASSANDRA-4181

Fix wide row counter deserialization.
Patch by Marco Cova, reviewed by brandonwilliams for CASSANDRA-4181


Project: http://git-wip-us.apache.org/repos/asf/cassandra/repo
Commit: http://git-wip-us.apache.org/repos/asf/cassandra/commit/7daa6625
Tree: http://git-wip-us.apache.org/repos/asf/cassandra/tree/7daa6625
Diff: http://git-wip-us.apache.org/repos/asf/cassandra/diff/7daa6625

Branch: refs/heads/trunk
Commit: 7daa6625a19e7556e0514e7c46efe6dab0b9de14
Parents: 3c4fa12
Author: Brandon Williams brandonwilli...@apache.org
Authored: Tue Apr 24 13:26:01 2012 -0500
Committer: Brandon Williams brandonwilli...@apache.org
Committed: Tue Apr 24 13:26:01 2012 -0500

--
 .../cassandra/hadoop/ColumnFamilyRecordReader.java |3 ++-
 1 files changed, 2 insertions(+), 1 deletions(-)
--


http://git-wip-us.apache.org/repos/asf/cassandra/blob/7daa6625/src/java/org/apache/cassandra/hadoop/ColumnFamilyRecordReader.java
--
diff --git a/src/java/org/apache/cassandra/hadoop/ColumnFamilyRecordReader.java 
b/src/java/org/apache/cassandra/hadoop/ColumnFamilyRecordReader.java
index 9459d5a..f4a2a7b 100644
--- a/src/java/org/apache/cassandra/hadoop/ColumnFamilyRecordReader.java
+++ b/src/java/org/apache/cassandra/hadoop/ColumnFamilyRecordReader.java
@@ -501,7 +501,8 @@ public class ColumnFamilyRecordReader extends 
RecordReaderByteBuffer, SortedMap
 if (columns.hasNext())
 {
 ColumnOrSuperColumn cosc = columns.next();
-ImmutableSortedMapByteBuffer, IColumn map = 
ImmutableSortedMap.of(cosc.column.name, unthriftifySimple(cosc.column));
+IColumn column = unthriftify(cosc);
+ImmutableSortedMapByteBuffer, IColumn map = 
ImmutableSortedMap.of(column.name(), column);
 return Pair.ByteBuffer, SortedMapByteBuffer, 
IColumncreate(currentRow.key, map);
 }

[2/7] git commit: update CHANGES and NEWS for 3690

update CHANGES and NEWS for 3690


Project: http://git-wip-us.apache.org/repos/asf/cassandra/repo
Commit: http://git-wip-us.apache.org/repos/asf/cassandra/commit/58dceaa6
Tree: http://git-wip-us.apache.org/repos/asf/cassandra/tree/58dceaa6
Diff: http://git-wip-us.apache.org/repos/asf/cassandra/diff/58dceaa6

Branch: refs/heads/cassandra-1.1
Commit: 58dceaa6a8ef31db5fc7e7c7afd83891c9cf8d14
Parents: 7daa662
Author: Jonathan Ellis jbel...@apache.org
Authored: Wed Apr 25 10:29:20 2012 -0500
Committer: Jonathan Ellis jbel...@apache.org
Committed: Wed Apr 25 10:29:20 2012 -0500

--
 CHANGES.txt |1 +
 NEWS.txt|9 +
 2 files changed, 10 insertions(+), 0 deletions(-)
--


http://git-wip-us.apache.org/repos/asf/cassandra/blob/58dceaa6/CHANGES.txt
--
diff --git a/CHANGES.txt b/CHANGES.txt
index 47f95ce..6a7a67b 100644
--- a/CHANGES.txt
+++ b/CHANGES.txt
@@ -1,4 +1,5 @@
 1.1.1-dev
+ * streaming commitlog backup + pitr (CASSANDRA-3690)
  * avoid generating redundant compaction tasks during streaming
(CASSANDRA-4174)
  * add -cf option to nodetool snapshot, and takeColumnFamilySnapshot to

http://git-wip-us.apache.org/repos/asf/cassandra/blob/58dceaa6/NEWS.txt
--
diff --git a/NEWS.txt b/NEWS.txt
index 1c8344d..89af1ab 100644
--- a/NEWS.txt
+++ b/NEWS.txt
@@ -9,6 +9,15 @@ upgrade, just in case you need to roll back to the previous 
version.
 by version X, but the inverse is not necessarily the case.)
 
 
+1.1.1
+=
+
+Features
+
+- Continuous commitlog archiving and point-in-time recovery.
+  See conf/commitlog_archiving.properties
+
+
 1.1
 ===

[7/7] git commit: add concurrent schema, cql3, and CFRR wide rows to NEWS. clarify that KeyRange.filter allows Hadoop to take advantage of C* indexes

add concurrent schema, cql3, and CFRR wide rows to NEWS.  clarify that 
KeyRange.filter allows Hadoop to take advantage of C* indexes


Project: http://git-wip-us.apache.org/repos/asf/cassandra/repo
Commit: http://git-wip-us.apache.org/repos/asf/cassandra/commit/3c4fa12e
Tree: http://git-wip-us.apache.org/repos/asf/cassandra/tree/3c4fa12e
Diff: http://git-wip-us.apache.org/repos/asf/cassandra/diff/3c4fa12e

Branch: refs/heads/trunk
Commit: 3c4fa12e15f26f5ddd5f9f8322dc1495f0c761e0
Parents: 5276e83
Author: Jonathan Ellis jbel...@apache.org
Authored: Fri Apr 20 10:59:01 2012 -0500
Committer: Sylvain Lebresne sylv...@datastax.com
Committed: Tue Apr 24 20:21:24 2012 +0200

--
 NEWS.txt |2 +-
 1 files changed, 1 insertions(+), 1 deletions(-)
--


http://git-wip-us.apache.org/repos/asf/cassandra/blob/3c4fa12e/NEWS.txt
--
diff --git a/NEWS.txt b/NEWS.txt
index 4522e03..1c8344d 100644
--- a/NEWS.txt
+++ b/NEWS.txt
@@ -81,7 +81,7 @@ Features
 - Streaming is now multithreaded.
 - Compactions may now be aborted via JMX or nodetool.
 - The stress tool is not new in 1.1, but it is newly included in
-  binary builds now, as well as the source tree.
+  binary builds as well as the source tree
 - Hadoop: a new BulkOutputFormat is included which will directly write
   SSTables locally and then stream them into the cluster.
   YOU SHOULD USE BulkOutputFormat BY DEFAULT.  ColumnFamilyOutputFormat

[5/7] git commit: Remove obsolete line from the NEWS file

Remove obsolete line from the NEWS file


Project: http://git-wip-us.apache.org/repos/asf/cassandra/repo
Commit: http://git-wip-us.apache.org/repos/asf/cassandra/commit/706edf94
Tree: http://git-wip-us.apache.org/repos/asf/cassandra/tree/706edf94
Diff: http://git-wip-us.apache.org/repos/asf/cassandra/diff/706edf94

Branch: refs/heads/trunk
Commit: 706edf94f809e4def86ea04fc5eb61b71ad699da
Parents: 808d7c9
Author: Sylvain Lebresne sylv...@datastax.com
Authored: Fri Apr 20 17:30:57 2012 +0200
Committer: Sylvain Lebresne sylv...@datastax.com
Committed: Tue Apr 24 20:21:24 2012 +0200

--
 NEWS.txt |3 ---
 1 files changed, 0 insertions(+), 3 deletions(-)
--


http://git-wip-us.apache.org/repos/asf/cassandra/blob/706edf94/NEWS.txt
--
diff --git a/NEWS.txt b/NEWS.txt
index 8593ee5..2c5f7b4 100644
--- a/NEWS.txt
+++ b/NEWS.txt
@@ -19,9 +19,6 @@ Upgrading
 - The KsDef.replication_factor field (deprecated since 0.8) has
   been removed.  Older clients will need to be updated to be able
   to continue to created and update keyspaces.
-- The KsDef.replication_factor field (deprecated since 0.8) has
-  been removed.  Older clients will need to be updated to be able
-  to continue to created and update keyspaces.
 - If you are running a multi datacenter setup, you should upgrade to
   the latest 1.0.x (or 0.8.x) release before upgrading.  Versions
   0.8.8 and 1.0.3-1.0.5 generate cross-dc forwarding that is incompatible

[jira] [Created] (CASSANDRA-4186) CQL3: make some keywords unreserved

Sylvain Lebresne created CASSANDRA-4186:
---

 Summary: CQL3: make some keywords unreserved
 Key: CASSANDRA-4186
 URL: https://issues.apache.org/jira/browse/CASSANDRA-4186
 Project: Cassandra
  Issue Type: Improvement
  Components: API
Affects Versions: 1.1.0
Reporter: Sylvain Lebresne
Assignee: Sylvain Lebresne
Priority: Minor
 Fix For: 1.1.1


CQL has quite a few keywords. Currently all of them are reserved, but this is 
not always necessary. PostreSQL for instance distinguish between reserved 
keywords and non-reserved ones, and allow things like {{key}}, {{timestamp}} or 
{{type}} as identifiers. I suggest we do the same as convenience for the user.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Updated] (CASSANDRA-4186) CQL3: make some keywords unreserved

[
https://issues.apache.org/jira/browse/CASSANDRA-4186?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Sylvain Lebresne updated CASSANDRA-4186:

Attachment: 0001-Add-unreserved-words.txt

Patch attached to have some keywords not reserved. I'll admit I've use the list
of reserved/non-reserver keywords of postgreSQL as inspiration for which to
reserve or not, but I'm open to a few changes if someone has good reason for it.

The patch also adds the list of native types directly in the grammar so as to
only parse those (or a string for custom ones) and to make them case
insensitive. They are not reserved.

CQL3: make some keywords unreserved
---

Key: CASSANDRA-4186
URL: https://issues.apache.org/jira/browse/CASSANDRA-4186
Project: Cassandra
Issue Type: Improvement
Components: API
Affects Versions: 1.1.0
Reporter: Sylvain Lebresne
Assignee: Sylvain Lebresne
Priority: Minor
Labels: cql3
Fix For: 1.1.1

Attachments: 0001-Add-unreserved-words.txt

CQL has quite a few keywords. Currently all of them are reserved, but this is
not always necessary. PostreSQL for instance distinguish between reserved
keywords and non-reserved ones, and allow things like {{key}}, {{timestamp}}
or {{type}} as identifiers. I suggest we do the same as convenience for the
user.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators:
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Commented] (CASSANDRA-4184) Make identifier and value grammar for CQL3 stricter


[ 
https://issues.apache.org/jira/browse/CASSANDRA-4184?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13261772#comment-13261772
 ] 

Sylvain Lebresne commented on CASSANDRA-4184:
-

bq. Is the main point of ident is to say this can be a column name?

Yes.

bq. I still do like the idea of saying uuids are a native data type and don't 
need quoting, like ints or doubles though.

Sure, I'm not proposing to remove that. They are uuid constants and are 
special in the grammar. Currently a value is number | string | uuid | 
identifier. I'm proposing to remove identifier. Because, if we allow it and 
have a table with columns {{foo}} and {{bar}}, then the behavior of {{SELECT * 
WHERE foo=bar}} is *very* unintuitive, as it does not return the records who 
have the same value for {{foo}} and {{bar}}, but rather {{bar}} is silently 
accepted as an equivalent for {{'bar'}}.

 Make identifier and value grammar for CQL3 stricter
 ---

 Key: CASSANDRA-4184
 URL: https://issues.apache.org/jira/browse/CASSANDRA-4184
 Project: Cassandra
  Issue Type: Improvement
  Components: API
Affects Versions: 1.1.0
Reporter: Sylvain Lebresne
Assignee: Sylvain Lebresne
  Labels: cql3
 Fix For: 1.1.1

 Attachments: 
 0001-Disallow-integer-and-uuid-constants-as-identifier.txt, 
 0002-Disallow-identifier-as-value.txt


 The current grammar for CQL3 allows:
 # uuid and integer constants as identifiers
 # identifier as value (aka term in the grammar)
 I think both of those should be removed.
 For 1, mostly because this feels useless and slightly complicates the grammar 
 which is annoying for the documentation of CQL3 for instance (note that this 
 doesn't mean forbidding integer or uuid as identifier, but means they have to 
 be double-quoted when used as such).
 For 2, I think that allowing identifier as value is actually misleading, 
 typically if you write things like {{SELECT foo WHERE foo=foo}}. It suggests 
 we support JOIN when we do not.
 Also, if both are done, then one will always be able to distinguish between 
 identifier and value even without any context, which is a nice property.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

[2/3] git commit: add 3912 to CHANGES, NEWS

add 3912 to CHANGES, NEWS


Project: http://git-wip-us.apache.org/repos/asf/cassandra/repo
Commit: http://git-wip-us.apache.org/repos/asf/cassandra/commit/cc68c49f
Tree: http://git-wip-us.apache.org/repos/asf/cassandra/tree/cc68c49f
Diff: http://git-wip-us.apache.org/repos/asf/cassandra/diff/cc68c49f

Branch: refs/heads/trunk
Commit: cc68c49f47426dbd89bf869d9b3d10de6e78d2ca
Parents: 58dceaa
Author: Jonathan Ellis jbel...@apache.org
Authored: Wed Apr 25 11:22:19 2012 -0500
Committer: Jonathan Ellis jbel...@apache.org
Committed: Wed Apr 25 11:22:19 2012 -0500

--
 CHANGES.txt |1 +
 NEWS.txt|1 +
 2 files changed, 2 insertions(+), 0 deletions(-)
--


http://git-wip-us.apache.org/repos/asf/cassandra/blob/cc68c49f/CHANGES.txt
--
diff --git a/CHANGES.txt b/CHANGES.txt
index 6a7a67b..ad2e12e 100644
--- a/CHANGES.txt
+++ b/CHANGES.txt
@@ -1,4 +1,5 @@
 1.1.1-dev
+ * incremental repair by token range (CASSANDRA-3912)
  * streaming commitlog backup + pitr (CASSANDRA-3690)
  * avoid generating redundant compaction tasks during streaming
(CASSANDRA-4174)

http://git-wip-us.apache.org/repos/asf/cassandra/blob/cc68c49f/NEWS.txt
--
diff --git a/NEWS.txt b/NEWS.txt
index 89af1ab..8c65101 100644
--- a/NEWS.txt
+++ b/NEWS.txt
@@ -16,6 +16,7 @@ Features
 
 - Continuous commitlog archiving and point-in-time recovery.
   See conf/commitlog_archiving.properties
+- Incremental repair by token range, exposed over JMX
 
 
 1.1

[1/3] git commit: Merge branch 'cassandra-1.1' into trunk

Updated Branches:
  refs/heads/cassandra-1.1 58dceaa6a - cc68c49f4
  refs/heads/trunk ea117154f - a05a5641f


Merge branch 'cassandra-1.1' into trunk


Project: http://git-wip-us.apache.org/repos/asf/cassandra/repo
Commit: http://git-wip-us.apache.org/repos/asf/cassandra/commit/a05a5641
Tree: http://git-wip-us.apache.org/repos/asf/cassandra/tree/a05a5641
Diff: http://git-wip-us.apache.org/repos/asf/cassandra/diff/a05a5641

Branch: refs/heads/trunk
Commit: a05a5641fffe0b44c32658843df8acee2765c13d
Parents: ea11715 cc68c49
Author: Jonathan Ellis jbel...@apache.org
Authored: Wed Apr 25 11:22:25 2012 -0500
Committer: Jonathan Ellis jbel...@apache.org
Committed: Wed Apr 25 11:22:25 2012 -0500

--
 CHANGES.txt |1 +
 NEWS.txt|1 +
 2 files changed, 2 insertions(+), 0 deletions(-)
--


http://git-wip-us.apache.org/repos/asf/cassandra/blob/a05a5641/CHANGES.txt
--
diff --cc CHANGES.txt
index 00aa405,ad2e12e..3d0b64a
--- a/CHANGES.txt
+++ b/CHANGES.txt
@@@ -1,14 -1,5 +1,15 @@@
 +1.2-dev
 + * Track tombstone expiration and compact when tombstone content is
 +   higher than a configurable threshold, default 20% (CASSANDRA-3442)
 + * update MurmurHash to version 3 (CASSANDRA-2975)
 + * (CLI) track elapsed time for `delete' operation (CASSANDRA-4060)
 + * (CLI) jline version is bumped to 1.0 to properly  support
 +   'delete' key function (CASSANDRA-4132)
 + * Save IndexSummary into new SSTable 'Summary' component (CASSANDRA-2392)
 +
 +
  1.1.1-dev
+  * incremental repair by token range (CASSANDRA-3912)
   * streaming commitlog backup + pitr (CASSANDRA-3690)
   * avoid generating redundant compaction tasks during streaming
 (CASSANDRA-4174)

[3/3] git commit: add 3912 to CHANGES, NEWS

add 3912 to CHANGES, NEWS


Project: http://git-wip-us.apache.org/repos/asf/cassandra/repo
Commit: http://git-wip-us.apache.org/repos/asf/cassandra/commit/cc68c49f
Tree: http://git-wip-us.apache.org/repos/asf/cassandra/tree/cc68c49f
Diff: http://git-wip-us.apache.org/repos/asf/cassandra/diff/cc68c49f

Branch: refs/heads/cassandra-1.1
Commit: cc68c49f47426dbd89bf869d9b3d10de6e78d2ca
Parents: 58dceaa
Author: Jonathan Ellis jbel...@apache.org
Authored: Wed Apr 25 11:22:19 2012 -0500
Committer: Jonathan Ellis jbel...@apache.org
Committed: Wed Apr 25 11:22:19 2012 -0500

--
 CHANGES.txt |1 +
 NEWS.txt|1 +
 2 files changed, 2 insertions(+), 0 deletions(-)
--


http://git-wip-us.apache.org/repos/asf/cassandra/blob/cc68c49f/CHANGES.txt
--
diff --git a/CHANGES.txt b/CHANGES.txt
index 6a7a67b..ad2e12e 100644
--- a/CHANGES.txt
+++ b/CHANGES.txt
@@ -1,4 +1,5 @@
 1.1.1-dev
+ * incremental repair by token range (CASSANDRA-3912)
  * streaming commitlog backup + pitr (CASSANDRA-3690)
  * avoid generating redundant compaction tasks during streaming
(CASSANDRA-4174)

http://git-wip-us.apache.org/repos/asf/cassandra/blob/cc68c49f/NEWS.txt
--
diff --git a/NEWS.txt b/NEWS.txt
index 89af1ab..8c65101 100644
--- a/NEWS.txt
+++ b/NEWS.txt
@@ -16,6 +16,7 @@ Features
 
 - Continuous commitlog archiving and point-in-time recovery.
   See conf/commitlog_archiving.properties
+- Incremental repair by token range, exposed over JMX
 
 
 1.1

[jira] [Updated] (CASSANDRA-3912) support incremental repair controlled by external agent

[
https://issues.apache.org/jira/browse/CASSANDRA-3912?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Jonathan Ellis updated CASSANDRA-3912:
--

Labels: jmx (was: )
Issue Type: New Feature (was: Improvement)

support incremental repair controlled by external agent
---

Key: CASSANDRA-3912
URL: https://issues.apache.org/jira/browse/CASSANDRA-3912
Project: Cassandra
Issue Type: New Feature
Components: Core
Reporter: Peter Schuller
Assignee: Peter Schuller
Labels: jmx
Fix For: 1.1.1

Attachments: 3912_v2.txt, CASSANDRA-3912-trunk-v1.txt,
CASSANDRA-3912-v2-001-add-nodetool-commands.txt,
CASSANDRA-3912-v2-002-fix-antientropyservice.txt

As a poor man's pre-cursor to CASSANDRA-2699, exposing the ability to repair
small parts of a range is extremely useful because it allows (with external
scripting logic) to slowly repair a node's content over time. Other than
avoiding the bulkyness of complete repairs, it means that you can safely do
repairs even if you absolutely cannot afford e.g. disk spaces spikes (see
CASSANDRA-2699 for what the issues are).
Attaching a patch that exposes a repairincremental command to nodetool,
where you specify a step and the number of total steps. Incrementally
performing a repair in 100 steps, for example, would be done by:
{code}
nodetool repairincremental 0 100
nodetool repairincremental 1 100
...
nodetool repairincremental 99 100
{code}
An external script can be used to keep track of what has been repaired and
when. This should allow (1) allow incremental repair to happen now/soon, and
(2) allow experimentation and evaluation for an implementation of
CASSANDRA-2699 which I still think is a good idea. This patch does nothing to
help the average deployment, but at least makes incremental repair possible
given sufficient effort spent on external scripting.
The big no-no about the patch is that it is entirely specific to
RandomPartitioner and BigIntegerToken. If someone can suggest a way to
implement this command generically using the Range/Token abstractions, I'd be
happy to hear suggestions.
An alternative would be to provide a nodetool command that allows you to
simply specify the specific token ranges on the command line. It makes using
it a bit more difficult, but would mean that it works for any partitioner and
token type.
Unless someone can suggest a better way to do this, I think I'll provide a
patch that does this. I'm still leaning towards supporting the simple step N
out of M form though.

[jira] [Commented] (CASSANDRA-4184) Make identifier and value grammar for CQL3 stricter


[ 
https://issues.apache.org/jira/browse/CASSANDRA-4184?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13261778#comment-13261778
 ] 

Jonathan Ellis commented on CASSANDRA-4184:
---

+1 from me then.  How about you, Eric?

 Make identifier and value grammar for CQL3 stricter
 ---

 Key: CASSANDRA-4184
 URL: https://issues.apache.org/jira/browse/CASSANDRA-4184
 Project: Cassandra
  Issue Type: Improvement
  Components: API
Affects Versions: 1.1.0
Reporter: Sylvain Lebresne
Assignee: Sylvain Lebresne
  Labels: cql3
 Fix For: 1.1.1

 Attachments: 
 0001-Disallow-integer-and-uuid-constants-as-identifier.txt, 
 0002-Disallow-identifier-as-value.txt


 The current grammar for CQL3 allows:
 # uuid and integer constants as identifiers
 # identifier as value (aka term in the grammar)
 I think both of those should be removed.
 For 1, mostly because this feels useless and slightly complicates the grammar 
 which is annoying for the documentation of CQL3 for instance (note that this 
 doesn't mean forbidding integer or uuid as identifier, but means they have to 
 be double-quoted when used as such).
 For 2, I think that allowing identifier as value is actually misleading, 
 typically if you write things like {{SELECT foo WHERE foo=foo}}. It suggests 
 we support JOIN when we do not.
 Also, if both are done, then one will always be able to distinguish between 
 identifier and value even without any context, which is a nice property.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Created] (CASSANDRA-4187) CQL3: move {max/min}_compaction_thresholds to compaction options

Sylvain Lebresne created CASSANDRA-4187:
---

 Summary: CQL3: move {max/min}_compaction_thresholds to compaction 
options
 Key: CASSANDRA-4187
 URL: https://issues.apache.org/jira/browse/CASSANDRA-4187
 Project: Cassandra
  Issue Type: Improvement
  Components: API
Affects Versions: 1.1.0
Reporter: Sylvain Lebresne
Assignee: Sylvain Lebresne
Priority: Trivial
 Fix For: 1.1.1
 Attachments: 0001-Refactor-CFPropDefs-to-avoid-duplicate-code.txt, 
0002-Move-min-max-compaction-threshold-settings-to-compacti.txt

It makes way more sense to have min_compaction_threshold and 
max_compaction_threshold be parts of the compaction_strategy_options. They are 
not in thrift (and CQL2) only for historical reasons, but there is no reason 
not to fix it. Especially given that they don't make sense for all compaction 
strategy (Leveled compaction ignores them).

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Updated] (CASSANDRA-4187) CQL3: move {max/min}_compaction_thresholds to compaction options


 [ 
https://issues.apache.org/jira/browse/CASSANDRA-4187?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Sylvain Lebresne updated CASSANDRA-4187:


Attachment: 0002-Move-min-max-compaction-threshold-settings-to-compacti.txt
0001-Refactor-CFPropDefs-to-avoid-duplicate-code.txt

Attached patches. The first is a small refactor to avoid code duplication, the 
second is the actual change.

 CQL3: move {max/min}_compaction_thresholds to compaction options
 

 Key: CASSANDRA-4187
 URL: https://issues.apache.org/jira/browse/CASSANDRA-4187
 Project: Cassandra
  Issue Type: Improvement
  Components: API
Affects Versions: 1.1.0
Reporter: Sylvain Lebresne
Assignee: Sylvain Lebresne
Priority: Trivial
  Labels: cql3
 Fix For: 1.1.1

 Attachments: 0001-Refactor-CFPropDefs-to-avoid-duplicate-code.txt, 
 0002-Move-min-max-compaction-threshold-settings-to-compacti.txt


 It makes way more sense to have min_compaction_threshold and 
 max_compaction_threshold be parts of the compaction_strategy_options. They 
 are not in thrift (and CQL2) only for historical reasons, but there is no 
 reason not to fix it. Especially given that they don't make sense for all 
 compaction strategy (Leveled compaction ignores them).

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Created] (CASSANDRA-4188) stress tool return a non-zero value when it fails

2012-04-25 Thread Tyler Patterson (JIRA)

Tyler Patterson created CASSANDRA-4188:
--

 Summary: stress tool return a non-zero value when it fails
 Key: CASSANDRA-4188
 URL: https://issues.apache.org/jira/browse/CASSANDRA-4188
 Project: Cassandra
  Issue Type: Improvement
  Components: Tools
Reporter: Tyler Patterson
Priority: Minor


I'm scripting the use of stress in the dtests, and it would be great if I could 
tell based on the return code if it succeeded or failed.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Created] (CASSANDRA-4189) Improve hints replay

2012-04-25 Thread Vijay (JIRA)

Vijay created CASSANDRA-4189:


 Summary: Improve hints replay
 Key: CASSANDRA-4189
 URL: https://issues.apache.org/jira/browse/CASSANDRA-4189
 Project: Cassandra
  Issue Type: Improvement
  Components: Core
Affects Versions: 1.2
Reporter: Vijay
Assignee: Vijay
Priority: Minor
 Fix For: 1.2


Problem: Hints are stored in one row.
when there are a lot of hints stored and we store Tombstones for the ones which 
has been replayed.
It might be worth shading the hints based on Hour at which the hints are 
stored. This can reduce the complexity of the scanning for hints.

Problem: Hints replay is too slow and single threaded.
There are use-case where the hints needs to be replayed ASAP to make the 
cluster more consistent.
In Multi region cluster, the throttle is already done due to the latency which 
is in the order of 100's of millisecond.
It might be worth trying to replay the hints in parallel and throttle on the 
number of bytes read from the disk or use the existing setting of throttle 
based on sleep interval on all the threads.


--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Created] (CASSANDRA-4190) Apparent data loss using super columns and row cache via ConcurrentLinkedHashCacheProvider

2012-04-25 Thread Mina Naguib (JIRA)

Mina Naguib created CASSANDRA-4190:
--

 Summary: Apparent data loss using super columns and row cache via 
ConcurrentLinkedHashCacheProvider
 Key: CASSANDRA-4190
 URL: https://issues.apache.org/jira/browse/CASSANDRA-4190
 Project: Cassandra
  Issue Type: Bug
Affects Versions: 1.0.9
 Environment: Linux 2.6.27
Reporter: Mina Naguib


Tested on a vanilla single-node cassandra 1.0.9 installation.

When using super columns along with row caching via 
ConcurrentLinkedHashCacheProvider (default if no JNA available, or explicitly 
configured even if JNA available), there's what appears as transient data loss.

Given this script executed in cassandra-cli:
{quote}
create keyspace Test;
use Test;

create column family Users with column_type='Super' and 
key_validation_class='UTF8Type' and comparator='UTF8Type' and 
subcomparator='UTF8Type' and default_validation_class='UTF8Type' and 
rows_cached=75000 and row_cache_provider='ConcurrentLinkedHashCacheProvider';

set Users['mina']['attrs']['name'] = 'Mina';
get Users['mina'];

set Users['mina']['attrs']['country'] = 'Canada';
get Users['mina'];

set Users['mina']['attrs']['region'] = 'Quebec';
get Users['mina'];
{quote}

The output from the 3 gets above is as follows:

{quote}
= (super_column=attrs,
 (column=name, value=Mina, timestamp=1335377788441000))
Returned 1 results.
{quote}

{quote}
= (super_column=attrs,
 (column=name, value=Mina, timestamp=1335377788441000))
Returned 1 results.
{quote}

{quote}
= (super_column=attrs,
 (column=name, value=Mina, timestamp=1335377788441000))
Returned 1 results.
{quote}

It's clear that the second and third set commands (country, region) are missing 
in the returned results.

If the row cache is explicitly invalidated (in a second terminal, via `nodetool 
-h localhost invalidaterowcache Test Users`), the missing data springs to life 
on next 'get':
{quote}
[default@Test] get Users['mina'];
= (super_column=attrs,
 (column=country, value=Canada, timestamp=1335377839592000)
 (column=name, value=Mina, timestamp=1335377788441000)
 (column=region, value=Quebec, timestamp=1335377871353000))
Returned 1 results.
{quote}

From cursory checks, this does not to appear to happen with regular columns, 
nor with JNA enabled + SerializingCacheProvider.



--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Commented] (CASSANDRA-4189) Improve hints replay

[
https://issues.apache.org/jira/browse/CASSANDRA-4189?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13261975#comment-13261975
]

Brandon Williams commented on CASSANDRA-4189:
-

bq. It might be worth shading the hints based on Hour at which the hints are
stored. This can reduce the complexity of the scanning for hints.

It's not clear to me that the extra code complexity (and IO) is worth this kind
of tradeoff. Also, on (mis)completion we flush and force a compaction that
should clear out the tombstones (see CASSANDRA-3733) so I'm skeptical this is a
real problem.

bq. Problem: Hints replay is too slow and single threaded.

I disagree, historically our problem with hints has always been *overload*,
which I think we finally got right in CASSANDRA-3554. Sure, in a two node
cluster maybe the single threaded nature is a problem, but in any cluster of
appreciable size it's always overload that's an issue, so I don't see much to
be gained by multithreading it.

Improve hints replay

Key: CASSANDRA-4189
URL: https://issues.apache.org/jira/browse/CASSANDRA-4189
Project: Cassandra
Issue Type: Improvement
Components: Core
Affects Versions: 1.2
Reporter: Vijay
Assignee: Vijay
Priority: Minor
Fix For: 1.2

Problem: Hints are stored in one row.
when there are a lot of hints stored and we store Tombstones for the ones
which has been replayed.
It might be worth shading the hints based on Hour at which the hints are
stored. This can reduce the complexity of the scanning for hints.
Problem: Hints replay is too slow and single threaded.
There are use-case where the hints needs to be replayed ASAP to make the
cluster more consistent.
In Multi region cluster, the throttle is already done due to the latency
which is in the order of 100's of millisecond.
It might be worth trying to replay the hints in parallel and throttle on the
number of bytes read from the disk or use the existing setting of throttle
based on sleep interval on all the threads.

[jira] [Commented] (CASSANDRA-4189) Improve hints replay

2012-04-25 Thread Vijay (JIRA)

[
https://issues.apache.org/jira/browse/CASSANDRA-4189?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13261991#comment-13261991
]

Vijay commented on CASSANDRA-4189:
--

{quote}
Also, on (mis)completion we flush and force a compaction that should clear out
the tombstones (see CASSANDRA-3733) so I'm skeptical this is a real problem.
{quote}
May be the above will fix it, the hints CF (about 10GB) is too large for the
node in question... so i have to do more tests.

{quote}
Sure, in a two node cluster maybe the single threaded nature is a problem, but
in any cluster of appreciable size it's always overload that's an issue, so I
don't see much to be gained by multithreading it.
{quote}
No the problem is when you have 10's of nodes and they are all in different
DC's, it is naturally throttled by the latency of 100's of milliseconds. Now
while replaying hints, the thread gets stuck replaying the hints to the remote
node, no other node gets the hints. What i am suggesting is to throttle but in
a multi threaded way.

Improve hints replay

[Cassandra Wiki] Update of ClientOptions by ShawnBissell

2012-04-25 Thread Apache Wiki

Dear Wiki user,

You have subscribed to a wiki page or wiki category on Cassandra Wiki for 
change notification.

The ClientOptions page has been changed by ShawnBissell:
http://wiki.apache.org/cassandra/ClientOptions?action=diffrev1=153rev2=154

* clj-hector: https://github.com/pingles/clj-hector
   * .NET
* Aquiles: http://aquiles.codeplex.com/
+   * Cassandraemon: http://cassandraemon.codeplex.com/
* cassandra-sharp: http://code.google.com/p/cassandra-sharp/
* FluentCassandra: https://github.com/managedfusion/fluentcassandra
   * Ruby:

[jira] [Commented] (CASSANDRA-4190) Apparent data loss using super columns and row cache via ConcurrentLinkedHashCacheProvider

2012-04-25 Thread Mina Naguib (JIRA)


[ 
https://issues.apache.org/jira/browse/CASSANDRA-4190?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13262005#comment-13262005
 ] 

Mina Naguib commented on CASSANDRA-4190:


I can confirm that testing with cassandra 1.1.0 has the same conclusion.

To reproduce against cassandra 1.1.0, edit cassandra.yaml and set:
{quote}
row_cache_provider: ConcurrentLinkedHashCacheProvider   

  
row_cache_size_in_mb: 200   

  
{quote}

And use this slightly updated script to accomodate for 1.1.0 changes:
{quote}
create keyspace Test;
use Test;

create column family Users with column_type='Super' and 
key_validation_class='UTF8Type' and comparator='UTF8Type' and 
subcomparator='UTF8Type' and default_validation_class='UTF8Type' and 
caching='ALL';

set Users['mina']['attrs']['name'] = 'Mina';
get Users['mina'];

set Users['mina']['attrs']['country'] = 'Canada';
get Users['mina'];

set Users['mina']['attrs']['region'] = 'Quebec';
get Users['mina'];
{quote}

The rest of the observations are the same as with the cassandra 1.0.9 test.

 Apparent data loss using super columns and row cache via 
 ConcurrentLinkedHashCacheProvider
 --

 Key: CASSANDRA-4190
 URL: https://issues.apache.org/jira/browse/CASSANDRA-4190
 Project: Cassandra
  Issue Type: Bug
Affects Versions: 1.0.9
 Environment: Linux 2.6.27
Reporter: Mina Naguib
  Labels: ConcurrentLinkedHashCacheProvider, cache, supercolumns

 Tested on a vanilla single-node cassandra 1.0.9 installation.
 When using super columns along with row caching via 
 ConcurrentLinkedHashCacheProvider (default if no JNA available, or explicitly 
 configured even if JNA available), there's what appears as transient data 
 loss.
 Given this script executed in cassandra-cli:
 {quote}
 create keyspace Test;
 use Test;
 create column family Users with column_type='Super' and 
 key_validation_class='UTF8Type' and comparator='UTF8Type' and 
 subcomparator='UTF8Type' and default_validation_class='UTF8Type' and 
 rows_cached=75000 and row_cache_provider='ConcurrentLinkedHashCacheProvider';
 set Users['mina']['attrs']['name'] = 'Mina';
 get Users['mina'];
 set Users['mina']['attrs']['country'] = 'Canada';
 get Users['mina'];
 set Users['mina']['attrs']['region'] = 'Quebec';
 get Users['mina'];
 {quote}
 The output from the 3 gets above is as follows:
 {quote}
 = (super_column=attrs,
  (column=name, value=Mina, timestamp=1335377788441000))
 Returned 1 results.
 {quote}
 {quote}
 = (super_column=attrs,
  (column=name, value=Mina, timestamp=1335377788441000))
 Returned 1 results.
 {quote}
 {quote}
 = (super_column=attrs,
  (column=name, value=Mina, timestamp=1335377788441000))
 Returned 1 results.
 {quote}
 It's clear that the second and third set commands (country, region) are 
 missing in the returned results.
 If the row cache is explicitly invalidated (in a second terminal, via 
 `nodetool -h localhost invalidaterowcache Test Users`), the missing data 
 springs to life on next 'get':
 {quote}
 [default@Test] get Users['mina'];
 = (super_column=attrs,
  (column=country, value=Canada, timestamp=1335377839592000)
  (column=name, value=Mina, timestamp=1335377788441000)
  (column=region, value=Quebec, timestamp=1335377871353000))
 Returned 1 results.
 {quote}
 From cursory checks, this does not to appear to happen with regular columns, 
 nor with JNA enabled + SerializingCacheProvider.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Updated] (CASSANDRA-4188) stress tool return a non-zero value when it fails

2012-04-25 Thread Tyler Patterson (JIRA)


 [ 
https://issues.apache.org/jira/browse/CASSANDRA-4188?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Tyler Patterson updated CASSANDRA-4188:
---

Description: I'm using of stress in the dtests, and it would be great if I 
could tell based on the return code if stress succeeded or failed.  (was: I'm 
scripting the use of stress in the dtests, and it would be great if I could 
tell based on the return code if it succeeded or failed.)

 stress tool return a non-zero value when it fails
 -

 Key: CASSANDRA-4188
 URL: https://issues.apache.org/jira/browse/CASSANDRA-4188
 Project: Cassandra
  Issue Type: Improvement
  Components: Tools
Reporter: Tyler Patterson
Priority: Minor
  Labels: stress

 I'm using of stress in the dtests, and it would be great if I could tell 
 based on the return code if stress succeeded or failed.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Updated] (CASSANDRA-4164) Cqlsh should support DESCRIBE on cql3-style composite CFs

2012-04-25 Thread paul cannon (JIRA)


 [ 
https://issues.apache.org/jira/browse/CASSANDRA-4164?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

paul cannon updated CASSANDRA-4164:
---

Attachment: 4164.patch.txt

teach cqlsh how to understand system.schema_columnfamilies so that it can get 
information on composite primary key tables.

patch attached, or, as usual, in my 4164 branch on my github. tagged here:

https://github.com/thepaul/cassandra/tree/pending/4164

Note that this still won't work for describing composite CFs in the system 
keyspace; system CFs don't show up in schema_columnfamilies, and there's no 
other way to get key-component column info.

 Cqlsh should support DESCRIBE on cql3-style composite CFs
 -

 Key: CASSANDRA-4164
 URL: https://issues.apache.org/jira/browse/CASSANDRA-4164
 Project: Cassandra
  Issue Type: Bug
Affects Versions: 1.1.0
Reporter: Nick Bailey
Assignee: paul cannon
  Labels: cql3
 Fix For: 1.1.1

 Attachments: 4164.patch.txt


 There is a discrepancy between create column family commands and then the 
 output of the describe command:
 {noformat}
 cqlsh:test CREATE TABLE timeline (
 ... user_id varchar,
 ... tweet_id uuid,
 ... author varchar,
 ... body varchar,
 ... PRIMARY KEY (user_id, tweet_id)
 ... );
 cqlsh:test describe columnfamily timeline;
 CREATE COLUMNFAMILY timeline (
   user_id text PRIMARY KEY
 ) WITH
   comment='' AND
   
 comparator='CompositeType(org.apache.cassandra.db.marshal.UUIDType,org.apache.cassandra.db.marshal.UTF8Type)'
  AND
   read_repair_chance=0.10 AND
   gc_grace_seconds=864000 AND
   default_validation=text AND
   min_compaction_threshold=4 AND
   max_compaction_threshold=32 AND
   replicate_on_write=True AND
   compaction_strategy_class='SizeTieredCompactionStrategy' AND
   
 compression_parameters:sstable_compression='org.apache.cassandra.io.compress.SnappyCompressor';
 {noformat}

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Updated] (CASSANDRA-4173) cqlsh: in cql3 mode, use cql3 quoting when outputting cql

2012-04-25 Thread paul cannon (JIRA)


 [ 
https://issues.apache.org/jira/browse/CASSANDRA-4173?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

paul cannon updated CASSANDRA-4173:
---

Attachment: 4173.patch.txt

break out calls to cql_escape so that they're separated by the sort of thing 
being quoted (column/cf/keyspace name vs. value) and go through the Shell 
object, so it can route to cql3 quoting or cql2 quoting as appropriate.

This should fix all uses of quoting except in tab completion areas.

Fix attached here and in the 4173 branch in my github, tagged here:

https://github.com/thepaul/cassandra/tree/pending/4173

The patch is meant to go on top of 4164.patch.txt from CASSANDRA-4164.

 cqlsh: in cql3 mode, use cql3 quoting when outputting cql
 -

 Key: CASSANDRA-4173
 URL: https://issues.apache.org/jira/browse/CASSANDRA-4173
 Project: Cassandra
  Issue Type: Improvement
  Components: Tools
Affects Versions: 1.1.0
Reporter: paul cannon
Assignee: paul cannon
Priority: Minor
  Labels: cql3, cqlsh
 Fix For: 1.1.1

 Attachments: 4173.patch.txt


 when cqlsh needs to output a column name or other term which needs quoting 
 (say, if you run DESCRIBE KEYSPACE and some column name has a space in it), 
 it currently only knows how to quote in the cql2 way. That is,
 {noformat}
 cqlsh:foo describe columnfamily bar
 CREATE COLUMNFAMILY bar (
   a int PRIMARY KEY,
   'b c' text
 ) WITH
 ...
 {noformat}
 cql3 does not recognize single quotes around column names, or columnfamily or 
 keyspace names either. cqlsh ought to learn how to use double-quotes instead 
 when in cql3 mode.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Commented] (CASSANDRA-4189) Improve hints replay

[
https://issues.apache.org/jira/browse/CASSANDRA-4189?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13262046#comment-13262046
]

Brandon Williams commented on CASSANDRA-4189:
-

bq. Now while replaying hints, the thread gets stuck replaying the hints to the
remote node, no other node gets the hints.

Ok, that makes sense.

Improve hints replay

[jira] [Updated] (CASSANDRA-4188) stress tool return a non-zero value when it fails


 [ 
https://issues.apache.org/jira/browse/CASSANDRA-4188?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Brandon Williams updated CASSANDRA-4188:


Fix Version/s: 1.0.10

 stress tool return a non-zero value when it fails
 -

 Key: CASSANDRA-4188
 URL: https://issues.apache.org/jira/browse/CASSANDRA-4188
 Project: Cassandra
  Issue Type: Improvement
  Components: Tools
Reporter: Tyler Patterson
Assignee: Pavel Yaskevich
Priority: Minor
  Labels: stress
 Fix For: 1.0.10


 I'm using of stress in the dtests, and it would be great if I could tell 
 based on the return code if stress succeeded or failed.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Assigned] (CASSANDRA-4188) stress tool return a non-zero value when it fails


 [ 
https://issues.apache.org/jira/browse/CASSANDRA-4188?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Brandon Williams reassigned CASSANDRA-4188:
---

Assignee: Pavel Yaskevich

 stress tool return a non-zero value when it fails
 -

 Key: CASSANDRA-4188
 URL: https://issues.apache.org/jira/browse/CASSANDRA-4188
 Project: Cassandra
  Issue Type: Improvement
  Components: Tools
Reporter: Tyler Patterson
Assignee: Pavel Yaskevich
Priority: Minor
  Labels: stress
 Fix For: 1.0.10


 I'm using of stress in the dtests, and it would be great if I could tell 
 based on the return code if stress succeeded or failed.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Assigned] (CASSANDRA-4190) Apparent data loss using super columns and row cache via ConcurrentLinkedHashCacheProvider


 [ 
https://issues.apache.org/jira/browse/CASSANDRA-4190?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jonathan Ellis reassigned CASSANDRA-4190:
-

Assignee: Sylvain Lebresne

 Apparent data loss using super columns and row cache via 
 ConcurrentLinkedHashCacheProvider
 --

 Key: CASSANDRA-4190
 URL: https://issues.apache.org/jira/browse/CASSANDRA-4190
 Project: Cassandra
  Issue Type: Bug
Affects Versions: 1.0.9
 Environment: Linux 2.6.27
Reporter: Mina Naguib
Assignee: Sylvain Lebresne
  Labels: ConcurrentLinkedHashCacheProvider, cache, supercolumns

 Tested on a vanilla single-node cassandra 1.0.9 installation.
 When using super columns along with row caching via 
 ConcurrentLinkedHashCacheProvider (default if no JNA available, or explicitly 
 configured even if JNA available), there's what appears as transient data 
 loss.
 Given this script executed in cassandra-cli:
 {quote}
 create keyspace Test;
 use Test;
 create column family Users with column_type='Super' and 
 key_validation_class='UTF8Type' and comparator='UTF8Type' and 
 subcomparator='UTF8Type' and default_validation_class='UTF8Type' and 
 rows_cached=75000 and row_cache_provider='ConcurrentLinkedHashCacheProvider';
 set Users['mina']['attrs']['name'] = 'Mina';
 get Users['mina'];
 set Users['mina']['attrs']['country'] = 'Canada';
 get Users['mina'];
 set Users['mina']['attrs']['region'] = 'Quebec';
 get Users['mina'];
 {quote}
 The output from the 3 gets above is as follows:
 {quote}
 = (super_column=attrs,
  (column=name, value=Mina, timestamp=1335377788441000))
 Returned 1 results.
 {quote}
 {quote}
 = (super_column=attrs,
  (column=name, value=Mina, timestamp=1335377788441000))
 Returned 1 results.
 {quote}
 {quote}
 = (super_column=attrs,
  (column=name, value=Mina, timestamp=1335377788441000))
 Returned 1 results.
 {quote}
 It's clear that the second and third set commands (country, region) are 
 missing in the returned results.
 If the row cache is explicitly invalidated (in a second terminal, via 
 `nodetool -h localhost invalidaterowcache Test Users`), the missing data 
 springs to life on next 'get':
 {quote}
 [default@Test] get Users['mina'];
 = (super_column=attrs,
  (column=country, value=Canada, timestamp=1335377839592000)
  (column=name, value=Mina, timestamp=1335377788441000)
  (column=region, value=Quebec, timestamp=1335377871353000))
 Returned 1 results.
 {quote}
 From cursory checks, this does not to appear to happen with regular columns, 
 nor with JNA enabled + SerializingCacheProvider.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Commented] (CASSANDRA-4188) stress tool return a non-zero value when it fails


[ 
https://issues.apache.org/jira/browse/CASSANDRA-4188?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13262051#comment-13262051
 ] 

Jonathan Ellis commented on CASSANDRA-4188:
---

What is succeeded for stress?  Is timing out once a failure?  Or taking 
longer than X minutes?

 stress tool return a non-zero value when it fails
 -

 Key: CASSANDRA-4188
 URL: https://issues.apache.org/jira/browse/CASSANDRA-4188
 Project: Cassandra
  Issue Type: Improvement
  Components: Tools
Reporter: Tyler Patterson
Assignee: Pavel Yaskevich
Priority: Minor
  Labels: stress
 Fix For: 1.0.10


 I'm using of stress in the dtests, and it would be great if I could tell 
 based on the return code if stress succeeded or failed.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Commented] (CASSANDRA-4188) stress tool return a non-zero value when it fails

2012-04-25 Thread Pavel Yaskevich (JIRA)


[ 
https://issues.apache.org/jira/browse/CASSANDRA-4188?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13262053#comment-13262053
 ] 

Pavel Yaskevich commented on CASSANDRA-4188:


No, it just returns normally but you will see the error messages in the log 
(with number of retries and other additional info such as exception 
type/message).

 stress tool return a non-zero value when it fails
 -

 Key: CASSANDRA-4188
 URL: https://issues.apache.org/jira/browse/CASSANDRA-4188
 Project: Cassandra
  Issue Type: Improvement
  Components: Tools
Reporter: Tyler Patterson
Assignee: Pavel Yaskevich
Priority: Minor
  Labels: stress
 Fix For: 1.0.10


 I'm using of stress in the dtests, and it would be great if I could tell 
 based on the return code if stress succeeded or failed.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Commented] (CASSANDRA-4189) Improve hints replay

[
https://issues.apache.org/jira/browse/CASSANDRA-4189?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13262066#comment-13262066
]

Jonathan Ellis commented on CASSANDRA-4189:
---

bq. It might be worth shading the hints based on Hour at which the hints are
stored. This can reduce the complexity of the scanning for hints.

Not sure this is a problem now that we drop the tombstones after each handoff.
Since there is only one handoff per target at a time, we don't need to worry
about having to skip tombstones in a meaningful volume.

bq. What i am suggesting is to throttle but in a multi threaded way

Improve hints replay

[jira] [Updated] (CASSANDRA-4164) Cqlsh should support DESCRIBE on cql3-style composite CFs


 [ 
https://issues.apache.org/jira/browse/CASSANDRA-4164?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jonathan Ellis updated CASSANDRA-4164:
--

Reviewer: brandon.williams

 Cqlsh should support DESCRIBE on cql3-style composite CFs
 -

 Key: CASSANDRA-4164
 URL: https://issues.apache.org/jira/browse/CASSANDRA-4164
 Project: Cassandra
  Issue Type: Bug
Affects Versions: 1.1.0
Reporter: Nick Bailey
Assignee: paul cannon
  Labels: cql3
 Fix For: 1.1.1

 Attachments: 4164.patch.txt


 There is a discrepancy between create column family commands and then the 
 output of the describe command:
 {noformat}
 cqlsh:test CREATE TABLE timeline (
 ... user_id varchar,
 ... tweet_id uuid,
 ... author varchar,
 ... body varchar,
 ... PRIMARY KEY (user_id, tweet_id)
 ... );
 cqlsh:test describe columnfamily timeline;
 CREATE COLUMNFAMILY timeline (
   user_id text PRIMARY KEY
 ) WITH
   comment='' AND
   
 comparator='CompositeType(org.apache.cassandra.db.marshal.UUIDType,org.apache.cassandra.db.marshal.UTF8Type)'
  AND
   read_repair_chance=0.10 AND
   gc_grace_seconds=864000 AND
   default_validation=text AND
   min_compaction_threshold=4 AND
   max_compaction_threshold=32 AND
   replicate_on_write=True AND
   compaction_strategy_class='SizeTieredCompactionStrategy' AND
   
 compression_parameters:sstable_compression='org.apache.cassandra.io.compress.SnappyCompressor';
 {noformat}

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Updated] (CASSANDRA-4173) cqlsh: in cql3 mode, use cql3 quoting when outputting cql


 [ 
https://issues.apache.org/jira/browse/CASSANDRA-4173?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jonathan Ellis updated CASSANDRA-4173:
--

Reviewer: brandon.williams

 cqlsh: in cql3 mode, use cql3 quoting when outputting cql
 -

 Key: CASSANDRA-4173
 URL: https://issues.apache.org/jira/browse/CASSANDRA-4173
 Project: Cassandra
  Issue Type: Improvement
  Components: Tools
Affects Versions: 1.1.0
Reporter: paul cannon
Assignee: paul cannon
Priority: Minor
  Labels: cql3, cqlsh
 Fix For: 1.1.1

 Attachments: 4173.patch.txt


 when cqlsh needs to output a column name or other term which needs quoting 
 (say, if you run DESCRIBE KEYSPACE and some column name has a space in it), 
 it currently only knows how to quote in the cql2 way. That is,
 {noformat}
 cqlsh:foo describe columnfamily bar
 CREATE COLUMNFAMILY bar (
   a int PRIMARY KEY,
   'b c' text
 ) WITH
 ...
 {noformat}
 cql3 does not recognize single quotes around column names, or columnfamily or 
 keyspace names either. cqlsh ought to learn how to use double-quotes instead 
 when in cql3 mode.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Commented] (CASSANDRA-4188) stress tool return a non-zero value when it fails

2012-04-25 Thread Tyler Patterson (JIRA)


[ 
https://issues.apache.org/jira/browse/CASSANDRA-4188?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13262089#comment-13262089
 ] 

Tyler Patterson commented on CASSANDRA-4188:


{quote}What is succeeded for stress? Is timing out once a failure? Or taking 
longer than X minutes?{quote}
I was thinking of capturing obvious failures, like when it hits the max number 
of retries. Pretty much any time that it used to hang indefinitely.

 stress tool return a non-zero value when it fails
 -

 Key: CASSANDRA-4188
 URL: https://issues.apache.org/jira/browse/CASSANDRA-4188
 Project: Cassandra
  Issue Type: Improvement
  Components: Tools
Reporter: Tyler Patterson
Assignee: Pavel Yaskevich
Priority: Minor
  Labels: stress
 Fix For: 1.0.10


 I'm using of stress in the dtests, and it would be great if I could tell 
 based on the return code if stress succeeded or failed.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Commented] (CASSANDRA-4188) stress tool return a non-zero value when it fails

2012-04-25 Thread Pavel Yaskevich (JIRA)


[ 
https://issues.apache.org/jira/browse/CASSANDRA-4188?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13262094#comment-13262094
 ] 

Pavel Yaskevich commented on CASSANDRA-4188:


bq. I was thinking of capturing obvious failures, like when it hits the max 
number of retries. Pretty much any time that it used to hang indefinitely.

That was fixed by CASSANDRA-4128

 stress tool return a non-zero value when it fails
 -

 Key: CASSANDRA-4188
 URL: https://issues.apache.org/jira/browse/CASSANDRA-4188
 Project: Cassandra
  Issue Type: Improvement
  Components: Tools
Reporter: Tyler Patterson
Assignee: Pavel Yaskevich
Priority: Minor
  Labels: stress
 Fix For: 1.0.10


 I'm using of stress in the dtests, and it would be great if I could tell 
 based on the return code if stress succeeded or failed.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Updated] (CASSANDRA-4190) Apparent data loss using super columns and row cache via ConcurrentLinkedHashCacheProvider

2012-04-25 Thread Mina Naguib (JIRA)


 [ 
https://issues.apache.org/jira/browse/CASSANDRA-4190?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Mina Naguib updated CASSANDRA-4190:
---

Affects Version/s: 1.1.0

 Apparent data loss using super columns and row cache via 
 ConcurrentLinkedHashCacheProvider
 --

 Key: CASSANDRA-4190
 URL: https://issues.apache.org/jira/browse/CASSANDRA-4190
 Project: Cassandra
  Issue Type: Bug
Affects Versions: 1.0.9, 1.1.0
 Environment: Linux 2.6.27
Reporter: Mina Naguib
Assignee: Sylvain Lebresne
  Labels: ConcurrentLinkedHashCacheProvider, cache, supercolumns

 Tested on a vanilla single-node cassandra 1.0.9 installation.
 When using super columns along with row caching via 
 ConcurrentLinkedHashCacheProvider (default if no JNA available, or explicitly 
 configured even if JNA available), there's what appears as transient data 
 loss.
 Given this script executed in cassandra-cli:
 {quote}
 create keyspace Test;
 use Test;
 create column family Users with column_type='Super' and 
 key_validation_class='UTF8Type' and comparator='UTF8Type' and 
 subcomparator='UTF8Type' and default_validation_class='UTF8Type' and 
 rows_cached=75000 and row_cache_provider='ConcurrentLinkedHashCacheProvider';
 set Users['mina']['attrs']['name'] = 'Mina';
 get Users['mina'];
 set Users['mina']['attrs']['country'] = 'Canada';
 get Users['mina'];
 set Users['mina']['attrs']['region'] = 'Quebec';
 get Users['mina'];
 {quote}
 The output from the 3 gets above is as follows:
 {quote}
 = (super_column=attrs,
  (column=name, value=Mina, timestamp=1335377788441000))
 Returned 1 results.
 {quote}
 {quote}
 = (super_column=attrs,
  (column=name, value=Mina, timestamp=1335377788441000))
 Returned 1 results.
 {quote}
 {quote}
 = (super_column=attrs,
  (column=name, value=Mina, timestamp=1335377788441000))
 Returned 1 results.
 {quote}
 It's clear that the second and third set commands (country, region) are 
 missing in the returned results.
 If the row cache is explicitly invalidated (in a second terminal, via 
 `nodetool -h localhost invalidaterowcache Test Users`), the missing data 
 springs to life on next 'get':
 {quote}
 [default@Test] get Users['mina'];
 = (super_column=attrs,
  (column=country, value=Canada, timestamp=1335377839592000)
  (column=name, value=Mina, timestamp=1335377788441000)
  (column=region, value=Quebec, timestamp=1335377871353000))
 Returned 1 results.
 {quote}
 From cursory checks, this does not to appear to happen with regular columns, 
 nor with JNA enabled + SerializingCacheProvider.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Created] (CASSANDRA-4191) Add `nodetool cfstats ks cf` abilities

2012-04-25 Thread Joaquin Casares (JIRA)

Joaquin Casares created CASSANDRA-4191:
--

 Summary: Add `nodetool cfstats ks cf` abilities
 Key: CASSANDRA-4191
 URL: https://issues.apache.org/jira/browse/CASSANDRA-4191
 Project: Cassandra
  Issue Type: New Feature
Affects Versions: 1.0.0
Reporter: Joaquin Casares
Priority: Minor


This way cfstats will only print information per keyspace/column family 
combinations.

Another related proposal as an alternative to this ticket:

Allow for `nodetool cfstats` to use --excludes or --includes to accept keyspace 
and column family arguments.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Commented] (CASSANDRA-4191) Add `nodetool cfstats ks cf` abilities