[jira] [Commented] (CASSANDRA-3909) Pig should handle wide rows

2012-04-17 Thread Matthew F. Dennis (Commented) (JIRA)

[ 
https://issues.apache.org/jira/browse/CASSANDRA-3909?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13255956#comment-13255956
 ] 

Matthew F. Dennis commented on CASSANDRA-3909:
--

+1 on inclusion in 1.1.0 (and if not, ASAP after 1.1.0)

> Pig should handle wide rows
> ---
>
> Key: CASSANDRA-3909
> URL: https://issues.apache.org/jira/browse/CASSANDRA-3909
> Project: Cassandra
>  Issue Type: Bug
>  Components: Hadoop
>Reporter: Brandon Williams
>Assignee: Brandon Williams
> Fix For: 1.1.1
>
> Attachments: 3909.txt
>
>
> Pig should be able to use the wide row support in CFIF.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (CASSANDRA-4156) CQL should support CL.TWO

2012-04-16 Thread Matthew F. Dennis (Commented) (JIRA)

[ 
https://issues.apache.org/jira/browse/CASSANDRA-4156?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13254945#comment-13254945
 ] 

Matthew F. Dennis commented on CASSANDRA-4156:
--

tested on EC2 with multiple nodes

> CQL should support CL.TWO
> -
>
> Key: CASSANDRA-4156
> URL: https://issues.apache.org/jira/browse/CASSANDRA-4156
> Project: Cassandra
>  Issue Type: Bug
>  Components: API
>Affects Versions: 1.0.1
>Reporter: paul cannon
>Assignee: Matthew F. Dennis
>Priority: Minor
>  Labels: cql, cql3
> Attachments: cassandra-trunk-4156.txt
>
>
> CL.TWO was overlooked in creating the CQL language spec. It should probably 
> be added.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (CASSANDRA-4109) nodetool snapshot should allow specifying list of CFs to snapshot

2012-04-02 Thread Matthew F. Dennis (Commented) (JIRA)

[ 
https://issues.apache.org/jira/browse/CASSANDRA-4109?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13244635#comment-13244635
 ] 

Matthew F. Dennis commented on CASSANDRA-4109:
--

probably, I didn't bother looking that far back ...

> nodetool snapshot should allow specifying list of CFs to snapshot
> -
>
> Key: CASSANDRA-4109
> URL: https://issues.apache.org/jira/browse/CASSANDRA-4109
> Project: Cassandra
>  Issue Type: Bug
>Reporter: Matthew F. Dennis
>Priority: Minor
>
> nodetool snapshot allows the keyspace to be specified, but not the column 
> families

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (CASSANDRA-3283) node is sending streams to itself during move operation

2012-02-02 Thread Matthew F. Dennis (Commented) (JIRA)

[ 
https://issues.apache.org/jira/browse/CASSANDRA-3283?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13199026#comment-13199026
 ] 

Matthew F. Dennis commented on CASSANDRA-3283:
--

observed this on 0.8.9 when moving a node.

moving a node from 3 node cluster with RF=3 resulted in the node just streaming 
it's entire dataset to itself 

> node is sending streams to itself during move operation
> ---
>
> Key: CASSANDRA-3283
> URL: https://issues.apache.org/jira/browse/CASSANDRA-3283
> Project: Cassandra
>  Issue Type: Bug
>  Components: Core
>Affects Versions: 0.8.6
> Environment: linux debian stable (squeeze), cassandra 0.8.6, java 6 
> Update 26
>Reporter: Zenek Kraweznik
>Assignee: Nick Bailey
>Priority: Minor
> Fix For: 0.8.10
>
>
> I'm moving node 10.10.10.231 from 113427455640312821154458202477256070485 to 
> 0.
> ring:
> Address DC  RackStatus State   LoadOwns   
>  Token
>   
>  127605887595351923798765477786913079296
> 10.10.10.232  datacenter1 rack1   Up Normal  148.7 GB50.00%  
> 42535295865117307932921825928971026432
> 10.10.10.233  datacenter1 rack1   Up Normal  156.77 GB   25.00%  
> 85070591730234615865843651857942052864
> 10.10.10.231  datacenter1 rack1   Up Moving  188.75 GB   16.67%  
> 113427455640312821154458202477256070485
> 10.10.10.234  datacenter1 rack1   Up Normal  94.89 GB8.33%   
> 127605887595351923798765477786913079296
> netstats from node1:
> Streaming to: /10.10.10.234
>/var/lib/cassandra/data/testdb2/ChangeLog-g-30-Data.db sections=2 
> progress=0/6859992288 - 0%
>/var/lib/cassandra/data/testdb2/ChangeLogIndex-g-10-Data.db sections=1 
> progress=0/15286 - 0%
>/var/lib/cassandra/data/testdb/ChangeLogIndex-g-48-Data.db sections=1 
> progress=0/90 - 0%
>/var/lib/cassandra/data/testdb/Testcoll-g-107-Data.db sections=2 
> progress=0/5276 - 0%
>/var/lib/cassandra/data/testdb/Testcoll2-g-74-Data.db sections=2 
> progress=0/470 - 0%
>/var/lib/cassandra/data/testdb/ChangeLogIndex-g-47-Data.db sections=1 
> progress=0/1156 - 0%
>/var/lib/cassandra/data/testdb/Testcoll-g-106-Data.db sections=2 
> progress=0/329027714 - 0%
>/var/lib/cassandra/data/testdb/ChangeLog-g-61-Data.db sections=2 
> progress=0/30212596 - 0%
>/var/lib/cassandra/data/testdb3/Testcoll2-g-42-Data.db sections=2 
> progress=0/774117 - 0%
>/var/lib/cassandra/data/testdb3/ChangeLogIndex-g-10-Data.db sections=1 
> progress=0/90 - 0%
> Streaming to: /10.10.10.231
>/var/lib/cassandra/data/testdb2/Testcoll-g-30-Data.db sections=2 
> progress=39059456000/87950308260 - 44%
>/var/lib/cassandra/data/testdb2/ChangeLog-g-30-Data.db sections=2 
> progress=0/7806077255 - 0%
>/var/lib/cassandra/data/testdb2/ChangeLogIndex-g-10-Data.db sections=1 
> progress=0/15286 - 0%
>/var/lib/cassandra/data/testdb3/Testcoll2-g-42-Data.db sections=2 
> progress=0/784033 - 0%
>/var/lib/cassandra/data/testdb3/ChangeLogIndex-g-10-Data.db sections=1 
> progress=0/90 - 0%
>/var/lib/cassandra/data/testdb/ChangeLogIndex-g-48-Data.db sections=1 
> progress=0/90 - 0%
>/var/lib/cassandra/data/testdb/Testcoll-g-107-Data.db sections=2 
> progress=0/10499 - 0%
>/var/lib/cassandra/data/testdb/Testcoll2-g-74-Data.db sections=2 
> progress=0/1042 - 0%
>/var/lib/cassandra/data/testdb/ChangeLogIndex-g-47-Data.db sections=1 
> progress=0/1156 - 0%
>/var/lib/cassandra/data/testdb/Testcoll-g-106-Data.db sections=2 
> progress=0/329965993 - 0%
>/var/lib/cassandra/data/testdb/ChangeLog-g-61-Data.db sections=2 
> progress=0/24633913 - 0%
> Streaming from: /10.10.10.231
>testdb: /var/lib/cassandra/data/testdb/ChangeLogIndex-g-47-Data.db 
> sections=1 progress=0/1156 - 0%
>testdb: /var/lib/cassandra/data/testdb/ChangeLogIndex-g-48-Data.db 
> sections=1 progress=0/90 - 0%
>testdb: /var/lib/cassandra/data/testdb/ChangeLog-g-61-Data.db sections=2 
> progress=0/24633913 - 0%
>testdb3: /var/lib/cassandra/data/testdb3/Testcoll2-g-42-Data.db sections=2 
> progress=0/784033 - 0%
>testdb: /var/lib/cassandra/data/testdb/Testcoll2-g-74-Data.db sections=2 
> progress=0/1042 - 0%
>testdb3: /var/lib/cassandra/data/testdb3/ChangeLogIndex-g-10-Data.db 
> sections=1 progress=0/90 - 0%
>testdb2: /var/lib/cassandra/data/testdb2/ChangeLog-g-30-Data.db sections=2 
> progress=0/7806077255 - 0%
>testdb2: /var/lib/cassandra/data/testdb2/ChangeLogIndex-g-10-Data.db 
> sections=1 progress=0/15286 - 0%
>testdb: /var/lib/cassandra/data/testdb/Testcoll-g-106-Data.db sections=2 
> progress=0/329965993 - 0%
>testdb2: /var/lib/cassandra/data/t

[jira] [Commented] (CASSANDRA-3705) Don't default the datacenter name in replication_strategies when the datacenter does not exist

2012-01-05 Thread Matthew F. Dennis (Commented) (JIRA)

[ 
https://issues.apache.org/jira/browse/CASSANDRA-3705?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13181068#comment-13181068
 ] 

Matthew F. Dennis commented on CASSANDRA-3705:
--

I'm in favor of always requiring a that the RF for at least one DC be set when 
creating a keyspace.

We should not allow a RF to be set for a DC that doesn't exist, or at the very 
least not default it to a DC that doesn't actually exist.

> Don't default the datacenter name in replication_strategies when the 
> datacenter does not exist
> --
>
> Key: CASSANDRA-3705
> URL: https://issues.apache.org/jira/browse/CASSANDRA-3705
> Project: Cassandra
>  Issue Type: Bug
>Affects Versions: 1.0.5
>Reporter: Joaquin Casares
>
> When using the AMI, which is currently set to use the EC2 snitch and the 
> NetworkTopologyStrategy is set to default by the cli, all keyspaces default 
> to datacenter1 being the datacenter name.
> So when running: 
> {noformat}
> create keyspace test;
> {noformat}
> we get this created:
> {noformat}
> Keyspace: test:
>   Replication Strategy: org.apache.cassandra.locator.NetworkTopologyStrategy
>   Durable Writes: true
> Options: [datacenter1:1]
> {noformat}
> This should error out immediately rather than letting the user go on to 
> discover the error later:
> {noformat}
> [default@test] set User['jsmith']['first'] = 'John';
> null
> UnavailableException()
> at 
> org.apache.cassandra.thrift.Cassandra$insert_result.read(Cassandra.java:15206)
> at 
> org.apache.cassandra.thrift.Cassandra$Client.recv_insert(Cassandra.java:858)
> at org.apache.cassandra.thrift.Cassandra$Client.insert(Cassandra.java:830)
> at org.apache.cassandra.cli.CliClient.executeSet(CliClient.java:902)
> at org.apache.cassandra.cli.CliClient.executeCLIStatement(CliClient.java:216)
> at 
> org.apache.cassandra.cli.CliMain.processStatementInteractive(CliMain.java:220)
> at org.apache.cassandra.cli.CliMain.main(CliMain.java:346)
> {noformat}
> Related link:
> http://www.datastax.com/support-forums/topic/new-datastax-ami-ami-fd23ec94-is-not-functionnal

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (CASSANDRA-3643) Cassandra C CQL driver

2011-12-16 Thread Matthew F. Dennis (Commented) (JIRA)

[ 
https://issues.apache.org/jira/browse/CASSANDRA-3643?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13171435#comment-13171435
 ] 

Matthew F. Dennis commented on CASSANDRA-3643:
--

I agree that C (and C++) access to Cassandra is pretty lacking at the moment.  
The problems you mention with thrift+c is part of the reason for that: any C 
driver for Cassandra would have to run over thrift (though the burden would be 
all the driver devs, not everyone who wants to use C and Cassandra together).

Even for CQL though, a C driver will still be communicating via thrift and 
having to deal with all the ... eccentricities that come along with it.

> Cassandra C CQL driver
> --
>
> Key: CASSANDRA-3643
> URL: https://issues.apache.org/jira/browse/CASSANDRA-3643
> Project: Cassandra
>  Issue Type: Wish
>  Components: Drivers
>Affects Versions: 1.0.6
> Environment: all
>Reporter: Vlad Paiu
>Priority: Blocker
>  Labels: C, cql, driver
>
> It's really a shame that such a great project like Cassandra doesn't support 
> a way for it to be used from within a C application.
> Thrift has never worked for C or it is very poorly documented. Either way, 
> integrating Cassandra with an application written in C is just not possible 
> in an elegant manner at the current moment in time.
> With the development of CQL, it would really be great if one could run CQL 
> commands from within a C library, very much like libmysqlclient.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (CASSANDRA-2786) After a minor compaction, deleted key-slices are visible again

2011-11-21 Thread Matthew F. Dennis (Commented) (JIRA)

[ 
https://issues.apache.org/jira/browse/CASSANDRA-2786?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13154786#comment-13154786
 ] 

Matthew F. Dennis commented on CASSANDRA-2786:
--

I should mention that I did change one line in the test.  I changed 
{noformat}column.timestamp = getTimestamp();{noformat} to 
{noformat}column.setTimestamp(getTimestamp());{noformat} because otherwise 
thrift complained that the timestamp wasn't set.

> After a minor compaction, deleted key-slices are visible again
> --
>
> Key: CASSANDRA-2786
> URL: https://issues.apache.org/jira/browse/CASSANDRA-2786
> Project: Cassandra
>  Issue Type: Bug
>  Components: Core
>Affects Versions: 0.8.0, 0.8.7
> Environment: Reproduced on single Cassandra node (CentOS 5.5)
> Reproduced on single Cassandra node (Windows Server 2008)
>Reporter: rene kochen
>Assignee: Sylvain Lebresne
> Fix For: 0.8.1, 0.8.2
>
> Attachments: 0001-Fix-wrong-purge-of-deleted-cf.patch, 
> 2786_part2.patch, CassandraIssue.zip, CassandraIssueJava.zip
>
>
> After a minor compaction, deleted key-slices are visible again.
> Steps to reproduce:
> 1) Insert a row named "test".
> 2) Insert 50 rows. During this step, row "test" is included in a major 
> compaction:
>file-1, file-2, file-3 and file-4 compacted to file-5 (includes "test").
> 3) Delete row named "test".
> 4) Insert 50 rows. During this step, row "test" is included in a minor 
> compaction:
>file-6, file-7, file-8 and file-9 compacted to file-10 (should include 
> tombstoned "test").
> After step 4, row "test" is live again.
> Test environment:
> Single node with empty database.
> Standard configured super-column-family (I see this behavior with several 
> gc_grace settings (big and small values):
> create column family Customers with column_type = 'Super' and comparator = 
> 'BytesType;
> In Cassandra 0.7.6 I observe the expected behavior, i.e. after step 4, the 
> row is still deleted.
> I've included a .NET program to reproduce the problem. I will add a Java 
> version later on.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (CASSANDRA-2786) After a minor compaction, deleted key-slices are visible again

2011-11-21 Thread Matthew F. Dennis (Commented) (JIRA)

[ 
https://issues.apache.org/jira/browse/CASSANDRA-2786?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13154784#comment-13154784
 ] 

Matthew F. Dennis commented on CASSANDRA-2786:
--

Per previous phone conversation, I tested this against 0.8.7 twice.

Both times the attached java test ran to completion against a single node and 
output "Done".

When was this last reproduced?

> After a minor compaction, deleted key-slices are visible again
> --
>
> Key: CASSANDRA-2786
> URL: https://issues.apache.org/jira/browse/CASSANDRA-2786
> Project: Cassandra
>  Issue Type: Bug
>  Components: Core
>Affects Versions: 0.8.0, 0.8.7
> Environment: Reproduced on single Cassandra node (CentOS 5.5)
> Reproduced on single Cassandra node (Windows Server 2008)
>Reporter: rene kochen
>Assignee: Sylvain Lebresne
> Fix For: 0.8.1, 0.8.2
>
> Attachments: 0001-Fix-wrong-purge-of-deleted-cf.patch, 
> 2786_part2.patch, CassandraIssue.zip, CassandraIssueJava.zip
>
>
> After a minor compaction, deleted key-slices are visible again.
> Steps to reproduce:
> 1) Insert a row named "test".
> 2) Insert 50 rows. During this step, row "test" is included in a major 
> compaction:
>file-1, file-2, file-3 and file-4 compacted to file-5 (includes "test").
> 3) Delete row named "test".
> 4) Insert 50 rows. During this step, row "test" is included in a minor 
> compaction:
>file-6, file-7, file-8 and file-9 compacted to file-10 (should include 
> tombstoned "test").
> After step 4, row "test" is live again.
> Test environment:
> Single node with empty database.
> Standard configured super-column-family (I see this behavior with several 
> gc_grace settings (big and small values):
> create column family Customers with column_type = 'Super' and comparator = 
> 'BytesType;
> In Cassandra 0.7.6 I observe the expected behavior, i.e. after step 4, the 
> row is still deleted.
> I've included a .NET program to reproduce the problem. I will add a Java 
> version later on.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (CASSANDRA-3453) cassandra.yaml needs a snapshot_directory setting

2011-11-04 Thread Matthew F. Dennis (Commented) (JIRA)

[ 
https://issues.apache.org/jira/browse/CASSANDRA-3453?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13144423#comment-13144423
 ] 

Matthew F. Dennis commented on CASSANDRA-3453:
--

we provide a setting for controlling the data directories, which is currently a 
parent of all the snapshots directories, so you already have the referenced 
problems (i.e. checking a setting to find out where your snapshots are and 
potentially forgetting to update it on a new machine).

Furthermore, if I have N out of M keyspaces I want to snapshot and backup (or 
bulk load to another cluster or ...) after the snapshot I have to remember each 
keyspace so I can then construct a path to each snapshot (e.g. 
/path/from/data_dir/setting/$ks_name/snapshots/$snapshot_name) in order to do 
what I wanted to do with the snapshots instead of knowing everything I care 
about is in one place (e.g. /path/from/snapshot_dir/setting/$snapshot_name).



> cassandra.yaml needs a snapshot_directory setting
> -
>
> Key: CASSANDRA-3453
> URL: https://issues.apache.org/jira/browse/CASSANDRA-3453
> Project: Cassandra
>  Issue Type: Improvement
>Reporter: Matthew F. Dennis
>
> cassandra.yaml really needs a config option for a snapshot directory. by 
> default it should be a peer folder with saved_caches, commitlog and data.
> assuming the value of that setting is /some/path/snapshot_dir then snapshots 
> would go into: /some/path/snapshot_dir/snapshot_name/keyspace_name/
> this makes automation, particularly around backups, much easier and gives 
> more control to ops

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (CASSANDRA-3453) cassandra.yaml needs a directory_setting

2011-11-04 Thread Matthew F. Dennis (Commented) (JIRA)

[ 
https://issues.apache.org/jira/browse/CASSANDRA-3453?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13144384#comment-13144384
 ] 

Matthew F. Dennis commented on CASSANDRA-3453:
--

1) It is easier to backup/copy all your snapshots for all keyspaces if they are 
in one place instead of having to iterate over all the keyspace directories 
looking for the snapshot directories.

2) it is safer when deleting snapshots (less chance of scripts/admins 
accidentally deleting the keyspace instead of all the snapshots. (e.g. rm -rf 
/some/path/$ksname/snapshots/* where $ksname ends up a blank space because of a 
script bug)
 
besides, checking a setting isn't very complex code wise ;)

> cassandra.yaml needs a directory_setting
> 
>
> Key: CASSANDRA-3453
> URL: https://issues.apache.org/jira/browse/CASSANDRA-3453
> Project: Cassandra
>  Issue Type: Improvement
>Reporter: Matthew F. Dennis
>
> cassandra.yaml really needs a config option for a snapshot directory. by 
> default it should be a peer folder with saved_caches, commitlog and data.
> assuming the value of that setting is /some/path/snapshot_dir then snapshots 
> would go into: /some/path/snapshot_dir/snapshot_name/keyspace_name/
> this makes automation, particularly around backups, much easier and gives 
> more control to ops

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira