Sylvain Lebresne created CASSANDRA-16069:
--------------------------------------------

             Summary: Loss of functionality around null clustering when 
dropping compact storage
                 Key: CASSANDRA-16069
                 URL: https://issues.apache.org/jira/browse/CASSANDRA-16069
             Project: Cassandra
          Issue Type: Bug
          Components: Legacy/CQL
            Reporter: Sylvain Lebresne


For backward compatibility reasons[1], it is allowed to insert rows where some 
of the clustering columns are {{null}} for compact tables. That support is a 
tad limited/inconsistent[2] but essentially you can do:
{noformat}
cqlsh:ks> CREATE TABLE t (k int, c1 int, c2 int, v int, PRIMARY KEY (k, c1, 
c2)) WITH COMPACT STORAGE;
cqlsh:ks> INSERT INTO t(k, c1, v) VALUES (1, 1, 1);
cqlsh:ks> SELECT * FROM t;

 k | c1 | c2   | v
---+----+------+---
 1 |  1 | null | 1

(1 rows)
cqlsh:ks> UPDATE t SET v = 2 WHERE k = 1 AND c1 = 1;
cqlsh:ks> SELECT * FROM t;

 k | c1 | c2   | v
---+----+------+---
 1 |  1 | null | 2

(1 rows)
{noformat}
This is not allowed on non-compact tables however:
{noformat}
cqlsh:ks> CREATE TABLE t2 (k int, c1 int, c2 int, v int, PRIMARY KEY (k, c1, 
c2));
cqlsh:ks> INSERT INTO t2(k, c1, v) VALUES (1, 1, 1);
InvalidRequest: Error from server: code=2200 [Invalid query] message="Some 
clustering keys are missing: c2"
cqlsh:ks> UPDATE t2 SET v = 2 WHERE k = 1 AND c1 = 1;
InvalidRequest: Error from server: code=2200 [Invalid query] message="Some 
clustering keys are missing: c2"
{noformat}
Which means that a user with a compact table that rely on this will not be able 
to use {{DROP COMPACT STORAGE}}.

Which is a problem for the 4.0 upgrade story. Problem to which we need an 
answer.

 
----
[1]: the underlying {{CompositeType}} used by such tables allows to provide 
only a prefix of components, so thrift users could have used such 
functionality. We thus had to support it in CQL, or those users wouldn't have 
been able to upgrade to CQL easily.

[2]: building on the example above, the value for {{c2}} is essentially 
{{null}}, yet none of the following is currently allowed:
{noformat}
cqlsh:ks> INSERT INTO t(k, c1, c2, v) VALUES (1, 1, null, 1);
InvalidRequest: Error from server: code=2200 [Invalid query] message="Invalid 
null value in condition for column c2"
cqlsh:ks> UPDATE t SET v = 2 WHERE k = 1 AND c1 = 1 AND c2 = null;
InvalidRequest: Error from server: code=2200 [Invalid query] message="Invalid 
null value in condition for column c2"
cqlsh:ks> SELECT * FROM c WHERE k = 1 AND c1 = 1 AND c2 = null;
InvalidRequest: Error from server: code=2200 [Invalid query] message="Invalid 
null value in condition for column c2"
{noformat}
Not only is that unintuitive/inconsistent, but the {{SELECT}} one means there 
is no way to select only the row. You can skip specifying {{c2}} in the 
{{SELECT}}, but this become a slice selection essentially, as shown below:
{noformat}
cqlsh:ks> INSERT INTO ct(k, c1, c2, v) VALUES (1, 1, 1, 1);
cqlsh:ks> SELECT * FROM ct WHERE k = 1 AND c1 = 1;

 k | c1 | c2   | v
---+----+------+---
 1 |  1 | null | 1
 1 |  1 |    1 | 1

(2 rows)
{noformat}



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: commits-unsubscr...@cassandra.apache.org
For additional commands, e-mail: commits-h...@cassandra.apache.org

Reply via email to