Re: [sqlite] BETWEEN and explicit collation assignment

Igor Tandetnik Thu, 22 Aug 2013 14:27:42 -0700

On 8/22/2013 4:06 PM, Roman Fleysher wrote:

Logically, I agree with Simon, collate modifier in table definition describes 
how indices should be built, nothing more.

But for what purpose are indexes built? Isn't it for the purpose ofbeing used to speed up queries? Wouldn't you expect a query like


select count(*) from t where x = 'a';

to actually use an index on t(x), when available? And if you do, doesn'tthat mean that the comparison (x = 'a') should use the same collation asan index on t(x), that is, the collation associated with column x? Andif so, then you are right back to the idea that the collation used bythe comparison operator should, at least sometimes, be inferred from thecollation associated with its operands. Therefore, the COLLATE clause inthe table definition should apply not just to building indexes, but alsoto the behavior of comparisons. QED.


Where is the flaw in this chain of reasoning, in your opinion?

Comparisons, as in these examples should specify which collation is to be used 
and if it is different from one used for indexing, then yes, Igor, index can 
not be used. I believe this is the present behavior already.

Obviously, if you explicitly specify a collation for a comparison thatdoesn't match that of an index, then the index cannot be used. Thequestion is, if you do *not* explicitly specify a collation for acomparison, as in example above - would you expect an index to be used?Personally, I would, and I rather like the fact that SQLite currentlydoes use an index in this case. Simon, on the other hand, appears to besaying that he would rather the index not be used, that the comparisonbe done using BINARY collation (though he never answered my directquestion about it).

I do not think Simon asks to change the way collations work. He is asking to make clear 
how they work. I think Simon is asking for clarity so that error like "(x collate A) 
= (y collate B)" can not take place.

What to do with the case of (x = y) where x and y are two columns withdifferent collations, then? How do you ensure that it cannot take place?And if you can't prevent it, then you need to do *something* in thiscase, and then you can do the exact same thing with "(x collate A) = (ycollate B)" case - no need to invent new syntax or machinery to try andavoid it.

This is similar to JOIN operation, which requires two tables to have common 
column.


JOIN operation requires no such thing. I don't understand this analogy.

Using this analogy, I would write:

select * from t WHERE x = y USING NOCASE


How is this different from

select * from t WHERE x = y COLLATE NOCASE

I only see a superficial difference in the keyword used. What's theimprovement?

I *do* understand what you and Simon are suggesting. I *do* understandthat you want some kind of syntax that would hang a collation onto theoperator, rather than its operands, for certain philosophical and/oraesthetic reasons. My point is, this would have to be done *in additionto*, not *instead of*, the existing mechanism (because of the whole"comparison must automatically pick the collation off the column inorder to use the index" thing). So it's strictly an added complexity.

The burden is then on the authors of the proposal to demonstrate thebenefits of the feature you are proposing, and to show that saidbenefits do in fact outweigh the cost of this added complexity. So Iwould ask you the same question I asked Simon: what exactly are thebenefits of your approach? Is there something that can be done your waybut can't be done the current way? Is there some bad outcome that'spossible the current way but prevented your way? Do you just prefer your(as yet unspecified) syntax on purely aesthetic grounds?

--
Igor Tandetnik

_______________________________________________
sqlite-users mailing list
sqlite-users@sqlite.org
http://sqlite.org:8080/cgi-bin/mailman/listinfo/sqlite-users

Re: [sqlite] BETWEEN and explicit collation assignment

Reply via email to