Re: [HACKERS] SQL feature requests

Michael Glaesemann Thu, 23 Aug 2007 09:51:39 -0700


On Aug 23, 2007, at 10:47 , Ben Tilly wrote:

On 8/22/07, Michael Glaesemann <[EMAIL PROTECTED]> wrote:


*This* seems like a bug:
test=# select record_id
        , count(observation_id) as bar
        from observation
        group by record_id
            , case when true
                   then 'foo'
              end;
record_id | bar
-----------+-----
          1 |   4
          2 |   4
          3 |   2
(3 rows)


Why does it seem like a bug to you?

Turn it around, and tell me in what way is its behaviour surprising to
someone who knows SQL.  You asked to group on something that is the
same for all rows.  That group by condition did nothing.  (Except
rendered the syntax valid when it might not have been.)  As I would
expect.

Considering that I expect the GROUP BY clause to include only columnreferences (or other groupings of column references), notexpressions. Whether or not the value is the same, it surprises methat something other than a column reference is accepted at all. Ihadn't realized this behavior was accepted in PostgreSQL, but I learnsomething new every day.

My non-rigorous way of thinking about GROUP BY is that it groups thislisted columns when the values of the listed columns are the same. Anexpression that evaluates to anything other than a column namedoesn't provide any information about which column to considergrouped, and expressions don't evaluate to column names, oridentifiers in general. If I understand you correctly, a GROUP BYitem that isn't a column name would be a value that's applied to allcolumns, and the actual value is irrelevant—different values don'tchange the result.

So the only purpose it would serve would be to prevent a trailingcomma from raising a syntax error: you'd still need to explicitlylist the other columns (unless the implementation behavior is changedto extend the spec there as well). What this does is allow you to usesomething like this (psuedocode):


group_columns = [ 'foo', 'bar', 'baz' ]

group_column_list = ''

for col in group_columns { group_column_list += col + ',' } #group_column_list = "foo,bar,baz,"

group_by_clause = "GROUP BY $group_column_list CASE WHEN TRUE THEN'quux' END"


rather than

group_column_list = join group_columns, ',' # group_column_list ="foo,bar,baz"

group_by_clause = "GROUP BY $group_column_list"

I still feel I'm missing something. If that's it, it seems likesomething easy enough to handle in middleware. Sorry if it appearsI'm being dense. I've definitely learned things in this thread.

Furthermore ask yourself whether anyone who wrote that would likely
have written it by accident.

I don't see what that has to do with anything. There are plenty ofthings I can write on purpose that would be nonsense. You might evenconsider my posts as prime examples :)


Michael Glaesemann
grzm seespotcode net



---------------------------(end of broadcast)---------------------------
TIP 9: In versions below 8.0, the planner will ignore your desire to
      choose an index scan if your joining column's datatypes do not
      match

Re: [HACKERS] SQL feature requests

Reply via email to