[SQL] Potential bug in postgres 8.2.4

2007-05-24 Thread Tomas Doran

I'm not sure if this is a bug, or not - but it looks like one to me.

if you say:

CREATE TABLE testtable (
col1   char(1),
   data   text
);

INSERT INTO testtable (col1, data) VALUES ('1', 'foobar');
INSERT INTO testtable (col1, data) VALUES ('2', 'foobarbaz');

The following queries all work:
INSERT INTO testtable (col1, data) VALUES (3::int, 'foobarbazquux');
SELECT * FROM testtable WHERE col1 = 3::int;
SELECT * FROM testtable WHERE col1 IN (1);
SELECT * FROM testtable WHERE col1 IN (1::int);

However these querys fail on 8.2.4, but work correctly on 8.1:
SELECT * FROM testtable WHERE col1 IN (1::int, 2::int);
SELECT * FROM testtable WHERE col1 IN (1, 2);

I could understand if the behavior was the same for single element IN  
clauses, and multiple element IN clauses - however as their behavior  
is different, and it used to work in 8.1


Cheers
Tom

---(end of broadcast)---
TIP 3: Have you checked our extensive FAQ?

  http://www.postgresql.org/docs/faq


Re: [SQL] Potential bug in postgres 8.2.4

2007-05-24 Thread Marcin Stępnicki
Dnia Thu, 24 May 2007 12:20:54 +0100, Tomas Doran napisał(a):

> CREATE TABLE testtable (
>  col1   char(1),
> data   text
> );
> 
> INSERT INTO testtable (col1, data) VALUES ('1', 'foobar'); INSERT INTO
> testtable (col1, data) VALUES ('2', 'foobarbaz');
> 
> The following queries all work:
> INSERT INTO testtable (col1, data) VALUES (3::int, 'foobarbazquux');
> SELECT * FROM testtable WHERE col1 = 3::int; SELECT * FROM testtable WHERE
> col1 IN (1); SELECT * FROM testtable WHERE col1 IN (1::int);
> 
> However these querys fail on 8.2.4, but work correctly on 8.1: SELECT *
> FROM testtable WHERE col1 IN (1::int, 2::int); SELECT * FROM testtable
> WHERE col1 IN (1, 2);
> 
> I could understand if the behavior was the same for single element IN
> clauses, and multiple element IN clauses - however as their behavior is
> different, and it used to work in 8.1

I'm not sure if I understand you correctly, but it seems that you are
comparing apples to oranges here (integer and character values). I am a
big fan of weakly typed languages like Python myself, but this situation
is different. I'd say that PostgreSQL 8.1 did a cast somewhere "behind the
scenes" but personally I think it is a bad idea. Consider:

SELECT * FROM testtable WHERE col1::int IN (1, 2);

instead. 

-- 
| And Do What You Will be the challenge | http://apcoln.linuxpl.org
|So be it in love that harms none   | http://biznes.linux.pl
|   For this is the only commandment.   | http://www.juanperon.info
`---*  JID: [EMAIL PROTECTED] *---' http://www.naszedzieci.org 



---(end of broadcast)---
TIP 2: Don't 'kill -9' the postmaster


Re: [SQL] Potential bug in postgres 8.2.4

2007-05-24 Thread Peter Eisentraut
Am Donnerstag, 24. Mai 2007 13:20 schrieb Tomas Doran:
> CREATE TABLE testtable (
>      col1   char(1),
>     data   text
> );

> The following queries all work:
> INSERT INTO testtable (col1, data) VALUES (3::int, 'foobarbazquux');
> SELECT * FROM testtable WHERE col1 = 3::int;
> SELECT * FROM testtable WHERE col1 IN (1);
> SELECT * FROM testtable WHERE col1 IN (1::int);

> However these querys fail on 8.2.4, but work correctly on 8.1:
> SELECT * FROM testtable WHERE col1 IN (1::int, 2::int);
> SELECT * FROM testtable WHERE col1 IN (1, 2);

All of this is strictly speaking incorrect anyway.  And the queries that do 
work will most likely start not working in a future version.  All of this is 
a gradual effort to reduce excessive automatic type casting.

I suggest you fix your application.

-- 
Peter Eisentraut
http://developer.postgresql.org/~petere/

---(end of broadcast)---
TIP 5: don't forget to increase your free space map settings


Re: [SQL] Potential bug in postgres 8.2.4

2007-05-24 Thread Tomas Doran


On 24 May 2007, at 12:34, Marcin Stępnicki wrote:


Dnia Thu, 24 May 2007 12:20:54 +0100, Tomas Doran napisał(a):


CREATE TABLE testtable (
 col1   char(1),
data   text
);

INSERT INTO testtable (col1, data) VALUES ('1', 'foobar'); INSERT  
INTO

testtable (col1, data) VALUES ('2', 'foobarbaz');

The following queries all work:
INSERT INTO testtable (col1, data) VALUES (3::int, 'foobarbazquux');
SELECT * FROM testtable WHERE col1 = 3::int; SELECT * FROM  
testtable WHERE

col1 IN (1); SELECT * FROM testtable WHERE col1 IN (1::int);

However these querys fail on 8.2.4, but work correctly on 8.1:  
SELECT *
FROM testtable WHERE col1 IN (1::int, 2::int); SELECT * FROM  
testtable

WHERE col1 IN (1, 2);

I could understand if the behavior was the same for single element IN
clauses, and multiple element IN clauses - however as their  
behavior is

different, and it used to work in 8.1


I'm not sure if I understand you correctly, but it seems that you are
comparing apples to oranges here (integer and character values).


Yep, totally - it's not nice, but we need to do it at $ork for  
hysterical raisins..


In the short term, adding the appropriate cast (in our code) isn't an  
option...


If I can do something to make it work in the postgres backend, then  
that'd be acceptable, and I'm investigating that..



I am a
big fan of weakly typed languages like Python myself, but this  
situation
is different. I'd say that PostgreSQL 8.1 did a cast somewhere  
"behind the

scenes" but personally I think it is a bad idea. Consider:

SELECT * FROM testtable WHERE col1::int IN (1, 2);

instead.


Yes, indeed - however I think it's a bug as 'SELECT * FROM testtable  
WHERE col1 IN (1)' DOES work, but 'SELECT * FROM testtable WHERE col1  
IN (1, 2)' does NOT work..


This is, at the very least, is a glaring inconsistency around how IN  
clauses are handled in different situations.


If this was a deliberate tightning of the behavior, is there a  
changelog entry/link to come docs about when this change happened  
that anyone can point me to?


Cheers
Tom

---(end of broadcast)---
TIP 9: In versions below 8.0, the planner will ignore your desire to
  choose an index scan if your joining column's datatypes do not
  match


Re: [SQL] Potential bug in postgres 8.2.4

2007-05-24 Thread Richard Huxton

Tomas Doran wrote:


On 24 May 2007, at 12:34, Marcin Stępnicki wrote:



I'm not sure if I understand you correctly, but it seems that you are
comparing apples to oranges here (integer and character values).


Yep, totally - it's not nice, but we need to do it at $ork for 
hysterical raisins..


In the short term, adding the appropriate cast (in our code) isn't an 
option...


If I can do something to make it work in the postgres backend, then 
that'd be acceptable, and I'm investigating that..


Well, if I were you, I'd just stick with 8.1 until you can fix the 
application.



I am a
big fan of weakly typed languages like Python myself, but this situation
is different. I'd say that PostgreSQL 8.1 did a cast somewhere "behind 
the

scenes" but personally I think it is a bad idea. Consider:

SELECT * FROM testtable WHERE col1::int IN (1, 2);

instead.


Yes, indeed - however I think it's a bug as 'SELECT * FROM testtable 
WHERE col1 IN (1)' DOES work, but 'SELECT * FROM testtable WHERE col1 IN 
(1, 2)' does NOT work..


This is, at the very least, is a glaring inconsistency around how IN 
clauses are handled in different situations.


What's biting you is the overly-loose matching against a single item (or 
all in 8.1). Most of the problems with PG seem to be where checks 
weren't strict enough in a previous version.


If this was a deliberate tightning of the behavior, is there a changelog 
entry/link to come docs about when this change happened that anyone can 
point me to?


My guess is that 8.2 is planning this by converting your IN into an 
array and testing against that. Actually, I can test that:


EXPLAIN ANALYSE SELECT * FROM foo WHERE a IN (1::char,2::char);
  QUERY PLAN
---
 Seq Scan on foo  (cost=0.00..36.12 rows=21 width=5) (actual 
time=0.029..0.033 rows=2 loops=1)

   Filter: (a = ANY ('{1,2}'::bpchar[]))
 Total runtime: 0.085 ms
(3 rows)

Yep. I don't think you can work round this by adding an implicit cast - 
only solution would be to hack the ANY code I suspect.


--
  Richard Huxton
  Archonet Ltd

---(end of broadcast)---
TIP 5: don't forget to increase your free space map settings


Re: [SQL] Potential bug in postgres 8.2.4

2007-05-24 Thread Marcin Stępnicki
Dnia Thu, 24 May 2007 12:54:48 +0100, Tomas Doran napisał(a):

> If I can do something to make it work in the postgres backend, then that'd
> be acceptable, and I'm investigating that..

>From what I know it's impossible without touching the source. 

> This is, at the very least, is a glaring inconsistency around how IN
> clauses are handled in different situations.

Yes, I think you are right.
 
> If this was a deliberate tightning of the behavior, is there a changelog
> entry/link to come docs about when this change happened that anyone can
> point me to?

I am not able to trace this particular change right now
(http://www.postgresql.org/docs/8.2/static/release.html). While you are
right that these changes should be perhaps better documented, 
such comparisions were a bad thing to do in the first place (I've learned
my lesson while upgrading from I think 7.1b3 to 7.1.3). Unfortunately I
see no other option than fixing them in your application.

-- 
| And Do What You Will be the challenge | http://apcoln.linuxpl.org
|So be it in love that harms none   | http://biznes.linux.pl
|   For this is the only commandment.   | http://www.juanperon.info
`---*  JID: [EMAIL PROTECTED] *---' http://www.naszedzieci.org 



---(end of broadcast)---
TIP 7: You can help support the PostgreSQL project by donating at

http://www.postgresql.org/about/donate


Re: [SQL] Potential bug in postgres 8.2.4

2007-05-24 Thread Tomas Doran


On 24 May 2007, at 13:19, Richard Huxton wrote:


Tomas Doran wrote:

On 24 May 2007, at 12:34, Marcin Stępnicki wrote:


I'm not sure if I understand you correctly, but it seems that you  
are

comparing apples to oranges here (integer and character values).
Yep, totally - it's not nice, but we need to do it at $ork for  
hysterical raisins..
In the short term, adding the appropriate cast (in our code) isn't  
an option...
If I can do something to make it work in the postgres backend,  
then that'd be acceptable, and I'm investigating that..


Well, if I were you, I'd just stick with 8.1 until you can fix the  
application.


That would be a great idea, however we have several live clients who  
have been upgraded (with entire QA and customer QA phases of testing)  
before we found this. So we're now stuffed :)


Yes, indeed - however I think it's a bug as 'SELECT * FROM  
testtable WHERE col1 IN (1)' DOES work, but 'SELECT * FROM  
testtable WHERE col1 IN (1, 2)' does NOT work..
This is, at the very least, is a glaring inconsistency around how  
IN clauses are handled in different situations.


What's biting you is the overly-loose matching against a single  
item (or all in 8.1). Most of the problems with PG seem to be where  
checks weren't strict enough in a previous version.


The tightening in general is biting me, but if the answer was 'it was  
deliberate tightening', and the behavior was consistent, then we'd  
have just dealt with it - it's the in-consistent behavior that makes  
me think this is a bug (or at least a gotcha, as it's not what you  
expect)...




If this was a deliberate tightning of the behavior, is there a  
changelog entry/link to come docs about when this change happened  
that anyone can point me to?


My guess is that 8.2 is planning this by converting your IN into an  
array and testing against that. Actually, I can test that:


That was my guess too - but I'm having a bad day and haven't got any  
further in playing with it than posted, thanks.


I'll be looking through the source / changelogs this afternoon and  
work out when/why this started happening.



EXPLAIN ANALYSE SELECT * FROM foo WHERE a IN (1::char,2::char);
  QUERY PLAN
-- 
-
 Seq Scan on foo  (cost=0.00..36.12 rows=21 width=5) (actual  
time=0.029..0.033 rows=2 loops=1)

   Filter: (a = ANY ('{1,2}'::bpchar[]))
 Total runtime: 0.085 ms
(3 rows)

Yep. I don't think you can work round this by adding an implicit  
cast - only solution would be to hack the ANY code I suspect.


Our DB driver does the right thing with quoting the values for us if  
we use a later version than the one we're running. This may be the  
solution we take..


The idea of hacking in the ANY code and then running the server in  
our production environment scares me ;)


Cheers
Tom


---(end of broadcast)---
TIP 2: Don't 'kill -9' the postmaster


Re: [SQL] Potential bug in postgres 8.2.4

2007-05-24 Thread Tom Lane
Tomas Doran <[EMAIL PROTECTED]> writes:
> The tightening in general is biting me, but if the answer was 'it was  
> deliberate tightening', and the behavior was consistent, then we'd  
> have just dealt with it - it's the in-consistent behavior that makes  
> me think this is a bug (or at least a gotcha, as it's not what you  
> expect)...

The direction of the future is that *all* those queries are going to
fail, because they're relying on an implicit integer-to-text conversion,
and its days are numbered.  That might happen as soon as 8.3:
http://archives.postgresql.org/pgsql-hackers/2007-04/msg00017.php
but it's been on the radar screen for a very long time, eg
http://archives.postgresql.org/pgsql-bugs/2001-10/msg00108.php
http://archives.postgresql.org/pgsql-hackers/2002-04/msg00450.php
http://archives.postgresql.org/pgsql-sql/2004-01/msg00064.php
http://archives.postgresql.org/pgsql-hackers/2005-11/msg00510.php

I think the reason this particular behavior changed in 8.2 is the
re-implementation of multi-element IN tests as ScalarArrayOps;
but it's part of an intentional long-term tightening of SQL semantics,
and you're not going to get far with a proposal to revert it.
Fix your code.

regards, tom lane

---(end of broadcast)---
TIP 3: Have you checked our extensive FAQ?

   http://www.postgresql.org/docs/faq


Re: [SQL] Potential bug in postgres 8.2.4

2007-05-24 Thread Tomas Doran


On 24 May 2007, at 15:51, Tom Lane wrote:


Tomas Doran <[EMAIL PROTECTED]> writes:

The tightening in general is biting me, but if the answer was 'it was
deliberate tightening', and the behavior was consistent, then we'd
have just dealt with it - it's the in-consistent behavior that makes
me think this is a bug (or at least a gotcha, as it's not what you
expect)...


The direction of the future is that *all* those queries are going to
fail, because they're relying on an implicit integer-to-text  
conversion,

and its days are numbered.  That might happen as soon as 8.3:




That's no bad thing.


I think the reason this particular behavior changed in 8.2 is the
re-implementation of multi-element IN tests as ScalarArrayOps;
but it's part of an intentional long-term tightening of SQL semantics,
and you're not going to get far with a proposal to revert it.


I wasn't suggesting reverting it - just that lists of one element  
being treated differently to lists of >1 element is not what I  
expected :)



Fix your code.


Easier said than done, but thankfully also not strictly my problem.

We have found that a newer database driver version does 'the right  
thing' for us by quoting the values in the IN () list.


Our reason for not upgrading is that this driver connects using the  
v8 protocol, and ergo logs an error when connecting to our legacy  
postgres 7.2 databases (the error is logged in the DB backend as it  
doesn't understand the v8 protocol). Yes, we are a million years  
behind in upgrading - it's underway currently...


The number of machines using the db / making connections causes the  
volumes of errors seen in the server logs to go totally mental, so we  
can't use the new driver with the legacy DBs


The current plan is to rebuild our 7.2 server to just remove this  
error message, and upgrade the database driver - as that wins us a  
lot of other things too.


Thanks for the swift and comprehensive response guys!

Cheers
Tom


---(end of broadcast)---
TIP 1: if posting/reading through Usenet, please send an appropriate
  subscribe-nomail command to [EMAIL PROTECTED] so that your
  message can get through to the mailing list cleanly


[SQL] off topic

2007-05-24 Thread chester c young
on lwn I read that pg is having problems releasing because of a want of
reviewers.

although my C is far too rusty I'd like to help out, perhaps with doc
or testing.

can someone direct me to the appropriate site?


  
Fussy?
 Opinionated? Impossible to please? Perfect.  Join Yahoo!'s user panel and lay 
it on us. http://surveylink.yahoo.com/gmrs/yahoo_panel_invite.asp?a=7 


---(end of broadcast)---
TIP 1: if posting/reading through Usenet, please send an appropriate
   subscribe-nomail command to [EMAIL PROTECTED] so that your
   message can get through to the mailing list cleanly


Re: [SQL] off topic

2007-05-24 Thread Rodrigo De León

On 5/24/07, chester c young <[EMAIL PROTECTED]> wrote:

on lwn I read that pg is having problems releasing because of a want of
reviewers.

although my C is far too rusty I'd like to help out, perhaps with doc
or testing.

can someone direct me to the appropriate site?


See:

http://www.postgresql.org/docs/faqs.FAQ_DEV.html

---(end of broadcast)---
TIP 5: don't forget to increase your free space map settings


[SQL] group by day

2007-05-24 Thread Edward W. Rouse
I have an audit table that I am trying to get a count of the number of distinct 
entries per day by the external table key field. I
can do a 
 
select count(distinct(id)) from audit where timestamp >= '01-may-2007'
 
and get a total count. What I need is a way to group on each day and get a 
count per day such that the result would be something
like
 
datecount
01-may-2007107
02-may-2007215
03-may-200796
04-may-20070
 
 
I would prefer the 0 entries be included but can live without them. Thanks.
 
Oh, postgres 7.4 by the way.
 
Edward W. Rouse

ComSquared Systems, Inc.

770-734-5301

 

 


Re: [SQL] group by day

2007-05-24 Thread A. Kretschmer
am  Thu, dem 24.05.2007, um 14:49:47 -0400 mailte Edward W. Rouse folgendes:
> I have an audit table that I am trying to get a count of the number of 
> distinct
> entries per day by the external table key field. I can do a
>  
> select count(distinct(id)) from audit where timestamp >= '01-may-2007'
>  
> and get a total count. What I need is a way to group on each day and get a
> count per day such that the result would be something like
>  
> datecount
> 01-may-2007107
> 02-may-2007215
> 03-may-200796
> 04-may-20070
>  
>  
> I would prefer the 0 entries be included but can live without them. Thanks.

You are searching for GROUP BY.

A simple example:

test=*# select * from foo;
 ts  | val
-+-
 2007-05-01 08:00:00 |  10
 2007-05-01 08:00:00 |  20
 2007-05-02 10:00:00 |  20
 2007-05-02 11:00:00 |  30
(4 rows)

Time: 1.079 ms
test=*# select ts::date, sum(val) from foo group by 1;
 ts | sum
+-
 2007-05-02 |  50
 2007-05-01 |  30
(2 rows)


Andreas
-- 
Andreas Kretschmer
Kontakt:  Heynitz: 035242/47150,   D1: 0160/7141639 (mehr: -> Header)
GnuPG-ID:   0x3FFF606C, privat 0x7F4584DA   http://wwwkeys.de.pgp.net

---(end of broadcast)---
TIP 6: explain analyze is your friend


Re: [SQL] group by day

2007-05-24 Thread Rodrigo De León

On 5/24/07, Edward W. Rouse <[EMAIL PROTECTED]> wrote:



I have an audit table that I am trying to get a count of the number of
distinct entries per day by the external table key field. I can do a

select count(distinct(id)) from audit where timestamp >= '01-may-2007'

and get a total count. What I need is a way to group on each day and get a
count per day such that the result would be something like

datecount
01-may-2007107
02-may-2007215
03-may-200796
04-may-20070


I would prefer the 0 entries be included but can live without them. Thanks.

Oh, postgres 7.4 by the way.



Edward W. Rouse

ComSquared Systems, Inc.

770-734-5301


SELECT   TIMESTAMP, COUNT(DISTINCT(ID))
   FROM AUDIT
GROUP BY TIMESTAMP
ORDER BY TIMESTAMP

---(end of broadcast)---
TIP 5: don't forget to increase your free space map settings