Re: [HACKERS] IN joining

2004-03-08 Thread Dennis Haney




Tom Lane wrote:

  Dennis Haney [EMAIL PROTECTED] writes:
  
  

  Joining {b,c} to {a} does not meet any of those four allowed cases.

  

Exactly my point... So why ever bother creating the {b,c} node which is 
legal by the above definition?

  
  
We don't, because there is no such join clause.

  

No, but we create the equality via the implied equality mechanism...

select * from a, b where a.id = b.id3 and a.id in (select c.id2 from c);

rtable is (after in-optimization):
resno refname relid inFromCl
- - - 
1 a  17143 inFromCl
2 b  17151 inFromCl
3 IN_subquery [subquery]
4 c  17147 inFromCl

in gdb:
break joinrels.c:563
commands
call bms_is_subset(ininfo-lefthand, rel1-relids)
call bms_equal(ininfo-righthand, rel2-relids)
call bms_is_subset(ininfo-lefthand, rel2-relids)
call bms_equal(ininfo-righthand, rel1-relids)
x/t rel1-relids.words
x/t rel2-relids.words
x/t joinrelids.words
p jointype
printf "%s\n",
pretty_format_node_dump(nodeToString(((RestrictInfo*)((RestrictInfo*)restrictlist)-clause)-clause))
end

then we get this join:

Breakpoint 4, make_join_rel (root=0x8307bc8, rel1=0x8316920,
rel2=0x8316b10, jointype=JOIN_UNIQUE_INNER)
 at joinrels.c:563
563 switch (jointype)
$92 = 0 '\0'
$93 = 1 '\001'
$94 = 0 '\0'
$95 = 0 '\0'
0x83169ac: 0100
0x8316b9c: 0001
0x832670c: 00010100
$96 = JOIN_UNIQUE_INNER
 {OPEXPR
 :opno 96
 :opfuncid 0
 :opresulttype 16
 :opretset false
 :args (
 {VAR
 :varno 4
 :varattno 1
 :vartype 23
 :vartypmod -1
 :varlevelsup 0
 :varnoold 4
 :varoattno 1
 }

 {VAR
 :varno 2
 :varattno 1
 :vartype 23
 :vartypmod -1
 :varlevelsup 0
 :varnoold 2
 :varoattno 1
 }
 )
 }


-- 
Dennis





Re: [HACKERS] IN joining

2004-03-08 Thread Tom Lane
Dennis Haney [EMAIL PROTECTED] writes:
 Exactly my point... So why ever bother creating the {b,c} node which is 
 legal by the above definition?
 
 We don't, because there is no such join clause.
 
 No, but we create the equality via the implied equality mechanism...

 select * from a, b where a.id = b.id3 and a.id in (select c.id2 from c);

Oh, I had forgotten that your original example involved an implied
equality.  I don't see that anything is wrong though.  The join path
that will result from considering the implied equality will be like

((UNIQUE-ified subselect) INNER JOIN b) INNER JOIN a

which is perfectly legal and perhaps even a winner.  Once you stick a
UNIQUE on top of the IN's subselect, you can treat the IN as exactly
like a plain equality join.

[ thinks a bit... ]  Actually I guess there is a problem here: we won't
actually generate that plan, because this test is too strict:

/*
 * If we already joined IN's RHS to any part of its LHS in
 * either input path, then this join is not constrained (the
 * necessary work was done at a lower level).
 */
if (bms_overlap(ininfo-lefthand, rel1-relids) 
bms_is_subset(ininfo-righthand, rel1-relids))
continue;
if (bms_overlap(ininfo-lefthand, rel2-relids) 
bms_is_subset(ininfo-righthand, rel2-relids))
continue;

I think it should be

/*
 * If we already joined IN's RHS to anything else in
 * either input path, then this join is not constrained (the
 * necessary work was done at a lower level).
 */
if (bms_is_subset(ininfo-righthand, rel1-relids) 
!bms_equal(ininfo-righthand, rel1-relids))
continue;
if (bms_is_subset(ininfo-righthand, rel2-relids) 
!bms_equal(ininfo-righthand, rel2-relids))
continue;

Comments?

regards, tom lane

---(end of broadcast)---
TIP 9: the planner will ignore your desire to choose an index scan if your
  joining column's datatypes do not match


Re: [HACKERS] IN joining

2004-03-08 Thread Dennis Haney
Tom Lane wrote:

[SNIP: a repetion of my first post ;) ]

I think it should be

   /*
* If we already joined IN's RHS to anything else in
* either input path, then this join is not constrained (the
* necessary work was done at a lower level).
*/
   if (bms_is_subset(ininfo-righthand, rel1-relids) 
   !bms_equal(ininfo-righthand, rel1-relids))
   continue;
   if (bms_is_subset(ininfo-righthand, rel2-relids) 
   !bms_equal(ininfo-righthand, rel2-relids))
   continue;
Comments?
 

It's good.
It was pretty much what I was thinking was wrong to begin with.
Whether the generated plans are valid is a different issue ;)
--
Dennis
---(end of broadcast)---
TIP 9: the planner will ignore your desire to choose an index scan if your
 joining column's datatypes do not match


[HACKERS] IN joining

2004-03-05 Thread Dennis Haney
Hi

I have a problem understanding the code to make certain in join are 
performed properly. Specifically I have problems understading when 
IN_UNIQUE_{INNER,OUTER} is a valid jointype.
Its in joinrels.c:make_join_rel.

Consider this example:

SELECT * FROM a,b WHERE a.id = b.id AND (a.id) IN (SELECT c.id FROM c)

the possible execution trees are {{a,b}, {c}}, {{a,c},{b}} and the code 
seems to also permit {{b,c},{a}}. It is the latter I'm having problems with.

When joining {b} and {c} it will fall through and suggest a 
IN_UNIQUE_{INNER,OUTER} jointype.

My logic is this: {c} \in {c,b} so it is a valid plan according to the 
first check.
We have an issue according to the second check and we haven't done the 
work before according to the 3rd and 4th checks.
Since the lefthand of the IN {a} is not in either {b} or {c} we skip the 
IN_JOIN{_REVERSE}.
But since one of the relations is equal to the right side {c} of the IN 
we determine that IN_UNIQUE_{INNER,OUTER} is a valid jointype.

Now, the next join between {a} and {b,c} is the one I fail to understand 
when it can ever happen...

{c} \in {a,b,c} so it is a valid plan according to the first check.
We have an issue according to the second check.
Since we have no trace of the IN's left hand {a} in {b,c} 3rd and 4th 
check says we have not done the work?!?
The final checks fail because {c} != {b,c}, thus we determine it is an 
invalid plan.

My question is: When is it ever a valid jointype to use 
IN_UNIQUE_{INNER,OUTER}? Or am I missing something?

--
Dennis
---(end of broadcast)---
TIP 5: Have you checked our extensive FAQ?
  http://www.postgresql.org/docs/faqs/FAQ.html


Re: [HACKERS] IN joining

2004-03-05 Thread Tom Lane
Dennis Haney [EMAIL PROTECTED] writes:
 Consider this example:
 SELECT * FROM a,b WHERE a.id = b.id AND (a.id) IN (SELECT c.id FROM c)
 the possible execution trees are {{a,b}, {c}}, {{a,c},{b}} and the code 
 seems to also permit {{b,c},{a}}.

No, it does not --- as you say, that would give wrong answers.  That
case is eliminated by the tests following this comment:

 * JOIN_IN technique will work if outerrel includes LHS and
 * innerrel is exactly RHS; conversely JOIN_REVERSE_IN handles
 * RHS/LHS.
 *
 * JOIN_UNIQUE_OUTER will work if outerrel is exactly RHS;
 * conversely JOIN_UNIQUE_INNER will work if innerrel is
 * exactly RHS.

Joining {b,c} to {a} does not meet any of those four allowed cases.

regards, tom lane

---(end of broadcast)---
TIP 1: subscribe and unsubscribe commands go to [EMAIL PROTECTED]


Re: [HACKERS] IN joining

2004-03-05 Thread Dennis Haney






Tom Lane wrote:

  Dennis Haney [EMAIL PROTECTED] writes:
  
  
Consider this example:
SELECT * FROM a,b WHERE a.id = b.id AND (a.id) IN (SELECT c.id FROM c)
the possible execution trees are {{a,b}, {c}}, {{a,c},{b}} and the code 
seems to also permit {{b,c},{a}}.

  
  
No, it does not --- as you say, that would give wrong answers.  That
case is eliminated by the tests following this comment:

 * JOIN_IN technique will work if outerrel includes LHS and
 * innerrel is exactly RHS; conversely JOIN_REVERSE_IN handles
 * RHS/LHS.
 *
 * JOIN_UNIQUE_OUTER will work if outerrel is exactly RHS;
 * conversely JOIN_UNIQUE_INNER will work if innerrel is
 * exactly RHS.

Joining {b,c} to {a} does not meet any of those four allowed cases.
  

Exactly my point... So why ever bother creating the {b,c} node which is
legal by the above definition?


-- 
Dennis
use Inline C = q{void p(char*g){
printf("Just Another %s Hacker\n",g);}};p("Perl");





Re: [HACKERS] IN joining

2004-03-05 Thread Tom Lane
Dennis Haney [EMAIL PROTECTED] writes:
 Joining {b,c} to {a} does not meet any of those four allowed cases.
 
 Exactly my point... So why ever bother creating the {b,c} node which is 
 legal by the above definition?

We don't, because there is no such join clause.

regards, tom lane

---(end of broadcast)---
TIP 7: don't forget to increase your free space map settings