Re: [HACKERS] like/ilike improvements

Andrew Dunstan Tue, 22 May 2007 09:54:52 -0700


Andrew Dunstan wrote:

Tom Lane wrote:
Andrew Dunstan <[EMAIL PROTECTED]> writes:
... It turns out (according to the analysis) that the only time weactually need to use NextChar is when we are matching an "_" in alike/ilike pattern.
I thought we'd determined that advancing bytewise for "%" was alsorisky,
in two cases:

1. Multibyte character set that is not UTF8 (more specifically, does not
have a guarantee that first bytes and not-first bytes are distinct)

I thought we disposed of the idea that there was a problem with charsetsthat didn't do first byte special.


And Dennis said:

Tom Lane skrev:
You could imagine trying to do
% a byte at a time (and indeed that's what I'd been thinking it did)
but that gets you out of sync which breaks the _ case.
It is only when you have a pattern like '%_' when this is a problemand we could detect this and do byte by byte when it's not. Now wecheck (*p == '\\') || (*p == '_') in each iteration when we scan overcharacters for '%', and we could do it once and have different loopsfor the two cases.

That's pretty much what the patch does now - It never tries to match asingle byte when it sees "_", whether or not preceeded by "%".


cheers

andrew




---------------------------(end of broadcast)---------------------------
TIP 3: Have you checked our extensive FAQ?

              http://www.postgresql.org/docs/faq

Re: [HACKERS] like/ilike improvements

Reply via email to