ITAGAKI Takahiro wrote:
Andrew Dunstan <[EMAIL PROTECTED]> wrote:

OK, here is a patch that I think incorporates all the ideas discussed (including part of Mark Mielke's suggestion about optimising %_). There is now no special treatment of UTF8 other than its use of a faster NextChar macro.

This is a benchmark result of 1000 loops of SELECT count(*) INTO cnt FROM item WHERE i_title LIKE '%BABABABABARIBA%'
on the table with 10000 rows.

| SQL_ASCII | LATIN1 | UTF8 | EUC_JP ---------+-----------+--------+-------+--------- HEAD | 8017 | 8029 | 16928 | 18213 Patched | 7899 | 7887 | 9985 | 10370 [ms]

It improved the performance not only for UTF8, but also for other
multi-byte encodings and a bit for single-byte encodings.



Interesting. I infer from these results that the biggest bang here comes from abandoning CHAREQ and doing all comparisons byte-wise.

cheers

andrew

---------------------------(end of broadcast)---------------------------
TIP 4: Have you searched our list archives?

              http://archives.postgresql.org

Reply via email to