Oleg Bartunov wrote:

your polish_english, polish configurations uses ispell language
and slow, while english configuration doesn't contains ispell.
So, what's your complains ? Try add ispell dictionary to english
configuration and see timings.

Oh, so this is not anomalous ? These are the expected speeds for an
ispell dictionary ? I didn't realize that. Sorry for the bother
then. It just seemed way too slow to be practical.

You can see real timings using ts_lexize() function for different dictionaries (try several time to avoid cold-start problem) instead
of ts_headline(), which involves other factors.

On my test machine I see no real difference between very simple
dictionary and french ispell, snowball dictionaries:

ts_lexize seems to be just as fast for simple, polish_ispell and english_stem with the 'voila' argument.

polish_ispell is in fact *faster* for the lorem ipsum text repeated a couple times (10 ?). Which suggests that the issue is with ts_headline iteself.

I see no big difference in ts_headline as well:

dev-oleg=# select ts_headline('english','I can do voila', 'voila'::tsquery);
      ts_headline
-----------------------
 I can do <b>voila</b>
(1 row)

Time: 0.265 ms

Yes, for 4-word texts the results are similar.
Try that with a longer text and the difference becomes more and more significant. For the lorem ipsum text, 'polish' is about 4 times slower, than 'english'. For 5 repetitions of the text, it's 6 times, for 10 repetitions - 7.5 times...

This is 8.4.1 version of PostgreSQL.

An that was 8.3.8/OSX.

cheers,
Wojciech Knapik

--
Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers

Reply via email to