Re: [HACKERS] english parser in text search: support for multiple words in the same position

Markus Wanner Mon, 02 Aug 2010 06:27:57 -0700

Hi,

On 08/02/2010 03:12 PM, Sushant Sinha wrote:

The current text parser already returns url and url_path. That already
increases the number of unique tokens.

Well, I think I simply turned that off to be able to search for plainwords. It still works for complete URLs, those are just treated liketext, then.

Earlier people have expressed the need to index urls/emails and
currently the text parser already does so. Reverting that would be a
regression of functionality. Further, a ranking function can take
advantage of direct match of a token.

That's a point, yes. However, simply making the same string turn uptwice in the tokenizer's output doesn't sound like the right solution tome. Especially considering that the query parser uses the very sametokenizer.


Regards

Markus Wanner

--
Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers

Re: [HACKERS] english parser in text search: support for multiple words in the same position

Reply via email to