https://bugs.koha-community.org/bugzilla3/show_bug.cgi?id=9729

--- Comment #8 from David Cook <dc...@prosentient.com.au> ---
(In reply to David Cook from comment #5)
> With words in ICU, it
> looks like we might only be removing punctuation that is preceded by
> whitespace (which seems to imitate the example chain.xml from
> https://software.indexdata.com/yaz/doc/yaz.pdf). 

Ooops. I misread that. Actually, it just strips both whitespace and punctuation
regardless of position.

Yeah, I don't think this will work with Zebra, since both the word and phrase
index registers are supposed to be normalized.

I am curious how Google does it. Historically, Google used to normalize its
queries and strip out symbols, but the past few years it seems to be more
intelligent (it will even convert "+" into "plus" so there is some good AI at
work there I reckon).

-- 
You are receiving this mail because:
You are watching all bug changes.
_______________________________________________
Koha-bugs mailing list
Koha-bugs@lists.koha-community.org
https://lists.koha-community.org/cgi-bin/mailman/listinfo/koha-bugs
website : http://www.koha-community.org/
git : http://git.koha-community.org/
bugs : http://bugs.koha-community.org/

Reply via email to