Re: matching both accented and non-accented character for non-accented characters?

2022-08-08 Thread David Bremner
Bence Ferdinandy writes: > Thanks! I didn't know unicode equivalence existed, but it seems to be the > feature I want, so at least now I have a name for it :) And yes, actually > setting the stemmer would also be cool, I saw that Xapian has a Hungarian > stemmer but I kind of assumed all

Re: matching both accented and non-accented character for non-accented characters?

2022-08-08 Thread Bence Ferdinandy
Thanks! I didn't know unicode equivalence existed, but it seems to be the feature I want, so at least now I have a name for it :) And yes, actually setting the stemmer would also be cool, I saw that Xapian has a Hungarian stemmer but I kind of assumed all stemmers are applied somehow (although it

Re: matching both accented and non-accented character for non-accented characters?

2022-08-08 Thread David Bremner
Bence Ferdinandy writes: > Hi, > > I'm in the process of trying to set up reading email in the terminal and > just installed notmuch, which looks like a pretty awesome tool. I currently > have one question nagging me: > > I have a lot of mail in my native Hungarian, which properly written is

matching both accented and non-accented character for non-accented characters?

2022-08-07 Thread Bence Ferdinandy
Hi, I'm in the process of trying to set up reading email in the terminal and just installed notmuch, which looks like a pretty awesome tool. I currently have one question nagging me: I have a lot of mail in my native Hungarian, which properly written is full of characters like éáűúü, but if

Re: accented characters

2017-11-13 Thread David Bremner
Stefano Zacchiroli <z...@debian.org> writes: > > Unicode has a notion of canonical form that rearrange accented > characters in a sequence of non-accented characters + modifiers > https://en.wikipedia.org/wiki/Unicode_equivalence . A bunch of libraries > use that stuff to n

Re: accented characters

2017-11-13 Thread Stefano Zacchiroli
lt to stripping accents). As a random example/data point, chromium does that and when you search unaccented strings in a web page will find any combination of them with accents. Is, by far, my best UX experience w.r.t. accents on GNU/Linux. Unicode has a notion of canonical form that rearrange acce

Re: accented characters

2017-11-13 Thread David Bremner
Bruno Deremble writes: > A way to handle this could be to only index non accented words which > requires to add a filter before the indexing process. I looked at the code > and it seems that this should be handled by gmime? > there are also libraries that are supposed to

accented characters

2017-11-12 Thread Bruno Deremble
Hi, I am still new to notmuch and keep experimenting it; a lot of very interesting features. I realized that searching "été" and "ete" do not give the same answer which may be confusing in some situation (in case the sender has an accented name and may or may not sign his email with his accented