> On 19/10/2020 08:28 Aki Tuomi <aki.tu...@open-xchange.com> wrote: > > > > On 19/10/2020 02:49 PGNet Dev <pgnet....@gmail.com> wrote: > > > > > > I've since rebuilt/reconfig'd all parts of my setup from scratch; some good > > cleanup along the way. > > > > Atm, my entire system for send/recv, store/retrieve, + rules & search is > > working as I intend. Ok, mostly ... > > > > Except for this accented-character search mystery. I've got a _lot_ of > > mail with various languages in bodies, so _do_ need to get this sorted. > > > > > On 10/18/20 2:58 PM, John Fawcett wrote: > > ... > > > silly question > > ... > > > > hardly! > > > > > > creating 2 messages > > > > (1) > > Subject: tambien > > Body: tambien > > > > (2) > > Subject: también > > Body: también > > > > and two more, two avoid known stop words > > > > (3) > > Subject: aausdfrhyetdwgyatrdf > > Body: aausdfrhyetdwgyatrdf > > > > (4) > > Subject: aausdfrhyétdwgyatrdf > > Body: aausdfrhyétdwgyatrdf > > > > > > 1st, > > > > doveadm fts rescan -u myu...@example.com > > doveadm index -u myu...@example.com -q '*' > > > > TBird/solr searches, > > > > Subject: tambien ==> FOUND > > Subject: también ==> FOUND > > Subject: aausdfrhyetdwgyatrdf ==> FOUND > > Subject: aausdfrhyétdwgyatrdf ==> FOUND > > > > Body: tambien ==> FOUND > > Body: también ==> (empty) > > Body: aausdfrhyetdwgyatrdf ==> FOUND > > Body: aausdfrhyétdwgyatrdf ==> (empty) > > > > suggests it's _not_ (just) an existing-stopword problem > > > > notable/odd that subject searches are OK, but not body. > > > > > > > > On 10/18/20 2:58 PM, Shawn Heisey wrote: > > ... > > > If you are using something like the following schema: > > > https://raw.githubusercontent.com/dovecot/core/master/doc/solr-schema-7.7.0.xml > > > > I am > > > > > Solr does have a set of ICU filters, which I would recommend using rather > > > than the lowercase filter > > > > I'll give that a try ; haven't used solr outside of the dovecot context -- > > so need to find a doc/example on how, exactly, that's done correctly. > > > > > I cannot say much about the panic you're getting when using the doveadm > > > command. The stacktrace says it is happening in > > > dovecot code, not Solr code. And it looks like the panic had nothing to > > > do with FTS or Solr ... what I see points to > > > mailbox storage code. > > > > again/still > > > > doveadm fts lookup -u myu...@example.com <any key> "<any str>" > > > > _all_ panic, as above, > > > > > > doveadm(myu...@example.com): Panic: file mail-storage.c: line 2112 > > (mailbox_get_open_status): assertion failed: (box->opened) > > doveadm(myu...@example.com): Error: Raw backtrace: > > /usr/lib64/dovecot/libdovecot.so.0(backtrace_append+0x46) [0x7f61bba4ecc6] > > -> /usr/lib64/dovecot/libdovecot.so.0(backtrace_get+0x22) > > [0x7f61bba4ede2] > > -> /usr/lib64/dovecot/libdovecot.so.0(+0x10025b) [0x7f61bba5825b] > > -> /usr/lib64/dovecot/libdovecot.so.0(+0x100297) [0x7f61bba58297] > > -> /usr/lib64/dovecot/libdovecot.so.0(+0x59bc6) [0x7f61bb9b1bc6] > > -> /usr/lib64/dovecot/libdovecot-storage.so.0(+0x4779e) [0x7f61bbb6579e] > > -> /usr/lib64/dovecot/lib21_fts_solr_plugin.so(+0x5849) [0x7f61bb5b7849] > > -> /usr/lib64/dovecot/lib20_fts_plugin.so(fts_backend_lookup+0x51) > > [0x7f61bb1d9491] > > -> /usr/lib64/dovecot/doveadm/lib20_doveadm_fts_plugin.so(+0x3280) > > [0x7f61bb14b280] > > -> doveadm(+0x343cd) [0x55f5def873cd] > > -> doveadm(+0x34fe0) [0x55f5def87fe0] > > -> doveadm(doveadm_cmd_ver2_to_mail_cmd_wrapper+0x22d) [0x55f5def88e2d] > > -> doveadm(doveadm_cmd_run_ver2+0x4e8) [0x55f5def998d8] > > -> doveadm(doveadm_cmd_try_run_ver2+0x3e) [0x55f5def9992e] > > -> doveadm(main+0x1d4) [0x55f5def77cf4] > > -> /lib64/libc.so.6(__libc_start_main+0xf2) [0x7f61bb613042] > > -> doveadm(_start+0x2e) [0x55f5def781ce] > > Aborted > > > > Hopefully dovecot devs might comment further. > > > > I'll see what I find with using the ICU filters -- if perhaps anything > > changes > > Hi! > > I can reproduce your problem with the `fts lookup` command. Luckily it's > equivalent to running `doveadm search`. I'll open a bug about this. > > Dovecot FTS tokenization is not done, unless you have `use_libfts` in > fts_solr setting, in your case > > fts_solr = url=https://solr.example.com:8984/solr/dovecot/ use_libfts > > Without this, everything is sent to to solr as-is, which is then expected to > do all the work. > > Aki
Also, I noticed you had soft_commit=yes on separate line, that also needs to be on *same* line as fts_solr, as these settings are passed along to solr plugin for parsing. Aki