Re: Truncated text during Xapian indexing

2018-02-20 Thread Sebastian Hagedorn
Thanks for your reply, that was very interesting and helpful! --On 15. Februar 2018 um 16:12:23 +0100 Robert Stepanek wrote: Just out of curiosity, how is the mapping between a Xapian docid and a message file on disk achieved? I played around with xapian-delve and the Perl example simplesear

Re: Truncated text during Xapian indexing

2018-02-15 Thread Robert Stepanek
On Thu, Feb 15, 2018, at 13:08, Sebastian Hagedorn wrote: > Is the setting "search_skipdiacrit" in imapd.conf honored during the > indexing or is that only relevant while searching? Given your comment > regarding search normalization above I take it Umlaut characters are not > considered diacrit

Re: Truncated text during Xapian indexing

2018-02-15 Thread Sebastian Hagedorn
--On 15. Februar 2018 um 11:20:32 +0100 Robert Stepanek wrote: On Thu, Feb 15, 2018, at 10:44, Sebastian Hagedorn wrote: ^Simon^: Is that the first 4Mb of the text/html and/or text/plain parts, or first 4Mb of the entire message body, ignoring any mime parts? This limit defines the maximu

Re: Truncated text during Xapian indexing

2018-02-15 Thread Robert Stepanek
On Thu, Feb 15, 2018, at 10:44, Sebastian Hagedorn wrote: > ^Simon^: Is that the first 4Mb of the text/html and/or text/plain parts, or > first 4Mb of the entire message body, ignoring any mime parts? This limit defines the maximum byte length per MIME body-part of type "text". The byte length

Truncated text during Xapian indexing

2018-02-15 Thread Sebastian Hagedorn
Hi, as a follow-up to a discussion on IRC, I noticed the following diagnostic log line while replicating mailboxes to a new server with Xapian: "Xapian: truncating text from message mailbox …" Nicola replied that to her knowledge only the first 4 MB of a message are indexed, which led to the