Re: [bug] dovecot passes zero byte input stream when passing email with .eml attachment to apache tika parser, causes 'SEVERE' error

2023-08-18 Thread Tim Allison
I think with the soon-to-be-next-release of Tika, you can turn off throwing zero-byte file exceptions via the config. The exceptions should be harmless and you can safely ignore them. For some users, they need to know that there's a zero-byte file, hence the default behavior. It can also be

Re: tika 2.4.1 'Text extraction failed' errors when dovecot+fts 2.3.19.1 passes embedded *.eml (message/rfc822) files ; org.apache.tika.parser.mail.RFC822Parser or dovecot ?

2022-08-01 Thread Tim Allison
This looks like zero-bytes are getting passed to Tika via dovecot. I don't know enough about dovecot to figure out what's going on. On Sat, Jul 30, 2022 at 7:51 PM PGNet Dev wrote: > i'm running > > dovecot 2.3.19.1 + fts > tika-server-standard 2.4.1 > > dovecot is feeding tika