Content-Type":"message/rfc822"}]
On Thu, Jan 25, 2024 at 2:11 PM Gerardo Hernandez
wrote:
> Hi Ken,
>
> Unfortunately enforcing Tika to use TXTParser does not solve our problem
> at all, I mean it would work for very simple emails, but we also want to be
> able to par
From: Ken Krugler
Sent: Wednesday, January 24, 2024 02:40 PM
To: user@tika.apache.org
Cc: Tim Allison
Subject: Re: Parser removes file content and treats it as Metadata
You don't often get email from kkrugler_li...@transpac.com. Learn why this is
important&
ary 18, 2024 10:39 PMTo: user@tika.apache.org <user@tika.apache.org>Cc: Mikhail Gushinets <mikhail.gushin...@aparavi.com>Subject: Parser removes file content and treats it as Metadata Hi, We are using Tika parser to obtain files' contents and then we do some post processing on them, unfortu
On 23.01.2024 20:27, Gerardo Hernandez wrote:
Btw we are currently working on 2.7.0 version
Please retry with the current version (2.9.1) and tell if that is better.
Tilman
Btw we are currently working on 2.7.0 version
From: Gerardo Hernandez
Sent: Tuesday, January 23, 2024 01:26 PM
To: user@tika.apache.org
Subject: Re: Parser removes file content and treats it as Metadata
You don't often get email from g.hernan...@aparav
, January 20, 2024 11:54 AM
To: user@tika.apache.org
Cc: Mikhail Gushinets
Subject: Re: Parser removes file content and treats it as Metadata
You don't often get email from kkrugler_li...@transpac.com. Learn why this is
important<https://aka.ms/LearnAboutSenderIdentification>
I ass
to:user@tika.apache.org>>
> Cc: Mikhail Gushinets <mailto:mikhail.gushin...@aparavi.com>>
> Subject: Parser removes file content and treats it as Metadata
>
> Hi,
>
> We are using Tika parser to obtain files' contents and then we do some post
> proce
This is the input file; I think it was not uploaded correctly.
Best regards,
Gerardo
From: Gerardo Hernandez
Sent: Thursday, January 18, 2024 10:39 PM
To: user@tika.apache.org
Cc: Mikhail Gushinets
Subject: Parser removes file content and treats it as Metadata
Hi,
We are using Tika parser to obtain files' contents and then we do some post
processing on them, unfortunately we recently got some unexpected results from
the AutoDectectParser using the attached text file
[https://res.cdn.office.net/assets/mail/file-icon/png/txt_16x16.png]
SampleFile_M_00