Re: Parser removes file content and treats it as Metadata

2024-01-25 Thread Tim Allison
Content-Type":"message/rfc822"}] On Thu, Jan 25, 2024 at 2:11 PM Gerardo Hernandez wrote: > Hi Ken, > > Unfortunately enforcing Tika to use TXTParser does not solve our problem > at all, I mean it would work for very simple emails, but we also want to be > able to par

Re: Parser removes file content and treats it as Metadata

2024-01-25 Thread Gerardo Hernandez
From: Ken Krugler Sent: Wednesday, January 24, 2024 02:40 PM To: user@tika.apache.org Cc: Tim Allison Subject: Re: Parser removes file content and treats it as Metadata You don't often get email from kkrugler_li...@transpac.com. Learn why this is important&

Re: Parser removes file content and treats it as Metadata

2024-01-24 Thread Ken Krugler
ary 18, 2024 10:39 PMTo: user@tika.apache.org <user@tika.apache.org>Cc: Mikhail Gushinets <mikhail.gushin...@aparavi.com>Subject: Parser removes file content and treats it as Metadata Hi, We are using Tika parser to obtain files' contents and then we do some post processing on them, unfortu

Re: Parser removes file content and treats it as Metadata

2024-01-23 Thread Tilman Hausherr
On 23.01.2024 20:27, Gerardo Hernandez wrote: Btw we are currently working on 2.7.0 version Please retry with the current version (2.9.1) and tell if that is better. Tilman

Re: Parser removes file content and treats it as Metadata

2024-01-23 Thread Gerardo Hernandez
Btw we are currently working on 2.7.0 version From: Gerardo Hernandez Sent: Tuesday, January 23, 2024 01:26 PM To: user@tika.apache.org Subject: Re: Parser removes file content and treats it as Metadata You don't often get email from g.hernan...@aparav

Re: Parser removes file content and treats it as Metadata

2024-01-23 Thread Gerardo Hernandez
, January 20, 2024 11:54 AM To: user@tika.apache.org Cc: Mikhail Gushinets Subject: Re: Parser removes file content and treats it as Metadata You don't often get email from kkrugler_li...@transpac.com. Learn why this is important<https://aka.ms/LearnAboutSenderIdentification> I ass

Re: Parser removes file content and treats it as Metadata

2024-01-20 Thread Ken Krugler
to:user@tika.apache.org>> > Cc: Mikhail Gushinets <mailto:mikhail.gushin...@aparavi.com>> > Subject: Parser removes file content and treats it as Metadata > > Hi, > > We are using Tika parser to obtain files' contents and then we do some post > proce

Re: Parser removes file content and treats it as Metadata

2024-01-18 Thread Gerardo Hernandez
This is the input file; I think it was not uploaded correctly. Best regards, Gerardo From: Gerardo Hernandez Sent: Thursday, January 18, 2024 10:39 PM To: user@tika.apache.org Cc: Mikhail Gushinets Subject: Parser removes file content and treats it as Metadata

Parser removes file content and treats it as Metadata

2024-01-18 Thread Gerardo Hernandez
Hi, We are using Tika parser to obtain files' contents and then we do some post processing on them, unfortunately we recently got some unexpected results from the AutoDectectParser using the attached text file [https://res.cdn.office.net/assets/mail/file-icon/png/txt_16x16.png] SampleFile_M_00