Re: Issue related to file mime type detection

2022-09-09 Thread Nick Burch

On Fri, 9 Sep 2022, Sindhu Mahadevappa wrote:

We are using tika-parsers 1.23


Tika 1.23 was released in December 2019! You should really use something 
much more recent


for comparing uploaded file mime type from file name as well as from 
file content for security purpose.


Apache Tika's detection is not recommended for security purposes. We try 
our best to give an answer. Our detection does not defend against 
specially crafted files which look like one type but is actually a 
different one.


mime type from file name as audio/mp4 and mine type from file content as 
video/mp4 so it is validating as file type not supported.


Try with a more recent version of Apache Tika. Make sure you include the 
Tika Parsers jar and dependencies for container aware detection within MP4 
files. If you still have an issue with Tika 2.4.1, raise a bug and upload 
a triggering file so we can investigate


Nick


RE: Issue related to file mime type detection

2022-09-15 Thread Sindhu Mahadevappa
Hi Team,

Thanks for the quick response.
We have been looking for the latest Tika 2.4.1 jar file, looks like it is not 
available anywhere.

Can you please share the link where we can get the latest 2.4.1 jar file, it 
will be very helpful.

Thanks & Regards
Sindhu Mahadevappa

> -Original Message-
> From: Nick Burch 
> Sent: Friday, September 9, 2022 3:48 PM
> To: Sindhu Mahadevappa 
> Cc: dev@tika.apache.org
> Subject: Re: Issue related to file mime type detection
>
> [EXTERNAL] This message originated from outside of ArisGlobal. Please treat 
> hyperlinks, attachments, and instructions in this email with caution. 
> ArisGlobal will not ask for you for credentials in any email.
>
> On Fri, 9 Sep 2022, Sindhu Mahadevappa wrote:
>> We are using tika-parsers 1.23
>
> Tika 1.23 was released in December 2019! You should really use
> something much more recent
>
>> for comparing uploaded file mime type from file name as well as from
>> file content for security purpose.
>
> Apache Tika's detection is not recommended for security purposes. We try our 
> best to give an answer. Our detection does not defend against specially 
> crafted files which look like one type but is actually a different one.
>
>> mime type from file name as audio/mp4 and mine type from file content
>> as
>> video/mp4 so it is validating as file type not supported.
>
> Try with a more recent version of Apache Tika. Make sure you include
> the Tika Parsers jar and dependencies for container aware detection
> within MP4 files. If you still have an issue with Tika 2.4.1, raise a
> bug and upload a triggering file so we can investigate
>
> Nick
> This email and any files transmitted with it are confidential and intended 
> solely for the use of the individual or entity to whom they are addressed. If 
> you are not the named addressee you should not disseminate, distribute or 
> copy this e-mail. Please notify the sender or system manager by email 
> immediately if you have received this e-mail by mistake and delete this 
> e-mail from your system. If you are not the intended recipient you are 
> notified that disclosing, copying, distributing or taking any action in 
> reliance on the contents of this information is strictly prohibited and 
> against the law.
>
This email and any files transmitted with it are confidential and intended 
solely for the use of the individual or entity to whom they are addressed. If 
you are not the named addressee you should not disseminate, distribute or copy 
this e-mail. Please notify the sender or system manager by email immediately if 
you have received this e-mail by mistake and delete this e-mail from your 
system. If you are not the intended recipient you are notified that disclosing, 
copying, distributing or taking any action in reliance on the contents of this 
information is strictly prohibited and against the law.


RE: Issue related to file mime type detection

2022-09-15 Thread Nick Burch

On Thu, 15 Sep 2022, Sindhu Mahadevappa wrote:
We have been looking for the latest Tika 2.4.1 jar file, looks like it 
is not available anywhere.


You can get the Tika App and Tika Server jars for 2.4.1 from
https://tika.apache.org/download.html

For the core and parser jars, manually downloading is not recommended as 
you risk missing dependencies. Just ask Maven or Gradle and they'll pull 
the latest jars for you


Nick