Hi Devs,
When using processors included from "nifi-media-nar", like 
ExtractMediaMetadata, to handle files with .ts or .m2ts  extensions and MIME 
Type: video/mpeg or MIME Type: video/m2ts, the metadata isn't processed or 
offloaded to external tools exiftool or ffmpeg. Instead it returns metadata as 
shown below. This output is from the file attributes processed by 
AttributesToJSON.

{  "file.group" : "root",  "file.lastModifiedTime" : 
"2024-07-30T13:37:03+1000",  "X-TIKA:Parsed-By-Full-Set" : 
"org.apache.tika.parser.EmptyParser",  "file.permissions" : "rw-r--r--",  
"uuid" : "50296a00-363a-404f-a9ab-24af547e1db2",  "absolute.path" : 
"/cluster/nifi-1/testdata/",  "path" : "/",  "filename" : 
"sample_960x400_ocean_with_audio.ts",  "X-TIKA:Parsed-By" : 
"org.apache.tika.parser.EmptyParser",  "file.creationTime" : 
"2024-07-30T13:37:03+1000",  "file.lastAccessTime" : 
"2024-07-31T20:45:33+1000",  "file.owner" : "root",  "Content-Type" : 
"application/octet-stream"}
{  "file.group" : "root",  "file.lastModifiedTime" : 
"2024-07-30T13:37:03+1000",  "X-TIKA:Parsed-By-Full-Set" : 
"org.apache.tika.parser.EmptyParser",  "file.permissions" : "rw-r--r--",  
"uuid" : "5b804484-5ced-40c7-b266-0415a67f7c07",  "absolute.path" : 
"/cluster/nifi-1/testdata/",  "path" : "/",  "filename" : 
"sample_1280x720_surfing_with_audio.m2ts",  "X-TIKA:Parsed-By" : 
"org.apache.tika.parser.EmptyParser",  "file.creationTime" : 
"2024-07-30T13:37:03+1000",  "file.lastAccessTime" : 
"2024-07-31T21:34:38+1000",  "file.owner" : "root",  "Content-Type" : 
"application/octet-stream"}

The sample files used for testing are from https://filesamples.com/formats/m2ts 
and https://filesamples.com/formats/ts 

Have tried this in NiFi versions 1.25 through to 2.0.0-M2 to no avail.
 Is it possible to fix this issue by adding files to 
"work/nar/extensions/nifi-media-nar-1.25.0.nar-unpacked/NAR-INF/bundled-dependencies,"
 similar to how I included tika-parser-scientific-package-2.9.1.jar for 
processing other files?

>From what I have found so far, the processor uses Tika and loading of 
>CompositeExternalParser was last updated in 2.8.0 as per 
>https://tika.apache.org/2.8.0/index.html and 
>http://issues.apache.org/jira/browse/TIKA-4022 
I have installed tika 2.9.1 and 2.9.2 separately and can confirm it does 
process the files metadata correctly with MIME Type: video/mpeg or MIME Type: 
video/m2ts by using exiftool

Below screenshot of NiFi test flow











Kind Regards,
Matthew Wilson

This e-mail may contain confidential and privileged material for the sole use 
of the intended recipient. Any review, use, distribution or disclosure by 
others is strictly prohibited. If you are not the intended recipient (or 
authorised to receive for the recipient), please contact the sender by reply 
e-mail and delete all copies of this message.

Reply via email to