Hi Devs,
When using processors included from "nifi-media-nar", like
ExtractMediaMetadata, to handle files with .ts or .m2ts extensions and MIME
Type: video/mpeg or MIME Type: video/m2ts, the metadata isn't processed or
offloaded to external tools exiftool or ffmpeg. Instead it returns metadata as
shown below. This output is from the file attributes processed by
AttributesToJSON.
{ "file.group" : "root", "file.lastModifiedTime" :
"2024-07-30T13:37:03+1000", "X-TIKA:Parsed-By-Full-Set" :
"org.apache.tika.parser.EmptyParser", "file.permissions" : "rw-r--r--",
"uuid" : "50296a00-363a-404f-a9ab-24af547e1db2", "absolute.path" :
"/cluster/nifi-1/testdata/", "path" : "/", "filename" :
"sample_960x400_ocean_with_audio.ts", "X-TIKA:Parsed-By" :
"org.apache.tika.parser.EmptyParser", "file.creationTime" :
"2024-07-30T13:37:03+1000", "file.lastAccessTime" :
"2024-07-31T20:45:33+1000", "file.owner" : "root", "Content-Type" :
"application/octet-stream"}
{ "file.group" : "root", "file.lastModifiedTime" :
"2024-07-30T13:37:03+1000", "X-TIKA:Parsed-By-Full-Set" :
"org.apache.tika.parser.EmptyParser", "file.permissions" : "rw-r--r--",
"uuid" : "5b804484-5ced-40c7-b266-0415a67f7c07", "absolute.path" :
"/cluster/nifi-1/testdata/", "path" : "/", "filename" :
"sample_1280x720_surfing_with_audio.m2ts", "X-TIKA:Parsed-By" :
"org.apache.tika.parser.EmptyParser", "file.creationTime" :
"2024-07-30T13:37:03+1000", "file.lastAccessTime" :
"2024-07-31T21:34:38+1000", "file.owner" : "root", "Content-Type" :
"application/octet-stream"}
The sample files used for testing are from https://filesamples.com/formats/m2ts
and https://filesamples.com/formats/ts
Have tried this in NiFi versions 1.25 through to 2.0.0-M2 to no avail.
Is it possible to fix this issue by adding files to
"work/nar/extensions/nifi-media-nar-1.25.0.nar-unpacked/NAR-INF/bundled-dependencies,"
similar to how I included tika-parser-scientific-package-2.9.1.jar for
processing other files?
>From what I have found so far, the processor uses Tika and loading of
>CompositeExternalParser was last updated in 2.8.0 as per
>https://tika.apache.org/2.8.0/index.html and
>http://issues.apache.org/jira/browse/TIKA-4022
I have installed tika 2.9.1 and 2.9.2 separately and can confirm it does
process the files metadata correctly with MIME Type: video/mpeg or MIME Type:
video/m2ts by using exiftool
Below screenshot of NiFi test flow
Kind Regards,
Matthew Wilson
This e-mail may contain confidential and privileged material for the sole use
of the intended recipient. Any review, use, distribution or disclosure by
others is strictly prohibited. If you are not the intended recipient (or
authorised to receive for the recipient), please contact the sender by reply
e-mail and delete all copies of this message.