Hi Devs, When using processors included from "nifi-media-nar", like ExtractMediaMetadata, to handle files with .ts or .m2ts extensions and MIME Type: video/mpeg or MIME Type: video/m2ts, the metadata isn't processed or offloaded to external tools exiftool or ffmpeg. Instead it returns metadata as shown below. This output is from the file attributes processed by AttributesToJSON.
{ "file.group" : "root", "file.lastModifiedTime" : "2024-07-30T13:37:03+1000", "X-TIKA:Parsed-By-Full-Set" : "org.apache.tika.parser.EmptyParser", "file.permissions" : "rw-r--r--", "uuid" : "50296a00-363a-404f-a9ab-24af547e1db2", "absolute.path" : "/cluster/nifi-1/testdata/", "path" : "/", "filename" : "sample_960x400_ocean_with_audio.ts", "X-TIKA:Parsed-By" : "org.apache.tika.parser.EmptyParser", "file.creationTime" : "2024-07-30T13:37:03+1000", "file.lastAccessTime" : "2024-07-31T20:45:33+1000", "file.owner" : "root", "Content-Type" : "application/octet-stream"} { "file.group" : "root", "file.lastModifiedTime" : "2024-07-30T13:37:03+1000", "X-TIKA:Parsed-By-Full-Set" : "org.apache.tika.parser.EmptyParser", "file.permissions" : "rw-r--r--", "uuid" : "5b804484-5ced-40c7-b266-0415a67f7c07", "absolute.path" : "/cluster/nifi-1/testdata/", "path" : "/", "filename" : "sample_1280x720_surfing_with_audio.m2ts", "X-TIKA:Parsed-By" : "org.apache.tika.parser.EmptyParser", "file.creationTime" : "2024-07-30T13:37:03+1000", "file.lastAccessTime" : "2024-07-31T21:34:38+1000", "file.owner" : "root", "Content-Type" : "application/octet-stream"} The sample files used for testing are from https://filesamples.com/formats/m2ts and https://filesamples.com/formats/ts Have tried this in NiFi versions 1.25 through to 2.0.0-M2 to no avail. Is it possible to fix this issue by adding files to "work/nar/extensions/nifi-media-nar-1.25.0.nar-unpacked/NAR-INF/bundled-dependencies," similar to how I included tika-parser-scientific-package-2.9.1.jar for processing other files? >From what I have found so far, the processor uses Tika and loading of >CompositeExternalParser was last updated in 2.8.0 as per >https://tika.apache.org/2.8.0/index.html and >http://issues.apache.org/jira/browse/TIKA-4022 I have installed tika 2.9.1 and 2.9.2 separately and can confirm it does process the files metadata correctly with MIME Type: video/mpeg or MIME Type: video/m2ts by using exiftool Below screenshot of NiFi test flow Kind Regards, Matthew Wilson This e-mail may contain confidential and privileged material for the sole use of the intended recipient. Any review, use, distribution or disclosure by others is strictly prohibited. If you are not the intended recipient (or authorised to receive for the recipient), please contact the sender by reply e-mail and delete all copies of this message.