I went ahead and wrote it up here
https://issues.apache.org/jira/browse/NIFI-12709

Thanks

On Wed, Jan 31, 2024 at 10:30 AM James McMahon <jsmcmah...@gmail.com> wrote:

> Happy to do that Joe. How do I create and submit a JIRA for consideration?
> I have not done one - at least, not for years.
> If you get me started, I will do a concise and thorough description in the
> ticket.
> Sincerely,
> Jim
>
> On Wed, Jan 31, 2024 at 12:12 PM Joe Witt <joe.w...@gmail.com> wrote:
>
>> James,
>>
>> Makes sense to create a JIRA to improve UnpackContent to extract these
>> attributes in the event of a zip file that happens to present them.  The
>> concept of lastModifiedDate does appear easily accessed if available in the
>> metadata.  Owner/Creator/Creation information looks less standard in the
>> case of a Zip but perhaps still capturable as extra fields.
>>
>> Thanks
>>
>> On Wed, Jan 31, 2024 at 10:01 AM James McMahon <jsmcmah...@gmail.com>
>> wrote:
>>
>>> I tried to use UnpackContent to extract the files within a zip file
>>> named ABC DEF (1).zip. (the filename has spaces in its name).
>>>
>>> UnpackContent seemed to work, but it did not preserve file attributes
>>> from the files in the zip. For example, the  lastModifiedTime   is not
>>> available so downstream I am unable to do
>>> this: 
>>> ${file.lastModifiedTime:toDate("yyyy-MM-dd'T'HH:mm:ssZ"):format("yyyyMMddHHmmss")}
>>>
>>> I did some digging and found that on the UnpackContent page, it says:
>>> file.lastModifiedTime  "The date and time that the unpacked file was
>>> last modified (*tar only*)."
>>>
>>> I need these file attributes for those files I extract from the zip. So
>>> as an alternative I tried configuring an ExecuteStreamCommand processor
>>> like this:
>>> Command Arguments  -c;"unzip -p -q < -"
>>> Command Path  /bin/bash
>>> Argument Delimiter   ;
>>>
>>> It throws these errors:
>>>
>>> 16:41:30 UTCERROR13023d28-6154-17fd-b4e8-7a30b35980ca
>>> ExecuteStreamCommand[id=13023d28-6154-17fd-b4e8-7a30b35980ca] Failed to
>>> write flow file to stdin due to Broken pipe: java.io.IOException: Broken
>>> pipe 16:41:30 UTCERROR13023d28-6154-17fd-b4e8-7a30b35980ca
>>> ExecuteStreamCommand[id=13023d28-6154-17fd-b4e8-7a30b35980ca] Transferring
>>> flow file FlowFile[filename=ABC DEF (1).zip] to nonzero status. Executable
>>> command /bin/bash ended in an error: /bin/bash: -: No such file or directory
>>>
>>> It does not seem to be applying the unzip to the stdin of the ESC
>>> processor. None of the files in the zip archive are output from ESC.
>>>
>>> What needs to be changed in my ESC configuration?
>>>
>>> Thank you in advance for any help.
>>>
>>>

Reply via email to