Tim Allison created TIKA-4207:
---------------------------------

             Summary: PipesParser should have option to extract raw bytes of 
embedded files
                 Key: TIKA-4207
                 URL: https://issues.apache.org/jira/browse/TIKA-4207
             Project: Tika
          Issue Type: New Feature
            Reporter: Tim Allison


There are many use cases, where text+metadata are important, but users also 
need the raw bytes from embedded files.

Let's make it possible to extract the usual rmeta content in _and_ the raw 
bytes. This is a preliminary step that will offer more customization options 
than the proposal in TIKA-3703.

This is targeted to 3.x.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to