[ 
https://issues.apache.org/jira/browse/TIKA-4207?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17831785#comment-17831785
 ] 

ASF GitHub Bot commented on TIKA-4207:
--------------------------------------

tballison merged PR #1699:
URL: https://github.com/apache/tika/pull/1699




> PipesParser should have option to extract raw bytes of embedded files
> ---------------------------------------------------------------------
>
>                 Key: TIKA-4207
>                 URL: https://issues.apache.org/jira/browse/TIKA-4207
>             Project: Tika
>          Issue Type: New Feature
>            Reporter: Tim Allison
>            Priority: Major
>
> There are many use cases, where text+metadata are important, but users also 
> need the raw bytes from embedded files.
> Let's make it possible to extract the usual rmeta content in _and_ the raw 
> bytes. This is a preliminary step that will offer more customization options 
> than the proposal in TIKA-3703.
> This is targeted to 3.x.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to