[
https://issues.apache.org/jira/browse/NIFI-9647?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
David Handermann updated NIFI-9647:
-----------------------------------
Summary: Add ExtractDocumentText Processor (was: Add support for full text
extraction of binary documents supported by Apache Tika)
> Add ExtractDocumentText Processor
> ---------------------------------
>
> Key: NIFI-9647
> URL: https://issues.apache.org/jira/browse/NIFI-9647
> Project: Apache NiFi
> Issue Type: Improvement
> Reporter: Mike Thomsen
> Assignee: Mike Thomsen
> Priority: Major
> Time Spent: 2h 40m
> Remaining Estimate: 0h
>
> This improvement will wrap Apache Tika using an updated version of Tim
> Spann's ExtractTextProcessor processor. I contacted Tim via LinkedIn, and he
> agreed to make it part of the NiFi code base going forward. In addition, this
> ticket adds the include-media profile which makes it possible to easily add
> the NiFi media bundle to a custom build of NiFi.
--
This message was sent by Atlassian Jira
(v8.20.1#820001)