Sorry for my delay. Reviewing now. On Wed, Sep 30, 2020 at 12:23 PM Alexander Klimetschek <aklim...@adobe.com.invalid> wrote:
> Hi Tika Committers, > > I was wondering if [1] has a chance of getting added. It brings the > command line options on par with the Tika API for text extraction for the > very common use case of getting „all text“ for indexing. The patch [2] has > unit tests and is IMO very straightforward. > > We rely on it at Adobe in our use of Tika via the cli, and currently have > to maintain our own patch build, and it would be great to be able to use > the standard releases again. > > [1] https://issues.apache.org/jira/browse/TIKA-3044 > [2] > https://patch-diff.githubusercontent.com/raw/apache/tika/pull/312.patch > > Thanks, > Alexander Klimetschek > > Principal Scientist, Adobe > Apache OpenWhisk & Jackrabbit committer >