It could be discussed at our next community meetup. Or a dedicated
one for this topic if it will dominate.
On Tue, Aug 13, 2024 at 12:21 PM Tim Allison wrote:
>
> All,
>
> Let me know how I can help. If there’s any way we can move people to
> tika-pipes, that’d be best.
>
> We have a Solr emitte
All,
Let me know how I can help. If there’s any way we can move people to
tika-pipes, that’d be best.
We have a Solr emitter already in Tika, but that might add too much
complexity for people just beginning.
I’m strongly in favor of extricating Tika’s dependencies from Solr’s for
all of the reas
Alternatively, just like we did with the DataImportHandler (DIH)[1],
we migrate the Tika stuff to an independent project/home on GitHub and
people install it if they need it. Like the DIH, Solr's Tika
integration is quite popular/used so I expect it'll be maintained
instead of abandoned. At that
Their "Pipes" stuff is what we need to integrate ;-).
On Monday, August 12, 2024 at 01:15:11 PM EDT, Christos Malliaridis
wrote:
I tried to find a java client for tika, but with no success so far.
The version upgrade would reduce the vulnerabilities from about 21 CVEs to
6, so it would d
I tried to find a java client for tika, but with no success so far.
The version upgrade would reduce the vulnerabilities from about 21 CVEs to
6, so it would definitely be an improvement and probably worth the
migration effort until a client is available.
On Mon, 12 Aug 2024, 18:15 Jan Høydahl,
Hi
Wrt Tika, I had been hoping that we could replace extracting handler with a
processor that delegates to Tika Server, but is otherwise feature parity. It
would remove tons of dependencies and attack surface from Solr.
I tried a POC once but could not find a suitable Java client for Tika Serve
Hello everyone,
I've been looking into the dependencies of the project and thought that we
could update a couple of them, together with their license files (wherever
necessary).
I tried to start with Apache Tika and upgrade it from 1.28.5 to 2.9.2,
which is a huge step due to some restructuring o