Re: Updating Dependencies - Apache Tika

2024-08-13 Thread David Smiley
It could be discussed at our next community meetup. Or a dedicated one for this topic if it will dominate. On Tue, Aug 13, 2024 at 12:21 PM Tim Allison wrote: > > All, > > Let me know how I can help. If there’s any way we can move people to > tika-pipes, that’d be best. > > We have a Solr emitte

Re: Updating Dependencies - Apache Tika

2024-08-13 Thread Tim Allison
All, Let me know how I can help. If there’s any way we can move people to tika-pipes, that’d be best. We have a Solr emitter already in Tika, but that might add too much complexity for people just beginning. I’m strongly in favor of extricating Tika’s dependencies from Solr’s for all of the reas

Re: Updating Dependencies - Apache Tika

2024-08-13 Thread David Smiley
Alternatively, just like we did with the DataImportHandler (DIH)[1], we migrate the Tika stuff to an independent project/home on GitHub and people install it if they need it. Like the DIH, Solr's Tika integration is quite popular/used so I expect it'll be maintained instead of abandoned. At that

Re: Updating Dependencies - Apache Tika

2024-08-12 Thread David Eric Pugh
Their "Pipes" stuff is what we need to integrate ;-). On Monday, August 12, 2024 at 01:15:11 PM EDT, Christos Malliaridis wrote: I tried to find a java client for tika, but with no success so far. The version upgrade would reduce the vulnerabilities from about 21 CVEs to 6, so it would d

Re: Updating Dependencies - Apache Tika

2024-08-12 Thread Christos Malliaridis
I tried to find a java client for tika, but with no success so far. The version upgrade would reduce the vulnerabilities from about 21 CVEs to 6, so it would definitely be an improvement and probably worth the migration effort until a client is available. On Mon, 12 Aug 2024, 18:15 Jan Høydahl,

Re: Updating Dependencies - Apache Tika

2024-08-12 Thread Jan Høydahl
Hi Wrt Tika, I had been hoping that we could replace extracting handler with a processor that delegates to Tika Server, but is otherwise feature parity. It would remove tons of dependencies and attack surface from Solr. I tried a POC once but could not find a suitable Java client for Tika Serve

Updating Dependencies - Apache Tika

2024-08-12 Thread Christos Malliaridis
Hello everyone, I've been looking into the dependencies of the project and thought that we could update a couple of them, together with their license files (wherever necessary). I tried to start with Apache Tika and upgrade it from 1.28.5 to 2.9.2, which is a huge step due to some restructuring o