Re: [VOTE] Release Apache Tika 2.8.0 Candidate #2

2023-05-14 Thread Dave Meikle
On Thu, 11 May 2023 at 21:08, Tim Allison wrote: > > Please vote on releasing this package as Apache Tika 2.8.0. > The vote is open for the next 72 hours and passes if a majority of at > least three +1 Tika PMC votes are cast. > > [ ] +1 Release this package as Apache Tika 2.8.0 > [ ] -1 Do not r

Re: [VOTE] Release Apache Tika 2.2.1 Candidate #3

2021-12-20 Thread Dave Meikle
On Mon, 20 Dec 2021 at 15:59, Tim Allison wrote: > A candidate for the Tika 2.2.1 release is available at: > https://dist.apache.org/repos/dist/dev/tika/2.2.1 > > The release candidate is a zip archive of the sources in: > https://github.com/apache/tika/tree/2.2.1-rc3/ > > The SHA-512 checksum of

Re: [VOTE] Release Apache Tika 2.0.0 Candidate #1

2021-07-18 Thread Dave Meikle
+1 Cheers, Dave On Wed, 14 Jul 2021 at 19:16, Tim Allison wrote: > All, > A candidate for the Tika 2.0.0 release is available > at: > https://dist.apache.org/repos/dist/dev/tika/2.0.0 > > The release candidate is a zip archive of

Re: logging formatter configuration compatible with StackDriver

2021-06-17 Thread Dave Meikle
wrote: >>> >> >>> >> On Fri, 11 Jun 2021, Cristian Zamfir wrote: >>> >> > I think for most people it would be quite critical to have logs >>> working. Do >>> >> > you happen to know how I can reach out to the person maintaining >>> the docker >>> >> > images https://hub.docker.com/u/dameikle to see if they are >>> available to >>> >> > update the images? Sounds like it is mostly >>> >> > https://hub.docker.com/u/dameikle >>> >> >>> >> Paging our very own Dave Meikle! >>> >> >>> >> Nick >>> >>

Re: [ANNOUNCE] Welcome Peter Lee as Tika PMC member and committer

2020-11-29 Thread Dave Meikle
Welcome, Peter! Great to have you on board. On Thu, 26 Nov 2020 at 02:08, Peter Lee wrote: > Many thanks to you, Tim. :) > > Hi, all > > I'm Peter Lee and I was a Apache Commons committer. I'm familiar with many > archivers and compressors. Feel free to ask me if you have some problems in > comp

Re: [VOTE] Release Apache Tika 1.25 Candidate #2

2020-11-25 Thread Dave Meikle
On Wed, 25 Nov 2020 at 12:20, Tim Allison wrote: > Please vote on releasing this package as Apache Tika 1.25. > The vote is open for the next 72 hours and passes if a majority of at > least three +1 Tika PMC votes are cast. > > [ ] +1 Release this package as Apache Tika 1.25 > [ ] -1 Do not relea

Fwd: Travel Assistance applications open. Please inform your communities

2018-02-17 Thread Dave Meikle
Hello, With ApacheCon NA coming up later this year, please see the below from the Travel Assistance Committee (TAC). Cheers, Dave The Travel Assistance Committee (TAC) are pleased to announce that travel assistance applications for ApacheCon NA 2018 are now open! We will be supporting Apa

Re: [VOTE] Release Apache Tika 1.16 Candidate #1

2017-07-12 Thread Dave Meikle
On 8 July 2017 at 03:40, Tim Allison wrote: > > A candidate for the Tika 1.16 release is available at: > https://dist.apache.org/repos/dist/dev/tika/ > > The release candidate is a zip archive of the sources in: > https://github.com/apache/tika/tree/1.16-rc1 > > The SHA1 checksum of the archive i

[VOTE] Apache Tika 1.5 RC2

2014-02-09 Thread Dave Meikle
Hi Guys, A new release candidate for the Tika 1.5 release is now available at: http://people.apache.org/~dmeikle/tika-1.5-rc2/ This fixes the issues with the POM version numbers for tika-dotnet and tika-java7 in Tika 1.5 RC1. The release candidate is a zip archive of the sources in: http://svn.a

Re: Problem parsing large (15MB) text files on Ubuntu 10.10

2013-05-19 Thread Dave Meikle
Thanks Ben. I have raised a JIRA ticket[1] so we can track work on this issue. Seems like it works fine on my Mac but can replicate your issues on various versions of Ubuntu (10.04, 10.10 and 12.04) in my VM Lab. Will do some straces to see what is going on. Cheers, Dave [1] https://issues.apac

Re: Broken socket pipe when writing a PNG to Tika (server mode)

2013-04-30 Thread Dave Meikle
Hi Ben, On 23 Apr 2013, at 08:22, Ben Turner wrote: > Hi Dave, > > Apologies to come back to this over a month later, but we had worked around / > not seen the issue for a while, but as we start to ramp up our testing it's > come back. > Investigating it from several angles today, the problem

Re: Windows Event file parser for Tika

2013-03-12 Thread Dave Meikle
Hi Vijay, On 12 Mar 2013, at 18:57, Vijayakumar Ramdoss wrote: > I would like to know whether Tika parser for Winodws event file(evt,evtx) > available? Please advice me. I am afraid no such parser exists in Tika at present. If this is something you think we should have, feel free to raise a

Re: Broken socket pipe when writing a PNG to Tika (server mode)

2013-03-12 Thread Dave Meikle
Hi Ben, On 12 Mar 2013, at 05:33, Ben Turner wrote: > * We then talk to it via ruby sockets (for non-rubyists, this streams a > document from the file system into our local tika server over a simple > socket) : > > #!/usr/bin/env ruby > require 'socket' > TCPSocket.open('127.0.0.1', 12345) do

Re: Tika and invisible text from pdf

2013-03-09 Thread Dave Meikle
Hi Brad, On 21 Feb 2013, at 11:28, Brad Stallion wrote: > I'm extracting text from PDF files using my own sax handler. The problem is > that I get both visible and invisible text, i.e. text contained in invisible > parts of the layout. > How can I identify the invisible parts? We use PDFBox u

Re: Tika 1.3 server (JAX-WS) usage

2013-02-03 Thread Dave Meikle
Hi Chris, On 2 Feb 2013, at 06:58, "Mattmann, Chris A (388J)" wrote: > Hey Guys, > > I suggested that we should release the WAR file as part of our Tika > releases: > > http://s.apache.org/qM > > Let's try and include the WAR file as one of the published and signed > artifacts for 1.4. > >

Re: Tika 1.3 server (JAX-WS) usage

2013-02-01 Thread Dave Meikle
Hi, On 1 Feb 2013, at 15:03, AJ Weber wrote: > NEW QUESTION: > Can I use the standard "options" (like -x, -h, -j, -r, etc) to control how > the server responds? I think it's returning generic text output as if -t or > -T was passed. The tika-server only provides two command line options -p f

Re: Tika 1.3 server (JAX-WS) usage

2013-01-31 Thread Dave Meikle
Hi, On 31 Jan 2013, at 18:49, AJ Weber wrote: > Thanks for the quick reply. I would tend to agree with you. Do you know > where I can find those binaries? I don't see any links to them on the Tika > website. > > -AJ We haven't release tika-server as a binary, so you will need to build thi

Re: Tika 1.3 server (JAX-WS) usage

2013-01-31 Thread Dave Meikle
Hi, On 31 Jan 2013, at 18:28, AJ Weber wrote: > Did the JAX-WS (JSR 311 contrib) server module make it into the 1.3 release? > I see ServerCli in the javadocs and the .server package seems to contain > classes to support what I read on the wiki regarding the web-service > "version" of the se

[ANNOUNCE] Apache Tika 1.3 Released

2013-01-22 Thread Dave Meikle
may not be available on all mirrors. When downloading from a mirror site, please remember to verify the downloads using signatures found on the Apache site: http://www.apache.org/dist/tika/KEYS For more information on Apache Tika, visit the project home page: http://tika.apache.org/ -- Dave

Re: fetching content from archives and images

2013-01-07 Thread Dave Meikle
Hi Maciej, On 7 Jan 2013, at 20:53, Maciej Liżewski wrote: > Hi, > > I downloaded tika sources and noticed that tests (ZipParserTest) check if > AutoDetectParser run with ZIP file return all file names and text content > extracted from those files... and this test passes without errors. however

Re: Return raw text from document

2012-08-18 Thread Dave Meikle
Hi Alex, On 17 Aug 2012, at 08:37, Alexander Cougarman wrote: > I'm using this C# code to call the parser directly via its URL; it returns > JSON: > > var url = @"http://localhost:8983/solr/update/extract";; > > var client = new WebClient(); > client.QueryString.Add("extractOnly","true"); > c

Re: tika-app-1.2.jar in server mode not responding (windows)

2012-07-21 Thread Dave Meikle
Hi Oliver, Wondering if you are getting confused between the Tika Application in server mode (-s or -p option) which allows socket level communication and the Tika Server which allows REST-ful communication. Using the Tika Application you can use the server mode to perform extraction via a TCP

Re: TikaApp "-s" option

2011-03-27 Thread Dave Meikle
Hi Zabrane, On Thursday, 17 March 2011, Zabrane Mickael wrote: > Could someone please explain me the "-s | --server" option and how > to use it? > This option is part of the Tika Network Server functionality being implemented as part of TIKA-593[1]. At present the -server command does not do an