Sorry,I forgot to say: I have tried Tika-app 2.3.0 and can successfully extract text content.
The problem only occurs on Tika server. didon...@126.com From: Tilman Hausherr Date: 2023-04-20 02:28 To: user Subject: Re: Tika server extraction failed Please update to the current version, 2.7.0. I was able to process both files, however I used tika-app in non interactive mode. If the problem still happens: what happens if you do the second file first? Tilman On 19.04.2023 04:07, didon...@126.com wrote: Hi, I have two files 111.xlsx and 2222.xlsx ( included in the compressed package). I use Tika server (2.3.0 to extract their metadata and text content : curl -T 1111.xlsx http://127.0.0.1:12000/meta curl -T 222.xlsx http://127.0.0.1:12000/meta However, one extraction (1111.xlsx) was successful, while the other (2222.xlsx) extraction failed. 2222. xlsx only has one more row of data than 1111. xlsx. The same is true for extracting text data. curl -T 1111.xlsx http://127.0.0.1:12000/tika --header "Accept: text/plain" -H "Content-Disposition: attachment; filename=1111.xlsx" curl -T 2222.xlsx http://127.0.0.1:12000/tika --header "Accept: text/plain" -H "Content-Disposition: attachment; filename=2222.xlsx" Please assist ! didon...@126.com