Please update to the current version, 2.7.0. I was able to process both files, however I used tika-app in non interactive mode.

If the problem still happens: what happens if you do the second file first?

Tilman

On 19.04.2023 04:07, didon...@126.com wrote:
Hi,

I have two files 111.xlsx and 2222.xlsx ( included in the compressed package).

    I  use Tika server (2.3.0 to extract their metadata and text content :
curl -T 1111.xlsx http://127.0.0.1:12000/meta
curl -T 222.xlsx http://127.0.0.1:12000/meta

However, one extraction (1111.xlsx) was successful, while the other (2222.xlsx) extraction failed.
    2222. xlsx only has one more row of data than 1111. xlsx.

    The same is true for extracting text data.
curl -T 1111.xlsx http://127.0.0.1:12000/tika --header "Accept: text/plain" -H "Content-Disposition: attachment; filename=1111.xlsx" curl -T 2222.xlsx http://127.0.0.1:12000/tika --header "Accept: text/plain" -H "Content-Disposition: attachment; filename=2222.xlsx"

Please assist !

------------------------------------------------------------------------
didon...@126.com

Reply via email to