Sorry,I forgot to say: I have tried Tika-app 2.3.0 and can successfully extract 
text content.

The problem only occurs on Tika server.



didon...@126.com
 
From: Tilman Hausherr
Date: 2023-04-20 02:28
To: user
Subject: Re: Tika server extraction failed
Please update to the current version, 2.7.0. I was able to process both files, 
however I used tika-app in non interactive mode.

If the problem still happens: what happens if you do the second file first?

Tilman

On 19.04.2023 04:07, didon...@126.com wrote:
Hi,

    I have two files 111.xlsx and 2222.xlsx ( included in the compressed 
package).

    I  use Tika server (2.3.0 to extract their metadata and text content :
    curl -T 1111.xlsx http://127.0.0.1:12000/meta
    curl -T 222.xlsx http://127.0.0.1:12000/meta

    However, one extraction (1111.xlsx) was successful, while the other 
(2222.xlsx) extraction failed.
    2222. xlsx only has one more row of data than 1111. xlsx.

    The same is true for extracting text data.
    curl -T 1111.xlsx http://127.0.0.1:12000/tika --header "Accept: text/plain" 
-H "Content-Disposition: attachment; filename=1111.xlsx"    
    curl -T 2222.xlsx http://127.0.0.1:12000/tika --header "Accept: text/plain" 
-H "Content-Disposition: attachment; filename=2222.xlsx"   

Please assist !



didon...@126.com

Reply via email to