Thanks.  I thought that TikaInputStream already automatically saved to disk to 
allow re-reading.

From: Luís Filipe Nassif <lfcnas...@gmail.com>
Sent: Friday, February 19, 2021 3:44 PM
To: user@tika.apache.org
Subject: Re: Re-using a TikaStream

You could call TikaInputStream.getPath() at the beginning of your parser, it 
will spool to file if not file based. After consuming the original inputStream, 
create a new one from the temp file created.

If you are using 2.0.0-ALPHA, there is:

https://github.com/apache/tika/blob/main/tika-core/src/main/java/org/apache/tika/io/InputStreamFactory.java

Use with the new methods from TikaInputStream:
public static TikaInputStream get(InputStreamFactory factory)
public InputStreamFactory getInputStreamFactory()

Hope this helps,
Luis

Em sex., 19 de fev. de 2021 às 16:09, Peter Kronenberg 
<peter.kronenb...@torch.ai<mailto:peter.kronenb...@torch.ai>> escreveu:
If I finish parsing a TikaStream, can I re-use the stream (before it is 
closed)?  I know you said that there is some magic behind the scenes where it 
spools it to a file.  Can I just call reset() to start from the beginning?

Peter


Peter Kronenberg  |  Senior AI Analytic ENGINEER
C: 703.887.5623
[Torch AI]<http://www.torch.ai/>
4303 W. 119th St., Leawood, KS 66209
WWW.TORCH.AI<http://www.torch.ai/>


Reply via email to