Thanks. I thought that TikaInputStream already automatically saved to disk to allow re-reading.
From: Luís Filipe Nassif <lfcnas...@gmail.com> Sent: Friday, February 19, 2021 3:44 PM To: user@tika.apache.org Subject: Re: Re-using a TikaStream You could call TikaInputStream.getPath() at the beginning of your parser, it will spool to file if not file based. After consuming the original inputStream, create a new one from the temp file created. If you are using 2.0.0-ALPHA, there is: https://github.com/apache/tika/blob/main/tika-core/src/main/java/org/apache/tika/io/InputStreamFactory.java Use with the new methods from TikaInputStream: public static TikaInputStream get(InputStreamFactory factory) public InputStreamFactory getInputStreamFactory() Hope this helps, Luis Em sex., 19 de fev. de 2021 às 16:09, Peter Kronenberg <peter.kronenb...@torch.ai<mailto:peter.kronenb...@torch.ai>> escreveu: If I finish parsing a TikaStream, can I re-use the stream (before it is closed)? I know you said that there is some magic behind the scenes where it spools it to a file. Can I just call reset() to start from the beginning? Peter Peter Kronenberg | Senior AI Analytic ENGINEER C: 703.887.5623 [Torch AI]<http://www.torch.ai/> 4303 W. 119th St., Leawood, KS 66209 WWW.TORCH.AI<http://www.torch.ai/>