Hi, On 10/13/07, Bertrand Delacretaz <[EMAIL PROTECTED]> wrote: > On 10/13/07, Keith R. Bennett <[EMAIL PROTECTED]> wrote: > > ...In order to accomplish the reading of an original resource only once, we > > have the RereadableInputStream.... > > Is that used for all parsers?
I actually managed to get rid of the re-reading of the input stream in the Microsoft parsers, see TIKA-63. > I'd prefer this to be an internal concern of Tika, rather than putting > the burden on the user to decide if the input can be read several > times safely. Unless someone really needs that feature now, of course. +1 The parser class should have the best knowledge on how many passes will be required for reading the stream. Most of the times times I guess a parser will either need just a single pass or will just read the whole document to memory. BR, Jukka Zititng
