> Since the operation in the streams is inherently sequential, that > schema makes sense in my opinion.
I agree. The new tell operation is now in the trunk. > The addition of a new function to the tokeniser returning the offset > of the beginning of the last read token would then use this > pdf_stm_tell function. > > What do you think? If you agree I will implement it as part of my > type4+tokeniser patch. It looks good to me. The easiest way to implement it in the token reader would probably be to replace the "reader->state = X" statements with calls to a private function (e.g. enter_state) that would also store the token's starting position. Then in flush_token, store this value when emitting a token. Many thanks for the hints :) -- Jose E. Marchesi <[email protected]> http://www.jemarch.net GNU Project http://www.gnu.org
