On 10/1/07, Jukka Zitting <[EMAIL PROTECTED]> wrote: > ...I can dig up > some of my old code and contribute it to commons-io and/or Tika...
Cool - I think it has its place in commons-io. > ...should we still create a temporary copy of the data while parsing or > can we rely on rereading the source of the data? A temporary copy > introduces quite a bit of overhead, but avoids nasty problems... I'd go for a temp copy, at least initially. As you mention, rereading can have "interesting" side effects sometimes... -Bertrand
