https://issues.apache.org/bugzilla/show_bug.cgi?id=54790

--- Comment #2 from Dmitry <[email protected]> ---
To be more precise:
- Opening fails with -Xmx800MB
- Opening succeeded with -Xmx900MB

Expected:
- Opening succeeds with -Xmx300MB

I repeat: DOC file size is 70MB. Potentially I can cut or put it as is to
fileshare.

> And to use TextPiece just as some lightweigh proxy to DocumentStream going to 
> be very ineffective (due to required character encoding-deconding process).

Deferred encoding-deconding is not a problem: the only flag is
"unicode=true|false". The problem is that DocumentStream is cut into millions
of tiny char buffers.

> Also, disabling preserveTextTable means the whole text is reconstructed into 
> single buffer (StringBuilder).

OOM happens before whole text is reconstructed. I would agree for x3 memory
consumption, that is 70MB -> 210MB heap. But x10 is too much. And yes,
"preserveTextTable" is disabled by default as far as I can see, unless it is
enabled by system property.

-- 
You are receiving this mail because:
You are the assignee for the bug.

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to