https://bz.apache.org/bugzilla/show_bug.cgi?id=60284
Bug ID: 60284
Summary: OldExcelExtractor should throw an
EncryptedDocumentException
Product: POI
Version: unspecified
Hardware: PC
OS: Windows NT
Status: NEW
Severity: normal
Priority: P2
Component: HSSF
Assignee: [email protected]
Reporter: [email protected]
On TIKA-2118, Seva Alekseyev shared a document that causes an
UnsupportedCodePage exception. The file is an old xls (BIFF5) that is
encrypted.
After looking through https://www.openoffice.org/sc/excelfileformat.pdf and
experimenting with some files that cause similar exceptions in Tika's
regression corpus, it appears that all records after a file pass record are
encrypted, even the contents of the code page record.
Let's throw an EncryptedDocumentException (Encryption not supported for old
excel files).
I did find one file that doesn't appear to be encrypted (attached to
TIKA-2118). I can open it, but it is write protected...so even though I can
open it and copy and paste contents out of it, the inner contents are
encrypted.
--
You are receiving this mail because:
You are the assignee for the bug.
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]