[ https://issues.apache.org/jira/browse/TIKA-823?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Antoni Mylka updated TIKA-823: ------------------------------ Attachment: testStarOffice-5.2-write.sdw testStarOffice-5.2-impress.sdd testStarOffice-5.2-draw.sda testStarOffice-5.2-calc.sdc The files I want to distinguish inside POIFSContainerDetector. Impress and Draw have the same set of top-level names. I'd like to distinguish them by strings contained in the raw content of the CompObj entry, but I don't know how to get that content via POI. Please have a look at my user@poi question. > Detect StarOffice files > ----------------------- > > Key: TIKA-823 > URL: https://issues.apache.org/jira/browse/TIKA-823 > Project: Tika > Issue Type: Improvement > Affects Versions: 1.1 > Reporter: Antoni Mylka > Attachments: testStarOffice-5.2-calc.sdc, > testStarOffice-5.2-draw.sda, testStarOffice-5.2-impress.sdd, > testStarOffice-5.2-write.sdw > > > I would like both MimeTypes and the POIFSContainerDetector to be able to > detect files created with Star Office Draw, Impress, Writer and Calc. > I started working on this, but stumbled upon a POI issue, which I posted to > poi-user. > http://thread.gmane.org/gmane.comp.jakarta.poi.user/17857 > Nick? Yegor? I know you're on the Tika list as well. Could you take a look? > How to get the raw content of CompObj entry? -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira