[ https://issues.apache.org/jira/browse/TIKA-105?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Niall Pemberton updated TIKA-105: --------------------------------- Attachment: ExcelEventParser.java > Excel parser implementation based on POI's Event API > ---------------------------------------------------- > > Key: TIKA-105 > URL: https://issues.apache.org/jira/browse/TIKA-105 > Project: Tika > Issue Type: Improvement > Components: parser > Reporter: Niall Pemberton > Priority: Minor > Attachments: ExcelEventParser.java > > > Tika's existing ExcelParser implementation uses POI's HSSFWorkbook to extract > text from an Excel file. POI also provides an alternative "Event API"[1] for > processing Excel files - the advantage being that it has a much smaller > memory footprint, but at the cost of a slightly more complex API. > I have written an alternative excel parser implementation based on the Event > API - if its of interest to the Tika project I'll write a test case for it. > [1] http://poi.apache.org/hssf/how-to.html#event_api -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.