Excel parser implementation based on POI's Event API
----------------------------------------------------

                 Key: TIKA-105
                 URL: https://issues.apache.org/jira/browse/TIKA-105
             Project: Tika
          Issue Type: Improvement
          Components: parser
            Reporter: Niall Pemberton
            Priority: Minor


Tika's existing ExcelParser implementation uses POI's HSSFWorkbook to extract 
text from an Excel file. POI also provides an alternative "Event API"[1] for 
processing Excel files - the advantage being that it has a much smaller memory 
footprint, but at the cost of a slightly more complex API.

I have written an alternative excel parser implementation based on the Event 
API - if its of interest to the Tika project I'll write a test case for it.


[1] http://poi.apache.org/hssf/how-to.html#event_api

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to