https://bz.apache.org/bugzilla/show_bug.cgi?id=60707

            Bug ID: 60707
           Summary: [PATCH] Reading very large excel files using StAX made
                    easier.
           Product: POI
           Version: unspecified
          Hardware: PC
                OS: All
            Status: NEW
          Severity: normal
          Priority: P2
         Component: SXSSF
          Assignee: dev@poi.apache.org
          Reporter: renjithjava...@gmail.com
  Target Milestone: ---

Created attachment 34731
  --> https://bz.apache.org/bugzilla/attachment.cgi?id=34731&action=edit
Patch that contains the new classes anf files introduced for enabling this
functionality.

This API helps the user to 'read' very large excel files and return the data in
String format. It represents the excel file as an instance of
‘org.apache.poi.xssf.streaming.reader.StreamedWorkbook’. This workbook can be
sub divided into StreamedSheets> StreamedRows > StreamedCells. StreamedCells
are the basic building block. StreamedCell represent the excel cell and holds
the String representation of the excel cell value.  In order to reduce the
memory usage StreamedCell is restricted to store only the String value & Cell
Number. Since the string value is exactly same as seen in excel file, user can
format it to whatever type he requires.

Apart from that, it uses the pull parser(StAX api) for streaming, so that the
user has more control over the parsing. User can read N rows of data, process
it and then read the next N blocks as so on..

Patch attached. Please take a look and let me know your comments.

thanks,
Renjith

-- 
You are receiving this mail because:
You are the assignee for the bug.
---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscr...@poi.apache.org
For additional commands, e-mail: dev-h...@poi.apache.org

Reply via email to