https://bz.apache.org/bugzilla/show_bug.cgi?id=60707
Bug ID: 60707 Summary: [PATCH] Reading very large excel files using StAX made easier. Product: POI Version: unspecified Hardware: PC OS: All Status: NEW Severity: normal Priority: P2 Component: SXSSF Assignee: dev@poi.apache.org Reporter: renjithjava...@gmail.com Target Milestone: --- Created attachment 34731 --> https://bz.apache.org/bugzilla/attachment.cgi?id=34731&action=edit Patch that contains the new classes anf files introduced for enabling this functionality. This API helps the user to 'read' very large excel files and return the data in String format. It represents the excel file as an instance of ‘org.apache.poi.xssf.streaming.reader.StreamedWorkbook’. This workbook can be sub divided into StreamedSheets> StreamedRows > StreamedCells. StreamedCells are the basic building block. StreamedCell represent the excel cell and holds the String representation of the excel cell value. In order to reduce the memory usage StreamedCell is restricted to store only the String value & Cell Number. Since the string value is exactly same as seen in excel file, user can format it to whatever type he requires. Apart from that, it uses the pull parser(StAX api) for streaming, so that the user has more control over the parsing. User can read N rows of data, process it and then read the next N blocks as so on.. Patch attached. Please take a look and let me know your comments. thanks, Renjith -- You are receiving this mail because: You are the assignee for the bug. --------------------------------------------------------------------- To unsubscribe, e-mail: dev-unsubscr...@poi.apache.org For additional commands, e-mail: dev-h...@poi.apache.org