Hey David, really appreciate your help. Thanks for the info below. I have tried all of those options Reading cells, using extractor, xssf sax. They all give me the data that are in the regular sheets but not the data that is in the pivot table cache which is what I am hunting for.
One thing I tried that partially worked was to get the package parts and dump then in a zip XSSFWorkbook wb = new XSSFWorkbook(inp); String outFilename = "outfile.zip"; ZipOutputStream out = new ZipOutputStream(new FileOutputStream(outFilename)); for (POIXMLDocumentPart p : wb.getRelations()) { PackagePart pt = p.getPackagePart(); pt.save(out); } out.close(); Upon unzipping I can see the xml files however they do not have the schema and hence the data is pretty much useless. For Example one of the XML files was pivotCacheDefinition1.xml <pivotCacheDefinition r:id="rId1" refreshedBy="" refreshedDate="40097.989743171296" createdVersion="1" recordCount="6907"> <cacheSource type="external" connectionId="2"/> <cacheFields count="86"> <cacheField name="Number" numFmtId="0" sqlType="4"> <sharedItems containsSemiMixedTypes="0" containsString="0" containsNumber="1" containsInteger="1" minValue="1" maxValue="9750" count="6904"> <n v="9739"/> <n v="9738"/> <n v="9736"/> ... It does not have any style associated with it or no schema. However when you use the Save As option in Excel to manually save the xlsx to xml the schema is well formed and XML file is usable. I will keep trying. Hopefully I wont have to get an intern to press the SAVE AS button. :) Again I really appreciate your help. Thanks to all working on this project. If there is something I am missing or if I am thinking in the wrong direction please feel free to let me know. THanks. --------------------------------------------------------------------- To unsubscribe, e-mail: user-unsubscr...@poi.apache.org For additional commands, e-mail: user-h...@poi.apache.org