Hi, Unfortunately I have to restart the run, commons-compress was missing which caused 70k failures out of 1.2mio documents. Thus XmlBeans was not properly tested and I need to re-trigger.
However I already noticed the following failures that look new (however these are all very rare, not related to XmlBeans at all, mostly in HSSF/HSLF!): 4 ERROR java.lang.ClassCastException: o.a.p.hssf.record.StyleRecord cannot be cast to o.a.p.hssf.record.ExtendedFormatRecord java.lang.ClassCastException: o.a.p.hssf.record.StyleRecord cannot be cast to o.a.p.hssf.record.ExtendedFormatRecord at o.a.p.hssf.model.InternalWorkbook.getExFormatAt(InternalWorkbook.java:870) at o.a.p.hssf.usermodel.HSSFCell.getCellStyle(HSSFCell.java:943) at o.a.p.hssf.usermodel.HSSFCell.getCellStyle(HSSFCell.java:71) at o.a.p.stress.HSSFFileHandler.handleFile(HSSFFileHandler.java:63) at org.dstadler.commoncrawl.FileHandlingRunnable.run(FileHandlingRunnable.java:64) at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149) at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624) at java.lang.Thread.run(Thread.java:748) Download <http://people.apache.org/~centic/poi_regression/reports/download.oldindex/com.adapchain.www_files_MMP_Based_Sequence.xls> 1 ERROR java.lang.NullPointerException java.lang.NullPointerException at o.a.p.hslf.model.textproperties.HSLFTabStopPropCollection.writeTabStops(HSLFTabStopPropCollection.java:83) at o.a.p.hslf.record.TextRulerAtom.writeIf(TextRulerAtom.java:150) at o.a.p.hslf.record.TextRulerAtom.writeOut(TextRulerAtom.java:126) at o.a.p.hslf.record.EscherTextboxWrapper.writeOut(EscherTextboxWrapper.java:90) at o.a.p.hslf.usermodel.HSLFTextParagraph.refreshRecords(HSLFTextParagraph.java:1158) at o.a.p.hslf.usermodel.HSLFTextParagraph.storeText(HSLFTextParagraph.java:969) at o.a.p.hslf.usermodel.HSLFTextShape.storeText(HSLFTextShape.java:851) at o.a.p.hslf.usermodel.HSLFSlideShow.writeDirtyParagraphs(HSLFSlideShow.java:485) at o.a.p.hslf.usermodel.HSLFSlideShow.writeDirtyParagraphs(HSLFSlideShow.java:477) at o.a.p.hslf.usermodel.HSLFSlideShow.write(HSLFSlideShow.java:451) at o.a.p.stress.SlideShowHandler.writeToArray(SlideShowHandler.java:63) at o.a.p.stress.SlideShowHandler.handleSlideShow(SlideShowHandler.java:49) at o.a.p.stress.HSLFFileHandler.handleFile(HSLFFileHandler.java:49) at org.dstadler.commoncrawl.FileHandlingRunnable.run(FileHandlingRunnable.java:64) at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149) at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624) at java.lang.Thread.run(Thread.java:748) Download <http://people.apache.org/~centic/poi_regression/reports/download.oldindex/ca.casid-acedi.www_sites_default_files_GDS_20save_20the_20date_20Nov_204.ppt> 1 ERROR java.lang.ArrayIndexOutOfBoundsException: * java.lang.ArrayIndexOutOfBoundsException: * at o.a.p.hssf.usermodel.HSSFOptimiser.optimiseCellStyles(HSSFOptimiser.java:229) at o.a.p.stress.HSSFFileHandler.handleFile(HSSFFileHandler.java:59) at org.dstadler.commoncrawl.FileHandlingRunnable.run(FileHandlingRunnable.java:64) at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149) at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624) at java.lang.Thread.run(Thread.java:748) Download <http://people.apache.org/~centic/poi_regression/reports/download.oldindex/com.9thtee.www_tivoconv3release.xls> Dominik. On Mon, Jun 25, 2018 at 7:30 AM, Dominik Stadler <dominik.stad...@gmx.at> wrote: > A regression run on the > 1mio commoncrawl documents is underway... > > Dominik > > On Sun, Jun 24, 2018, 22:50 Andreas Beeker <kiwiwi...@apache.org> wrote: > >> I'm +1 too ... we can refactor the code in the next release. >> >> A govdocs run would be nice, to see if we have further OOM problems. >> >> Andi >> >> >>