Hey Tim et al.,
Do the tests fail for you with Java 12? [INFO] Running org.apache.tika.parser.pkg.GzipParserTest [INFO] Tests run: 3, Failures: 0, Errors: 0, Skipped: 0, Time elapsed: 0.397 s - in org.apache.tika.parser.pkg.GzipParserTest [INFO] Running org.apache.tika.TestXMLEntityExpansion [WARNING] Tests run: 3, Failures: 0, Errors: 0, Skipped: 1, Time elapsed: 0.085 s - in org.apache.tika.TestXMLEntityExpansion [INFO] Running org.apache.tika.mime.MimeTypeTest [INFO] Tests run: 3, Failures: 0, Errors: 0, Skipped: 0, Time elapsed: 0.001 s - in org.apache.tika.mime.MimeTypeTest [INFO] Running org.apache.tika.mime.MimeTypesTest [INFO] Tests run: 5, Failures: 0, Errors: 0, Skipped: 0, Time elapsed: 0.001 s - in org.apache.tika.mime.MimeTypesTest [INFO] Running org.apache.tika.mime.TestMimeTypes [INFO] Tests run: 80, Failures: 0, Errors: 0, Skipped: 0, Time elapsed: 8.997 s - in org.apache.tika.mime.TestMimeTypes [INFO] Running org.apache.tika.TestCorruptedFiles [WARNING] Tests run: 1, Failures: 0, Errors: 0, Skipped: 1, Time elapsed: 0.001 s - in org.apache.tika.TestCorruptedFiles [INFO] [INFO] Results: [INFO] [ERROR] Failures: [ERROR] TesseractOCRParserTest.confirmMultiPageTiffHandling:290->TikaTest.assertContains:110 Page 2 not found in: <html xmlns="http://www.w3.org/1999/xhtml"> <head> <meta name="Exif Image:Page Number" content="1 2" /> <meta name="Exif IFD0:Strip Offsets" content="8 28680 46835 73454" /> <meta name="Exif IFD0:JPEG Tables" content="[289 values]" /> <meta name="Exif Image:Samples Per Pixel" content="3 samples/pixel" /> <meta name="Exif Image:Image Height" content="600 pixels" /> <meta name="tiff:ImageLength" content="600" /> <meta name="Exif Image:Compression" content="JPEG" /> <meta name="Exif Image:Y Resolution" content="96 dots per inch" /> <meta name="Exif IFD0:X Resolution" content="96 dots per inch" /> <meta name="tiff:ResolutionUnit" content="Inch" /> <meta name="Exif IFD0:Image Height" content="600 pixels" /> <meta name="Exif IFD0:Strip Byte Counts" content="28672 18155 26619 4002 bytes" /> <meta name="File Size" content="156867 bytes" /> <meta name="Exif IFD0:Image Width" content="800 pixels" /> <meta name="Exif Image:Photometric Interpretation" content="RGB" /> <meta name="Exif IFD0:Samples Per Pixel" content="3 samples/pixel" /> <meta name="Exif IFD0:Planar Configuration" content="Chunky (contiguous for each subsampling pixel)" /> <meta name="Exif IFD0:Rows Per Strip" content="160 rows/strip" /> <meta name="Exif Image:Image Width" content="800 pixels" /> <meta name="File Name" content="apache-tika-17704590698477286878.tmp" /> <meta name="Exif IFD0:Bits Per Sample" content="8 8 8 bits/component/pixel" /> <meta name="tiff:BitsPerSample" content="8" /> <meta name="Exif IFD0:Resolution Unit" content="Inch" /> <meta name="Content-Type" content="image/tiff" /> <meta name="X-Parsed-By" content="org.apache.tika.parser.DefaultParser" /> <meta name="X-Parsed-By" content="org.apache.tika.parser.ocr.TesseractOCRParser" /> <meta name="X-Parsed-By" content="org.apache.tika.parser.image.TiffParser" /> <meta name="Exif Image:Planar Configuration" content="Chunky (contiguous for each subsampling pixel)" /> <meta name="File Modified Date" content="Wed Mar 18 01:27:47 +00:00 2020" /> <meta name="tiff:XResolution" content="96.0" /> <meta name="tiff:SamplesPerPixel" content="3" /> <meta name="exif:PageCount" content="2" /> <meta name="Exif Image:Strip Byte Counts" content="28672 18155 27500 4002 bytes" /> <meta name="Exif IFD0:Orientation" content="Top, left side (Horizontal / normal)" /> <meta name="tiff:Orientation" content="1" /> <meta name="Exif IFD0:Compression" content="JPEG" /> <meta name="Exif IFD0:Page Number" content="0 1" /> <meta name="Exif Image:Rows Per Strip" content="160 rows/strip" /> <meta name="Exif Image:X Resolution" content="96 dots per inch" /> <meta name="Exif Image:Orientation" content="Top, left side (Horizontal / normal)" /> <meta name="Exif IFD0:Photometric Interpretation" content="RGB" /> <meta name="Exif Image:JPEG Tables" content="[289 values]" /> <meta name="tiff:ImageWidth" content="800" /> <meta name="tiff:YResolution" content="96.0" /> <meta name="Exif Image:Bits Per Sample" content="8 8 8 bits/component/pixel" /> <meta name="Exif Image:Strip Offsets" content="77997 106669 124824 152324" /> <meta name="Exif IFD0:Y Resolution" content="96 dots per inch" /> <meta name="Exif Image:Resolution Unit" content="Inch" /> <title></title> </head> <body><div class="ocr">Multipage TIFF Example Page 1 </div> </body></html> [ERROR] TesseractOCRParserTest.testOCROutputsHOCR:146->TikaTest.assertContains:110 Happy</span> not found in: <html xmlns="http://www.w3.org/1999/xhtml"> <head> <meta name="pdf:docinfo:custom:AAPL:Keywords" content="" /> <meta name="pdf:PDFVersion" content="1.3" /> <meta name="pdf:docinfo:title" content="Presentation1" /> <meta name="xmp:CreatorTool" content="PowerPoint" /> <meta name="pdf:hasXFA" content="false" /> <meta name="access_permission:modify_annotations" content="true" /> <meta name="access_permission:can_print_degraded" content="true" /> <meta name="AAPL:Keywords" content="" /> <meta name="dc:creator" content="grantingersoll" /> <meta name="dcterms:created" content="2014-02-08T19:57:12Z" /> <meta name="dcterms:modified" content="2014-02-08T19:57:12Z" /> <meta name="Last-Modified" content="2014-02-08T19:57:12Z" /> <meta name="dc:format" content="application/pdf; version=1.3" /> <meta name="pdf:docinfo:creator_tool" content="PowerPoint" /> <meta name="access_permission:fill_in_form" content="true" /> <meta name="pdf:docinfo:keywords" content="" /> <meta name="pdf:docinfo:modified" content="2014-02-08T19:57:12Z" /> <meta name="meta:save-date" content="2014-02-08T19:57:12Z" /> <meta name="pdf:encrypted" content="false" /> <meta name="dc:title" content="Presentation1" /> <meta name="cp:subject" content="" /> <meta name="pdf:docinfo:subject" content="" /> <meta name="pdf:hasMarkedContent" content="false" /> <meta name="Content-Type" content="application/pdf" /> <meta name="pdf:docinfo:creator" content="grantingersoll" /> <meta name="X-Parsed-By" content="org.apache.tika.parser.DefaultParser" /> <meta name="X-Parsed-By" content="org.apache.tika.parser.pdf.PDFParser" /> <meta name="meta:author" content="grantingersoll" /> <meta name="dc:subject" content="" /> <meta name="dc:subject" content="" /> <meta name="dc:subject" content="" /> <meta name="dc:subject" content="" /> <meta name="meta:creation-date" content="2014-02-08T19:57:12Z" /> <meta name="access_permission:extract_for_accessibility" content="true" /> <meta name="access_permission:assemble_document" content="true" /> <meta name="xmpTPg:NPages" content="1" /> <meta name="pdf:hasXMP" content="false" /> <meta name="access_permission:extract_content" content="true" /> <meta name="access_permission:can_print" content="true" /> <meta name="meta:keyword" content="" /> <meta name="access_permission:can_modify" content="true" /> <meta name="pdf:docinfo:producer" content="Mac OS X 10.9.1 Quartz PDFContext" /> <meta name="pdf:docinfo:created" content="2014-02-08T19:57:12Z" /> <title>Presentation1</title> </head> <body><div class="page"><p /> <img src="embedded:image0.png" alt="image0.png" /></div> </body></html><html xmlns="http://www.w3.org/1999/xhtml"> <head> <meta name="Transparency Alpha" content="none" /> <meta name="tiff:ImageLength" content="261" /> <meta name="Compression CompressionTypeName" content="deflate" /> <meta name="Data BitsPerSample" content="8 8 8" /> <meta name="Data PlanarConfiguration" content="PixelInterleaved" /> <meta name="Dimension VerticalPixelSize" content="0.35273367" /> <meta name="IHDR" content="width=934, height=261, bitDepth=8, colorType=RGB, compressionMethod=deflate, filterMethod=adaptive, interlaceMethod=none" /> <meta name="embeddedResourceType" content="INLINE" /> <meta name="Chroma ColorSpaceType" content="RGB" /> <meta name="tiff:BitsPerSample" content="8 8 8" /> <meta name="Content-Type" content="image/png" /> <meta name="height" content="261" /> <meta name="X-Parsed-By" content="org.apache.tika.parser.DefaultParser" /> <meta name="X-Parsed-By" content="org.apache.tika.parser.ocr.TesseractOCRParser" /> <meta name="X-Parsed-By" content="org.apache.tika.parser.image.ImageParser" /> <meta name="pHYs" content="pixelsPerUnitXAxis=2835, pixelsPerUnitYAxis=2835, unitSpecifier=meter" /> <meta name="Dimension PixelAspectRatio" content="1.0" /> <meta name="resourceName" content="image0.png" /> <meta name="pdf:hasXMP" content="false" /> <meta name="Compression NumProgressiveScans" content="1" /> <meta name="Dimension HorizontalPixelSize" content="0.35273367" /> <meta name="Chroma BlackIsZero" content="true" /> <meta name="Compression Lossless" content="true" /> <meta name="X-TIKA:embedded_depth" content="1" /> <meta name="width" content="934" /> <meta name="Dimension ImageOrientation" content="Normal" /> <meta name="X-TIKA:embedded_resource_path" content="/image0.png" /> <meta name="tiff:ImageWidth" content="934" /> <meta name="Chroma NumChannels" content="3" /> <meta name="Data SampleFormat" content="UnsignedIntegral" /> <title></title> </head> <body><div class="ocr"> <div class="ocr_page" id="page_1" title="image "/var/folders/n5/1d_k3z4s2293q8ntx_n8sw54mm5n_8/T/apache-tika-1878472717677617651.tmp"; bbox 0 0 934 261; ppageno 0"> <div class="ocr_carea" id="block_1_1" title="bbox 21 34 465 66"> <p class="ocr_par" id="par_1_1" lang="eng" title="bbox 21 34 465 66"> <span class="ocr_line" id="line_1_1" title="bbox 21 34 465 66; baseline 0.005 -7; x_size 33; x_descenders 7; x_ascenders 8"> <span class="ocrx_word" id="word_1_1" title="bbox 21 36 135 66; x_wconf 96"><strong>Happy</strong></span> <span class="ocrx_word" id="word_1_2" title="bbox 161 37 232 60; x_wconf 96"><strong>New</strong></span> <span class="ocrx_word" id="word_1_3" title="bbox 259 38 345 60; x_wconf 96"><strong>Year</strong></span> <span class="ocrx_word" id="word_1_4" title="bbox 375 34 465 61; x_wconf 96"><strong>2003!</strong></span> </span> </p> </div> </div> </div> </body></html> [INFO] [ERROR] Tests run: 1188, Failures: 2, Errors: 0, Skipped: 48 [INFO] [INFO] ------------------------------------------------------------------------ [INFO] Reactor Summary for Apache Tika 2.0.0-SNAPSHOT: [INFO] [INFO] Apache Tika parent ................................. SUCCESS [ 8.822 s] [INFO] Apache Tika core ................................... SUCCESS [ 39.589 s] [INFO] Apache Tika parsers ................................ FAILURE [09:04 min] [INFO] Apache Tika OSGi bundle ............................ SKIPPED [INFO] Apache Tika XMP .................................... SKIPPED [INFO] Apache Tika serialization .......................... SKIPPED [INFO] Apache Tika batch .................................. SKIPPED [INFO] Apache Tika language detection ..................... SKIPPED [INFO] Apache Tika application ............................ SKIPPED [INFO] Apache Tika translate .............................. SKIPPED [INFO] Apache Tika server ................................. SKIPPED [INFO] Apache Tika eval ................................... SKIPPED [INFO] Apache Tika examples ............................... SKIPPED [INFO] Apache Tika Java-7 Components ...................... SKIPPED [INFO] Apache Tika Deep Learning (powered by DL4J) ........ SKIPPED [INFO] Apache Tika Natural Language Processing ............ SKIPPED [INFO] Apache Tika ........................................ SKIPPED [INFO] ------------------------------------------------------------------------ [INFO] BUILD FAILURE [INFO] ------------------------------------------------------------------------ [INFO] Total time: 09:57 min [INFO] Finished at: 2020-03-17T18:31:10-07:00 [INFO] ------------------------------------------------------------------------ [ERROR] Failed to execute goal org.apache.maven.plugins:maven-surefire-plugin:3.0.0-M4:test (default-test) on project tika-parsers: There are test failures. [ERROR] [ERROR] Please refer to /Users/mattmann/src/tika/tika-parsers/target/surefire-reports for the individual test results. [ERROR] Please refer to dump files (if any exist) [date].dump, [date]-jvmRun[N].dump and [date].dumpstream. [ERROR] -> [Help 1] [ERROR] [ERROR] To see the full stack trace of the errors, re-run Maven with the -e switch. [ERROR] Re-run Maven using the -X switch to enable full debug logging. [ERROR] [ERROR] For more information about the errors and possible solutions, please read the following articles: [ERROR] [Help 1] http://cwiki.apache.org/confluence/display/MAVEN/MojoFailureException [ERROR] [ERROR] After correcting the problems, you can resume the build with the command [ERROR] mvn <goals> -rf :tika-parsers pomodoro:tika mattmann$ java -version openjdk version "12.0.1" 2019-04-16 OpenJDK Runtime Environment (build 12.0.1+12) OpenJDK 64-Bit Server VM (build 12.0.1+12, mixed mode, sharing) pomodoro:tika mattmann$ Any ideas? Cheers, Chris