dependabot[bot] opened a new pull request #179:
URL: https://github.com/apache/any23/pull/179


   Bumps `tika.version` from 1.24 to 1.27.
   Updates `tika-core` from 1.24 to 1.27
   <details>
   <summary>Changelog</summary>
   <p><em>Sourced from <a 
href="https://github.com/apache/tika/blob/main/CHANGES.txt";>tika-core's 
changelog</a>.</em></p>
   <blockquote>
   <p>Release 2.1.1 - ???</p>
   <ul>
   <li>
   <p>Improve robustness and features of the httpfetcher (TIKA-3543)</p>
   </li>
   <li>
   <p>Add optional fetch ranges to FetchEmitTuple to allow range fetching from,
   e.g. http or s3 (TIKA-3542).</p>
   </li>
   <li>
   <p>Exclude dependencies on jsoup and ehcache in ucar grib/cdm 
(TIKA-3003).</p>
   </li>
   </ul>
   <p>Release 2.1.0 - 08/18/2021</p>
   <p>MAJOR CHANGES in 2.1.0:</p>
   <ul>
   <li>
   <p>Improved packaging for tika-parsers-extended. Use the 
tika-parser-scientific-package and
   tika-parser-sqlite3-package artifacts if you want fat jars with 
dependencies. (TIKA-3510)</p>
   </li>
   <li>
   <p>Tika app writes UTF-8 when an encoding is not specified; the legacy 
behavior
   was UTF-8 on Mac OS, but System default on other OSs (TIKA-3515).</p>
   </li>
   <li>
   <p>Change the default rendering strategy for PDFs from NO_TEXT to ALL 
(TIKA-3520).</p>
   </li>
   </ul>
   <p>Other changes:</p>
   <ul>
   <li>
   <p>Fixed bug that pointed to the wrong tessdata directory if the user 
specified
   a tesseract path but not also a tessdata path (TIKA-3518).</p>
   </li>
   <li>
   <p>Fixed bug in Icu4j's encoding detector where it would return non-standard
   names for charsets, e.g. IBM424_rtl is now returned as IBM424 
(TIKA-3516).</p>
   </li>
   <li>
   <p>Add a simple UrlFetcher in tika-core as a basic alternative
   to tika-fetcher-http (TIKA-3527).</p>
   </li>
   <li>
   <p>Add tika-pipes support for Google Cloud Storage (TIKA-3524).</p>
   </li>
   <li>
   <p>Fix markup ordering errors in xhtml output for ODT files (TIKA-2242).</p>
   </li>
   <li>
   <p>Fix serialization of embedded docs in OpenSearch emitter
   and fix embedded documents not being indexed in some use
   cases in the Solr emitter (TIKA-3490).</p>
   </li>
   <li>
   <p>Add pipesClientId system property to PipesServer so that each
   forked process can log to its own logger (TIKA-3480).</p>
   </li>
   <li>
   <p>Add DateNormalizingMetadataFilter let users ensure that all dates
   emitted to Solr/OpenSearch are in UTC. Users can configure which
   timezone they'd like to use in cases where the file format does
   not store a timezone (TIKA-3496).</p>
   </li>
   <li>
   <p>Breaking change in the Solr and OpenSearch emitters. To achieve</p>
   </li>
   </ul>
   <!-- raw HTML omitted -->
   </blockquote>
   <p>... (truncated)</p>
   </details>
   <details>
   <summary>Commits</summary>
   <ul>
   <li><a 
href="https://github.com/apache/tika/commit/ccf94425e1cd6136ab3c06046580704393039294";><code>ccf9442</code></a>
 [maven-release-plugin] prepare release 1.27-rc1</li>
   <li><a 
href="https://github.com/apache/tika/commit/31d44e99f22ab62cb778f0b9b081052e60eed4b5";><code>31d44e9</code></a>
 prep for 1.27-rc1</li>
   <li><a 
href="https://github.com/apache/tika/commit/f41413050c58befdbe4d020ba7e9d8ca4783646e";><code>f414130</code></a>
 TIKA-3459 -- integrate Drew Noakes metadata-extractor as the underlying MP4 
p...</li>
   <li><a 
href="https://github.com/apache/tika/commit/74c5e5a8e934f4adc23cdf3a696021667663dbce";><code>74c5e5a</code></a>
 TIKA-3460 -- add missing properties files for jaiimageio-core</li>
   <li><a 
href="https://github.com/apache/tika/commit/57f5912b5c46a53bd8e5a04bfd117142ac2034d4";><code>57f5912</code></a>
 TIKA-3457 -- general upgrades for 1.27</li>
   <li><a 
href="https://github.com/apache/tika/commit/4ba5fd7eb8b1a6ccc45fd773b73e6f809a652370";><code>4ba5fd7</code></a>
 TIKA-3456 -- LanguageDetector should chunk long strings and test for 
hasEnoug...</li>
   <li><a 
href="https://github.com/apache/tika/commit/90c6ea47c6c76aa5df1f3bf7f6e0c64e30344a2d";><code>90c6ea4</code></a>
 TIKA-3444 -- upgrade to pdfbox 2.0.24</li>
   <li><a 
href="https://github.com/apache/tika/commit/1224f881a1dc54282281cdf2fc9ecf3f3e429393";><code>1224f88</code></a>
 TIKA-3441 -- improve likelihood that tesseract processes will be shutdown on 
...</li>
   <li><a 
href="https://github.com/apache/tika/commit/e8ec2233faa57f6dab8f2aa7f582203a3c707af6";><code>e8ec223</code></a>
 Merge remote-tracking branch 'origin/branch_1x' into branch_1x</li>
   <li><a 
href="https://github.com/apache/tika/commit/d7fa2cd284a0d400a1ef29f7111018bb16b1cc5d";><code>d7fa2cd</code></a>
 TIKA-3441 -- improve likelihood that tesseract processes will be shutdown on 
...</li>
   <li>Additional commits viewable in <a 
href="https://github.com/apache/tika/compare/1.24...1.27";>compare view</a></li>
   </ul>
   </details>
   <br />
   
   Updates `tika-parsers` from 1.24 to 1.27
   <details>
   <summary>Changelog</summary>
   <p><em>Sourced from <a 
href="https://github.com/apache/tika/blob/main/CHANGES.txt";>tika-parsers's 
changelog</a>.</em></p>
   <blockquote>
   <p>Release 2.1.1 - ???</p>
   <ul>
   <li>
   <p>Improve robustness and features of the httpfetcher (TIKA-3543)</p>
   </li>
   <li>
   <p>Add optional fetch ranges to FetchEmitTuple to allow range fetching from,
   e.g. http or s3 (TIKA-3542).</p>
   </li>
   <li>
   <p>Exclude dependencies on jsoup and ehcache in ucar grib/cdm 
(TIKA-3003).</p>
   </li>
   </ul>
   <p>Release 2.1.0 - 08/18/2021</p>
   <p>MAJOR CHANGES in 2.1.0:</p>
   <ul>
   <li>
   <p>Improved packaging for tika-parsers-extended. Use the 
tika-parser-scientific-package and
   tika-parser-sqlite3-package artifacts if you want fat jars with 
dependencies. (TIKA-3510)</p>
   </li>
   <li>
   <p>Tika app writes UTF-8 when an encoding is not specified; the legacy 
behavior
   was UTF-8 on Mac OS, but System default on other OSs (TIKA-3515).</p>
   </li>
   <li>
   <p>Change the default rendering strategy for PDFs from NO_TEXT to ALL 
(TIKA-3520).</p>
   </li>
   </ul>
   <p>Other changes:</p>
   <ul>
   <li>
   <p>Fixed bug that pointed to the wrong tessdata directory if the user 
specified
   a tesseract path but not also a tessdata path (TIKA-3518).</p>
   </li>
   <li>
   <p>Fixed bug in Icu4j's encoding detector where it would return non-standard
   names for charsets, e.g. IBM424_rtl is now returned as IBM424 
(TIKA-3516).</p>
   </li>
   <li>
   <p>Add a simple UrlFetcher in tika-core as a basic alternative
   to tika-fetcher-http (TIKA-3527).</p>
   </li>
   <li>
   <p>Add tika-pipes support for Google Cloud Storage (TIKA-3524).</p>
   </li>
   <li>
   <p>Fix markup ordering errors in xhtml output for ODT files (TIKA-2242).</p>
   </li>
   <li>
   <p>Fix serialization of embedded docs in OpenSearch emitter
   and fix embedded documents not being indexed in some use
   cases in the Solr emitter (TIKA-3490).</p>
   </li>
   <li>
   <p>Add pipesClientId system property to PipesServer so that each
   forked process can log to its own logger (TIKA-3480).</p>
   </li>
   <li>
   <p>Add DateNormalizingMetadataFilter let users ensure that all dates
   emitted to Solr/OpenSearch are in UTC. Users can configure which
   timezone they'd like to use in cases where the file format does
   not store a timezone (TIKA-3496).</p>
   </li>
   <li>
   <p>Breaking change in the Solr and OpenSearch emitters. To achieve</p>
   </li>
   </ul>
   <!-- raw HTML omitted -->
   </blockquote>
   <p>... (truncated)</p>
   </details>
   <details>
   <summary>Commits</summary>
   <ul>
   <li><a 
href="https://github.com/apache/tika/commit/ccf94425e1cd6136ab3c06046580704393039294";><code>ccf9442</code></a>
 [maven-release-plugin] prepare release 1.27-rc1</li>
   <li><a 
href="https://github.com/apache/tika/commit/31d44e99f22ab62cb778f0b9b081052e60eed4b5";><code>31d44e9</code></a>
 prep for 1.27-rc1</li>
   <li><a 
href="https://github.com/apache/tika/commit/f41413050c58befdbe4d020ba7e9d8ca4783646e";><code>f414130</code></a>
 TIKA-3459 -- integrate Drew Noakes metadata-extractor as the underlying MP4 
p...</li>
   <li><a 
href="https://github.com/apache/tika/commit/74c5e5a8e934f4adc23cdf3a696021667663dbce";><code>74c5e5a</code></a>
 TIKA-3460 -- add missing properties files for jaiimageio-core</li>
   <li><a 
href="https://github.com/apache/tika/commit/57f5912b5c46a53bd8e5a04bfd117142ac2034d4";><code>57f5912</code></a>
 TIKA-3457 -- general upgrades for 1.27</li>
   <li><a 
href="https://github.com/apache/tika/commit/4ba5fd7eb8b1a6ccc45fd773b73e6f809a652370";><code>4ba5fd7</code></a>
 TIKA-3456 -- LanguageDetector should chunk long strings and test for 
hasEnoug...</li>
   <li><a 
href="https://github.com/apache/tika/commit/90c6ea47c6c76aa5df1f3bf7f6e0c64e30344a2d";><code>90c6ea4</code></a>
 TIKA-3444 -- upgrade to pdfbox 2.0.24</li>
   <li><a 
href="https://github.com/apache/tika/commit/1224f881a1dc54282281cdf2fc9ecf3f3e429393";><code>1224f88</code></a>
 TIKA-3441 -- improve likelihood that tesseract processes will be shutdown on 
...</li>
   <li><a 
href="https://github.com/apache/tika/commit/e8ec2233faa57f6dab8f2aa7f582203a3c707af6";><code>e8ec223</code></a>
 Merge remote-tracking branch 'origin/branch_1x' into branch_1x</li>
   <li><a 
href="https://github.com/apache/tika/commit/d7fa2cd284a0d400a1ef29f7111018bb16b1cc5d";><code>d7fa2cd</code></a>
 TIKA-3441 -- improve likelihood that tesseract processes will be shutdown on 
...</li>
   <li>Additional commits viewable in <a 
href="https://github.com/apache/tika/compare/1.24...1.27";>compare view</a></li>
   </ul>
   </details>
   <br />
   
   
   Dependabot will resolve any conflicts with this PR as long as you don't 
alter it yourself. You can also trigger a rebase manually by commenting 
`@dependabot rebase`.
   
   [//]: # (dependabot-automerge-start)
   [//]: # (dependabot-automerge-end)
   
   ---
   
   <details>
   <summary>Dependabot commands and options</summary>
   <br />
   
   You can trigger Dependabot actions by commenting on this PR:
   - `@dependabot rebase` will rebase this PR
   - `@dependabot recreate` will recreate this PR, overwriting any edits that 
have been made to it
   - `@dependabot merge` will merge this PR after your CI passes on it
   - `@dependabot squash and merge` will squash and merge this PR after your CI 
passes on it
   - `@dependabot cancel merge` will cancel a previously requested merge and 
block automerging
   - `@dependabot reopen` will reopen this PR if it is closed
   - `@dependabot close` will close this PR and stop Dependabot recreating it. 
You can achieve the same result by closing it manually
   - `@dependabot ignore this major version` will close this PR and stop 
Dependabot creating any more for this major version (unless you reopen the PR 
or upgrade to it yourself)
   - `@dependabot ignore this minor version` will close this PR and stop 
Dependabot creating any more for this minor version (unless you reopen the PR 
or upgrade to it yourself)
   - `@dependabot ignore this dependency` will close this PR and stop 
Dependabot creating any more for this dependency (unless you reopen the PR or 
upgrade to it yourself)
   
   
   </details>


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: dev-unsubscr...@any23.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Reply via email to