Author: grossws Date: Sat Apr 20 20:59:46 2019 New Revision: 1857883 URL: http://svn.apache.org/viewvc?rev=1857883&view=rev Log: Updated links to Apache sites
Updated: * http://*.apache.org/* links to https://*.apache.org/*; * links to wiki (from MoinMoin to Confluence); * links for old CHANGES.txt to https://archive.apache.org/dist/tika/CHANGES-X.Y.txt; * repository in doap.rdf Modified: tika/site/src/site/apt/1.20/configuring.apt tika/site/src/site/apt/1.20/detection.apt tika/site/src/site/apt/1.20/examples.apt tika/site/src/site/apt/1.20/formats.apt tika/site/src/site/apt/1.20/gettingstarted.apt tika/site/src/site/apt/1.20/index.apt tika/site/src/site/apt/1.20/parser.apt tika/site/src/site/apt/1.20/parser_guide.apt tika/site/src/site/apt/contribute.apt.vm tika/site/src/site/apt/download.apt.vm tika/site/src/site/apt/index.apt.vm tika/site/src/site/apt/mail-lists.apt tika/site/src/site/apt/security.apt tika/site/src/site/resources/doap.rdf tika/site/src/site/site.vm tika/site/src/site/site.xml Modified: tika/site/src/site/apt/1.20/configuring.apt URL: http://svn.apache.org/viewvc/tika/site/src/site/apt/1.20/configuring.apt?rev=1857883&r1=1857882&r2=1857883&view=diff ============================================================================== --- tika/site/src/site/apt/1.20/configuring.apt (original) +++ tika/site/src/site/apt/1.20/configuring.apt Sat Apr 20 20:59:46 2019 @@ -9,7 +9,7 @@ ~~ (the "License"); you may not use this file except in compliance with ~~ the License. You may obtain a copy of the License at ~~ -~~ http://www.apache.org/licenses/LICENSE-2.0 +~~ https://www.apache.org/licenses/LICENSE-2.0 ~~ ~~ Unless required by applicable law or agreed to in writing, software ~~ distributed under the License is distributed on an "AS IS" BASIS, Modified: tika/site/src/site/apt/1.20/detection.apt URL: http://svn.apache.org/viewvc/tika/site/src/site/apt/1.20/detection.apt?rev=1857883&r1=1857882&r2=1857883&view=diff ============================================================================== --- tika/site/src/site/apt/1.20/detection.apt (original) +++ tika/site/src/site/apt/1.20/detection.apt Sat Apr 20 20:59:46 2019 @@ -9,7 +9,7 @@ ~~ (the "License"); you may not use this file except in compliance with ~~ the License. You may obtain a copy of the License at ~~ -~~ http://www.apache.org/licenses/LICENSE-2.0 +~~ https://www.apache.org/licenses/LICENSE-2.0 ~~ ~~ Unless required by applicable law or agreed to in writing, software ~~ distributed under the License is distributed on an "AS IS" BASIS, @@ -142,7 +142,7 @@ MediaType detect(java.io.InputStream inp by the default Mime Magic detection only. For more information on container formats and Tika, see - {{{http://wiki.apache.org/tika/MetadataDiscussion}}} + {{{https://cwiki.apache.org/confluence/display/TIKA/MetadataDiscussion}}} * {The default Tika Detector} Modified: tika/site/src/site/apt/1.20/examples.apt URL: http://svn.apache.org/viewvc/tika/site/src/site/apt/1.20/examples.apt?rev=1857883&r1=1857882&r2=1857883&view=diff ============================================================================== --- tika/site/src/site/apt/1.20/examples.apt (original) +++ tika/site/src/site/apt/1.20/examples.apt Sat Apr 20 20:59:46 2019 @@ -9,7 +9,7 @@ ~~ (the "License"); you may not use this file except in compliance with ~~ the License. You may obtain a copy of the License at ~~ -~~ http://www.apache.org/licenses/LICENSE-2.0 +~~ https://www.apache.org/licenses/LICENSE-2.0 ~~ ~~ Unless required by applicable law or agreed to in writing, software ~~ distributed under the License is distributed on an "AS IS" BASIS, @@ -21,8 +21,8 @@ Apache Tika API Usage Examples This page provides a number of examples on how to use the various Tika APIs. All of the examples shown are also available in the - {{{https://svn.apache.org/repos/asf/tika/trunk/tika-example}Tika Example - module}} in SVN. + {{{https://github.com/apache/tika/tree/master/tika-example}Tika Example + module}} in GIT. %{toc|section=1|fromDepth=1} @@ -144,5 +144,5 @@ Apache Tika API Usage Examples A number of other examples are also available, including all of the examples from the {{{http://manning.com/mattmann/}Tika In Action book}}. These can all be found in the - {{{https://svn.apache.org/repos/asf/tika/trunk/tika-example}Tika Example - module}} in SVN. + {{{https://github.com/apache/tika/tree/master/tika-example}Tika Example + module}} in GIT. Modified: tika/site/src/site/apt/1.20/formats.apt URL: http://svn.apache.org/viewvc/tika/site/src/site/apt/1.20/formats.apt?rev=1857883&r1=1857882&r2=1857883&view=diff ============================================================================== --- tika/site/src/site/apt/1.20/formats.apt (original) +++ tika/site/src/site/apt/1.20/formats.apt Sat Apr 20 20:59:46 2019 @@ -9,7 +9,7 @@ ~~ (the "License"); you may not use this file except in compliance with ~~ the License. You may obtain a copy of the License at ~~ -~~ http://www.apache.org/licenses/LICENSE-2.0 +~~ https://www.apache.org/licenses/LICENSE-2.0 ~~ ~~ Unless required by applicable law or agreed to in writing, software ~~ distributed under the License is distributed on an "AS IS" BASIS, @@ -61,7 +61,7 @@ Supported Document Formats {{{./api/org/apache/tika/parser/microsoft/OfficeParser.html}OfficeParser}} and {{{./api/org/apache/tika/parser/microsoft/ooxml/OOXMLParser.html}OOXMLParser}} - classes use {{{http://poi.apache.org/}Apache POI}} libraries to support + classes use {{{https://poi.apache.org/}Apache POI}} libraries to support text and metadata extraction from both OLE2 and OOXML documents. Old, pre-OLE2 Excel files (Excel 2, 3 and 4) are handled by the @@ -104,7 +104,7 @@ Supported Document Formats The {{{./api/org/apache/tika/parser/pdf/PDFParser.html}PDFParser}} class parsers Portable Document Format (PDF) documents using the - {{{http://pdfbox.apache.org/}Apache PDFBox}} library. + {{{https://pdfbox.apache.org/}Apache PDFBox}} library. * {Electronic Publication Format} @@ -123,7 +123,7 @@ Supported Document Formats * {Compression and packaging formats} - Tika uses the {{{http://commons.apache.org/compress/}Commons Compress}} + Tika uses the {{{https://commons.apache.org/compress/}Commons Compress}} library to support various compression and packaging formats. The {{{./api/org/apache/tika/parser/pkg/CompressorParser.html}CompressorParser}} class handles parsing of the top level compression formats, then @@ -237,7 +237,7 @@ Supported Document Formats can be used (if the required tool is installed) to generate a numeric representation of the video suitable for similarity searches. More details on this approach, and setup instructions for the parser + tool, can be - found on {{{https://wiki.apache.org/tika/PooledTimeSeriesParser}the Tika + found on {{{https://cwiki.apache.org/confluence/display/TIKA/PooledTimeSeriesParser}the Tika wiki page for the parser}}. * {Java class files and archives} @@ -365,7 +365,7 @@ Supported Document Formats datasets are required to enable or power this support. Details on the requirements and setup steps are generally given either in - the parser's javadocs, or on the {{{https://wiki.apache.org/tika/}Tika wiki}}. + the parser's javadocs, or on the {{{https://cwiki.apache.org/tika/}Tika wiki}}. The {{{./api/org/apache/tika/parser/sentiment/analysis/SentimentParser.html}SentimentParser}} class classifies documents based on the sentiment of document, powered by Apache @@ -383,7 +383,7 @@ Supported Document Formats frameworks are generally required, often accessed via REST services. The {{{./api/org/apache/tika/parser/recognition/}recognition package}} contains most of these. Details on the requirements and setup steps are generally given - on the {{{https://wiki.apache.org/tika/}Tika wiki}}. + on the {{{https://cwiki.apache.org/tika/}Tika wiki}}. Full list of Supported Formats Modified: tika/site/src/site/apt/1.20/gettingstarted.apt URL: http://svn.apache.org/viewvc/tika/site/src/site/apt/1.20/gettingstarted.apt?rev=1857883&r1=1857882&r2=1857883&view=diff ============================================================================== --- tika/site/src/site/apt/1.20/gettingstarted.apt (original) +++ tika/site/src/site/apt/1.20/gettingstarted.apt Sat Apr 20 20:59:46 2019 @@ -9,7 +9,7 @@ ~~ (the "License"); you may not use this file except in compliance with ~~ the License. You may obtain a copy of the License at ~~ -~~ http://www.apache.org/licenses/LICENSE-2.0 +~~ https://www.apache.org/licenses/LICENSE-2.0 ~~ ~~ Unless required by applicable law or agreed to in writing, software ~~ distributed under the License is distributed on an "AS IS" BASIS, @@ -30,7 +30,7 @@ Getting and building the sources version control. Once you have the sources, you can build them using the - {{{http://maven.apache.org/}Maven 2}} build system. Executing the + {{{https://maven.apache.org/}Maven 3}} build system. Executing the following command in the base directory will build the sources and install the resulting artifacts in your local Maven repository. @@ -63,7 +63,7 @@ Build artifacts [tika-server/target/tika-server-*.jar] Tika JAX-RS REST application. This is a Jetty web server running Tika - REST services as described in {{{http://wiki.apache.org/tika/TikaJAXRS}this page}}. + REST services as described in {{{https://cwiki.apache.org/confluence/display/TIKA/TikaJAXRS}this page}}. [tika-bundle/target/tika-bundle-*.jar] Tika bundle. An OSGi bundle that combines tika-parsers with non-OSGified @@ -126,7 +126,7 @@ dependencies { Using Tika in an Ant project - If you are using {{{http://ant.apache.org/ivy/}Apache Ivy}} as your + If you are using {{{https://ant.apache.org/ivy/}Apache Ivy}} as your dependency manager tool with Ant, then to include Tika with the full set of parsers, you should depend on the <<< tika-parsers >>> artifact like this: Modified: tika/site/src/site/apt/1.20/index.apt URL: http://svn.apache.org/viewvc/tika/site/src/site/apt/1.20/index.apt?rev=1857883&r1=1857882&r2=1857883&view=diff ============================================================================== --- tika/site/src/site/apt/1.20/index.apt (original) +++ tika/site/src/site/apt/1.20/index.apt Sat Apr 20 20:59:46 2019 @@ -9,7 +9,7 @@ ~~ (the "License"); you may not use this file except in compliance with ~~ the License. You may obtain a copy of the License at ~~ -~~ http://www.apache.org/licenses/LICENSE-2.0 +~~ https://www.apache.org/licenses/LICENSE-2.0 ~~ ~~ Unless required by applicable law or agreed to in writing, software ~~ distributed under the License is distributed on an "AS IS" BASIS, @@ -22,35 +22,35 @@ Apache Tika 1.20 The most notable changes in Tika 1.20 over the previous release are: - * Upgrade to Apache POI 4.0.1 ({{{http://issues.apache.org/jira/browse/TIKA-2751}TIKA-2751}}). + * Upgrade to Apache POI 4.0.1 ({{{https://issues.apache.org/jira/browse/TIKA-2751}TIKA-2751}}). * Integrate/parameterize new angles handling in - PDFBox ({{{http://issues.apache.org/jira/browse/TIKA-2779}TIKA-2779}}). + PDFBox ({{{https://issues.apache.org/jira/browse/TIKA-2779}TIKA-2779}}). - * Upgrade to PDFBox 2.0.13 ({{{http://issues.apache.org/jira/browse/TIKA-2788}TIKA-2788}}). + * Upgrade to PDFBox 2.0.13 ({{{https://issues.apache.org/jira/browse/TIKA-2788}TIKA-2788}}). * Prevent content within <style/> and <script/> elements - to be written in the ToTextContentHandler ({{{http://issues.apache.org/jira/browse/TIKA-2550}TIKA-2550}}). + to be written in the ToTextContentHandler ({{{https://issues.apache.org/jira/browse/TIKA-2550}TIKA-2550}}). * Switch child to parent communication to a shared memory-mappedfile in tika-server's -spawnChild mode. * Fix bug in tika-server when run in legacy mode (not -spawnChild) that caused it to return 503 on documents submitted after - it hit an OutOfMemoryError ({{{http://issues.apache.org/jira/browse/TIKA-2776}TIKA-2776}}). + it hit an OutOfMemoryError ({{{https://issues.apache.org/jira/browse/TIKA-2776}TIKA-2776}}). - * Upgrade jaxb-runtime and javax.activation ({{{http://issues.apache.org/jira/browse/TIKA-2778}TIKA-2778}}). + * Upgrade jaxb-runtime and javax.activation ({{{https://issues.apache.org/jira/browse/TIKA-2778}TIKA-2778}}). * tika-app in batch mode now requires an interrupt or kill signal to the parent process to stop the parent - and the child processes ({{{http://issues.apache.org/jira/browse/TIKA-2780}TIKA-2780}}). + and the child processes ({{{https://issues.apache.org/jira/browse/TIKA-2780}TIKA-2780}}). - * Bulk upgrade of dependencies ({{{http://issues.apache.org/jira/browse/TIKA-2775}TIKA-2775}}). + * Bulk upgrade of dependencies ({{{https://issues.apache.org/jira/browse/TIKA-2775}TIKA-2775}}). - * Improve language id efficiency in tika-eval ({{{http://issues.apache.org/jira/browse/TIKA-2777}TIKA-2777}}). + * Improve language id efficiency in tika-eval ({{{https://issues.apache.org/jira/browse/TIKA-2777}TIKA-2777}}). - * 25.2 ({{{http://issues.apache.org/jira/browse/TIKA-2773}TIKA-2773}}). + * 25.2 ({{{https://issues.apache.org/jira/browse/TIKA-2773}TIKA-2773}}). - * Remove duplication of notes in PPT slides ({{{http://issues.apache.org/jira/browse/TIKA-2735}TIKA-2735}}) + * Remove duplication of notes in PPT slides ({{{https://issues.apache.org/jira/browse/TIKA-2735}TIKA-2735}}) * Use -javaHome or $JAVA_HOME (if they exist) when spawning child in tika-server's -spawnChild mode. Modified: tika/site/src/site/apt/1.20/parser.apt URL: http://svn.apache.org/viewvc/tika/site/src/site/apt/1.20/parser.apt?rev=1857883&r1=1857882&r2=1857883&view=diff ============================================================================== --- tika/site/src/site/apt/1.20/parser.apt (original) +++ tika/site/src/site/apt/1.20/parser.apt Sat Apr 20 20:59:46 2019 @@ -9,7 +9,7 @@ ~~ (the "License"); you may not use this file except in compliance with ~~ the License. You may obtain a copy of the License at ~~ -~~ http://www.apache.org/licenses/LICENSE-2.0 +~~ https://www.apache.org/licenses/LICENSE-2.0 ~~ ~~ Unless required by applicable law or agreed to in writing, software ~~ distributed under the License is distributed on an "AS IS" BASIS, @@ -233,8 +233,8 @@ try { warmly welcome. The goal of Tika is to reuse existing parser libraries like - {{{http://pdfbox.apache.org/}PDFBox}} or - {{{http://poi.apache.org/}Apache POI}} as much as possible, and so most + {{{https://pdfbox.apache.org/}PDFBox}} or + {{{https://poi.apache.org/}Apache POI}} as much as possible, and so most of the parser classes in Tika are adapters to such external libraries. Tika also contains some general purpose parser implementations that are Modified: tika/site/src/site/apt/1.20/parser_guide.apt URL: http://svn.apache.org/viewvc/tika/site/src/site/apt/1.20/parser_guide.apt?rev=1857883&r1=1857882&r2=1857883&view=diff ============================================================================== --- tika/site/src/site/apt/1.20/parser_guide.apt (original) +++ tika/site/src/site/apt/1.20/parser_guide.apt Sat Apr 20 20:59:46 2019 @@ -9,7 +9,7 @@ ~~ (the "License"); you may not use this file except in compliance with ~~ the License. You may obtain a copy of the License at ~~ -~~ http://www.apache.org/licenses/LICENSE-2.0 +~~ https://www.apache.org/licenses/LICENSE-2.0 ~~ ~~ Unless required by applicable law or agreed to in writing, software ~~ distributed under the License is distributed on an "AS IS" BASIS, @@ -71,7 +71,7 @@ Get Tika parsing up and running in 5 min * (the "License"); you may not use this file except in compliance with * the License. You may obtain a copy of the License at * - * http://www.apache.org/licenses/LICENSE-2.0 + * https://www.apache.org/licenses/LICENSE-2.0 * * Unless required by applicable law or agreed to in writing, software * distributed under the License is distributed on an "AS IS" BASIS, Modified: tika/site/src/site/apt/contribute.apt.vm URL: http://svn.apache.org/viewvc/tika/site/src/site/apt/contribute.apt.vm?rev=1857883&r1=1857882&r2=1857883&view=diff ============================================================================== --- tika/site/src/site/apt/contribute.apt.vm (original) +++ tika/site/src/site/apt/contribute.apt.vm Sat Apr 20 20:59:46 2019 @@ -9,7 +9,7 @@ ~~ (the "License"); you may not use this file except in compliance with ~~ the License. You may obtain a copy of the License at ~~ -~~ http://www.apache.org/licenses/LICENSE-2.0 +~~ https://www.apache.org/licenses/LICENSE-2.0 ~~ ~~ Unless required by applicable law or agreed to in writing, software ~~ distributed under the License is distributed on an "AS IS" BASIS, @@ -63,7 +63,7 @@ New Parsers, Detectors and Mime Types If your new Parser or Detector depends on libraries which we cannot include in Tika for license reasons, you are encouraged to list it on the - {{{http://wiki.apache.org/tika/3rd%20party%20parser%20plugins}3rd Party + {{{https://cwiki.apache.org/confluence/display/TIKA/3rd+party+parser+plugins}3rd Party Parser Plugins}} page on the Tika wiki. Submitting Enhancements and Fixes @@ -78,7 +78,7 @@ Submitting Enhancements and Fixes of changed and new files, including binaries, which can then be attached to an issue. - Github Pulls - If you are working from our + GitHub Pull Requests - If you are working from our {{{https://github.com/apache/tika/}GitHub mirror}}, it is possible to open a pull request for your change. Please include the JIRA Issue number in the pull request, so it can be linked by the ASF GitHub bot. @@ -101,7 +101,7 @@ Dependencies Any new dependencies introduced must be under a suitable license. Broadly, they must be Open Source, and must not place restrictions on larger works they are incorporated within. A list of the allowed licenses is maintained - by the {{{http://www.apache.org/legal/resolved.html}ASF Legal Affairs + by the {{{https://www.apache.org/legal/resolved.html}ASF Legal Affairs Committee}}. If in doubt, check on the dev list. All new and updated dependencies must be in Maven Central. (It is not @@ -133,12 +133,12 @@ Code Formatting Other Resources - * The {{{http://community.apache.org/}Apache Community Development + * The {{{https://community.apache.org/}Apache Community Development project (ComDev)}} provide general advice on getting started with contributing to Apache projects * The Apache Nutch project provide a comprehensive guide on - {{{http://wiki.apache.org/nutch/Becoming_A_Nutch_Developer}becoming a + {{{https://wiki.apache.org/nutch/Becoming_A_Nutch_Developer}becoming a Nutch Devloper}}, much of which applies equally for Apache Tika too * The book {{{http://manning.com/mattmann/}Tika in Action}} has a lot Modified: tika/site/src/site/apt/download.apt.vm URL: http://svn.apache.org/viewvc/tika/site/src/site/apt/download.apt.vm?rev=1857883&r1=1857882&r2=1857883&view=diff ============================================================================== --- tika/site/src/site/apt/download.apt.vm (original) +++ tika/site/src/site/apt/download.apt.vm Sat Apr 20 20:59:46 2019 @@ -9,7 +9,7 @@ ~~ (the "License"); you may not use this file except in compliance with ~~ the License. You may obtain a copy of the License at ~~ -~~ http://www.apache.org/licenses/LICENSE-2.0 +~~ https://www.apache.org/licenses/LICENSE-2.0 ~~ ~~ Unless required by applicable law or agreed to in writing, software ~~ distributed under the License is distributed on an "AS IS" BASIS, Modified: tika/site/src/site/apt/index.apt.vm URL: http://svn.apache.org/viewvc/tika/site/src/site/apt/index.apt.vm?rev=1857883&r1=1857882&r2=1857883&view=diff ============================================================================== --- tika/site/src/site/apt/index.apt.vm (original) +++ tika/site/src/site/apt/index.apt.vm Sat Apr 20 20:59:46 2019 @@ -9,7 +9,7 @@ ~~ (the "License"); you may not use this file except in compliance with ~~ the License. You may obtain a copy of the License at ~~ -~~ http://www.apache.org/licenses/LICENSE-2.0 +~~ https://www.apache.org/licenses/LICENSE-2.0 ~~ ~~ Unless required by applicable law or agreed to in writing, software ~~ distributed under the License is distributed on an "AS IS" BASIS, @@ -34,8 +34,8 @@ Apache Tika - a content analysis toolkit or send an email to the {{{./mail-lists.html}Tika development list}}. Tika is a project of the - {{{http://www.apache.org/}Apache Software Foundation}}, and was formerly - a subproject of {{{http://lucene.apache.org/}Apache Lucene}}. + {{{https://www.apache.org/}Apache Software Foundation}}, and was formerly + a subproject of {{{https://lucene.apache.org/}Apache Lucene}}. Latest News @@ -49,7 +49,7 @@ Latest News [9 October 2018: Apache Tika Release] Apache Tika 1.19.1 has been released! This release includes two critical bug fixes: a) fixing the MP3Parser (TIKA-2730) and b) limiting entity expansions in SAX parsing (TIKA-2727). - Please see the {{{https://dist.apache.org/repos/dist/release/tika/CHANGES-1.19.1.txt}CHANGES.txt}} + Please see the {{{https://archive.apache.org/dist/tika/CHANGES-1.19.1.txt}CHANGES.txt}} file for the full list of changes in the release and have a look at the download page for more information on how to obtain Apache Tika 1.19.1. @@ -57,14 +57,14 @@ Latest News [18 September 2018: Apache Tika Release] Apache Tika 1.19 has been released! This release requires Java 8. This release includes bug fixes, improved mime detection, security fixes and upgrades to dependencies. - Please see the {{{https://dist.apache.org/repos/dist/release/tika/CHANGES-1.19.txt}CHANGES.txt}} + Please see the {{{https://archive.apache.org/dist/tika/CHANGES-1.19.txt}CHANGES.txt}} file for the full list of changes in the release and have a look at the download page for more information on how to obtain Apache Tika 1.19. [24 April 2018: Apache Tika Release] Apache Tika 1.18 has been released! This release includes bug fixes (e.g. extraction from grouped shapes in PPT), security fixes and upgrades to dependencies. <<PLEASE NOTE:>> The next versions will require Java 8. - Please see the {{{https://dist.apache.org/repos/dist/release/tika/CHANGES-1.18.txt}CHANGES.txt}} + Please see the {{{https://archive.apache.org/dist/tika/CHANGES-1.18.txt}CHANGES.txt}} file for the full list of changes in the release and have a look at the download page for more information on how to obtain Apache Tika 1.18. @@ -72,7 +72,7 @@ Latest News Apache Tika 1.17 has been released! This release includes new support for automatic image captioning, as well as numerous bug fixes and upgrades to dependencies. <<PLEASE NOTE:>> this will be the last version that will support Java 7. The next versions will require Java 8. - Please see the {{{https://dist.apache.org/repos/dist/release/tika/CHANGES-1.17.txt}CHANGES.txt}} + Please see the {{{https://archive.apache.org/dist/tika/CHANGES-1.17.txt}CHANGES.txt}} file for the full list of changes in the release and have a look at the download page for more information on how to obtain Apache Tika 1.17. @@ -80,21 +80,21 @@ Latest News Apache Tika 1.16 has been released! This release includes integration with USCDataScience's Age Predictor, more warnings for missing optional dependencies, extraction of text from charts and diagrams in ooxml files, and numerous improvements to mime detection. This release removes two dependencies that - may have been incompatible with ASL 2.0 -- org.json and jj2000. Please see the {{{https://dist.apache.org/repos/dist/release/tika/CHANGES-1.16.txt}CHANGES.txt}} + may have been incompatible with ASL 2.0 -- org.json and jj2000. Please see the {{{https://archive.apache.org/dist/tika/CHANGES-1.16.txt}CHANGES.txt}} file for the full list of changes in the release and have a look at the download page for more information on how to obtain Apache Tika 1.16. [30 May 2017: Apache Tika Release] Apache Tika 1.15 has been released! This release includes integration with Google's Tensorflow Object Recognition via the OpenCV API, a new 'tika-eval' module, configurable encoding detectors - and several new parsers. Please see the {{{https://dist.apache.org/repos/dist/release/tika/CHANGES-1.15.txt}CHANGES.txt}} + and several new parsers. Please see the {{{https://archive.apache.org/dist/tika/CHANGES-1.15.txt}CHANGES.txt}} file for a full list of changes in this release and have a look at the download page for more information on how to obtain Apache Tika 1.15. [19 Oct 2016: Apache Tika Release] Apache Tika 1.14 has been released! This release includes integration with Google's Tensorflow Image Recognition via the Inception API, improvements to PDF parsing using OCR, message parsing - and MIME detection. Please see the {{{https://dist.apache.org/repos/dist/release/tika/CHANGES-1.14.txt}CHANGES.txt}} + and MIME detection. Please see the {{{https://archive.apache.org/dist/tika/CHANGES-1.14.txt}CHANGES.txt}} file for a full list of changes in this release and have a look at the downlpage page for more information on how to obtain Apache Tika 1.14. @@ -102,7 +102,7 @@ Latest News Apache Tika 1.13 has been released! This release includes some significant changes to the PDF support, including PDFBox 2.0.1, two new NER system support (MIT-NLP Information Extraction and GROBID Quantities), a new tika-langdetect module, and much more. Please see the - {{{https://dist.apache.org/repos/dist/release/tika/CHANGES-1.13.txt}CHANGES.txt}} + {{{https://archive.apache.org/dist/tika/CHANGES-1.13.txt}CHANGES.txt}} file for a full list of changes in this release and have a look at the download page for more information on how to obtain Apache Tika 1.13. @@ -117,7 +117,7 @@ Latest News to Named Entity Recognition (Stanford NER integration and Apache OpenNLP) and additionally efficiency improvements to the GeoTopicParser. There are also bugfixes to Tika REST server in this release. Please see the - {{{https://dist.apache.org/repos/dist/release/tika/CHANGES-1.12.txt}CHANGES.txt}} + {{{https://archive.apache.org/dist/tika/CHANGES-1.12.txt}CHANGES.txt}} file for a full list of changes in this release and have a look at the download page for more information on how to obtain Apache Tika 1.12. @@ -126,7 +126,7 @@ Latest News that better utilize Java7 support, that help extract more content using the cTAKES clinical extraction system and GROBID journal parser, and improvements to Tesseract extraction. Please see the - {{{https://dist.apache.org/repos/dist/release/tika/CHANGES-1.11.txt}CHANGES.txt}} + {{{https://archive.apache.org/dist/tika/CHANGES-1.11.txt}CHANGES.txt}} file for a full list of changes in this release and have a look at the download page for more information on how to obtain Apache Tika 1.11. @@ -134,7 +134,7 @@ Latest News Apache Tika 1.10 has been released! This release includes several improvements including the ability to parse MS Access Files, composite parser creation via Tika Config XML, and many more! Please see the - {{{https://dist.apache.org/repos/dist/release/tika/CHANGES-1.10.txt}CHANGES.txt}} + {{{https://archive.apache.org/dist/tika/CHANGES-1.10.txt}CHANGES.txt}} file for a full list of changes in this release and have a look at the download page for more information on how to obtain Apache Tika 1.10. @@ -143,7 +143,7 @@ Latest News parsers that extract additional content e.g., from images using EXIF and FFMPEG, along with improvements to MIME detection using probabilistic means, and updates to the Tika REST server supporting translation and language detection. Please see - the {{{https://dist.apache.org/repos/dist/release/tika/CHANGES-1.9.txt}CHANGES.txt}} + the {{{https://archive.apache.org/dist/tika/CHANGES-1.9.txt}CHANGES.txt}} file for a full list of changes in this release. Have a look at the download page for more information on how to obtain Apache Tika 1.9. @@ -151,7 +151,7 @@ Latest News Apache Tika 1.8 has been released! This release includes several bug fixes, tika-batch (a batch processing system for processing large sets of files), and more! Please see the - {{{https://dist.apache.org/repos/dist/release/tika/CHANGES-1.8.txt}CHANGES.txt}} + {{{https://archive.apache.org/dist/tika/CHANGES-1.8.txt}CHANGES.txt}} file for a full list of changes in this release and have a look at the download page for more information on how to obtain Apache Tika 1.8. @@ -159,7 +159,7 @@ Latest News Apache Tika 1.7 has been released! This release includes bug fixes and new features including a new Tesseract OCR Parser; a new GDAL Parser; more supported formats, and overall improvements in Tika stability. Please see the - {{{https://dist.apache.org/repos/dist/release/tika/CHANGES-1.7.txt}CHANGES.txt}} + {{{https://archive.apache.org/dist/tika/CHANGES-1.7.txt}CHANGES.txt}} file for a full list of changes in this release and have a look at the download page for more information on how to obtain Apache Tika 1.7. @@ -167,7 +167,7 @@ Latest News Apache Tika 1.6 has been released! This release includes bug fixes and new features including a new Translation API; more supported formats, and overall improvements in Tika stability. Please see the - {{{https://dist.apache.org/repos/dist/release/tika/CHANGES-1.6.txt}CHANGES.txt}} + {{{https://archive.apache.org/dist/tika/CHANGES-1.6.txt}CHANGES.txt}} file for a full list of changes in this release and have a look at the download page for more information on how to obtain Apache Tika 1.6. @@ -181,7 +181,7 @@ Latest News [19 Feb 2014: Apache Tika Release] Apache Tika 1.5 has been released! This release includes several important bugfixes - and new features. Please see the {{{http://www.apache.org/dist/tika/CHANGES-1.5.txt}CHANGES.txt}} + and new features. Please see the {{{https://archive.apache.org/dist/tika/CHANGES-1.5.txt}CHANGES.txt}} file for a full list of changes in this release, and have a look at the download page for more information on how to obtain Apache Tika 1.5. @@ -195,13 +195,13 @@ Latest News [3 July 2013: Apache Tika Release] Apache Tika 1.4 has been released! This release includes several important bugfixes - and new features. Please see the {{{http://www.apache.org/dist/tika/CHANGES-1.4.txt}CHANGES.txt}} + and new features. Please see the {{{https://archive.apache.org/dist/tika/CHANGES-1.4.txt}CHANGES.txt}} file for a full list of changes in this release, and have a look at the download page for more information on how to obtain Apache Tika 1.4. [22 January 2013: Apache Tika Release] Apache Tika 1.3 has been released! This release includes several important bugfixes - and new features. Please see the {{{http://www.apache.org/dist/tika/CHANGES-1.3.txt}CHANGES.txt}} + and new features. Please see the {{{https://archive.apache.org/dist/tika/CHANGES-1.3.txt}CHANGES.txt}} file for a full list of changes in this release, and have a look at the download page for more information on how to obtain Apache Tika 1.3. @@ -210,7 +210,7 @@ Latest News new core sub-modules, including the Tika JAX-RS Network Server, as well as new support for handling XMP metadata. Of course, new file formats have been added and improvements have been made to parsing and detection of existing - formats. Please see the {{{http://www.apache.org/dist/tika/CHANGES-1.2.txt}CHANGES.txt}} + formats. Please see the {{{https://archive.apache.org/dist/tika/CHANGES-1.2.txt}CHANGES.txt}} file for a full list of changes in this release, and have a look at the download page for more information on how to obtain Apache Tika 1.2. @@ -218,7 +218,7 @@ Latest News Apache Tika 1.1 is out the door! We've made a number of improvements to PDF, RTF and MP3 parsing. We've also provided some new features on the command line including the ability to list detectors. Other bug fixes and - improvements are listed in the {{{http://www.apache.org/dist/tika/CHANGES-1.1.txt}CHANGES.txt}} + improvements are listed in the {{{https://archive.apache.org/dist/tika/CHANGES-1.1.txt}CHANGES.txt}} file for this release. Have a look at the download page for more information on the release. @@ -412,9 +412,9 @@ Latest News [December 27th, 2007: Tika 0.1-incubating Released!] Tika has made its first official release, titled 0.1-incubating. See - the {{{http://www.apache.org/dist/incubator/tika/CHANGES-0.1-incubating.txt}CHANGES.txt}} + the {{{https://archive.apache.org/dist/incubator/tika/CHANGES-0.1-incubating.txt}CHANGES.txt}} file for more information on the list of updates in this initial release. Thanks to all - who contributed! You can download the official source tarball {{{http://www.apache.org/dyn/closer.cgi/incubator/tika}here}}. + who contributed! You can download the official source tarball {{{https://www.apache.org/dyn/closer.cgi/incubator/tika}here}}. [October 8th, 2007: Welcome Keith Bennett!] The Tika PPMC has {{{http://www.nabble.com/Please-welcome-Keith-Bennett-as-a-Tika-committer%21-tf4586151.html#a13107428}elected}} @@ -424,4 +424,4 @@ Latest News The Apache Tika project was formally started when the {{{http://wiki.apache.org/incubator/TikaProposal}Tika proposal}} was {{{http://mail-archives.apache.org/mod_mbox/incubator-general/200703.mbox/%[email protected]%3e}accepted}} - by the {{{http://incubator.apache.org/}Apache Incubator PMC}}. + by the {{{https://incubator.apache.org/}Apache Incubator PMC}}. Modified: tika/site/src/site/apt/mail-lists.apt URL: http://svn.apache.org/viewvc/tika/site/src/site/apt/mail-lists.apt?rev=1857883&r1=1857882&r2=1857883&view=diff ============================================================================== --- tika/site/src/site/apt/mail-lists.apt (original) +++ tika/site/src/site/apt/mail-lists.apt Sat Apr 20 20:59:46 2019 @@ -9,7 +9,7 @@ ~~ (the "License"); you may not use this file except in compliance with ~~ the License. You may obtain a copy of the License at ~~ -~~ http://www.apache.org/licenses/LICENSE-2.0 +~~ https://www.apache.org/licenses/LICENSE-2.0 ~~ ~~ Unless required by applicable law or agreed to in writing, software ~~ distributed under the License is distributed on an "AS IS" BASIS, @@ -21,8 +21,8 @@ Tika Mailing Lists Mailing lists are the primary communication channel of the Tika project. See the Apache - {{{http://www.apache.org/foundation/mailinglists.html}mailing list}} and - {{{http://www.apache.org/dev/contrib-email-tips.html}email contribution}} + {{{https://www.apache.org/foundation/mailinglists.html}mailing list}} and + {{{https://www.apache.org/dev/contrib-email-tips.html}email contribution}} pages for general information about the use of our mailing list and the expected list behaviour. @@ -50,7 +50,7 @@ Subscribing and unsubscribing {{{https://lists.apache.org/[email protected]}lists.apache.org Tika pages}}. New users may also find the Apache - {{{http://www.apache.org/foundation/mailinglists.html}mailing list}} + {{{https://www.apache.org/foundation/mailinglists.html}mailing list}} page useful for discovering additional mailing list features. If you have problems, please contact Modified: tika/site/src/site/apt/security.apt URL: http://svn.apache.org/viewvc/tika/site/src/site/apt/security.apt?rev=1857883&r1=1857882&r2=1857883&view=diff ============================================================================== --- tika/site/src/site/apt/security.apt (original) +++ tika/site/src/site/apt/security.apt Sat Apr 20 20:59:46 2019 @@ -9,7 +9,7 @@ ~~ (the "License"); you may not use this file except in compliance with ~~ the License. You may obtain a copy of the License at ~~ -~~ http://www.apache.org/licenses/LICENSE-2.0 +~~ https://www.apache.org/licenses/LICENSE-2.0 ~~ ~~ Unless required by applicable law or agreed to in writing, software ~~ distributed under the License is distributed on an "AS IS" BASIS, @@ -143,13 +143,13 @@ Acronyms and Terms * Command Execution -- A malicious client could execute anything on tika-server's commandline - * Deserialization Vulnerability- {{{https://www.owasp.org/index.php/Deserialization_Cheat_Sheet}OWASP's - Cheat Sheet}}. A malicious actor could run arbitrary code on your computer. + * Deserialization Vulnerability -- {{{https://www.owasp.org/index.php/Deserialization_Cheat_Sheet}OWASP's + Cheat Sheet}}. A malicious actor could run arbitrary code on your computer. - * OOM - Out of Memory Error - Parsers may allocate more memory than is available. This can sometimes be caused + * OOM -- Out of Memory Error -- Parsers may allocate more memory than is available. This can sometimes be caused by parsers not performing sanity checks before allocation. See, for example: {{{https://issues.apache.org/jira/browse/TIKA-1631}TIKA-1631}} - * XXE - {{{https://www.owasp.org/index.php/XML_External_Entity_(XXE)_Processing} + * XXE -- {{{https://www.owasp.org/index.php/XML_External_Entity_(XXE)_Processing} XML External Entity Processing}} A malicious client could access data on your system. Modified: tika/site/src/site/resources/doap.rdf URL: http://svn.apache.org/viewvc/tika/site/src/site/resources/doap.rdf?rev=1857883&r1=1857882&r2=1857883&view=diff ============================================================================== --- tika/site/src/site/resources/doap.rdf (original) +++ tika/site/src/site/resources/doap.rdf Sat Apr 20 20:59:46 2019 @@ -13,12 +13,12 @@ ======================================================================= --> - <Project rdf:about="http://tika.apache.org"> + <Project rdf:about="https://tika.apache.org"> <created>2007-03-22</created> <license rdf:resource="http://usefulinc.com/doap/licenses/asl20" /> <name>Apache Tika</name> - <homepage rdf:resource="http://tika.apache.org" /> - <asfext:pmc rdf:resource="http://tika.apache.org" /> + <homepage rdf:resource="https://tika.apache.org" /> + <asfext:pmc rdf:resource="https://tika.apache.org" /> <shortdesc>The Apache Tika toolkit detects and extracts metadata and structured text content from various documents using existing parser libraries. </shortdesc> @@ -27,9 +27,9 @@ applications that work with various kinds of digital documents to easily detect and extract metadata and content from all major file formats. </description> - <bug-database rdf:resource="http://issues.apache.org/jira/browse/TIKA" /> - <mailing-list rdf:resource="http://tika.apache.org/mail-lists.html" /> - <download-page rdf:resource="http://tika.apache.org/download.html" /> + <bug-database rdf:resource="https://issues.apache.org/jira/browse/TIKA" /> + <mailing-list rdf:resource="https://tika.apache.org/mail-lists.html" /> + <download-page rdf:resource="https://tika.apache.org/download.html" /> <foaf:account> <foaf:OnlineAccount> <foaf:accountServiceHomepage rdf:resource="https://www.twitter.com/"/> @@ -38,7 +38,7 @@ </foaf:OnlineAccount> </foaf:account> <programming-language>Java</programming-language> - <category rdf:resource="http://projects.apache.org/category/library" /> + <category rdf:resource="https://projects.apache.org/projects.html?category#library" /> <release> <Version> <name>Apache Tika 1.20</name> @@ -250,10 +250,10 @@ </Version> </release> <repository> - <SVNRepository> - <location rdf:resource="http://svn.apache.org/repos/asf/tika/"/> - <browse rdf:resource="http://svn.apache.org/viewvc/tika/"/> - </SVNRepository> + <GitRepository> + <location rdf:resource="https://github.com/apache/tika.git"/> + <browse rdf:resource="https://github.com/apache/tika/"/> + </GitRepository> </repository> </Project> </rdf:RDF> Modified: tika/site/src/site/site.vm URL: http://svn.apache.org/viewvc/tika/site/src/site/site.vm?rev=1857883&r1=1857882&r2=1857883&view=diff ============================================================================== --- tika/site/src/site/site.vm (original) +++ tika/site/src/site/site.vm Sat Apr 20 20:59:46 2019 @@ -10,7 +10,7 @@ "License"); you may not use this file except in compliance with the License. You may obtain a copy of the License at - http://www.apache.org/licenses/LICENSE-2.0 + https://www.apache.org/licenses/LICENSE-2.0 Unless required by applicable law or agreed to in writing, software distributed under the License is distributed on an @@ -184,12 +184,12 @@ <body onLoad="initProvider();"> <div id="body"> <div id="banner"> - <a href="http://tika.apache.org" id="bannerLeft" title="Apache Tika" - ><img src="http://tika.apache.org/tika.png" alt="Apache Tika" + <a href="https://tika.apache.org" id="bannerLeft" title="Apache Tika" + ><img src="https://tika.apache.org/tika.png" alt="Apache Tika" width="292" height="100"/></a> - <a href="http://www.apache.org/" id="bannerRight" + <a href="https://www.apache.org/" id="bannerRight" title="The Apache Software Foundation" - ><img src="http://tika.apache.org/asf-logo.gif" alt="The Apache Software Foundation" + ><img src="https://tika.apache.org/asf-logo.gif" alt="The Apache Software Foundation" width="387" height="100"/></a> </div> <div id="content"> @@ -228,8 +228,8 @@ <p> #set ( $currentYear = ${currentDate.year} + 1900 ) Copyright © $currentYear - <a href="http://www.apache.org/">The Apache Software Foundation</a>. - Site powered by <a href="http://maven.apache.org/">Apache Maven</a>. + <a href="https://www.apache.org/">The Apache Software Foundation</a>. + Site powered by <a href="https://maven.apache.org/">Apache Maven</a>. Search powered by <a href="http://www.lucidimagination.com">Lucid Imagination</a> and <a href="http://sematext.com">Sematext</a>. Modified: tika/site/src/site/site.xml URL: http://svn.apache.org/viewvc/tika/site/src/site/site.xml?rev=1857883&r1=1857882&r2=1857883&view=diff ============================================================================== --- tika/site/src/site/site.xml (original) +++ tika/site/src/site/site.xml Sat Apr 20 20:59:46 2019 @@ -8,7 +8,7 @@ "License"); you may not use this file except in compliance with the License. You may obtain a copy of the License at - http://www.apache.org/licenses/LICENSE-2.0 + https://www.apache.org/licenses/LICENSE-2.0 Unless required by applicable law or agreed to in writing, software distributed under the License is distributed on an @@ -20,13 +20,13 @@ <project name="Apache Tika"> <bannerLeft> <alt>Apache Tika</alt> - <src>http://tika.apache.org/tika.png</src> - <href>http://tika.apache.org/</href> + <src>https://tika.apache.org/tika.png</src> + <href>https://tika.apache.org/</href> </bannerLeft> <bannerRight> <alt>The Apache Software Foundation</alt> - <src>http://tika.apache.org/asf-logo.gif</src> - <href>http://www.apache.org/</href> + <src>https://tika.apache.org/asf-logo.gif</src> + <href>https://www.apache.org/</href> </bannerRight> <body> <head> @@ -36,7 +36,7 @@ <item name="Download" href="download.html"/> <item name="Contribute" href="contribute.html"/> <item name="Mailing Lists" href="mail-lists.html"/> - <item name="Tika Wiki" href="http://wiki.apache.org/tika/"/> + <item name="Tika Wiki" href="https://cwiki.apache.org/confluence/display/tika"/> <item name="Issue Tracker" href="https://issues.apache.org/jira/browse/TIKA"/> <item name="Security" href="security.html"/> </menu> @@ -175,11 +175,11 @@ </item> </menu> <menu name="The Apache Software Foundation"> - <item name="About" href="http://www.apache.org/foundation/"/> - <item name="License" href="http://www.apache.org/licenses/"/> - <item name="Security" href="http://www.apache.org/security/"/> - <item name="Sponsorship" href="http://www.apache.org/foundation/sponsorship.html"/> - <item name="Thanks" href="http://www.apache.org/foundation/thanks.html"/> + <item name="About" href="https://www.apache.org/foundation/"/> + <item name="License" href="https://www.apache.org/licenses/"/> + <item name="Security" href="https://www.apache.org/security/"/> + <item name="Sponsorship" href="https://www.apache.org/foundation/sponsorship.html"/> + <item name="Thanks" href="https://www.apache.org/foundation/thanks.html"/> </menu> </body> </project>
