[ https://issues.apache.org/jira/browse/TIKA-3620?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Lewis John McGibbney resolved TIKA-3620. ---------------------------------------- Resolution: Fixed https://tika.apache.org/2.2.0/detection.html#Language_Detection > Language detection documentation needs attention > ------------------------------------------------ > > Key: TIKA-3620 > URL: https://issues.apache.org/jira/browse/TIKA-3620 > Project: Tika > Issue Type: Improvement > Components: languageidentifier > Affects Versions: 2.1.0 > Reporter: Lewis John McGibbney > Assignee: Lewis John McGibbney > Priority: Major > Fix For: 2.2.0 > > > This language identifier/detection suffers from a few problems > # Clarity is needed on identifier/identification Vs detector/detection. Which > is it? The source code says identifier whereas the [documentation is nested > under > detection|https://tika.apache.org/2.1.0/detection.html#Language_Detection]. > # The > [org.apache.tika.language.LanguageIdentifier|https://tika.apache.org/2.1.0/api/org/apache/tika/language/LanguageIdentifier.html] > returns 404. What is this meant to resolve to? > # Generally speaking the [documentation is literally > non-existent|https://tika.apache.org/2.1.0/detection.html#Language_Detection]. > I checked the wiki and failed to find anything. I did find some [minor > documentation|https://tika.apache.org/2.1.0/examples.html#Language_Identification] > but this is also severely lacking. Also note the broken hyperlink. > Some suggestions for improvement > # Fix the broken hyperlinks. > # Hyperlink to the existing example namely > [LanguageDetectorExample.java|https://github.com/apache/tika/blob/main/tika-example/src/main/java/org/apache/tika/example/LanguageDetectorExample.java], > > [LanguageDetectingParser.java|https://github.com/apache/tika/blob/main/tika-example/src/main/java/org/apache/tika/example/LanguageDetectingParser.java] > and > [Language.java|https://github.com/apache/tika/blob/main/tika-example/src/main/java/org/apache/tika/example/Language.java] > # Hyperlink to the [LanguageDetector > Javadoc|https://tika.apache.org/2.1.0/api/index.html?org/apache/tika/language/detect/LanguageDetector.html] > and atleast mention some of the other implementations. -- This message was sent by Atlassian Jira (v8.20.1#820001)