[ https://issues.apache.org/jira/browse/TIKA-1622?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14542386#comment-14542386 ]
Chris A. Mattmann commented on TIKA-1622: ----------------------------------------- hi [~tledoux] so I tried the patch out, and for whatever reason, Tika's language identifier detects the corrected french sentence as italian, and thus fails the unit test. Sigh. FYI this: {noformat} ------------------------------------------------------- T E S T S ------------------------------------------------------- Running org.apache.tika.server.DetectorResourceTest Tests run: 2, Failures: 0, Errors: 0, Skipped: 0, Time elapsed: 1.774 sec - in org.apache.tika.server.DetectorResourceTest Running org.apache.tika.server.LanguageResourceTest Tests run: 4, Failures: 2, Errors: 0, Skipped: 0, Time elapsed: 0.277 sec <<< FAILURE! - in org.apache.tika.server.LanguageResourceTest testDetectFrenchString(org.apache.tika.server.LanguageResourceTest) Time elapsed: 0.048 sec <<< FAILURE! org.junit.ComparisonFailure: expected:<[fr]> but was:<[it]> at org.junit.Assert.assertEquals(Assert.java:115) at org.junit.Assert.assertEquals(Assert.java:144) at org.apache.tika.server.LanguageResourceTest.testDetectFrenchString(LanguageResourceTest.java:82) testDetectFrenchFile(org.apache.tika.server.LanguageResourceTest) Time elapsed: 0.031 sec <<< FAILURE! org.junit.ComparisonFailure: expected:<[fr]> but was:<[it]> at org.junit.Assert.assertEquals(Assert.java:115) at org.junit.Assert.assertEquals(Assert.java:144) at org.apache.tika.server.LanguageResourceTest.testDetectFrenchFile(LanguageResourceTest.java:106) {noformat} > Expose Tika LanguageIdentifier via Tika Server > ---------------------------------------------- > > Key: TIKA-1622 > URL: https://issues.apache.org/jira/browse/TIKA-1622 > Project: Tika > Issue Type: Bug > Components: languageidentifier, server > Reporter: Chris A. Mattmann > Assignee: Chris A. Mattmann > Fix For: 1.9 > > Attachments: TIKA-1622-commeci.patch > > > The LanguageIdentifier in Tika should be exposed via Tika JAX-RS. -- This message was sent by Atlassian JIRA (v6.3.4#6332)