[ 
https://issues.apache.org/jira/browse/TIKA-1622?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14542386#comment-14542386
 ] 

Chris A. Mattmann commented on TIKA-1622:
-----------------------------------------

hi [~tledoux] so I tried the patch out, and for whatever reason, Tika's 
language identifier detects the corrected french sentence as italian, and thus 
fails the unit test. Sigh. FYI this:

{noformat}
-------------------------------------------------------
 T E S T S
-------------------------------------------------------
Running org.apache.tika.server.DetectorResourceTest
Tests run: 2, Failures: 0, Errors: 0, Skipped: 0, Time elapsed: 1.774 sec - in 
org.apache.tika.server.DetectorResourceTest
Running org.apache.tika.server.LanguageResourceTest
Tests run: 4, Failures: 2, Errors: 0, Skipped: 0, Time elapsed: 0.277 sec <<< 
FAILURE! - in org.apache.tika.server.LanguageResourceTest
testDetectFrenchString(org.apache.tika.server.LanguageResourceTest)  Time 
elapsed: 0.048 sec  <<< FAILURE!
org.junit.ComparisonFailure: expected:<[fr]> but was:<[it]>
        at org.junit.Assert.assertEquals(Assert.java:115)
        at org.junit.Assert.assertEquals(Assert.java:144)
        at 
org.apache.tika.server.LanguageResourceTest.testDetectFrenchString(LanguageResourceTest.java:82)

testDetectFrenchFile(org.apache.tika.server.LanguageResourceTest)  Time 
elapsed: 0.031 sec  <<< FAILURE!
org.junit.ComparisonFailure: expected:<[fr]> but was:<[it]>
        at org.junit.Assert.assertEquals(Assert.java:115)
        at org.junit.Assert.assertEquals(Assert.java:144)
        at 
org.apache.tika.server.LanguageResourceTest.testDetectFrenchFile(LanguageResourceTest.java:106)
{noformat}


> Expose Tika LanguageIdentifier via Tika Server
> ----------------------------------------------
>
>                 Key: TIKA-1622
>                 URL: https://issues.apache.org/jira/browse/TIKA-1622
>             Project: Tika
>          Issue Type: Bug
>          Components: languageidentifier, server
>            Reporter: Chris A. Mattmann
>            Assignee: Chris A. Mattmann
>             Fix For: 1.9
>
>         Attachments: TIKA-1622-commeci.patch
>
>
> The LanguageIdentifier in Tika should be exposed via Tika JAX-RS.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to