[ 
https://issues.apache.org/jira/browse/TIKA-2105?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15536363#comment-15536363
 ] 

susserj commented on TIKA-2105:
-------------------------------

Hi Tim

I tried added chcp 65001 which didn't work and then I tried chcp 1252 which 
didn't work.
I tried it directly in the cmd prompt and then in I executed the command 
"c:\Program Files (x86)\Java\jre7\bin\java" -jar c:\temp\tika-app-1.13.jar -m 
"S:\0300 Audit and Evaluation\360 Internal Audits\2008-09\2009-10\IC IT 
Environment 2009\français.docx".

Again, received the same errors messages.
Exception in thread "main" java.net.MalformedURLException: unknown protocol: s
        at java.net.URL.<init>(Unknown Source)
        at java.net.URL.<init>(Unknown Source)
        at java.net.URL.<init>(Unknown Source)
        at org.apache.tika.cli.TikaCLI.process(TikaCLI.java:472)
        at org.apache.tika.cli.TikaCLI.main(TikaCLI.java:145)



> Unable to process documents with french accents in filenames
> ------------------------------------------------------------
>
>                 Key: TIKA-2105
>                 URL: https://issues.apache.org/jira/browse/TIKA-2105
>             Project: Tika
>          Issue Type: Bug
>          Components: batch
>    Affects Versions: 1.13
>         Environment: Windows 7, Java version 1.7.0.111
>            Reporter: susserj
>
> When I execute the following batch test1.bat script from my command prompt,  
> I get this error message:
> test1.bat
> @echo off
> "C:\Program Files (x86)\Java\jre7\bin\java" -jar c:\temp\tika-app-1.13.jar -m 
> "S:\2008-09\2009-10\IC IT Environment 2009\français.docx"
> Error:
> Exception in thread "main" java.net.MalformedURLException: unknown protocol: s
>         at java.net.URL.<init>(Unknown Source)
>         at java.net.URL.<init>(Unknown Source)
>         at java.net.URL.<init>(Unknown Source)
>         at org.apache.tika.cli.TikaCLI.process(TikaCLI.java:472)
>         at org.apache.tika.cli.TikaCLI.main(TikaCLI.java:145)
> When the filenames don't have special French characters, it works fine. (I 
> cannot change the names of  all the files that need to be processed).
> I apologise, my experience with java and TIKA is very limited.
> Thanks



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to