susserj created TIKA-2105: ----------------------------- Summary: Unable to process documents with french accents in filenames Key: TIKA-2105 URL: https://issues.apache.org/jira/browse/TIKA-2105 Project: Tika Issue Type: Bug Components: batch Affects Versions: 1.13 Environment: Windows 7, Java version 1.7.0.111 Reporter: susserj
When I execute the following batch test1.bat script from my command prompt, I get this error message: test1.bat @echo off "C:\Program Files (x86)\Java\jre7\bin\java" -jar c:\temp\tika-app-1.13.jar -m "S:\2008-09\2009-10\IC IT Environment 2009\français.docx" Error: Exception in thread "main" java.net.MalformedURLException: unknown protocol: s at java.net.URL.<init>(Unknown Source) at java.net.URL.<init>(Unknown Source) at java.net.URL.<init>(Unknown Source) at org.apache.tika.cli.TikaCLI.process(TikaCLI.java:472) at org.apache.tika.cli.TikaCLI.main(TikaCLI.java:145) When the filenames don't have special French characters, it works fine. (I cannot change the names of all the files that need to be processed). I apologise, my experience with java and TIKA is very limited. Thanks -- This message was sent by Atlassian JIRA (v6.3.4#6332)