W00t!  I'm wondering if we should add them to our containers?  @Dave
Meikle @Konstantin Gribov  @lewi...@apache.org and other fellow devs,
wdyt?

‪On Mon, May 1, 2023 at 2:20 PM ‫שי ברק‬‎ <shai...@gmail.com> wrote:‬
>
> Adding these lines to my docker file works for me!
> Thank you very much :)
> On Mon, 1 May 2023 at 20:45 Tim Allison <talli...@apache.org> wrote:
>>
>> I'm sorry that I can't debug this for you without the triggering file.
>> My memory is that you have to set the locale in the container along
>> these lines: 
>> https://stackoverflow.com/questions/28405902/how-to-set-the-locale-inside-a-debian-ubuntu-docker-container
>>
>> I defer to fellow Tika devs and others who know more about this on how
>> to fix it. :(
>>
>>
>> ‪On Mon, May 1, 2023 at 1:04 PM ‫שי ברק‬‎ <shai...@gmail.com> wrote:‬
>> >
>> > It’s not working for me.
>> > I’m trying to pass in the entrypoint on my Dockerfile the 
>> > ‘-Dfile.encoding=UTF-8’.
>> > Like this:
>> >
>> > ```
>> > FROM apache/tika:2.7.0.1-full
>> > ADD tika-config.xml /tika-config.xml
>> > ENTRYPOINT [ "/bin/sh", "-c", "exec java -Dfile.encoding=UTF-8 -cp 
>> > \"/tika-server-standard-2.7.0.jar:/tika-extras/*\" 
>> > org.apache.tika.server.core.TikaServerCli -h 0.0.0.0 --config 
>> > /tika-config.xml $0 $@"]
>> > ```
>> >
>> > But I get the same exception.
>> > Any other suggestions?
>> >
>> > On Mon, 1 May 2023 at 17:25 Tim Allison <talli...@apache.org> wrote:
>> >>
>> >> How is the locale set on your server?
>> >>
>> >> Maybe this will help?
>> >> https://superuser.com/questions/1673843/extract-files-with-special-characters-in-filenames-from-a-rar-archive-using-the
>> >>
>> >> ‪On Sun, Apr 30, 2023 at 12:44 PM ‫שי ברק‬‎ <shai...@gmail.com> wrote:‬
>> >> >
>> >> > Hey,
>> >> > I'm using the tika server 2.7.0 and trying to unpack RAR files.
>> >> > I've noticed that when the rar contains files that there names are in 
>> >> > English, it works fine but
>> >> > when it's not I get an exception that says:
>> >> >
>> >> > INFO  [qtp573102881-28] 16:39:34,983 
>> >> > org.apache.tika.server.core.resource.UnpackerResource /unpack 
>> >> > (autodetecting type)
>> >> > WARN  [qtp573102881-28] 16:39:41,238 
>> >> > org.apache.tika.server.core.resource.UnpackerResource unpack/all: Text 
>> >> > extraction failed (null)
>> >> > org.apache.tika.exception.TikaException: Unrecoverable problem with rar 
>> >> > file, exitValue=9 :
>> >> > Cannot create ?? ??? ????? ?????????.pdf
>> >> > No such file or directory
>> >> > Cannot create ???? ?????? ??????
>> >> >         at 
>> >> > org.apache.tika.parser.pkg.UnrarParser.parse(UnrarParser.java:101) 
>> >> > ~[tika-server-standard-2.7.0.jar:2.7.0]
>> >> >         at 
>> >> > org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:298) 
>> >> > ~[tika-server-standard-2.7.0.jar:2.7.0]
>> >> >         at 
>> >> > org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:298) 
>> >> > ~[tika-server-standard-2.7.0.jar:2.7.0]
>> >> >         at 
>> >> > org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:195)
>> >> >  ~[tika-server-standard-2.7.0.jar:2.7.0]
>> >> >         at 
>> >> > org.apache.tika.server.core.resource.TikaResource.parse(TikaResource.java:352)
>> >> >  ~[tika-server-standard-2.7.0.jar:2.7.0]
>> >> >         at 
>> >> > org.apache.tika.server.core.resource.UnpackerResource.process(UnpackerResource.java:145)
>> >> >  ~[tika-server-standard-2.7.0.jar:2.7.0]
>> >> >         at 
>> >> > org.apache.tika.server.core.resource.UnpackerResource.unpackAll(UnpackerResource.java:109)
>> >> >  ~[tika-server-standard-2.7.0.jar:2.7.0]
>> >> >         at jdk.internal.reflect.NativeMethodAccessorImpl.invoke0(Native 
>> >> > Method) ~[?:?]
>> >> >         at 
>> >> > jdk.internal.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:77)
>> >> >  ~[?:?]
>> >> >         at 
>> >> > jdk.internal.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
>> >> >  ~[?:?]
>> >> >         at java.lang.reflect.Method.invoke(Method.java:568) ~[?:?]
>> >> >         at 
>> >> > org.apache.cxf.service.invoker.AbstractInvoker.performInvocation(AbstractInvoker.java:179)
>> >> >  ~[tika-server-standard-2.7.0.jar:2.7.0]
>> >> >         at 
>> >> > org.apache.cxf.service.invoker.AbstractInvoker.invoke(AbstractInvoker.java:96)
>> >> >  ~[tika-server-standard-2.7.0.jar:2.7.0]
>> >> >         at 
>> >> > org.apache.cxf.jaxrs.JAXRSInvoker.invoke(JAXRSInvoker.java:201) 
>> >> > ~[tika-server-standard-2.7.0.jar:2.7.0]
>> >> >         at 
>> >> > org.apache.cxf.jaxrs.JAXRSInvoker.invoke(JAXRSInvoker.java:104) 
>> >> > ~[tika-server-standard-2.7.0.jar:2.7.0]
>> >> >         at 
>> >> > org.apache.cxf.interceptor.ServiceInvokerInterceptor$1.run(ServiceInvokerInterceptor.java:59)
>> >> >  ~[tika-server-standard-2.7.0.jar:2.7.0]
>> >> >         at 
>> >> > org.apache.cxf.interceptor.ServiceInvokerInterceptor.handleMessage(ServiceInvokerInterceptor.java:96)
>> >> >  ~[tika-server-standard-2.7.0.jar:2.7.0]
>> >> >         at 
>> >> > org.apache.cxf.phase.PhaseInterceptorChain.doIntercept(PhaseInterceptorChain.java:307)
>> >> >  ~[tika-server-standard-2.7.0.jar:2.7.0]
>> >> >         at 
>> >> > org.apache.cxf.transport.ChainInitiationObserver.onMessage(ChainInitiationObserver.java:121)
>> >> >  ~[tika-server-standard-2.7.0.jar:2.7.0]
>> >> >         at 
>> >> > org.apache.cxf.transport.http.AbstractHTTPDestination.invoke(AbstractHTTPDestination.java:265)
>> >> >  ~[tika-server-standard-2.7.0.jar:2.7.0]
>> >> >         at 
>> >> > org.apache.cxf.transport.http_jetty.JettyHTTPDestination.doService(JettyHTTPDestination.java:247)
>> >> >  ~[tika-server-standard-2.7.0.jar:2.7.0]
>> >> >         at 
>> >> > org.apache.cxf.transport.http_jetty.JettyHTTPHandler.handle(JettyHTTPHandler.java:79)
>> >> >  ~[tika-server-standard-2.7.0.jar:2.7.0]
>> >> >         at 
>> >> > org.eclipse.jetty.server.handler.HandlerWrapper.handle(HandlerWrapper.java:127)
>> >> >  ~[tika-server-standard-2.7.0.jar:2.7.0]
>> >> >         at 
>> >> > org.eclipse.jetty.server.handler.ScopedHandler.nextHandle(ScopedHandler.java:235)
>> >> >  ~[tika-server-standard-2.7.0.jar:2.7.0]
>> >> >         at 
>> >> > org.eclipse.jetty.server.handler.ContextHandler.doHandle(ContextHandler.java:1440)
>> >> >  ~[tika-server-standard-2.7.0.jar:2.7.0]
>> >> >         at 
>> >> > org.eclipse.jetty.server.handler.ScopedHandler.nextScope(ScopedHandler.java:190)
>> >> >  ~[tika-server-standard-2.7.0.jar:2.7.0]
>> >> >         at 
>> >> > org.eclipse.jetty.server.handler.ContextHandler.doScope(ContextHandler.java:1355)
>> >> >  ~[tika-server-standard-2.7.0.jar:2.7.0]
>> >> >         at 
>> >> > org.eclipse.jetty.server.handler.ScopedHandler.handle(ScopedHandler.java:141)
>> >> >  ~[tika-server-standard-2.7.0.jar:2.7.0]
>> >> >         at 
>> >> > org.eclipse.jetty.server.handler.ContextHandlerCollection.handle(ContextHandlerCollection.java:191)
>> >> >  ~[tika-server-standard-2.7.0.jar:2.7.0]
>> >> >         at 
>> >> > org.eclipse.jetty.server.handler.HandlerWrapper.handle(HandlerWrapper.java:127)
>> >> >  ~[tika-server-standard-2.7.0.jar:2.7.0]
>> >> >         at org.eclipse.jetty.server.Server.handle(Server.java:516) 
>> >> > ~[tika-server-standard-2.7.0.jar:2.7.0]
>> >> >         at 
>> >> > org.eclipse.jetty.server.HttpChannel.lambda$handle$1(HttpChannel.java:487)
>> >> >  ~[tika-server-standard-2.7.0.jar:2.7.0]
>> >> >         at 
>> >> > org.eclipse.jetty.server.HttpChannel.dispatch(HttpChannel.java:732) 
>> >> > ~[tika-server-standard-2.7.0.jar:2.7.0]
>> >> >         at 
>> >> > org.eclipse.jetty.server.HttpChannel.handle(HttpChannel.java:479) 
>> >> > ~[tika-server-standard-2.7.0.jar:2.7.0]
>> >> >         at 
>> >> > org.eclipse.jetty.server.HttpConnection.onFillable(HttpConnection.java:277)
>> >> >  ~[tika-server-standard-2.7.0.jar:2.7.0]
>> >> >         at 
>> >> > org.eclipse.jetty.io.AbstractConnection$ReadCallback.succeeded(AbstractConnection.java:311)
>> >> >  ~[tika-server-standard-2.7.0.jar:2.7.0]
>> >> >         at 
>> >> > org.eclipse.jetty.io.FillInterest.fillable(FillInterest.java:105) 
>> >> > ~[tika-server-standard-2.7.0.jar:2.7.0]
>> >> >         at 
>> >> > org.eclipse.jetty.io.ChannelEndPoint$1.run(ChannelEndPoint.java:104) 
>> >> > ~[tika-server-standard-2.7.0.jar:2.7.0]
>> >> >         at 
>> >> > org.eclipse.jetty.util.thread.QueuedThreadPool.runJob(QueuedThreadPool.java:883)
>> >> >  ~[tika-server-standard-2.7.0.jar:2.7.0]
>> >> >         at 
>> >> > org.eclipse.jetty.util.thread.QueuedThreadPool$Runner.run(QueuedThreadPool.java:1034)
>> >> >  ~[tika-server-standard-2.7.0.jar:2.7.0]
>> >> >         at java.lang.Thread.run(Thread.java:833) ~[?:?]
>> >> >
>> >> > How can I work around this issue?

Reply via email to