[ 
https://issues.apache.org/jira/browse/TIKA-1796?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15009146#comment-15009146
 ] 

Nick Burch commented on TIKA-1796:
----------------------------------

All the container aware functionality is now available via DefaultDetector 
(assuming you have all of the necessary Tika dependencies on your classpath)

See http://wiki.apache.org/tika/Troubleshooting%20Tika#Detectors_Missing for 
how to check if you have all the detectors you should have via DefaultDector

> Issues with tika jar and Microsoft documents like doc.,ppt, xls etc
> -------------------------------------------------------------------
>
>                 Key: TIKA-1796
>                 URL: https://issues.apache.org/jira/browse/TIKA-1796
>             Project: Tika
>          Issue Type: Bug
>    Affects Versions: 0.9
>         Environment: UNIX server
>            Reporter: Femi
>
> We have had a problem with tika-app-0.9.jar when it comes to using Microsoft 
> documents (we do not have issues with PDFs and images). It creates tika files 
> which are held by our weblogic java process.
> For example, if one runs the command :- lsof -p 27305|grep deleted
> java      27305  oracle  330r      REG              253,1   295674         68 
> /tmp/apache-tika-5125182301796025972.tmp (deleted)
> java      27305  oracle  334r      REG              253,1   272896         69 
> /tmp/apache-tika-8997882426533237375.tmp (deleted)
> java      27305  oracle  335r      REG              253,1   295674         78 
> /tmp/apache-tika-5232377327199509251.tmp (deleted)
> java      27305  oracle  336r      REG              253,1    45327         43 
> /tmp/apache-tika-6884061409786039638.tmp (deleted)
> java      27305  oracle  339r      REG              253,1   272895         41 
> /tmp/apache-tika-6752501215118342524.tmp (deleted)
> java      27305  oracle  340r      REG              253,1   272895         41 
> /tmp/apache-tika-6752501215118342524.tmp (deleted)
> java      27305  oracle  341r      REG              253,1    45327         75 
> /tmp/apache-tika-7548218713808428132.tmp (deleted)
> The above is a long list of held tika files from Microsoft docs in deleted 
> state but they are still handled by the weblogic process.
> The only way we can get these tika files closed or released is by restarting 
> the weblogic server.
> This cost us money as we had to stop the server to get rid of the tika files 
> filling up our tmp folder.
> We have had this issue for almost 3 years now. I have been researching on the 
> web to see if there are solutions out there in an upgraded tika-jar but it 
> seems there are none.
> I was thinking it will be resolved in an upgraded jar file but it seems that 
> is not the case.
> Please is there any solution to this issue?
> Regards,
> Femi Balogun,
> Application Support Engineer,



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to