[ 
https://issues.apache.org/jira/browse/SOLR-10350?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Waleed Raza updated SOLR-10350:
-------------------------------
    Comment: was deleted

(was: Aby bata dega to kia mar jayega)

> By posting documents by post.jar i saw that it uses 
> org.apache.tika.parser.txt.TXTParser" how can i change the parse that it also 
> extract text from images which are inside pdf and also separate images like 
> jpg
> -----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
>
>                 Key: SOLR-10350
>                 URL: https://issues.apache.org/jira/browse/SOLR-10350
>             Project: Solr
>          Issue Type: Bug
>      Security Level: Public(Default Security Level. Issues are Public) 
>          Components: Schema and Analysis
>    Affects Versions: 6.4.1
>            Reporter: Waleed Raza
>




--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

Reply via email to