[jira] [Updated] (SOLR-2332) TikaEntityProcessor retrieves only File Names from Zip extraction

2012-09-07 Thread Hoss Man (JIRA)

 [ 
https://issues.apache.org/jira/browse/SOLR-2332?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Hoss Man updated SOLR-2332:
---

Fix Version/s: (was: 4.0)

removing fixVersion=4.0 since there is no evidence that anyone is currently 
working on this issue.  (this can certainly be revisited if volunteers step 
forward)


> TikaEntityProcessor retrieves only File Names from Zip extraction
> -
>
> Key: SOLR-2332
> URL: https://issues.apache.org/jira/browse/SOLR-2332
> Project: Solr
>  Issue Type: Bug
>  Components: contrib - DataImportHandler
>Reporter: Jayendra Patil
> Attachments: SOLR-2332.patch, solr-word.zip
>
>
> Extraction of Zip files using TikaEntityProcessor results in only names of 
> file.
> It does not extract the contents of the Files in the Zip

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] Updated: (SOLR-2332) TikaEntityProcessor retrieves only File Names from Zip extraction

2011-03-17 Thread Hoss Man (JIRA)

 [ 
https://issues.apache.org/jira/browse/SOLR-2332?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Hoss Man updated SOLR-2332:
---

Affects Version/s: (was: 4.0)
Fix Version/s: 3.2

I can't find any docs suggestion how exactly TikaEntityProcessor should be 
expected to deal with zip files, particularly what to expect if a zip files 
contains multiple documents.

FWIW: TikaEntityProcessor did not exist in Solr 1.4.1, so the behavior 
currently seen in the 3x branch (and the 3.1rc1 artifacts) is not a regression.

> TikaEntityProcessor retrieves only File Names from Zip extraction
> -
>
> Key: SOLR-2332
> URL: https://issues.apache.org/jira/browse/SOLR-2332
> Project: Solr
>  Issue Type: Bug
>  Components: contrib - DataImportHandler
>Reporter: Jayendra Patil
> Fix For: 3.2
>
> Attachments: SOLR-2332.patch, solr-word.zip
>
>
> Extraction of Zip files using TikaEntityProcessor results in only names of 
> file.
> It does not extract the contents of the Files in the Zip

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] Updated: (SOLR-2332) TikaEntityProcessor retrieves only File Names from Zip extraction

2011-01-23 Thread Jayendra Patil (JIRA)

 [ 
https://issues.apache.org/jira/browse/SOLR-2332?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jayendra Patil updated SOLR-2332:
-

Attachment: solr-word.zip
SOLR-2332.patch

Attached is the Patch for the fix and Testcase.
Also attached is the Test zip file.

> TikaEntityProcessor retrieves only File Names from Zip extraction
> -
>
> Key: SOLR-2332
> URL: https://issues.apache.org/jira/browse/SOLR-2332
> Project: Solr
>  Issue Type: Bug
>  Components: contrib - DataImportHandler
>Affects Versions: 4.0
>Reporter: Jayendra Patil
> Attachments: SOLR-2332.patch, solr-word.zip
>
>
> Extraction of Zip files using TikaEntityProcessor results in only names of 
> file.
> It does not extract the contents of the Files in the Zip

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org