[
https://issues.apache.org/jira/browse/SOLR-284?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Chris Harris updated SOLR-284:
------------------------------
Attachment: rich.patch
Here's a new version of rich.patch. My previous attempt didn't actually include
all the necessary files! (Curses upon you, TortoiseSVN.) This one also includes
preliminary support for plaintext and HTML files. (HTML support is done by
running the input through the HTMLStripReader.)
> Parsing Rich Document Types
> ---------------------------
>
> Key: SOLR-284
> URL: https://issues.apache.org/jira/browse/SOLR-284
> Project: Solr
> Issue Type: New Feature
> Components: update
> Affects Versions: 1.3
> Reporter: Eric Pugh
> Fix For: 1.3
>
> Attachments: libs.zip, rich.patch, rich.patch, rich.patch,
> source.zip, test-files.zip, test.zip
>
>
> I have developed a RichDocumentRequestHandler based on the CSVRequestHandler
> that supports streaming a PDF, Word, Powerpoint, Excel, or PDF document into
> Solr.
> There is a wiki page with information here:
> http://wiki.apache.org/solr/UpdateRichDocuments
>
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.