[
https://issues.apache.org/jira/browse/TIKA-200?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12683363#action_12683363
]
Uwe Schindler edited comment on TIKA-200 at 3/19/09 1:41 AM:
-------------------------------------------------------------
For a more advanced parsing of content type and also support of compressed HTTP
streams, have a look at
http://panfmp.svn.sourceforge.net/viewvc/panfmp/main/trunk/src/de/pangaea/metadataportal/harvester/OAIHarvesterBase.java?view=markup
line 177 ff.
This is a nice method that creates a SAX InputSource with all properties
correctly set from an HTTP URL with some extra features. The InputSource with
only a given SystemID does not support compression, retry-after. For the
underlying parser to work correct, the charset encoding should be set (if
available from the HTTP response). This more complex example was needed for an
OAI-PMH harvester for effective metadata harvesting with compression,
last-modified and so on.
was (Author: thetaphi):
For a more advanced parsing of content type and also support of compressed
HTTP streams, have a look at
http://panfmp.svn.sourceforge.net/viewvc/panfmp/main/trunk/src/de/pangaea/metadataportal/harvester/OAIHarvesterBase.java?view=markup
line 177 ff.
This is a nice method that creates a SAX InputSource with all properties
correctly set from an HTTP urlswith some extra features, the InputSource with
only a given SystemID does not support (compression, retry-after). For the
underlying parser to work correct, the charset encoding should be set (if
available from the HTTP response). This mpore complex example was needed for an
OAI-PMH harvester for effective metadata harvesting with compression and so on.
> Allow URL drag and drop in the Tika GUI
> ---------------------------------------
>
> Key: TIKA-200
> URL: https://issues.apache.org/jira/browse/TIKA-200
> Project: Tika
> Issue Type: New Feature
> Components: gui
> Reporter: Jukka Zitting
> Priority: Minor
> Fix For: 0.4
>
> Attachments: TIKA-200.diff
>
>
> It would be nice if I could drag a URL from my browser to the Tika GUI window
> to have the linked document downloaded and parsed by Tika.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.