[ http://issues.apache.org/jira/browse/NUTCH-53?page=all ]
Rohit Kulkarni updated NUTCH-53:
--------------------------------
Attachment: parse-zip.zip
The plugin is tested with the latest nutch SVN and seems to work
fine.
Currently handles and calls parsers for the following types of files within the
zip file..
text/plain
text/html
msexcel
mspowerpoint
msword
pdf
rtf
mp3
zip
Please try it out and let me know if anyone has any suggestions.
Plugin is attached as a zip file
thanks,
Rohit & Ashish
> Parser plugin for Zip files
> ---------------------------
>
> Key: NUTCH-53
> URL: http://issues.apache.org/jira/browse/NUTCH-53
> Project: Nutch
> Type: Improvement
> Components: fetcher
> Reporter: Rohit Kulkarni
> Priority: Trivial
> Attachments: parse-zip.zip
>
> Nutch plugin to parse Zip files (using java.util.zip)
--
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the administrators:
http://issues.apache.org/jira/secure/Administrators.jspa
-
For more information on JIRA, see:
http://www.atlassian.com/software/jira
-------------------------------------------------------
SF email is sponsored by - The IT Product Guide
Read honest & candid reviews on hundreds of IT Products from real users.
Discover which products truly live up to the hype. Start reading now.
http://ads.osdn.com/?ad_id=6595&alloc_id=14396&op=click
_______________________________________________
Nutch-developers mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/nutch-developers