I see on the wiki that HTTPS is supported by protocol-httpclient but not
protocol-http.
However, protocol-httpclient is not recommended for use (
https://issues.apache.org/jira/browse/NUTCH-990).
Is there a plan for supporting HTTPS? Happy to help implement if possible :)
Thanks
Matt
don't need
any plugins for that
HTH
Julien
On 12 July 2011 13:35, Matthew Painter matthew.pain...@kusiri.com
wrote:
Hi all,
I was wondering about the feasibility of creating a plugin for nutch
that create a solr update command, and added it to a queue for
indexing
Hi all,
I was wondering about the feasibility of creating a plugin for nutch that
create a solr update command, and added it to a queue for indexing after it
first parses the page, rather than when crawling has finished.
This would allow you to do real-time indexing when crawling.
Drawbacks:
Hi all,
I was wondering about the feasibility of creating a plugin for nutch that
create a solr update command, and added it to a queue for indexing after it
first parses the page, rather than when crawling has finished.
This would allow you to do real-time indexing when crawling.
Drawbacks:
-parse-update-linkdb sequence. You don't need any plugins
for that
HTH
Julien
On 12 July 2011 13:35, Matthew Painter matthew.pain...@kusiri.comwrote:
Hi all,
I was wondering about the feasibility of creating a plugin for nutch that
create a solr update command, and added it to a queue
5 matches
Mail list logo