[ 
https://issues.apache.org/jira/browse/NUTCH-1360?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Lewis John McGibbney updated NUTCH-1360:
----------------------------------------

    Attachment: NUTCH-1360-trunkv4.patch

Hi [~jnioche] thank you for review. Yes you are certainly right, this is 
expensive indeed. The attached patch obtains the Configuration and 
NutchConfiguration from accessing HttpBase.getConf(). 
There was no existing getConf() implementation for protocol-http's Http class, 
howeever the Configuration is the same as this class extends HttpBase anyways.
If this is OK I will go and change this for 2.x HEAD as well.   

> Suport the storing of IP address connected to when web crawling
> ---------------------------------------------------------------
>
>                 Key: NUTCH-1360
>                 URL: https://issues.apache.org/jira/browse/NUTCH-1360
>             Project: Nutch
>          Issue Type: New Feature
>          Components: protocol
>    Affects Versions: nutchgora, 1.5
>            Reporter: Lewis John McGibbney
>            Assignee: Lewis John McGibbney
>            Priority: Minor
>             Fix For: 2.3, 1.8
>
>         Attachments: NUTCH-1360-NUTCH-289-nutch-1.5.1.patch, 
> NUTCH-1360-nutchgora-v2.patch, NUTCH-1360-nutchgora.patch, 
> NUTCH-1360-trunk.patch, NUTCH-1360-trunkv2.patch, NUTCH-1360-trunkv3.patch, 
> NUTCH-1360-trunkv4.patch, NUTCH-1360v3.patch, NUTCH-1360v4.patch, 
> NUTCH-1360v5.patch
>
>
> Simple issue enabling us to capture the specific IP address of the host which 
> we connect to to fetch a page.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)

Reply via email to