[ https://issues.apache.org/jira/browse/NUTCH-1360?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Lewis John McGibbney updated NUTCH-1360: ---------------------------------------- Attachment: NUTCH-1360-trunkv4.patch Hi [~jnioche] thank you for review. Yes you are certainly right, this is expensive indeed. The attached patch obtains the Configuration and NutchConfiguration from accessing HttpBase.getConf(). There was no existing getConf() implementation for protocol-http's Http class, howeever the Configuration is the same as this class extends HttpBase anyways. If this is OK I will go and change this for 2.x HEAD as well. > Suport the storing of IP address connected to when web crawling > --------------------------------------------------------------- > > Key: NUTCH-1360 > URL: https://issues.apache.org/jira/browse/NUTCH-1360 > Project: Nutch > Issue Type: New Feature > Components: protocol > Affects Versions: nutchgora, 1.5 > Reporter: Lewis John McGibbney > Assignee: Lewis John McGibbney > Priority: Minor > Fix For: 2.3, 1.8 > > Attachments: NUTCH-1360-NUTCH-289-nutch-1.5.1.patch, > NUTCH-1360-nutchgora-v2.patch, NUTCH-1360-nutchgora.patch, > NUTCH-1360-trunk.patch, NUTCH-1360-trunkv2.patch, NUTCH-1360-trunkv3.patch, > NUTCH-1360-trunkv4.patch, NUTCH-1360v3.patch, NUTCH-1360v4.patch, > NUTCH-1360v5.patch > > > Simple issue enabling us to capture the specific IP address of the host which > we connect to to fetch a page. -- This message was sent by Atlassian JIRA (v6.1.5#6160)