[ https://issues.apache.org/jira/browse/NUTCH-2280?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Steve Yao updated NUTCH-2280: ----------------------------- Attachment: NUTCH-2280.YAO.160712.patch.txt format the code with eclipse format with 2 spaces. > HTTP Post form authentication CookiePolicy configuration > -------------------------------------------------------- > > Key: NUTCH-2280 > URL: https://issues.apache.org/jira/browse/NUTCH-2280 > Project: Nutch > Issue Type: New Feature > Components: protocol > Affects Versions: 1.11 > Reporter: Steve Yao > Priority: Minor > Labels: authentication > Attachments: NUTCH-2280.YAO.160705.patch.txt, > NUTCH-2280.YAO.160712.patch.txt > > Original Estimate: 168h > Remaining Estimate: 168h > > The protocol-httpclient plugin supports HTTP form authentication with form > values post back to the assigned login URL and store the session cookie for > following content retrieving. > The httpclient default CookiePolicy setting is in use. This default setting > will reject cookie has domain set starting as ".", for example > domain=".domain.com". This kind of domain value could be accepted by most web > browsers. > I suggest to add an configurable option in conf/httpclient-auth.xml: > {code:xml}<credentials authMethod="formMethod" ...> > ... > <loginCookie> > <policy>DEFAULT | BROWSER_COMPATIBILITY | NETSCAPE RFC_2109 | > RFC_2965</policy> > </loginCookie> > </credentials>{code} > Then, the httpclient could take this Cookie policy value. > I am working on a patch for this feature. But before i implement the > configuration format change, i would like to hear any other suggestions or > comments. -- This message was sent by Atlassian JIRA (v6.3.4#6332)