BasicURLNormalizer should collapse runs of slashes with a single slash
----------------------------------------------------------------------
Key: NUTCH-620
URL: https://issues.apache.org/jira/browse/NUTCH-620
Project: Nutch
Issue Type: Bug
Components: fetcher
Affects Versions: 0.9.0
Environment: JDK 1.6 update 5, Tomcat 6, Windows Server 2003,
Reporter: Mark DeSpain
Fix For: 1.0.0
The BasicURLNormalizer should collapse runs of slash characters '/' with a
single slash.
For example, the following URLs should be normalized to
http://lucene.apache.org/nutch/about.html
* http://lucene.apache.org/nutch//about.html
* http://lucene.apache.org//nutch/about.html
* http://lucene.apache.org/////nutch////about.html (an exaggerated example)
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.