[ http://jira.codehaus.org/browse/WAGON-218?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Joakim Erdfelt closed WAGON-218. -------------------------------- Resolution: Fixed Work completed in revision 662070 After a sample LinkParser replacement is P.o.C. in a wagon-http-with-webdav branch, and a discussion in the [EMAIL PROTECTED] mailing list. The following changes have been made. 1) Replaced jtidy with nekohtml This resulted in a smaller dependency list and improved memory utilization. 2) Replaces reliance of String URL manipulation with use of java.net.URI This change makes the detection of content that belongs to the page more accurate, as well as enables some complex relative uri resolution almost trivial. 3) Added more unit tests for real world scenarios encountered since the original implementation was loose on the world. > Link Parsing in http is flawed > ------------------------------ > > Key: WAGON-218 > URL: http://jira.codehaus.org/browse/WAGON-218 > Project: Maven Wagon > Issue Type: Improvement > Components: wagon-http, wagon-http-lightweight > Affects Versions: 1.0-beta-2 > Reporter: Joakim Erdfelt > Assignee: Joakim Erdfelt > > The link parsing in wagon http has a few issues. > a) not all links detected. > b) the various ways that page content is identified via url string > manipulation isn't working in many example cases. > c) the use of jtidy introduces a large dependency and high memory usage. -- This message is automatically generated by JIRA. - If you think it was sent incorrectly contact one of the administrators: http://jira.codehaus.org/secure/Administrators.jspa - For more information on JIRA, see: http://www.atlassian.com/software/jira