Hello,
I started a bigger fetch using latest HTMLParser code and got some
NullPointer exceptions from HTMLParser class (line 201)
if (metaTags.getRefresh()) {
status.setMinorCode(ParseStatus.SUCCESS_REDIRECT);
201: status.setMessage(metaTags.getRefreshHref().toString());
}
Example of problematic URL:
http://calgary.foundlocally.com/Travel/Attr-CityWalks.htm
The problem is with meta tags:
Meta tags for http://calgary.foundlocally.com/Travel/Attr-CityWalks.htm:
base=null, noCache=false, noFollow=false, noIndex=false, refresh=true,
refreshHref=null
So it simply has empty refreshHref so some check before invoking
toString() is needed. I am not sure what should be done here right now
- but probably this case should not be handled as redirect.
Regards
Piotr