Hello,
I started a bigger fetch using latest HTMLParser code and got some NullPointer exceptions from HTMLParser class (line 201)

        if (metaTags.getRefresh()) {
              status.setMinorCode(ParseStatus.SUCCESS_REDIRECT);
201:           status.setMessage(metaTags.getRefreshHref().toString());
         }

Example of problematic URL: http://calgary.foundlocally.com/Travel/Attr-CityWalks.htm
The problem is with meta tags:
Meta tags for http://calgary.foundlocally.com/Travel/Attr-CityWalks.htm: base=null, noCache=false, noFollow=false, noIndex=false, refresh=true, refreshHref=null

So it simply has empty refreshHref so some check before invoking toString() is needed. I am not sure what should be done here right now - but probably this case should not be handled as redirect.
Regards
Piotr




Reply via email to