Cached.jsp has a bug with encoding
----------------------------------

                 Key: NUTCH-910
                 URL: https://issues.apache.org/jira/browse/NUTCH-910
             Project: Nutch
          Issue Type: Bug
          Components: web gui
    Affects Versions: 1.0.0
         Environment: Any enironment
            Reporter: Attila Pados
            Priority: Minor
             Fix For: 1.0.0


cached.jsp

Pages that has a non default encoding, or not utf-8 etc, the cached content is 
displayed screwed. This is quite annoying, but doesn't harm critically 
functionality.

add       :   Metadata parseData = bean.getParseData(details).getParseMeta();
original :  Metadata metaData = bean.getParseData(details).getContentMeta();

replace: String encoding = (String) parseData.get("CharEncodingForConversion");

In the cached jsp, the encoding variable is tried to retrieved from the wrong 
metadata source, contentMeta, which doesn't include this value.
It resides in the parseMetadata instead. 

First line is not a replacement above, it has to be added.  Original metadata 
is needed there for other things.
Then below, the encoding value line has to be changed, that is a replacement.

This fix is for 1.0 nutch version, i didn't found an issue in the list that 
would cover this, just a mail found with google, on a mailing list that refered 
to it.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to