[ 
https://issues.apache.org/jira/browse/NUTCH-657?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13066426#comment-13066426
 ] 

Lewis John McGibbney commented on NUTCH-657:
--------------------------------------------

I have been unsuccessful in submitting a patch for a file name change as oppose 
to content changes within the file... any pointers please? I am not familiar 
with submitting patches for file name changes.

Yes Markus, non of these files exist within trunk... strange. From doing some 
background reading into the classes I can see that two authors are Sami Siren 
and Jerome Charron. Is there anyone on board that has experience working with 
the language identifier code? This is really the first time I have looked over 
it...

> Estonian N-gram profile has wrong name
> --------------------------------------
>
>                 Key: NUTCH-657
>                 URL: https://issues.apache.org/jira/browse/NUTCH-657
>             Project: Nutch
>          Issue Type: Bug
>    Affects Versions: 0.8.1, 0.9.0
>            Reporter: Jonathan Young
>            Priority: Trivial
>
> The Nutch language identifier plugin contains an ngram profile, ee.ngp, in 
> src/plugin/languageidentifier/src/java/org/apache/nutch/analysis/lang .  "ee" 
> is the ISO-3166-1-alpha-2 code for Estonia (see 
> http://www.iso.org/iso/country_codes/iso_3166_code_lists/english_country_names_and_code_elements.htm),
>  but it is the ISO-639-2 code for Ewe (see 
> http://www.loc.gov/standards/iso639-2/php/English_list.php).  "et" is the 
> ISO-639-2 code for Estonian, and the language profile in ee.ngp is clearly 
> Estonian.
> Proposed solution: rename ee.ngp to et.ngp .

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Reply via email to