[GitHub] [nutch] lewismc commented on pull request #233: NUTCH-2449: Replace Tika LanguageIdentifier in language-identifier

2021-12-17 Thread GitBox
lewismc commented on pull request #233: URL: https://github.com/apache/nutch/pull/233#issuecomment-996990713 Closing this PR off now in place of https://github.com/apache/nutch/pull/716 -- This is an automated message from the Apache Git Service. To respond to the message, please log on

[GitHub] [nutch] lewismc commented on pull request #233: NUTCH-2449: Replace Tika LanguageIdentifier in language-identifier

2021-12-15 Thread GitBox
lewismc commented on pull request #233: URL: https://github.com/apache/nutch/pull/233#issuecomment-995421003 OK @sebastian-nagel I will take it on yes. I'm nearly finished with my metrics documentation then I'll come back to this one. -- This is an automated message from the Apache Git

[GitHub] [nutch] lewismc commented on pull request #233: NUTCH-2449: Replace Tika LanguageIdentifier in language-identifier

2021-12-15 Thread GitBox
lewismc commented on pull request #233: URL: https://github.com/apache/nutch/pull/233#issuecomment-994961039 @YossiTamari @sebastian-nagel this has been sitting for way too long. By the looks of the above correspondence the decision was made to overwrite the logic in language-identifier