janhoy opened a new pull request, #4326: URL: https://github.com/apache/solr/pull/4326
Replace com.cybozu.labs:langdetect (abandoned since 2012) with io.github.azagniotov:language-detection:12.5.2, a maintained fork with an active release history. The new library bundles its own language profiles, so the 53 profile files previously shipped in the langid module resources are removed. The factory no longer loads profiles at startup; it creates a shared LanguageDetectionOrchestrator instead. The processor converts the field-content Reader to a String and calls orchestrator.detectAll(). `commons-io` was only used for profile loading and is also removed from the langid module dependencies. Some tests needed reworking to pass due to different behavior of the libraries, and that this new supports more languages, which introduces some ambiguity. https://issues.apache.org/jira/browse/SOLR-18208 Implemented entirely by Claude Code -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected] --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
