Github user amiara514 commented on the pull request: https://github.com/apache/jena/pull/52#issuecomment-101035985 > But since there is only a relatively small number of Lucene analyzers anyway, maybe this is OK. It's why it's done like this :-) >No, that wouldn't work. You have to use the same analyzer for both indexing and queries (in this case, the language-specific analyzer), otherwise the tokens won't match. Exactly > But I think it should still be possible to share the same index, if you have a field that specifies the language and make sure to target your queries only to the specific language. Store the language as an extra field is easy to do during the document creation (on the addEntity method). Add an extra param in queries is not a problem either (done in my solution). But how to change correctly the existent code to target Lucene taking that extra language into account ?
--- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. ---