Github user amiara514 commented on the pull request:

    https://github.com/apache/jena/pull/53#issuecomment-97955739
  
    Hi Osma,
    > ex:paris rdfs:label "Paris"@fr .
    ex:paris rdfs:label "Paris"@en .
    
    Indeed, both of them will be dropped (however, can be solved with multi 
language proposal)
    
    >Similar things may happen for variants that differ only in letter case 
(which will be folded to lowercase by most analyzers), or singular/plural forms 
that get stemmed into the same base form. These may all cause false matches 
when entries are about to be deleted.
    
    Yes... it works for my needs but it's true that it's not a global solution.
    
    I don't think that storing the original literal in non-debug mode is a good 
way to follow. It will result in a huge index and will affect performance.
    Instead of that, storing a smaller identifier generated from a hash of a 
key (like uri+literal+language) will surely be a better solution.
    
    Agree with that ?



---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

Reply via email to