Mayya Sharipova created LUCENE-8100:
---------------------------------------

             Summary: Error on reindex using WordNet synonyms file
                 Key: LUCENE-8100
                 URL: https://issues.apache.org/jira/browse/LUCENE-8100
             Project: Lucene - Core
          Issue Type: Bug
          Components: modules/analysis
    Affects Versions: 7.0.1
            Reporter: Mayya Sharipova
            Priority: Minor


Originally reported in the ES issues: 
https://github.com/elastic/elasticsearch/issues/27798#issuecomment-351838983

but looks like the issue is introduced from the Lucene 7.0.X.

Copying the user's issue here:

------------------------------------------------------

I'm encountering the following error on indexing when trying to use the wn_s.pl 
synonyms file (which I've moved to /usr/local/etc/elasticsearch):


{code:javascript}
{
        "error": {
                "root_cause": [{
                        "type": "illegal_argument_exception",
                        "reason": "failed to build synonyms"
                }],
                "type": "illegal_argument_exception",
                "reason": "failed to build synonyms",
                "caused_by": {
                        "type": "parse_exception",
                        "reason": "Invalid synonym rule at line 2",
                        "caused_by": {
                                "type": "illegal_argument_exception",
                                "reason": "term: physical entity analyzed to a 
token with posinc != 1"
                        }
                }
        }
}
{code}

Here's the line it's objecting to:

s(100001930,1,'physical entity',n,1,0). 
I'm using the WordNet Prolog synonyms file from 
http://wordnetcode.princeton.edu/3.0/WNprolog-3.0.tar.gz2
------------------------------------------------------

Looks like the error comes from  Lucene's classes of *WordnetSynonymParser* and 
*SynonymMap*, and changes introduce from Lucene 7.0 version.




--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

Reply via email to