I am building a classifier with OpenNLP and leveraging JWNLDictionary.  In my 
experiments I am finding after many invocations of the classifier it hangs in 
jwnl.  Specifically doing a read.  I am using WordNet 3.0 (normal princeton 
distro, not stanfords).  



The thread dump is below (well not all of it but the OpenNLP + jwnl part.  In 
the halt case it was trying to work with the word 
"found_r_n_rnhttpsttlc_blablacompost_show_post_full_view_dw_w_b_ar_dxxcfazbay_ppid_rnrn"

Now clearly that isnt a word so I should work on my tokenization :-}  

I tried this in a single thread and tried the default jwnl defined in the 
opennlp pom and also 1.4 rc3

Any pointers would be helpful.

Cheers

C


java.lang.Thread.State: RUNNABLE
          at java.io.RandomAccessFile.read(RandomAccessFile.java:-1)
          at java.io.RandomAccessFile.readLine(RandomAccessFile.java:871)
          at 
net.didion.jwnl.princeton.file.PrincetonRandomAccessDictionaryFile.readLine(PrincetonRandomAccessDictionaryFile.java:48)
          at 
net.didion.jwnl.dictionary.file_manager.FileManagerImpl.getIndexedLinePointer(FileManagerImpl.java:220)
          - locked <0x1071> (a 
net.didion.jwnl.princeton.file.PrincetonRandomAccessDictionaryFile)
          at 
net.didion.jwnl.dictionary.FileBackedDictionary.getIndexWord(FileBackedDictionary.java:171)
          at 
net.didion.jwnl.dictionary.morph.LookupIndexWordOperation.execute(LookupIndexWordOperation.java:15)
          at 
net.didion.jwnl.dictionary.morph.AbstractDelegatingOperation.delegate(AbstractDelegatingOperation.java:47)
          at 
net.didion.jwnl.dictionary.morph.TokenizerOperation.tryAllCombinations(TokenizerOperation.java:131)
          at 
net.didion.jwnl.dictionary.morph.TokenizerOperation.tryAllCombinations(TokenizerOperation.java:102)
          at 
net.didion.jwnl.dictionary.morph.TokenizerOperation.execute(TokenizerOperation.java:75)
          at 
net.didion.jwnl.dictionary.morph.DefaultMorphologicalProcessor$LookupInfo.executeNextOperation(DefaultMorphologicalProcessor.java:172)
          at 
net.didion.jwnl.dictionary.morph.DefaultMorphologicalProcessor.lookupNextBaseForm(DefaultMorphologicalProcessor.java:125)
          at 
net.didion.jwnl.dictionary.morph.DefaultMorphologicalProcessor.lookupAllBaseForms(DefaultMorphologicalProcessor.java:142)
          at 
opennlp.tools.coref.mention.JWNLDictionary.getLemmas(JWNLDictionary.java:99)

Reply via email to