When I used command line training tool on my data (training.txt)  it gives 
error as follows:
------------------------------------------------------------------------------------------------------------------------
C:\OpenNLP\apache-opennlp-1.5.1-incubating-bin\apache-opennlp-1.5.1-incubating>java
 -jar lib\opennlp-tools-*.jar TokenNameFinderTrainer -encoding UTF-8 -lang en
 -data trainingFile.txt -model mymodel.bin
Indexing events using cutoff of 5
        Computing event counts...  java.nio.charset.MalformedInputException: 
Input length = 1
Incorporating indexed data for training...
Exception in thread "main" java.lang.NullPointerException
        at opennlp.maxent.GISTrainer.trainModel(GISTrainer.java:272)
        at opennlp.maxent.GISTrainer.trainModel(GISTrainer.java:252)
        at opennlp.maxent.GIS.trainModel(GIS.java:228)
        at opennlp.maxent.GIS.trainModel(GIS.java:179)
        at opennlp.tools.namefind.NameFinderME.train(NameFinderME.java:345)
        at opennlp.tools.namefind.NameFinderME.train(NameFinderME.java:356)
        at opennlp.tools.cmdline.namefind.TokenNameFinderTrainerTool.run(TokenNa
meFinderTrainerTool.java:87)
        at opennlp.tools.cmdline.CLI.main(CLI.java:183) 
---------------------------------------------------------------------------
I do not know what is the problem and this is part of my data in the text file
 
Professor <START> Michael <END> 
Professor <START> Naci  <END>
Dr <START> Richard <END> ( p / t ) 
Dr <START> David  <END>
Professor <START> Vic <END> 
Dr <START> Adrian  <END>
Dr <START> Martin <END>
Dr <START> Timothy  <END>
Dr <START> Ian  <END>
Dr <START> Ali <END> 
-----------------------------------------------------------------------------------------------------------------------
                                          

Reply via email to