See
<https://builds.apache.org/job/OpenNLP_java8/org.apache.opennlp$opennlp-tools/115/changes>
Changes:
[tommaso] OPENNLP-777 - naive bayes classifier (patch from Cohan Sujay Carlos)
[joern] OPENNLP-799
The NewlineSentenceDetector does not work because of a small comparison error.
The patch fixes the problem, and also adds a class with some tests.
Thanks to Gustavo Knuppe for providing a patch.
------------------------------------------
[...truncated 7534 lines...]
Done indexing.
Incorporating indexed data for training...
done.
Number of Event Tokens: 1493
Number of Outcomes: 2
Number of Predicates: 3564
...done.
Computing model parameters ...
Performing 100 iterations.
1: ... loglikelihood=-1147.1585838266672 0.8507552870090634
2: ... loglikelihood=-472.967500991906 0.9601208459214502
3: ... loglikelihood=-291.07723687760796 0.9848942598187311
4: ... loglikelihood=-206.26897413824335 0.9897280966767371
5: ... loglikelihood=-158.3922436217585 0.9903323262839879
6: ... loglikelihood=-128.24522995997793 0.9909365558912386
7: ... loglikelihood=-107.74217629199612 0.9909365558912386
8: ... loglikelihood=-92.97035107242277 0.9915407854984895
9: ... loglikelihood=-81.84501995931943 0.9933534743202417
10: ... loglikelihood=-73.16874284671613 0.9933534743202417
11: ... loglikelihood=-66.21090895123683 0.9939577039274925
12: ... loglikelihood=-60.50338961954675 0.9939577039274925
13: ... loglikelihood=-55.733845669524406 0.9939577039274925
14: ... loglikelihood=-51.68653772919517 0.9939577039274925
15: ... loglikelihood=-48.207847831607644 0.9939577039274925
16: ... loglikelihood=-45.18539821693645 0.9945619335347432
17: ... loglikelihood=-42.53498749725327 0.9945619335347432
18: ... loglikelihood=-40.19217936680927 0.9945619335347432
19: ... loglikelihood=-38.10674165383808 0.9945619335347432
20: ... loglikelihood=-36.23887732676898 0.9945619335347432
21: ... loglikelihood=-34.55660952475116 0.9945619335347432
22: ... loglikelihood=-33.03392703100277 0.9945619335347432
23: ... loglikelihood=-31.649442000456848 0.9945619335347432
24: ... loglikelihood=-30.385400121524597 0.9945619335347432
25: ... loglikelihood=-29.226938178083326 0.9945619335347432
26: ... loglikelihood=-28.161518597897903 0.9951661631419939
27: ... loglikelihood=-27.17849286226126 0.9951661631419939
28: ... loglikelihood=-26.268760267607313 0.9969788519637462
29: ... loglikelihood=-25.42449829026183 0.997583081570997
30: ... loglikelihood=-24.63894744084868 0.997583081570997
31: ... loglikelihood=-23.906238084701968 0.997583081570997
32: ... loglikelihood=-23.221249932839708 0.997583081570997
33: ... loglikelihood=-22.579497214621284 0.9981873111782478
34: ... loglikelihood=-21.97703421569259 0.9981873111782478
35: ... loglikelihood=-21.41037709412822 0.9981873111782478
36: ... loglikelihood=-20.876438802575137 0.9981873111782478
37: ... loglikelihood=-20.37247463286615 0.9987915407854985
38: ... loglikelihood=-19.896036423328383 0.9987915407854985
39: ... loglikelihood=-19.444933871098907 0.9987915407854985
40: ... loglikelihood=-19.0172017030897 0.9987915407854985
41: ... loglikelihood=-18.6110717021854 0.9987915407854985
42: ... loglikelihood=-18.22494877620019 0.9987915407854985
43: ... loglikelihood=-17.85739040818644 0.9987915407854985
44: ... loglikelihood=-17.507088946936864 0.9987915407854985
45: ... loglikelihood=-17.172856292797512 0.9987915407854985
46: ... loglikelihood=-16.85361061139514 0.9987915407854985
47: ... loglikelihood=-16.548364770570913 0.9987915407854985
48: ... loglikelihood=-16.256216246763078 0.9987915407854985
49: ... loglikelihood=-15.976338288690007 0.9987915407854985
50: ... loglikelihood=-15.707972160299303 0.9987915407854985
51: ... loglikelihood=-15.45042031304788 0.9987915407854985
52: ... loglikelihood=-15.203040360799628 0.9987915407854985
53: ... loglikelihood=-14.965239749903033 0.9987915407854985
54: ... loglikelihood=-14.736471033061646 0.9987915407854985
55: ... loglikelihood=-14.516227669024024 0.9987915407854985
56: ... loglikelihood=-14.304040281369872 0.9987915407854985
57: ... loglikelihood=-14.099473319130976 0.9987915407854985
58: ... loglikelihood=-13.902122069972096 0.9987915407854985
59: ... loglikelihood=-13.711609983415938 0.9987915407854985
60: ... loglikelihood=-13.52758626733365 0.9987915407854985
61: ... loglikelihood=-13.349723725807632 0.9987915407854985
62: ... loglikelihood=-13.177716810642115 0.9987915407854985
63: ... loglikelihood=-13.011279862365202 0.9987915407854985
64: ... loglikelihood=-12.850145519627702 0.9987915407854985
65: ... loglikelihood=-12.694063278537318 0.9987915407854985
66: ... loglikelihood=-12.542798185737231 0.9987915407854985
67: ... loglikelihood=-12.396129650999749 0.9987915407854985
68: ... loglikelihood=-12.253850366805954 0.9987915407854985
69: ... loglikelihood=-12.11576532385566 0.9987915407854985
70: ... loglikelihood=-11.981690912737507 0.9987915407854985
71: ... loglikelihood=-11.851454103104592 0.9987915407854985
72: ... loglikelihood=-11.724891692680847 0.9987915407854985
73: ... loglikelihood=-11.601849619275114 0.9987915407854985
74: ... loglikelihood=-11.482182329732385 0.9987915407854985
75: ... loglikelihood=-11.365752200407057 0.9987915407854985
76: ... loglikelihood=-11.2524290043248 0.9987915407854985
77: ... loglikelihood=-11.142089420708611 0.9987915407854985
78: ... loglikelihood=-11.034616582996058 0.9987915407854985
79: ... loglikelihood=-10.929899661873668 0.9987915407854985
80: ... loglikelihood=-10.827833480207184 0.9987915407854985
81: ... loglikelihood=-10.728318157060322 0.9987915407854985
82: ... loglikelihood=-10.631258778272858 0.9987915407854985
83: ... loglikelihood=-10.536565091316653 0.9987915407854985
84: ... loglikelihood=-10.444151222369774 0.9987915407854985
85: ... loglikelihood=-10.353935413745008 0.9987915407854985
86: ... loglikelihood=-10.265839779986413 0.9987915407854985
87: ... loglikelihood=-10.179790081104313 0.9987915407854985
88: ... loglikelihood=-10.095715511561208 0.9987915407854985
89: ... loglikelihood=-10.013548503747355 0.9987915407854985
90: ... loglikelihood=-9.933224544799275 0.9987915407854985
91: ... loglikelihood=-9.854682005716311 0.9987915407854985
92: ... loglikelihood=-9.777861981823415 0.9987915407854985
93: ... loglikelihood=-9.70270814371098 0.9987915407854985
94: ... loglikelihood=-9.629166597858127 0.9987915407854985
95: ... loglikelihood=-9.557185756213398 0.9987915407854985
96: ... loglikelihood=-9.486716214069075 0.9987915407854985
97: ... loglikelihood=-9.417710635619596 0.9987915407854985
98: ... loglikelihood=-9.350123646646741 0.9987915407854985
99: ... loglikelihood=-9.283911733818757 0.9987915407854985
100: ... loglikelihood=-9.219033150132912 0.9987915407854985
Tests run: 2, Failures: 0, Errors: 0, Skipped: 0, Time elapsed: 0.388 sec - in
opennlp.tools.tokenize.TokenizerMETest
Running opennlp.tools.tokenize.TokSpanEventStreamTest
Tests run: 1, Failures: 0, Errors: 0, Skipped: 0, Time elapsed: 0 sec - in
opennlp.tools.tokenize.TokSpanEventStreamTest
Running opennlp.tools.tokenize.DictionaryDetokenizerTest
Tests run: 3, Failures: 0, Errors: 0, Skipped: 0, Time elapsed: 0.003 sec - in
opennlp.tools.tokenize.DictionaryDetokenizerTest
Running opennlp.tools.tokenize.TokenizerModelTest
Indexing events using cutoff of 0
Computing event counts... done. 9 events
Indexing... done.
Sorting and merging events... done. Reduced 9 events to 9.
Done indexing.
Incorporating indexed data for training...
done.
Number of Event Tokens: 9
Number of Outcomes: 2
Number of Predicates: 60
...done.
Computing model parameters ...
Performing 100 iterations.
1: ... loglikelihood=-6.238324625039508 0.6666666666666666
2: ... loglikelihood=-4.267390099971502 1.0
3: ... loglikelihood=-3.2418789317069647 1.0
4: ... loglikelihood=-2.6098065824093446 1.0
5: ... loglikelihood=-2.181429732130584 1.0
6: ... loglikelihood=-1.872334360148705 1.0
7: ... loglikelihood=-1.6390415672400755 1.0
8: ... loglikelihood=-1.4568638470768123 1.0
9: ... loglikelihood=-1.3107519597433812 1.0
10: ... loglikelihood=-1.191017196220442 1.0
11: ... loglikelihood=-1.0911445131976356 1.0
12: ... loglikelihood=-1.0065942255608578 1.0
13: ... loglikelihood=-0.9341077561309318 1.0
14: ... loglikelihood=-0.8712865048909157 1.0
15: ... loglikelihood=-0.8163263298695127 1.0
16: ... loglikelihood=-0.7678445247739041 1.0
17: ... loglikelihood=-0.724763808132552 1.0
18: ... loglikelihood=-0.6862325748265687 1.0
19: ... loglikelihood=-0.6515688567637903 1.0
20: ... loglikelihood=-0.6202201672253774 1.0
21: ... loglikelihood=-0.5917342195885426 1.0
22: ... loglikelihood=-0.565737237090749 1.0
23: ... loglikelihood=-0.5419176553733099 1.0
24: ... loglikelihood=-0.5200137175054115 1.0
25: ... loglikelihood=-0.4998039195471357 1.0
26: ... loglikelihood=-0.4810995714417062 1.0
27: ... loglikelihood=-0.46373894685403827 1.0
28: ... loglikelihood=-0.4475826400064324 1.0
29: ... loglikelihood=-0.432509848920957 1.0
30: ... loglikelihood=-0.4184153765692203 1.0
31: ... loglikelihood=-0.4052071933464748 1.0
32: ... loglikelihood=-0.39280444210887405 1.0
33: ... loglikelihood=-0.3811357948648633 1.0
34: ... loglikelihood=-0.37013809092929856 1.0
35: ... loglikelihood=-0.3597552019054489 1.0
36: ... loglikelihood=-0.34993708064463436 1.0
37: ... loglikelihood=-0.34063896033509855 1.0
38: ... loglikelihood=-0.33182067680170807 1.0
39: ... loglikelihood=-0.3234460924726966 1.0
40: ... loglikelihood=-0.31548260466707856 1.0
41: ... loglikelihood=-0.30790072415618486 1.0
42: ... loglikelihood=-0.3006737125631292 1.0
43: ... loglikelihood=-0.2937772692413833 1.0
44: ... loglikelihood=-0.2871892599360768 1.0
45: ... loglikelihood=-0.2808894808692854 1.0
46: ... loglikelihood=-0.27485945297234643 1.0
47: ... loglikelihood=-0.26908224186739715 1.0
48: ... loglikelihood=-0.2635422999181489 1.0
49: ... loglikelihood=-0.25822532725861264 1.0
50: ... loglikelihood=-0.2531181491933867 1.0
51: ... loglikelihood=-0.248208607764115 1.0
52: ... loglikelihood=-0.24348546560965584 1.0
53: ... loglikelihood=-0.2389383205249888 1.0
54: ... loglikelihood=-0.23455752935593194 1.0
55: ... loglikelihood=-0.23033414006154845 1.0
56: ... loglikelihood=-0.22625983094012614 1.0
57: ... loglikelihood=-0.2223268561531997 1.0
58: ... loglikelihood=-0.21852799679951024 1.0
59: ... loglikelihood=-0.21485651689062477 1.0
60: ... loglikelihood=-0.21130612366500146 1.0
61: ... loglikelihood=-0.20787093175003185 1.0
62: ... loglikelihood=-0.20454543074392073 1.0
63: ... loglikelihood=-0.20132445584283049 1.0
64: ... loglikelihood=-0.19820316118486242 1.0
65: ... loglikelihood=-0.19517699562229196 1.0
66: ... loglikelihood=-0.1922416806679774 1.0
67: ... loglikelihood=-0.18939319039177877 1.0
68: ... loglikelihood=-0.18662773306883887 1.0
69: ... loglikelihood=-0.18394173440426667 1.0
70: ... loglikelihood=-0.18133182217853244 1.0
71: ... loglikelihood=-0.17879481217523166 1.0
72: ... loglikelihood=-0.1763276952680289 1.0
73: ... loglikelihood=-0.17392762555694571 1.0
74: ... loglikelihood=-0.17159190945588323 1.0
75: ... loglikelihood=-0.16931799564360095 1.0
76: ... loglikelihood=-0.1671034657995311 1.0
77: ... loglikelihood=-0.16494602605384556 1.0
78: ... loglikelihood=-0.16284349908838996 1.0
79: ... loglikelihood=-0.1607938168314145 1.0
80: ... loglikelihood=-0.15879501369470103 1.0
81: ... loglikelihood=-0.15684522030668507 1.0
82: ... loglikelihood=-0.15494265769967447 1.0
83: ... loglikelihood=-0.1530856319132385 1.0
84: ... loglikelihood=-0.15127252897944293 1.0
85: ... loglikelihood=-0.149501810258773 1.0
86: ... loglikelihood=-0.1477720080984901 1.0
87: ... loglikelihood=-0.1460817217877075 1.0
88: ... loglikelihood=-0.14442961378581715 1.0
89: ... loglikelihood=-0.14281440620295088 1.0
90: ... loglikelihood=-0.14123487751306105 1.0
91: ... loglikelihood=-0.1396898594818759 1.0
92: ... loglikelihood=-0.13817823429353202 1.0
93: ... loglikelihood=-0.13669893186105736 1.0
94: ... loglikelihood=-0.13525092730712387 1.0
95: ... loglikelihood=-0.13383323860263727 1.0
96: ... loglikelihood=-0.13244492435174648 1.0
97: ... loglikelihood=-0.13108508171279148 1.0
98: ... loglikelihood=-0.12975284444556362 1.0
99: ... loglikelihood=-0.12844738107601078 1.0
100: ... loglikelihood=-0.12716789317023386 1.0
Tests run: 1, Failures: 0, Errors: 0, Skipped: 0, Time elapsed: 0.089 sec - in
opennlp.tools.tokenize.TokenizerModelTest
Running opennlp.tools.stemmer.PorterStemmerTest
Tests run: 2, Failures: 0, Errors: 0, Skipped: 0, Time elapsed: 0.002 sec - in
opennlp.tools.stemmer.PorterStemmerTest
Results :
Tests run: 394, Failures: 0, Errors: 0, Skipped: 1
[JENKINS] Recording test results
log4j:WARN No appenders could be found for logger
(org.apache.commons.beanutils.converters.BooleanConverter).
log4j:WARN Please initialize the log4j system properly.
[INFO]
[INFO] --- maven-bundle-plugin:2.3.4:bundle (default-bundle) @ opennlp-tools ---
[INFO]
[INFO] --- maven-site-plugin:3.4:attach-descriptor (attach-descriptor) @
opennlp-tools ---
[INFO]
[INFO] --- maven-javadoc-plugin:2.9.1:jar (create-javadoc-jar) @ opennlp-tools
---
[INFO]
3 errors
100 warnings