Sergey Kamov created NLPCRAFT-223:
-------------------------------------
Summary: Suspicious log output.
Key: NLPCRAFT-223
URL: https://issues.apache.org/jira/browse/NLPCRAFT-223
Project: NLPCraft
Issue Type: Improvement
Components: probe
Reporter: Sergey Kamov
Assignee: Aaron Radzinski
Fix For: 0.7.4
See {color:#000000}NCEnricherLimitSpec.
See variant
{color}
{color:#000000}Jan-19|17:15:52|INF| NCAsciiTable - (NCAsciiTable.scala:626) -
Parsing variant #1 of 3 for: 'handful of a b'
+===================================================================================================================+
| idx | origtext | lemma | pos | quoted | stopword | freeword | wordindexes |
direct | sparsity | token data |
+===================================================================================================================+
| 0 | handful of | handful of | --- | false | false | true | [0,1] | true | 0 |
|
| 1 | A | a | DT | false | false | false | [2] | true | 0 | <<A>> id=A |
| 2 | B | b | NN | false | false | false | [3] | true | 0 | <<B>> id=B |
+-------------------------------------------------------------------------------------------------------------------+{color}
{color:#000000}The first token `handful of` seems strange.{color}
{color:#000000}It is not `stop words chunk, it isn't `limit`, but it contains 2
split words. {color}
--
This message was sent by Atlassian Jira
(v8.3.4#803005)