Author: rwesten
Date: Tue Nov 19 14:33:09 2013
New Revision: 1543439
URL: http://svn.apache.org/r1543439
Log:
STANBOL-1211: added documentation for the Minimum Chunk Match Score option for
the FST linking engine
Modified:
stanbol/site/trunk/content/docs/trunk/components/enhancer/engines/lucenefstlinking.mdtext
Modified:
stanbol/site/trunk/content/docs/trunk/components/enhancer/engines/lucenefstlinking.mdtext
URL:
http://svn.apache.org/viewvc/stanbol/site/trunk/content/docs/trunk/components/enhancer/engines/lucenefstlinking.mdtext?rev=1543439&r1=1543438&r2=1543439&view=diff
==============================================================================
---
stanbol/site/trunk/content/docs/trunk/components/enhancer/engines/lucenefstlinking.mdtext
(original)
+++
stanbol/site/trunk/content/docs/trunk/components/enhancer/engines/lucenefstlinking.mdtext
Tue Nov 19 14:33:09 2013
@@ -154,7 +154,8 @@ The Entity Linking Configuration of this
* __Redirect Field__ _(enhancer.engines.linking.redirectField)_</s>: Note
implemented. __NOTE__ This might not be possible to efficiently implement. When
those redirects need already be considered when building the FST models.
* <s>__Use EntityRankings__
_(enhancer.engines.linking.useEntityRankings)_</s>: This configuration gets
__IGNORED__. EntityRanking based sorting is enabled as soon as the _Entity
Ranking Field_ is configured.
* <s>__Lemma based Matching__ _(enhancer.engines.linking.lemmaMatching)_</s>:
Not Yet implemented
-* <s>__Min Match Score__ _(enhancer.engines.linking.minMatchScore)_</s>: Not
Yet Implemented. The FST linking engine is based on the Lucene Analyzer chains
configured for the _index_ and _store_ field of the FST configuration. Only if
Tokens do match after the Analyzers where applied a Entity is suggested.
+* __Min Match Score__ _(enhancer.engines.linking.minMatchScore)_: The FST
linking engine uses [Levenshtein
distance](http://en.wikipedia.org/wiki/Levenshtein_distance) between the
mention in the text and the best matching label of an Entity. It only adds
suggestions if the match is greater as the configured value. _NOTE_ that this
might filter suggestions of the FST for several reasons but typical reasons are
stemming on short labels as well as case insensitive analyzers combined with
case sensitive matching.
+* __Minimum Chunk Match Score__
_(enhancer.engines.linking.minChunkMatchScore)_: Tags provided by FST linking
are reduced if they do match less as the configured percentage of tokens in a
chunk. Implemented as `TagClusterReducer`.
In addition the following properties are __IGNORED__ as they are not relevant
for the FST Linking Engine: