Revision: 17733
http://sourceforge.net/p/gate/code/17733
Author: ian_roberts
Date: 2014-03-24 20:45:07 +0000 (Mon, 24 Mar 2014)
Log Message:
-----------
More detail on annotations.
Modified Paths:
--------------
gate/trunk/plugins/ANNIE/.annie-defaults-metadata/long-desc.html
gate/trunk/plugins/Lang_Arabic/resources/.arabic-pipeline-metadata/long-desc.html
gate/trunk/plugins/Lang_French/.french-pipeline-metadata/long-desc.html
gate/trunk/plugins/Lang_German/resources/.german-pipeline-metadata/long-desc.html
gate/trunk/plugins/Lang_Romanian/resources/.romanian-pipeline-metadata/long-desc.html
gate/trunk/plugins/Lang_Russian/resources/.russie-inflex-metadata/long-desc.html
gate/trunk/plugins/Lang_Russian/resources/.russie-metadata/long-desc.html
gate/trunk/plugins/Lang_Russian/resources/.russie-ortho-inflex-metadata/long-desc.html
gate/trunk/plugins/Lang_Russian/resources/.russie-ortho-metadata/long-desc.html
gate/trunk/plugins/OpenNLP/resources/.opennlp-de-metadata/long-desc.html
gate/trunk/plugins/OpenNLP/resources/.opennlp-metadata/long-desc.html
gate/trunk/plugins/OpenNLP/resources/.opennlp-nl-metadata/long-desc.html
gate/trunk/plugins/Tagger_Framework/resources/AbGene/.abgene-metadata/long-desc.html
gate/trunk/plugins/Tagger_Framework/resources/Hunpos/.hunpos-en-metadata/long-desc.html
gate/trunk/plugins/Tagger_Framework/resources/Hunpos/.hunpos-hu-metadata/long-desc.html
gate/trunk/plugins/Tagger_Measurements/resources/.annie-measurements-metadata/long-desc.html
gate/trunk/plugins/Tagger_Measurements/resources/.measurements-metadata/long-desc.html
gate/trunk/plugins/Tagger_NP_Chunking/.np-chunker-metadata/long-desc.html
gate/trunk/plugins/Tagger_PennBio/resources/.pennbio-metadata/long-desc.html
gate/trunk/plugins/Twitter/resources/.twitie-en-metadata/long-desc.html
Modified: gate/trunk/plugins/ANNIE/.annie-defaults-metadata/long-desc.html
===================================================================
--- gate/trunk/plugins/ANNIE/.annie-defaults-metadata/long-desc.html
2014-03-24 19:35:52 UTC (rev 17732)
+++ gate/trunk/plugins/ANNIE/.annie-defaults-metadata/long-desc.html
2014-03-24 20:45:07 UTC (rev 17733)
@@ -6,3 +6,50 @@
<p>It is the prototypical information extraction pipeline distributed
with the <a href="http://gate.ac.uk">GATE framework</a> and forms the base of
many more complex GATE-based IE applications.</p>
+
+<table>
+ <tr>
+ <td colspan="2"><b>Default annotations</b></td>
+ </tr>
+ <tr>
+ <td><tt>:Person</tt></td>
+ <td rowspan="4">Standard named entity types</td>
+ </tr>
+ <tr>
+ <td><tt>:Location</tt></td>
+ </tr>
+ <tr>
+ <td><tt>:Organization</tt></td>
+ </tr>
+ <tr>
+ <td><tt>:Date</tt></td>
+ </tr>
+ <tr>
+ <td><tt>:Address</tt></td>
+ <td>Includes email and IP addresses as well as street addresses</td>
+ </tr>
+ <tr>
+ <td colspan="2"><b>Additional annotations available if selected</b></td>
+ </tr>
+ <tr>
+ <td><tt>:Money</tt></td>
+ <td>Monetary amounts</td>
+ </tr>
+ <tr>
+ <td><tt>:Percent</tt></td>
+ <td>Expressions representing percentages</td>
+ </tr>
+ <tr>
+ <td><tt>:Token</tt></td>
+ <td>The individual tokens of the text, with "category" feature for POS</td>
+ </tr>
+ <tr>
+ <td><tt>:SpaceToken</tt></td>
+ <td>The spaces between tokens</td>
+ </tr>
+ <tr>
+ <td><tt>:Sentence</tt></td>
+ <td>Sentences detected by the sentence splitter</td>
+ </tr>
+</table>
+
Modified:
gate/trunk/plugins/Lang_Arabic/resources/.arabic-pipeline-metadata/long-desc.html
===================================================================
---
gate/trunk/plugins/Lang_Arabic/resources/.arabic-pipeline-metadata/long-desc.html
2014-03-24 19:35:52 UTC (rev 17732)
+++
gate/trunk/plugins/Lang_Arabic/resources/.arabic-pipeline-metadata/long-desc.html
2014-03-24 20:45:07 UTC (rev 17733)
@@ -1,4 +1,51 @@
<p>A named entity recognition pipeline that identifies basic entity types,
such
as <em>Person</em>, <em>Location</em>, <em>Organization</em>, <em>Money</em>
amounts, <em>Time</em> and <em>Date</em> expressions. It works on documents
-in the Arabic language.</p>
\ No newline at end of file
+in the Arabic language.</p>
+
+<table>
+ <tr>
+ <td colspan="2"><b>Default annotations</b></td>
+ </tr>
+ <tr>
+ <td><tt>:Person</tt></td>
+ <td rowspan="4">Standard named entity types</td>
+ </tr>
+ <tr>
+ <td><tt>:Location</tt></td>
+ </tr>
+ <tr>
+ <td><tt>:Organization</tt></td>
+ </tr>
+ <tr>
+ <td><tt>:Date</tt></td>
+ </tr>
+ <tr>
+ <td><tt>:Address</tt></td>
+ <td>Includes email and IP addresses as well as street addresses</td>
+ </tr>
+ <tr>
+ <td colspan="2"><b>Additional annotations available if selected</b></td>
+ </tr>
+ <tr>
+ <td><tt>:Money</tt></td>
+ <td>Monetary amounts</td>
+ </tr>
+ <tr>
+ <td><tt>:Percent</tt></td>
+ <td>Expressions representing percentages</td>
+ </tr>
+ <tr>
+ <td><tt>:Token</tt></td>
+ <td>The individual tokens of the text, with "category" feature for POS</td>
+ </tr>
+ <tr>
+ <td><tt>:SpaceToken</tt></td>
+ <td>The spaces between tokens</td>
+ </tr>
+ <tr>
+ <td><tt>:Sentence</tt></td>
+ <td>Sentences detected by the sentence splitter</td>
+ </tr>
+</table>
+
Modified:
gate/trunk/plugins/Lang_French/.french-pipeline-metadata/long-desc.html
===================================================================
--- gate/trunk/plugins/Lang_French/.french-pipeline-metadata/long-desc.html
2014-03-24 19:35:52 UTC (rev 17732)
+++ gate/trunk/plugins/Lang_French/.french-pipeline-metadata/long-desc.html
2014-03-24 20:45:07 UTC (rev 17733)
@@ -1,4 +1,51 @@
<p>A named entity recognition pipeline that identifies basic entity types,
such
as <em>Person</em>, <em>Location</em>, <em>Organization</em>, <em>Money</em>
amounts, <em>Time</em> and <em>Date</em> expressions. It works on documents
-in the French language.</p>
\ No newline at end of file
+in the French language.</p>
+
+<table>
+ <tr>
+ <td colspan="2"><b>Default annotations</b></td>
+ </tr>
+ <tr>
+ <td><tt>:Person</tt></td>
+ <td rowspan="4">Standard named entity types</td>
+ </tr>
+ <tr>
+ <td><tt>:Location</tt></td>
+ </tr>
+ <tr>
+ <td><tt>:Organization</tt></td>
+ </tr>
+ <tr>
+ <td><tt>:Date</tt></td>
+ </tr>
+ <tr>
+ <td><tt>:Address</tt></td>
+ <td>Includes email and IP addresses as well as street addresses</td>
+ </tr>
+ <tr>
+ <td colspan="2"><b>Additional annotations available if selected</b></td>
+ </tr>
+ <tr>
+ <td><tt>:Money</tt></td>
+ <td>Monetary amounts</td>
+ </tr>
+ <tr>
+ <td><tt>:Percent</tt></td>
+ <td>Expressions representing percentages</td>
+ </tr>
+ <tr>
+ <td><tt>:Token</tt></td>
+ <td>The individual tokens of the text, with "category" feature for POS</td>
+ </tr>
+ <tr>
+ <td><tt>:SpaceToken</tt></td>
+ <td>The spaces between tokens</td>
+ </tr>
+ <tr>
+ <td><tt>:Sentence</tt></td>
+ <td>Sentences detected by the sentence splitter</td>
+ </tr>
+</table>
+
Modified:
gate/trunk/plugins/Lang_German/resources/.german-pipeline-metadata/long-desc.html
===================================================================
---
gate/trunk/plugins/Lang_German/resources/.german-pipeline-metadata/long-desc.html
2014-03-24 19:35:52 UTC (rev 17732)
+++
gate/trunk/plugins/Lang_German/resources/.german-pipeline-metadata/long-desc.html
2014-03-24 20:45:07 UTC (rev 17733)
@@ -1,4 +1,51 @@
<p>A named entity recognition pipeline that identifies basic entity types,
such
as <em>Person</em>, <em>Location</em>, <em>Organization</em>, <em>Money</em>
amounts, <em>Time</em> and <em>Date</em> expressions. It works on documents
-in the German language.</p>
\ No newline at end of file
+in the German language.</p>
+
+<table>
+ <tr>
+ <td colspan="2"><b>Default annotations</b></td>
+ </tr>
+ <tr>
+ <td><tt>:Person</tt></td>
+ <td rowspan="4">Standard named entity types</td>
+ </tr>
+ <tr>
+ <td><tt>:Location</tt></td>
+ </tr>
+ <tr>
+ <td><tt>:Organization</tt></td>
+ </tr>
+ <tr>
+ <td><tt>:Date</tt></td>
+ </tr>
+ <tr>
+ <td><tt>:Address</tt></td>
+ <td>Includes email and IP addresses as well as street addresses</td>
+ </tr>
+ <tr>
+ <td colspan="2"><b>Additional annotations available if selected</b></td>
+ </tr>
+ <tr>
+ <td><tt>:Money</tt></td>
+ <td>Monetary amounts</td>
+ </tr>
+ <tr>
+ <td><tt>:Percent</tt></td>
+ <td>Expressions representing percentages</td>
+ </tr>
+ <tr>
+ <td><tt>:Token</tt></td>
+ <td>The individual tokens of the text, with "category" feature for POS</td>
+ </tr>
+ <tr>
+ <td><tt>:SpaceToken</tt></td>
+ <td>The spaces between tokens</td>
+ </tr>
+ <tr>
+ <td><tt>:Sentence</tt></td>
+ <td>Sentences detected by the sentence splitter</td>
+ </tr>
+</table>
+
Modified:
gate/trunk/plugins/Lang_Romanian/resources/.romanian-pipeline-metadata/long-desc.html
===================================================================
---
gate/trunk/plugins/Lang_Romanian/resources/.romanian-pipeline-metadata/long-desc.html
2014-03-24 19:35:52 UTC (rev 17732)
+++
gate/trunk/plugins/Lang_Romanian/resources/.romanian-pipeline-metadata/long-desc.html
2014-03-24 20:45:07 UTC (rev 17733)
@@ -1,4 +1,51 @@
<p>A named entity recognition pipeline that identifies basic entity types,
such
as <em>Person</em>, <em>Location</em>, <em>Organization</em>, <em>Money</em>
amounts, <em>Time</em> and <em>Date</em> expressions. It works on documents
-in the Romanian language.</p>
\ No newline at end of file
+in the Romanian language.</p>
+
+<table>
+ <tr>
+ <td colspan="2"><b>Default annotations</b></td>
+ </tr>
+ <tr>
+ <td><tt>:Person</tt></td>
+ <td rowspan="4">Standard named entity types</td>
+ </tr>
+ <tr>
+ <td><tt>:Location</tt></td>
+ </tr>
+ <tr>
+ <td><tt>:Organization</tt></td>
+ </tr>
+ <tr>
+ <td><tt>:Date</tt></td>
+ </tr>
+ <tr>
+ <td><tt>:Address</tt></td>
+ <td>Includes email and IP addresses as well as street addresses</td>
+ </tr>
+ <tr>
+ <td colspan="2"><b>Additional annotations available if selected</b></td>
+ </tr>
+ <tr>
+ <td><tt>:Money</tt></td>
+ <td>Monetary amounts</td>
+ </tr>
+ <tr>
+ <td><tt>:Percent</tt></td>
+ <td>Expressions representing percentages</td>
+ </tr>
+ <tr>
+ <td><tt>:Token</tt></td>
+ <td>The individual tokens of the text, with "category" feature for POS</td>
+ </tr>
+ <tr>
+ <td><tt>:SpaceToken</tt></td>
+ <td>The spaces between tokens</td>
+ </tr>
+ <tr>
+ <td><tt>:Sentence</tt></td>
+ <td>Sentences detected by the sentence splitter</td>
+ </tr>
+</table>
+
Modified:
gate/trunk/plugins/Lang_Russian/resources/.russie-inflex-metadata/long-desc.html
===================================================================
---
gate/trunk/plugins/Lang_Russian/resources/.russie-inflex-metadata/long-desc.html
2014-03-24 19:35:52 UTC (rev 17732)
+++
gate/trunk/plugins/Lang_Russian/resources/.russie-inflex-metadata/long-desc.html
2014-03-24 20:45:07 UTC (rev 17733)
@@ -5,3 +5,58 @@
<p>This version of the pipeline includes an <em>inflexional gazetteer</em> to
recognise more morphological variants of target names.</p>
+
+<table>
+ <tr>
+ <td colspan="2"><b>Default annotations</b></td>
+ </tr>
+ <tr>
+ <td><tt>:Person</tt></td>
+ <td rowspan="4">Standard named entity types</td>
+ </tr>
+ <tr>
+ <td><tt>:Location</tt></td>
+ </tr>
+ <tr>
+ <td><tt>:Organization</tt></td>
+ </tr>
+ <tr>
+ <td><tt>:Date</tt></td>
+ </tr>
+ <tr>
+ <td><tt>:Address</tt></td>
+ <td>Includes email and IP addresses as well as street addresses</td>
+ </tr>
+ <tr>
+ <td colspan="2"><b>Additional annotations available if selected</b></td>
+ </tr>
+ <tr>
+ <td><tt>:Money</tt></td>
+ <td>Monetary amounts</td>
+ </tr>
+ <tr>
+ <td><tt>:Percent</tt></td>
+ <td>Expressions representing percentages</td>
+ </tr>
+ <tr>
+ <td><tt>:Token</tt></td>
+ <td>The individual tokens of the text, with "category" feature for POS</td>
+ </tr>
+ <tr>
+ <td><tt>:SpaceToken</tt></td>
+ <td>The spaces between tokens</td>
+ </tr>
+ <tr>
+ <td><tt>:Sentence</tt></td>
+ <td>Sentences detected by the sentence splitter</td>
+ </tr>
+ <tr>
+ <td><tt>:Lookup</tt></td>
+ <td>Individual gazetteer lookups – for those lookups that come from
the inflectional gazetteer this includes a "lemma" feature giving the base word
form</td>
+ </tr>
+ <tr>
+ <td><tt>:MSD</tt></td>
+ <td>"Morpho-Syntactic Description" for selected tokens, including features
for "lemma" (the base form of inflected words) and "type" (roughly equivalent
to a part of speech tag in English, though more complex as it encodes features
such as gender, grammatical case, etc.)</td>
+ </tr>
+</table>
+
Modified:
gate/trunk/plugins/Lang_Russian/resources/.russie-metadata/long-desc.html
===================================================================
--- gate/trunk/plugins/Lang_Russian/resources/.russie-metadata/long-desc.html
2014-03-24 19:35:52 UTC (rev 17732)
+++ gate/trunk/plugins/Lang_Russian/resources/.russie-metadata/long-desc.html
2014-03-24 20:45:07 UTC (rev 17733)
@@ -2,3 +2,57 @@
as <em>Person</em>, <em>Location</em>, <em>Organization</em>, <em>Money</em>
amounts, <em>Time</em> and <em>Date</em> expressions. It works on documents
in the Russian language.</p>
+
+<table>
+ <tr>
+ <td colspan="2"><b>Default annotations</b></td>
+ </tr>
+ <tr>
+ <td><tt>:Person</tt></td>
+ <td rowspan="4">Standard named entity types</td>
+ </tr>
+ <tr>
+ <td><tt>:Location</tt></td>
+ </tr>
+ <tr>
+ <td><tt>:Organization</tt></td>
+ </tr>
+ <tr>
+ <td><tt>:Date</tt></td>
+ </tr>
+ <tr>
+ <td><tt>:Address</tt></td>
+ <td>Includes email and IP addresses as well as street addresses</td>
+ </tr>
+ <tr>
+ <td colspan="2"><b>Additional annotations available if selected</b></td>
+ </tr>
+ <tr>
+ <td><tt>:Money</tt></td>
+ <td>Monetary amounts</td>
+ </tr>
+ <tr>
+ <td><tt>:Percent</tt></td>
+ <td>Expressions representing percentages</td>
+ </tr>
+ <tr>
+ <td><tt>:Token</tt></td>
+ <td>The individual tokens of the text, with "category" feature for POS</td>
+ </tr>
+ <tr>
+ <td><tt>:SpaceToken</tt></td>
+ <td>The spaces between tokens</td>
+ </tr>
+ <tr>
+ <td><tt>:Sentence</tt></td>
+ <td>Sentences detected by the sentence splitter</td>
+ </tr>
+ <tr>
+ <td><tt>:Lookup</tt></td>
+ <td>Individual gazetteer lookups</td>
+ </tr>
+ <tr>
+ <td><tt>:MSD</tt></td>
+ <td>"Morpho-Syntactic Description" for selected tokens, including features
for "lemma" (the base form of inflected words) and "type" (roughly equivalent
to a part of speech tag in English, though more complex as it encodes features
such as gender, grammatical case, etc.)</td>
+ </tr>
+</table>
Modified:
gate/trunk/plugins/Lang_Russian/resources/.russie-ortho-inflex-metadata/long-desc.html
===================================================================
---
gate/trunk/plugins/Lang_Russian/resources/.russie-ortho-inflex-metadata/long-desc.html
2014-03-24 19:35:52 UTC (rev 17732)
+++
gate/trunk/plugins/Lang_Russian/resources/.russie-ortho-inflex-metadata/long-desc.html
2014-03-24 20:45:07 UTC (rev 17733)
@@ -7,3 +7,57 @@
recognise more morphological variants of target names, and an
<em>orthomatcher</em> to perform basic coreference resolution based on
orthographic similarity.</p>
+
+<table>
+ <tr>
+ <td colspan="2"><b>Default annotations</b></td>
+ </tr>
+ <tr>
+ <td><tt>:Person</tt></td>
+ <td rowspan="4">Standard named entity types</td>
+ </tr>
+ <tr>
+ <td><tt>:Location</tt></td>
+ </tr>
+ <tr>
+ <td><tt>:Organization</tt></td>
+ </tr>
+ <tr>
+ <td><tt>:Date</tt></td>
+ </tr>
+ <tr>
+ <td><tt>:Address</tt></td>
+ <td>Includes email and IP addresses as well as street addresses</td>
+ </tr>
+ <tr>
+ <td colspan="2"><b>Additional annotations available if selected</b></td>
+ </tr>
+ <tr>
+ <td><tt>:Money</tt></td>
+ <td>Monetary amounts</td>
+ </tr>
+ <tr>
+ <td><tt>:Percent</tt></td>
+ <td>Expressions representing percentages</td>
+ </tr>
+ <tr>
+ <td><tt>:Token</tt></td>
+ <td>The individual tokens of the text, with "category" feature for POS</td>
+ </tr>
+ <tr>
+ <td><tt>:SpaceToken</tt></td>
+ <td>The spaces between tokens</td>
+ </tr>
+ <tr>
+ <td><tt>:Sentence</tt></td>
+ <td>Sentences detected by the sentence splitter</td>
+ </tr>
+ <tr>
+ <td><tt>:Lookup</tt></td>
+ <td>Individual gazetteer lookups – for those lookups that come from
the inflectional gazetteer this includes a "lemma" feature giving the base word
form</td>
+ </tr>
+ <tr>
+ <td><tt>:MSD</tt></td>
+ <td>"Morpho-Syntactic Description" for selected tokens, including features
for "lemma" (the base form of inflected words) and "type" (roughly equivalent
to a part of speech tag in English, though more complex as it encodes features
such as gender, grammatical case, etc.)</td>
+ </tr>
+</table>
Modified:
gate/trunk/plugins/Lang_Russian/resources/.russie-ortho-metadata/long-desc.html
===================================================================
---
gate/trunk/plugins/Lang_Russian/resources/.russie-ortho-metadata/long-desc.html
2014-03-24 19:35:52 UTC (rev 17732)
+++
gate/trunk/plugins/Lang_Russian/resources/.russie-ortho-metadata/long-desc.html
2014-03-24 20:45:07 UTC (rev 17733)
@@ -5,3 +5,57 @@
<p>This version of the pipeline includes an <em>orthomatcher</em> to perform
basic coreference resolution based on orthographic similarity.</p>
+
+<table>
+ <tr>
+ <td colspan="2"><b>Default annotations</b></td>
+ </tr>
+ <tr>
+ <td><tt>:Person</tt></td>
+ <td rowspan="4">Standard named entity types</td>
+ </tr>
+ <tr>
+ <td><tt>:Location</tt></td>
+ </tr>
+ <tr>
+ <td><tt>:Organization</tt></td>
+ </tr>
+ <tr>
+ <td><tt>:Date</tt></td>
+ </tr>
+ <tr>
+ <td><tt>:Address</tt></td>
+ <td>Includes email and IP addresses as well as street addresses</td>
+ </tr>
+ <tr>
+ <td colspan="2"><b>Additional annotations available if selected</b></td>
+ </tr>
+ <tr>
+ <td><tt>:Money</tt></td>
+ <td>Monetary amounts</td>
+ </tr>
+ <tr>
+ <td><tt>:Percent</tt></td>
+ <td>Expressions representing percentages</td>
+ </tr>
+ <tr>
+ <td><tt>:Token</tt></td>
+ <td>The individual tokens of the text, with "category" feature for POS</td>
+ </tr>
+ <tr>
+ <td><tt>:SpaceToken</tt></td>
+ <td>The spaces between tokens</td>
+ </tr>
+ <tr>
+ <td><tt>:Sentence</tt></td>
+ <td>Sentences detected by the sentence splitter</td>
+ </tr>
+ <tr>
+ <td><tt>:Lookup</tt></td>
+ <td>Individual gazetteer lookups</td>
+ </tr>
+ <tr>
+ <td><tt>:MSD</tt></td>
+ <td>"Morpho-Syntactic Description" for selected tokens, including features
for "lemma" (the base form of inflected words) and "type" (roughly equivalent
to a part of speech tag in English, though more complex as it encodes features
such as gender, grammatical case, etc.)</td>
+ </tr>
+</table>
Modified:
gate/trunk/plugins/OpenNLP/resources/.opennlp-de-metadata/long-desc.html
===================================================================
--- gate/trunk/plugins/OpenNLP/resources/.opennlp-de-metadata/long-desc.html
2014-03-24 19:35:52 UTC (rev 17732)
+++ gate/trunk/plugins/OpenNLP/resources/.opennlp-de-metadata/long-desc.html
2014-03-24 20:45:07 UTC (rev 17733)
@@ -3,3 +3,17 @@
<a href="http://opennlp.apache.org/">Apache OpenNLP</a>. The components are
based on the maxent machine learning algorithm, and produce Token and Sentence
annotations in a form compatible with other standard GATE tools.</p>
+
+<table>
+ <tr>
+ <td colspan="2"><b>Default annotations</b></td>
+ </tr>
+ <tr>
+ <td><tt>:Token</tt></td>
+ <td>The individual tokens of the text, with "category" feature for POS</td>
+ </tr>
+ <tr>
+ <td><tt>:Sentence</tt></td>
+ <td>Sentences detected by the sentence splitter</td>
+ </tr>
+</table>
Modified: gate/trunk/plugins/OpenNLP/resources/.opennlp-metadata/long-desc.html
===================================================================
--- gate/trunk/plugins/OpenNLP/resources/.opennlp-metadata/long-desc.html
2014-03-24 19:35:52 UTC (rev 17732)
+++ gate/trunk/plugins/OpenNLP/resources/.opennlp-metadata/long-desc.html
2014-03-24 20:45:07 UTC (rev 17733)
@@ -4,3 +4,45 @@
<a href="http://opennlp.apache.org/">Apache OpenNLP</a>. The components are
based on the maxent machine learning algorithm, and produce Token and Sentence
annotations in a form compatible with other standard GATE tools.</p>
+
+<table>
+ <tr>
+ <td colspan="2"><b>Default annotations</b></td>
+ </tr>
+ <tr>
+ <td><tt>:Person</tt></td>
+ <td rowspan="4">Standard named entity types</td>
+ </tr>
+ <tr>
+ <td><tt>:Location</tt></td>
+ </tr>
+ <tr>
+ <td><tt>:Organization</tt></td>
+ </tr>
+ <tr>
+ <td><tt>:Date</tt></td>
+ </tr>
+ <tr>
+ <td colspan="2"><b>Additional annotations available if selected</b></td>
+ </tr>
+ <tr>
+ <td><tt>:Money</tt></td>
+ <td>Monetary amounts</td>
+ </tr>
+ <tr>
+ <td><tt>:Percentage</tt></td>
+ <td>Expressions representing percentages</td>
+ </tr>
+ <tr>
+ <td><tt>:Time</tt></td>
+ <td>Time expressions</td>
+ </tr>
+ <tr>
+ <td><tt>:Sentence</tt></td>
+ <td>Sentences detected by the sentence splitter</td>
+ </tr>
+ <tr>
+ <td><tt>:Token</tt></td>
+ <td>The individual tokens of the text, with "category" feature for POS and
"chunk" feature for for the I/O/B-style chunk tags. Complete chunks derived
from the tags are also available as their respective annotation types (e.g. a
sequence of tokens tagged B-NP, I-NP, I-NP gives rise to an "NP" annotation
spanning the sequence).</td>
+ </tr>
+</table>
Modified:
gate/trunk/plugins/OpenNLP/resources/.opennlp-nl-metadata/long-desc.html
===================================================================
--- gate/trunk/plugins/OpenNLP/resources/.opennlp-nl-metadata/long-desc.html
2014-03-24 19:35:52 UTC (rev 17732)
+++ gate/trunk/plugins/OpenNLP/resources/.opennlp-nl-metadata/long-desc.html
2014-03-24 20:45:07 UTC (rev 17733)
@@ -4,3 +4,45 @@
<a href="http://opennlp.apache.org/">Apache OpenNLP</a>. The components are
based on the maxent machine learning algorithm, and produce Token and Sentence
annotations in a form compatible with other standard GATE tools.</p>
+
+<table>
+ <tr>
+ <td colspan="2"><b>Default annotations</b></td>
+ </tr>
+ <tr>
+ <td><tt>:Person</tt></td>
+ <td rowspan="4">Standard named entity types</td>
+ </tr>
+ <tr>
+ <td><tt>:Location</tt></td>
+ </tr>
+ <tr>
+ <td><tt>:Organization</tt></td>
+ </tr>
+ <tr>
+ <td><tt>:Date</tt></td>
+ </tr>
+ <tr>
+ <td colspan="2"><b>Additional annotations available if selected</b></td>
+ </tr>
+ <tr>
+ <td><tt>:Money</tt></td>
+ <td>Monetary amounts</td>
+ </tr>
+ <tr>
+ <td><tt>:Percentage</tt></td>
+ <td>Expressions representing percentages</td>
+ </tr>
+ <tr>
+ <td><tt>:Time</tt></td>
+ <td>Time expressions</td>
+ </tr>
+ <tr>
+ <td><tt>:Sentence</tt></td>
+ <td>Sentences detected by the sentence splitter</td>
+ </tr>
+ <tr>
+ <td><tt>:Token</tt></td>
+ <td>The individual tokens of the text, with "category" feature for POS and
"chunk" feature for for the I/O/B-style chunk tags. Complete chunks derived
from the tags are also available as their respective annotation types (e.g. a
sequence of tokens tagged B-NP, I-NP, I-NP gives rise to an "NP" annotation
spanning the sequence).</td>
+ </tr>
+</table>
Modified:
gate/trunk/plugins/Tagger_Framework/resources/AbGene/.abgene-metadata/long-desc.html
===================================================================
---
gate/trunk/plugins/Tagger_Framework/resources/AbGene/.abgene-metadata/long-desc.html
2014-03-24 19:35:52 UTC (rev 17732)
+++
gate/trunk/plugins/Tagger_Framework/resources/AbGene/.abgene-metadata/long-desc.html
2014-03-24 20:45:07 UTC (rev 17733)
@@ -6,3 +6,13 @@
<p>For full details, see Tanabe and Wilbur (2002) "Tagging gene and protein
names in biomedical text", <em>Bioinformatics</em> 18(8):1124&endash;1132,
<a
href="http://dx.doi.org/10.1093/bioinformatics/18.8.1124">doi:10.1093/bioinformatics/18.8.1124</a>.</p>
+
+<table>
+ <tr>
+ <td colspan="2"><b>Default annotations</b></td>
+ </tr>
+ <tr>
+ <td><tt>:Gene</tt></td>
+ <td>Expressions denoting genes</td>
+ </tr>
+</table>
Modified:
gate/trunk/plugins/Tagger_Framework/resources/Hunpos/.hunpos-en-metadata/long-desc.html
===================================================================
---
gate/trunk/plugins/Tagger_Framework/resources/Hunpos/.hunpos-en-metadata/long-desc.html
2014-03-24 19:35:52 UTC (rev 17732)
+++
gate/trunk/plugins/Tagger_Framework/resources/Hunpos/.hunpos-en-metadata/long-desc.html
2014-03-24 20:45:07 UTC (rev 17733)
@@ -3,3 +3,24 @@
<p>This application tags English language text using the
<a
href="http://code.google.com/p/hunpos/downloads/detail?name=en_wsj.model.gz">en_wsj</a>
model from the Hunpos distribution.</p>
+
+<table>
+ <tr>
+ <td colspan="2"><b>Default annotations</b></td>
+ </tr>
+ <tr>
+ <td><tt>:Token</tt></td>
+ <td>The individual tokens of the text, with "category" feature for POS</td>
+ </tr>
+ <tr>
+ <td colspan="2"><b>Additional annotations available if selected</b></td>
+ </tr>
+ <tr>
+ <td><tt>:SpaceToken</tt></td>
+ <td>The spaces between tokens</td>
+ </tr>
+ <tr>
+ <td><tt>:Sentence</tt></td>
+ <td>Sentences detected by the sentence splitter</td>
+ </tr>
+</table>
Modified:
gate/trunk/plugins/Tagger_Framework/resources/Hunpos/.hunpos-hu-metadata/long-desc.html
===================================================================
---
gate/trunk/plugins/Tagger_Framework/resources/Hunpos/.hunpos-hu-metadata/long-desc.html
2014-03-24 19:35:52 UTC (rev 17732)
+++
gate/trunk/plugins/Tagger_Framework/resources/Hunpos/.hunpos-hu-metadata/long-desc.html
2014-03-24 20:45:07 UTC (rev 17733)
@@ -3,3 +3,24 @@
<p>This application tags Hungarian language text using the
<a
href="http://code.google.com/p/hunpos/downloads/detail?name=hu_szeged_kr.model.gz">hu_szeged_kr</a>
model from the Hunpos distribution.</p>
+
+<table>
+ <tr>
+ <td colspan="2"><b>Default annotations</b></td>
+ </tr>
+ <tr>
+ <td><tt>:Token</tt></td>
+ <td>The individual tokens of the text, with "category" feature for POS</td>
+ </tr>
+ <tr>
+ <td colspan="2"><b>Additional annotations available if selected</b></td>
+ </tr>
+ <tr>
+ <td><tt>:SpaceToken</tt></td>
+ <td>The spaces between tokens</td>
+ </tr>
+ <tr>
+ <td><tt>:Sentence</tt></td>
+ <td>Sentences detected by the sentence splitter</td>
+ </tr>
+</table>
Modified:
gate/trunk/plugins/Tagger_Measurements/resources/.annie-measurements-metadata/long-desc.html
===================================================================
---
gate/trunk/plugins/Tagger_Measurements/resources/.annie-measurements-metadata/long-desc.html
2014-03-24 19:35:52 UTC (rev 17732)
+++
gate/trunk/plugins/Tagger_Measurements/resources/.annie-measurements-metadata/long-desc.html
2014-03-24 20:45:07 UTC (rev 17733)
@@ -6,3 +6,57 @@
<p>This pipeline combines the basic ANNIE named entity system with taggers to
recognise numeric expressions (digits and words) and to annotate and normalise
measurement expressions with features giving their value in SI units.</p>
+
+<table>
+ <tr>
+ <td colspan="2"><b>Default annotations</b></td>
+ </tr>
+ <tr>
+ <td><tt>:Person</tt></td>
+ <td rowspan="4">Standard named entity types</td>
+ </tr>
+ <tr>
+ <td><tt>:Location</tt></td>
+ </tr>
+ <tr>
+ <td><tt>:Organization</tt></td>
+ </tr>
+ <tr>
+ <td><tt>:Date</tt></td>
+ </tr>
+ <tr>
+ <td><tt>:Address</tt></td>
+ <td>Includes email and IP addresses as well as street addresses</td>
+ </tr>
+ <tr>
+ <td><tt>:Measurement</tt></td>
+ <td>Measurement expressions, with features giving the value and unit of
the measurement, both in the original form specified in the document and in a
form normalized to SI units</td>
+ </tr>
+ <tr>
+ <td colspan="2"><b>Additional annotations available if selected</b></td>
+ </tr>
+ <tr>
+ <td><tt>:Money</tt></td>
+ <td>Monetary amounts</td>
+ </tr>
+ <tr>
+ <td><tt>:Percent</tt></td>
+ <td>Expressions representing percentages</td>
+ </tr>
+ <tr>
+ <td><tt>:Token</tt></td>
+ <td>The individual tokens of the text, with "category" feature for POS</td>
+ </tr>
+ <tr>
+ <td><tt>:SpaceToken</tt></td>
+ <td>The spaces between tokens</td>
+ </tr>
+ <tr>
+ <td><tt>:Sentence</tt></td>
+ <td>Sentences detected by the sentence splitter</td>
+ </tr>
+ <tr>
+ <td><tt>:Ratio</tt></td>
+ <td>Expressions denoting a ratio rather than a simple measurement,
typically percentages but also expressions like "300 parts per million"</td>
+ </tr>
+</table>
Modified:
gate/trunk/plugins/Tagger_Measurements/resources/.measurements-metadata/long-desc.html
===================================================================
---
gate/trunk/plugins/Tagger_Measurements/resources/.measurements-metadata/long-desc.html
2014-03-24 19:35:52 UTC (rev 17732)
+++
gate/trunk/plugins/Tagger_Measurements/resources/.measurements-metadata/long-desc.html
2014-03-24 20:45:07 UTC (rev 17733)
@@ -6,3 +6,58 @@
one unit matching results expressed in another.</p>
<p>As a side-effect this pipeline also annotates tokens and sentences.</p>
+
+<table>
+ <tr>
+ <td colspan="4"><b>Default annotations</b></td>
+ </tr>
+ <tr>
+ <td colspan="2"><tt>:Measurement</tt></td>
+ <td colspan="2">Measurement expressions, with features:</td>
+ </tr>
+ <tr>
+ <td> </td>
+ <td colspan="2"><tt>type</tt></td>
+ <td>"scalar" for single measurements, or "interval" for intervals (e.g. "1
to 5 pounds")</td>
+ </tr>
+ <tr>
+ <td> </td>
+ <td colspan="2"><tt>unit</tt></td>
+ <td>The unit of the measurement (gram, mile, ...)</td>
+ </tr>
+ <tr>
+ <td> </td>
+ <td colspan="2"><tt>value</tt></td>
+ <td>The numeric value of the measurement quantity as specified in the
text</td>
+ </tr>
+ <tr>
+ <td> </td>
+ <td colspan="2"><tt>normalizedUnit</tt></td>
+ <td>The "normalized" unit for the measurement in the SI system (kilogram,
metre, etc.)</td>
+ </tr>
+ <tr>
+ <td> </td>
+ <td colspan="2"><tt>normalizedValue</tt></td>
+ <td>The equivalent value of the measurement in the normalized unit. For
interval measurements this is replaced by a "normalizedMaxValue" and
"normalizedMinValue" giving the end-points of the interval.</td>
+ </tr>
+ <tr>
+ <td> </td>
+ <td colspan="2"><tt>dimension</tt></td>
+ <td>Speed, volume, area, time, etc.</td>
+ </tr>
+ <tr>
+ <td colspan="4"><b>Additional annotations available if selected</b></td>
+ </tr>
+ <tr>
+ <td colspan="2"><tt>:Sentence</tt></td>
+ <td colspan="2">Sentences detected by the sentence splitter</td>
+ </tr>
+ <tr>
+ <td colspan="2"><tt>:Token</tt></td>
+ <td colspan="2">The individual tokens of the text</td>
+ </tr>
+ <tr>
+ <td colspan="2"><tt>:Ratio</tt></td>
+ <td colspan="2">Expressions denoting a ratio rather than a simple
measurement, typically percentages but also expressions like "300 parts per
million"</td>
+ </tr>
+</table>
Modified:
gate/trunk/plugins/Tagger_NP_Chunking/.np-chunker-metadata/long-desc.html
===================================================================
--- gate/trunk/plugins/Tagger_NP_Chunking/.np-chunker-metadata/long-desc.html
2014-03-24 19:35:52 UTC (rev 17732)
+++ gate/trunk/plugins/Tagger_NP_Chunking/.np-chunker-metadata/long-desc.html
2014-03-24 20:45:07 UTC (rev 17733)
@@ -2,3 +2,28 @@
phrases with a <em>NounChunk</em> annotation. This application also includes a
tokeniser, sentence splitter and POS tagger as these are required by the
chunking algorithm.</p>
+
+<table>
+ <tr>
+ <td colspan="2"><b>Default annotations</b></td>
+ </tr>
+ <tr>
+ <td><tt>:NounChunk</tt></td>
+ <td>Noun chunks discovered by the chunker</td>
+ </tr>
+ <tr>
+ <td colspan="2"><b>Additional annotations available if selected</b></td>
+ </tr>
+ <tr>
+ <td><tt>:Token</tt></td>
+ <td>The individual tokens of the text, with "category" feature for POS</td>
+ </tr>
+ <tr>
+ <td><tt>:SpaceToken</tt></td>
+ <td>The spaces between tokens</td>
+ </tr>
+ <tr>
+ <td><tt>:Sentence</tt></td>
+ <td>Sentences detected by the sentence splitter</td>
+ </tr>
+</table>
Modified:
gate/trunk/plugins/Tagger_PennBio/resources/.pennbio-metadata/long-desc.html
===================================================================
---
gate/trunk/plugins/Tagger_PennBio/resources/.pennbio-metadata/long-desc.html
2014-03-24 19:35:52 UTC (rev 17732)
+++
gate/trunk/plugins/Tagger_PennBio/resources/.pennbio-metadata/long-desc.html
2014-03-24 20:45:07 UTC (rev 17733)
@@ -13,3 +13,40 @@
</ul>
</li>
</ul>
+
+<table>
+ <tr>
+ <td colspan="2"><b>Default annotations</b></td>
+ </tr>
+ <tr>
+ <td><tt>:gene</tt></td>
+ <td>Expressions denoting genes</td>
+ </tr>
+ <tr>
+ <td><tt>:malignancy-type</tt></td>
+ <td>Expressions denoting malignancy types</td>
+ </tr>
+ <tr>
+ <td><tt>:location</tt></td>
+ <td rowspan="5">Expressions relating to genomic variation</td>
+ </tr>
+ <tr>
+ <td><tt>:state-original</tt></td>
+ </tr>
+ <tr>
+ <td><tt>:state-altered</tt></td>
+ </tr>
+ <tr>
+ <td><tt>:variation</tt></td>
+ </tr>
+ <tr>
+ <td><tt>:type</tt></td>
+ </tr>
+ <tr>
+ <td colspan="2"><b>Additional annotations available if selected</b></td>
+ </tr>
+ <tr>
+ <td><tt>:Token</tt></td>
+ <td>The individual tokens of the text</td>
+ </tr>
+</table>
Modified:
gate/trunk/plugins/Twitter/resources/.twitie-en-metadata/long-desc.html
===================================================================
--- gate/trunk/plugins/Twitter/resources/.twitie-en-metadata/long-desc.html
2014-03-24 19:35:52 UTC (rev 17732)
+++ gate/trunk/plugins/Twitter/resources/.twitie-en-metadata/long-desc.html
2014-03-24 20:45:07 UTC (rev 17733)
@@ -10,3 +10,65 @@
<em>Person</em>, <em>Location</em>, <em>Organization</em>, <em>Money</em>
amounts, <em>Time</em> and <em>Date</em> expressions.</li>
</ul>
+
+<table>
+ <tr>
+ <td colspan="2"><b>Default annotations</b></td>
+ </tr>
+ <tr>
+ <td><tt>:Person</tt></td>
+ <td rowspan="4">Standard named entity types</td>
+ </tr>
+ <tr>
+ <td><tt>:Location</tt></td>
+ </tr>
+ <tr>
+ <td><tt>:Organization</tt></td>
+ </tr>
+ <tr>
+ <td><tt>:Date</tt></td>
+ </tr>
+ <tr>
+ <td><tt>:Address</tt></td>
+ <td>Includes email and IP addresses as well as street addresses</td>
+ </tr>
+ <tr>
+ <td><tt>:Token</tt></td>
+ <td>The individual tokens of the text, with "category" feature for POS</td>
+ </tr>
+ <tr>
+ <td><tt>:Emoticon</tt></td>
+ <td>Emoticons such as <tt>:-)</tt></td>
+ </tr>
+ <tr>
+ <td><tt>:Hashtag</tt></td>
+ <td>Hashtags, including the leading # character</td>
+ </tr>
+ <tr>
+ <td><tt>:URL</tt></td>
+ <td>URL mentions</td>
+ </tr>
+ <tr>
+ <td><tt>:UserID</tt></td>
+ <td>The username part of @user mentions, <em>not</em> including the
leading @ sign</td>
+ </tr>
+ <tr>
+ <td colspan="2"><b>Additional annotations available if selected</b></td>
+ </tr>
+ <tr>
+ <td><tt>:Money</tt></td>
+ <td>Monetary amounts</td>
+ </tr>
+ <tr>
+ <td><tt>:Percent</tt></td>
+ <td>Expressions representing percentages</td>
+ </tr>
+ <tr>
+ <td><tt>:SpaceToken</tt></td>
+ <td>The spaces between tokens</td>
+ </tr>
+ <tr>
+ <td><tt>:Sentence</tt></td>
+ <td>Sentences detected by the sentence splitter</td>
+ </tr>
+</table>
This was sent by the SourceForge.net collaborative development platform, the
world's largest Open Source development site.
------------------------------------------------------------------------------
Learn Graph Databases - Download FREE O'Reilly Book
"Graph Databases" is the definitive new guide to graph databases and their
applications. Written by three acclaimed leaders in the field,
this first edition is now available. Download your free book today!
http://p.sf.net/sfu/13534_NeoTech
_______________________________________________
GATE-cvs mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/gate-cvs