Please read "the Entity linking is for Entities in the Text"

On Tue, Sep 23, 2014 at 2:20 AM, Bhoomin Pandya <bhoominpan...@gmail.com> wrote:
> Hi Rupert:
>
> I agree with you that:
> 1. the Entity linking is for Entities the Text
> 2. the Categorization is at Content or Pages Level
>
> Could you help me with following please:
>
> Does Categorization at content or page level mean categorization at URI level?
>
> Are "context' along with the triples (N-Quads) allowed in Stanbol
>
> Best Regards,
>
> Bhoomin Pandya
>
> On Mon, Sep 22, 2014 at 8:07 PM, Maatari Daniel Okouya
> <okouy...@yahoo.fr> wrote:
>> I understand better.
>>
>> I think the key sentence here was: “Important is that Entity Linking 
>> requires an actual mention of the
>> Entity in the text while categories do not depend on such mentions. "
>>
>>
>> -So basically wether the category is based on a SKOS DataSet or Not, this 
>> does not matter at all  !!!
>>
>> -In both case they link to a dataset, it does not matter if it is SKOS based 
>> or not. The difference is how the entity to which we link comes up.
>>
>>
>>
>> Few questions here if you don’t mind. I’m not trying to reemployment things 
>> here, but simply to better understand things so i can use the tool properly.
>>
>>
>> 1) How would the information of a specific category set be fetch ? The 
>> process of linking in categorisation must be different, in that you do not 
>> have the type to guide you. You may well end up with synonyms, without the 
>> type erros would occurs. I can see why using a controlled vocabulary would 
>> be more easy. There, the disambiguation is within the label directly.
>> Would you confirm my assumption here ? That categorisation with a Skos based 
>> dataset (thesarus) is more easy ?
>>
>> 2) Is the reason for the Named Entity Recognition to limit itself to these 
>> three specific Type “Pertinence” ? Also would this type be customisable, 
>> meaning could you have a bit more types ?
>>
>>
>>
>> 3)  What i want to achieve is describing some content resource according to 
>> schema.org. For creativeWork, it has the property “schema:about” which must 
>> point to a “schema:Thing”. I presume by that, google is expecting here, 
>> something else than a controlled Concept. I’m not saying that it is not 
>> possible. In the sameWay, with FOAF:Topic that i would also use, I want to 
>> point to the real thing rather than a control vocabulary Concept. I would 
>> rather use, dc:subject for the SKOS:Concept.  Does it make sense? Can the 
>> enhancement indeed, categorise according to non-skos instance, that are in 
>> an external dataset?
>>
>>
>> Many thanks,
>>
>> Maatari
>>
>>
>>
>> --
>> Maatari Daniel Okouya
>> Sent with Airmail
>>
>> On 22 Sep 2014 at 06:49:14, Rupert Westenthaler 
>> (rupert.westentha...@gmail.com) wrote:
>>
>> Hi Maatari,
>>
>> On Mon, Sep 22, 2014 at 8:22 AM, Maatari Daniel Okouya
>> <okouy...@yahoo.fr> wrote:
>>> I’m a bit confused about few concept. Could someone clarify them a bit.
>>>
>>>
>>> When it comes to assigning some topics to a content resource, what would be 
>>> the difference between entity linking and categorization ?
>>>
>>
>> First lets explain the terminology as used by Stanbol. For that I will
>> use a todays headline:
>>
>> "Lewis Hamilton not thinking about title after winning Singapore GP"
>>
>> Named Entity Recognition: Detects mentions of Entity types within the
>> text. Typically Persons, Organizations and Locations
>> * Lewis Hamilton -> person
>> * Singapore -> location
>>
>> Entity Linking: Detects mentions of known Entities within the processed Text
>> * Lewis Hamilton -> http://en.wikipedia.org/wiki/Lewis_Hamilton
>> * Singapore Grand Prix -> http://en.wikipedia.org/wiki/Singapore_Grand_Prix
>>
>> Categorization: Assigns the content to a fixed set of categories.
>> Categories might be hierarchical. A typical example are the IPTC Media
>> Topics [1] which I will use for this example.
>> * sport -> http://cv.iptc.org/newscodes/mediatopic/15000000
>> * Formula One -> http://cv.iptc.org/newscodes/mediatopic/20000994
>>
>> Important is that Entity Linking requires an actual mention of the
>> Entity in the text while categories do not depend on such mentions.
>>
>>> What I see as of now, within some tools well established is the 
>>> classification part. Usually it makes use of a control vocabulary to 
>>> classify the content. Output = resource dc:Subject controledVocabularyTerm
>>>
>>> However, what i also see in the description of content resource online 
>>> within some authority website is to link the document to external non skos 
>>> resource via for instance the Foaf:Topic.
>>>
>>> In that second case, do we have both an entity linking and a classification 
>>> ? or is it that both are the same, it is just that the knowledge base 
>>> change, from external source to controlled vocabulary. Which would mean 
>>> that in the world of linked data, content classification / categorization 
>>> include entity linking? In that case i would say that, the same was 
>>> happening when linking to a controlled vocabulary term.
>>>
>>
>> IMO the properties used to represent analysis results do not
>> necessarily indicate if the results express linked entities or
>> categorizations. Based on the definition both dc:subject and
>> foaf:topic they should be both used for categories.
>>
>>>
>>> I'm little confused here. If someone, could clarify these notion i would 
>>> appreciate.
>>
>> hope this helps
>> best
>> Rupert
>>
>> [1] http://cv.iptc.org/newscodes/mediatopic
>>
>> --
>> | Rupert Westenthaler rupert.westentha...@gmail.com
>> | Bodenlehenstraße 11 ++43-699-11108907
>> | A-5500 Bischofshofen
>> | REDLINK.CO 
>> ..........................................................................
>> | http://redlink.co/

Reply via email to