Re: NIF + Stanbol

Sebastian Hellmann Thu, 30 May 2013 17:57:19 -0700

Hi Reto,

I am not sure, whether I understand your example correctly. What doesengine B do exactly?The example below is taken fromhttp://persistence.uni-leipzig.org/nlp2rdf/examples/wikilex/contextcollectionNote that Marl was the first thing I could find on the web aboutsentiment. Maybe there is something better.Interestingly it fits perfectly for the modeling with the NIF Stanbolprofile.

For OA, we would need to create at least two more URNs:

one for the specific resource and then we would need to split up themarl:Opinion into oa:Annotation (for provenance dc:created and creator)and oa:hasBody (for opinion properties).

# context (the plaintext content of the wikipedia articlehttp://en.wikipedia.org/wiki/Alcoholism)

<Alcoholism.txt#char=0,91429>
    a nif:Context, nif:RFC5147String ;
    nif:beginIndex "0" ;
    nif:endIndex "91429" ;
    nif:sourceUrl <http://en.wikipedia.org/wiki/Alcoholism> ;

nif:isString "(snip) ... Benzodiazepines, while useful in themanagement of acute alcohol withd....." .



<Alcoholism.txt#char=37028,37043>
    a  nif:RFC5147String ;

nif:anchorOf "Benzodiazepines, while useful in the management ofacute alcohol withdrawal, if used long-term can cause a worse outcome inalcoholism."

    nif:beginIndex "37028" ;
    nif:endIndex "37165" ;
# nif simple profile, just two property
    nif:sentimentValue "-0.80"^^xsd:decimal ;
    nif:sentimentValueConfidence "0.9999978209631343" ;
#nif stanbol profile
    nif:opinion <http://uri_or_urn_for_the_marl_opinion> ;
    nif:referenceContext <Alcoholism.txt#char=0,91429>  .

<http://uri_or_urn_for_the_marl_opinion>
#some properties omitted
    marl:extractedFrom <Alcoholism.txt#char=37028,37043> ;
    <http://fise.iks-project.eu/ontology/confidence>
"0.9999978209631343"^^<http://www.w3.org/2001/XMLSchema#double> ;
      <http://fise.iks-project.eu/ontology/extracted-from>
<urn:content-item-sha1-b744059cdc5f802db787e9c40a7c3df53c5b6e68> ;
      <http://purl.org/dc/terms/created>
"2013-05-31T00:45:56.555Z"^^<http://www.w3.org/2001/XMLSchema#dateTime> ;
      <http://purl.org/dc/terms/creator>

"org.apache.stanbol.enhancer.engines.sentdetect.SentimentDetectionEnhancementEngine"^^<http://www.w3.org/2001/XMLSchema#string>;

        rdf:type <http://purl.org/marl/ns#Opinion> .

All the best,
Sebastian

Am 30.05.2013 17:28, schrieb Reto Bachmann-Gmür:

Hi Sebastain

Indeed there seems to be quite some overlap between open annotationand NIF. The annotation we need cleraly go beyound NLP-annotation. Wemight want to say that according to engine A the second sentence waswritter in a bad mood and that that according to engine B a particularreferes to a the person with profile http://...

It would be good to see examples on how this looks like in NIF andOpen Annotation.


Cheers,
Reto

On Sun, May 26, 2013 at 10:31 PM, Sebastian Hellmann<[email protected]<mailto:[email protected]>> wrote:


    Hi Reto,
    let me offer you a different perspective:

    Am 24.05.2013 15:40, schrieb Reto Bachmann-Gmür:

        I haven't had the time yet to look at NIF. I'm happy that you
        agree that
        Stanbol should express the annotations (wherever possible)
        using the
        emergen W3C annotation standard.


    Which standard do you mean with "emerging W3C annotation standard"
    ? NIF or Open Annotation?
    From a formal perspective NIF will be very official in June when
    Internationalization Tag Set (ITS) Version 2.0 (
    http://www.w3.org/TR/its20 ) becomes a W3C recommendation.
    Open Annotation is a community of practice (also a W3C community
    group) . The formal W3C requirements for working groups and W3C
    recommendations do not apply to them.

    NIF targets NLP web services and NLP middleware and frameworks.
    Requirements for NIF are scalability, exactness, reproducibility
    of annotations.
    Open Annotation currently targets annotation of web resources from
    an end user perspective, especially seen through a browser.
    I am currently active on their mailing list, to help them with the
    NLP use case (which is not at the core of their interest).

    From both viewpoints NIF is a more formal standard as well as more
    pertinent for Stanbol.
    There is a NIF community as well, but we wanted to get the
    technical core straight, which is finished now, but not properly
    documented yet (due to moving infrastructure).

    Please have a look at the (scientific) description:
    http://svn.aksw.org/papers/2013/ISWC_NIF/public.pdf

    The core idea of the FISE ontology is quite good, but for us to
    include it into NIF, it still needs some work.
    The problem is, if Stanbol moves to Open Annotation completely,
    NIF will need to fill this gap and create a model that is similar
    to the way Stanbol works now.
    This is why I started this issue.

    All the best,
    Sebastian



        To my understanding "Stanbol Enhancement Structure" is
        supposed to describe
        which ontological terms and construct are used to describe their
        enhancements.

        Cheers,
        Reto


        On Thu, May 23, 2013 at 5:38 PM, Sebastian Hellmann <
        [email protected]
        <mailto:[email protected]>> wrote:

            Hello Reto,
            ah yes, I read up on some of the Apache procedures to see
            how you are
            working and I see now that the mailing list is the most
            important means of
            communication here.   Thanks for pointing me to the issue.
            In principle, do
            you want me to comment here or in the issue tracker?

            Regarding the different models:

            First of all alignment should happen with the "Open
            Annotation Data Model":
            http://www.w3.org/ns/oa#
            This is the most current version. Annotation Ontology was
            merged into it.

            I really recommend grounding any work on their model, as
            it is really good
            and powerful. I am not sure however, whether, it provides
            the right level
            of scalability for NLP.
            Looking at: http://de.slideshare.net/**paolociccarese/open-**
            
annotation-specifiers-and-**specific-resources-tutorial<http://de.slideshare.net/paolociccarese/open-annotation-specifiers-and-specific-resources-tutorial>


            There are 3 important things missing:
            - inclusion of the actual text in the web service request
            - providing best practices for identifiers, e.g.
            http://purl.org/olia/penn.owl#**DT
            <http://purl.org/olia/penn.owl#DT>

            - reducing the number of URNs and triples

            This is where NIF comes in. (If you are in doubt, please
            try to create an
            OA example where a simple sentence is POS annotated over a
            web service).

            Regarding Ruperts problem with backward compatibility.
            In a first step, it should be enough to build an RDF
            parser/serializer
            based on the new OWL file.

            I didn't yet understand, what is meant exactly by "Stanbol
            Enhancement
            Structure"[1].
            Is this the OWL file for serializing annotations (e.g. for
            use in SPARQL)
            or does it describe the internal structure of the Stanbol
            Java Framework?

            I think the second one can stay as it is for now and then
            the new
            structure should be created (as serialization format)
            meanwhile with the
            clear aim to replace the former in the future. This would
            give all clients
            enough time to adapt.

            What do you think?

            All the best,
            Sebastian

            [1] http://stanbol.apache.org/**docs/trunk/components/**
            
enhancer/enhancementstructure.**html<http://stanbol.apache.org/docs/trunk/components/enhancer/enhancementstructure.html>





            Am 23.05.2013 14:12, schrieb Reto Bachmann-Gmür:

                Hi Sebastian

                Are you aware of
                
https://issues.apache.org/**jira/browse/STANBOL-351<https://issues.apache.org/jira/browse/STANBOL-351>


                ?

                Rtaher than doing telcos we should discus things on
                the list.

                Cheers,
                Reto


                On Thu, May 23, 2013 at 9:27 AM, Sebastian Hellmann <
                [email protected]**leipzig.de
                <http://leipzig.de>
                <[email protected]
                <mailto:[email protected]>>>
                wrote:

                  Hi all,

                    we created an OWL schema called NLP Interchange
                    Format(NIF), which
                    leverages Apache Stanbols FISE ontology.
                    Recent documentation is here:
                    
http://svn.aksw.org/papers/****2013/ISWC_NIF/public.pdf<http://svn.aksw.org/papers/**2013/ISWC_NIF/public.pdf>
                    <http:**//svn.aksw.org/papers/2013/**ISWC_NIF/public.pdf
                    
<http://svn.aksw.org/papers/2013/**ISWC_NIF/public.pdf><http://svn.aksw.org/papers/2013/ISWC_NIF/public.pdf>



                    Personally, I think the general structure (using
                    URN for each annotation)
                    is quite good, but I am a little bit unhappy with
                    some facts:
                    1. URL persistence: when will the FISE ontology
                    move from IKS to the
                    Apache Stanbol namespace. In my opinion, sooner is
                    better. The longer it
                    is
                    out there, the more side effects it will cause:
                    http://xkcd.com/1172/
                    2. Some issues need discussions and some
                    streamlining. I would be happy
                    to
                    be of assistance and would offer to hold some
                    Ontology telcos to get it
                    straight.
                    
http://svn.apache.org/repos/****asf/stanbol/trunk/enhancer/**<http://svn.apache.org/repos/**asf/stanbol/trunk/enhancer/**>
                    
generic/servicesapi/src/main/****resources/fise.owl<http://**
                    svn.apache.org/repos/asf/**stanbol/trunk/enhancer/**
                    
<http://svn.apache.org/repos/asf/**stanbol/trunk/enhancer/**>


                    
generic/servicesapi/src/main/**resources/fise.owl<http://svn.apache.org/repos/asf/stanbol/trunk/enhancer/generic/servicesapi/src/main/resources/fise.owl>
                    e.g.
                    - start and end have xsd:int limiting it to a 4GB
                    text file
                    - extracted-from might not need to be functional.
                    Also there might be a
                    relation to prov:wasDerivedFrom
                    These issues all need discussion however.

                    Any ideas on how to proceed?

                    All the best,
                    Sebastian

                    --
                    Dipl. Inf. Sebastian Hellmann
                    Department of Computer Science, University of Leipzig
                    Events: NLP & DBpedia 2013
                    (http://nlp-dbpedia2013.blogs.****aksw.org
                    <http://aksw.org><
                    http://nlp-**dbpedia2013.blogs.aksw.org
                    
<http://dbpedia2013.blogs.aksw.org><http://nlp-dbpedia2013.blogs.aksw.org>

                        ,

                    Deadline: *July 8th*)
                    Venha para a Alemanha como PhD:
                    http://bis.informatik.uni-**le**
                    ipzig.de/csf <http://ipzig.de/csf>
                    <http://leipzig.de/csf><http://bis.**
                    informatik.uni-leipzig.de/csf
                    <http://informatik.uni-leipzig.de/csf>
                    <http://bis.informatik.uni-leipzig.de/csf>

                    Projects: http://nlp2rdf.org ,
                    http://linguistics.okfn.org ,
                    http://dbpedia.org/Wiktionary , http://dbpedia.org
                    Homepage:
                    http://bis.informatik.uni-**le**ipzig.de/SebastianHellmann
                    
<http://ipzig.de/SebastianHellmann><http://leipzig.de/SebastianHellmann>
                    <htt**p://bis.informatik.uni-**leipzig.de/SebastianHellmann
                    
<http://leipzig.de/SebastianHellmann><http://bis.informatik.uni-leipzig.de/SebastianHellmann>
                    Research Group: http://aksw.org


            --
            Dipl. Inf. Sebastian Hellmann
            Department of Computer Science, University of Leipzig
            Events: NLP & DBpedia 2013
            (http://nlp-dbpedia2013.blogs.**aksw.org
            <http://aksw.org><http://nlp-dbpedia2013.blogs.aksw.org>,
            Deadline: *July 8th*)
            Venha para a Alemanha como PhD:
            http://bis.informatik.uni-**leipzig.de/csf
            <http://leipzig.de/csf><http://bis.informatik.uni-leipzig.de/csf>
            Projects: http://nlp2rdf.org , http://linguistics.okfn.org ,
            http://dbpedia.org/Wiktionary , http://dbpedia.org
            Homepage:
            http://bis.informatik.uni-**leipzig.de/SebastianHellmann
            
<http://leipzig.de/SebastianHellmann><http://bis.informatik.uni-leipzig.de/SebastianHellmann>
            Research Group: http://aksw.org

--Dipl. Inf. Sebastian Hellmann

    Department of Computer Science, University of Leipzig
    Events: NLP & DBpedia 2013 (http://nlp-dbpedia2013.blogs.aksw.org,
    Deadline: *July 8th*)
    Venha para a Alemanha como PhD:
    http://bis.informatik.uni-leipzig.de/csf
    Projects: http://nlp2rdf.org , http://linguistics.okfn.org ,
    http://dbpedia.org/Wiktionary , http://dbpedia.org
    Homepage: http://bis.informatik.uni-leipzig.de/SebastianHellmann
    Research Group: http://aksw.org



--
Dipl. Inf. Sebastian Hellmann
Department of Computer Science, University of Leipzig

Events: NLP & DBpedia 2013 (http://nlp-dbpedia2013.blogs.aksw.org,Deadline: *July 8th*)

Venha para a Alemanha como PhD: http://bis.informatik.uni-leipzig.de/csf

Projects: http://nlp2rdf.org , http://linguistics.okfn.org ,http://dbpedia.org/Wiktionary , http://dbpedia.org

Homepage: http://bis.informatik.uni-leipzig.de/SebastianHellmann
Research Group: http://aksw.org

Re: NIF + Stanbol

Reply via email to