All right, I had a quick chat with Rupert as I needed to understand some
more things.

We'll split the two goals:
 1. provide a Task Request-based API in the enhancer scope
 2. provide a Text Extraction feature as a Preprocessing Engine.

Task Request-based API, features:
 1. a new end-point that can be added in /enhancer/task
 2. the end-point takes a Task Request (interface to be defined)
 3. the Task Request will allow to post:
      a) content or URL submission
      b) per-call engine parameters
      c) per-cal EnhancementChain definitions
 4. it supports synchronous operations
 5. eventually it can support asynchronous operations with a callback URL
(this point is to review as probably a proxy/gateway is more appropriate)

In order to implement the above the JIRA STANBOL-488 [1] must be taken into
consideration.

Text Extraction features:
 1. currently Readability
 2. might be interesting to try out Boilerpipe and Goose to understand the
better performance and quality
 3. to implement, just use the ContentReference to create the ContentItem.
It will load the data automatically.

[1] https://issues.apache.org/jira/browse/STANBOL-488

BR
David


On Mon, Jan 14, 2013 at 8:22 PM, Rupert Westenthaler <
[email protected]> wrote:

> Hi
>
> On Mon, Jan 14, 2013 at 2:21 PM, David Riccitelli <[email protected]>
> wrote:
> >> Are you now asking how this could be made available via the /api/tasks
> API
> >> you proposed?
> >
> > Yep, should it be then restricted to that specific URL [1] or would the
> > engine be allowed to create an additional end-point at /api/tasks?
> >
>
> I would not add an other RESTful service to an EnhancementEngine,
> because this goes against modularity forcing users to have both - the
> engine and the RESTful API.
> I would rather have a service that does the TextExtraction and than
> implement an EnhancementEngine and a RESTful service based on that.
>
> best
> Rupert
>
> > BR,
> > David
> >
> > [1] http://{host}:{port}/{stanbol-root}/enhancer/engine/{engine-name}
> >
> >
> > On Mon, Jan 14, 2013 at 2:21 PM, Fabian Christ <
> [email protected]
> >> wrote:
> >
> >> 2013/1/14 David Riccitelli <[email protected]>
> >>
> >> > About point a) I have a question. As the API allow for selection of
> the
> >> > Enhancement Chain, how would that work if we move the API in an
> engine.
> >> The
> >> > engine can be executed outside of the scope of an enhancement chain?
> >> >
> >>
> >> You can call single engine [1] by using
> >>
> >> http://{host}:{port}/{stanbol-root}/enhancer/engine/{engine-name}
> >>
> >> Are you now asking how this could be made available via the /api/tasks
> API
> >> you proposed?
> >>
> >> [1]
> >>
> http://stanbol.apache.org/docs/trunk/components/enhancer/engines/index.html
> >>
> >>
> >> > Shall we move this thread on a JIRA thread?
> >> >
> >>
> >> No, discussing this here is totally fine. We should create a JIRA that
> >> describes what to do/implement after the discussion and we have some
> >> consensus.
> >>
> >> Best,
> >>  - Fabian
> >>
> >>
> >> --
> >> Fabian
> >> http://twitter.com/fctwitt
> >>
> >
> >
> >
> > --
> > David Riccitelli
> >
> > -- check the Swagger for WordLift <http://bit.ly/VtoM5H>
> >
> ********************************************************************************
> > InsideOut10 s.r.l.
> > P.IVA: IT-11381771002
> > Fax: +39 0110708239
> > ---
> > LinkedIn: http://it.linkedin.com/in/riccitelli
> > Twitter: ziodave
> > ---
> > Layar Partner Network<
> http://www.layar.com/publishing/developers/list/?page=1&country=&city=&keyword=insideout10&lpn=1
> >
> >
> ********************************************************************************
>
>
>
> --
> | Rupert Westenthaler             [email protected]
> | Bodenlehenstraße 11                             ++43-699-11108907
> | A-5500 Bischofshofen
>



-- 
David Riccitelli

-- check the Swagger for WordLift <http://bit.ly/VtoM5H>
********************************************************************************
InsideOut10 s.r.l.
P.IVA: IT-11381771002
Fax: +39 0110708239
---
LinkedIn: http://it.linkedin.com/in/riccitelli
Twitter: ziodave
---
Layar Partner 
Network<http://www.layar.com/publishing/developers/list/?page=1&country=&city=&keyword=insideout10&lpn=1>
********************************************************************************

Reply via email to