Hello Bertrand,
I think what you're describing has the same aim, has many points in common
and is giving even more value to it.
Therefore it should be taken in consideration further defining the scope of
the activity.
> 1. Simple enhancement of textual content
The POST request is very similar to the Enhancer APIs [1], although the
return data would be different. You're proposing to define an output
different from the Enhancement Structure [2], like an XML/JSON format
indexed by Entity, correct? e.g.
{
language: "en",
entities: [{
"<about>": {
type: "{Person|Organization|Product|...}",
confidence: 1.0
}
},{
...
}]
}
We could also try to mimic similar APIs output formats to enable an easy
switch from one system to another.
> 2. Enhancement of binary content
Agree.
> 3. Enhancement of remote content
I think this matches the proposal for the Task Request json. We would also
allow to add some per-call analysis settings here. Maybe something like
(similar to what has been implemented so far):
{
url: "http://server/path/doc.ext", -- or -- content: "actual content",
mimeType: "content/mime-type",
parameters: {
"engine-a-param-1": "value-1",
"engine-b-param-2": "value-2",
"engine-c-param-n": "value-3"
}
}
> Same as 2. but the posted (json?) document contains URLs of content
> that Stanbol first retrieves
The DCE could retrieve the content directly. In the case of Readability it
is required for it to be able to access contents that are spread on
multiple pages.
BR,
David
[1]
http://stanbol.apache.org/docs/trunk/components/enhancer/#RESTful_API
[2]
http://stanbol.apache.org/docs/trunk/components/enhancer/enhancementstructure.html
On Tue, Jan 15, 2013 at 12:17 PM, Bertrand Delacretaz <
[email protected]> wrote:
> On Tue, Jan 15, 2013 at 11:02 AM, Bertrand Delacretaz
> <[email protected]> wrote:
> ...
> > 4. Requests including enhancement pipeline definitions ("stateless
> Stanbol")...
> > Using a multipart POST in the previous use cases, one part can be a
> > pipeline definition...
>
> We could also use a part to supply initial metadata about the content
> being submitted.
>
> -Bertrand
>
--
David Riccitelli
-- check the Swagger for WordLift <http://bit.ly/VtoM5H>
********************************************************************************
InsideOut10 s.r.l.
P.IVA: IT-11381771002
Fax: +39 0110708239
---
LinkedIn: http://it.linkedin.com/in/riccitelli
Twitter: ziodave
---
Layar Partner
Network<http://www.layar.com/publishing/developers/list/?page=1&country=&city=&keyword=insideout10&lpn=1>
********************************************************************************