Dear Stanbol Community,

I work at Netgen, a small web agency mostly using eZ Publish CMS and we are
also an eZ Publish  business partners for a long time. Together with
another independent eZ consultant Paul Borgermans we are interested to add
some semantic possibilities to it so we were in Salzburg last month to try
to figure out what exactly could we do. My colleague Petar followed up the
discussion with Mr. Suat Gonul after the event.

After seeing some solutions already been made by early adapters and knowing
the eZ Publish CMS architecture really well we have a potential idea on
what we could do as a proof of concept. But we need your help to evaluate
the idea and give some feedback.

The idea is simple: to map the eZ taxonomy to Stanbol and use the
enhancements to help the editor annotate the content.

Before I ask few questions about how to implement that idea, here are few
fact about eZ which could be interesting:
- all content is stored as objects, without the HTML markup
- objects are instantiated from a class (there is a collection of default
classes but it is possible to add more)
- class has collection of attributes
- objects are stored hierarchically as nodes
- objects can have relations with other objects
- objects can be tagged with tags
- tags are organized separately and can be hierarchical too

To conclude, we can export some kind of general knowledge from eZ, e.g.:
object1 is_a_parent_of object2
object1 is_a_class_of class1 (e.g. Person, Blog post, Folder, ... )
object1 is_related_to object3
object1 is_tagged with tag1
tag1 is_a_parent_of tag2

Of course, we could add more specific knowledge (depends on the specific
project) but we would like to keep it as general as can be. What we could
do easily is to generate an RDF with the data.

So our main goal would be to push part of this knowledge to Stanbol and use
it to enhance newly created objects not using dbpedia but rather internal
data.


So here are some questions:

1. What could be added to EntityHub from the knowledge listed above and
what is the best (stable enough) way to add entities to Entity Hub: is it
via REST or manually as described here:
http://incubator.apache.org/stanbol/docs/trunk/customvocabulary.html (of
course REST would be more interesting, but if it is not ready yet a manual
approach would be good too)

2. Would we need to tackle Onthology Manager in any way to organize the
entities or this can be skipped? (this is the most vague thing I have
encountered)

3. How to connect enhancer to use our entities? Is the Keywordlinking
Engine way to go:
http://incubator.apache.org/stanbol/docs/trunk/enhancer/engines/keywordlinkingengine.html
 ?

4. Would it be faster to use the same entity types as dbpedia (Person,
Company,...) or introducing new ones should be straightforward?

Any kind of feedback would be welcome and would be helpful in our
evaluation....

Best regards


-- 
Ivo Lukač

Netgen d.o.o. - A.M.Tripala 3/I, 10000 Zagreb, Croatia
web: http://www.netgen.hr, tel: +385 (0)1 3879722, mob: +385 (0)91 5251566
---------------------------------------------------------
everyday tweets: http://twitter.com/ilukac
company blog: http://www.netgen.hr/eng/blog
professional profile: http://www.linkedin.com/in/ivolukac
personal blog: http://ilukac.com/
member of the CISEx board: http://www.exportboomers.com/
presenting: http://ezsummercamp.com

Reply via email to