Hi Alessio, and welcome aboard!
It is great to see you're jogging the idea of contributing with so many
engines.
I would like to contribute to the project by submitting this module
(obviously as open source).
Is this possible? How can I obtain a user/password in order to submit
the code to STANBOL svn repository?
You might want to run for committership. If so you are very welcome to
register an account on the Apache issue tracker, open "Improvement" or "New
feature" tickets and post the code of your engines as patches:
https://issues.apache.org/**jira/browse/STANBOL<https://issues.apache.org/jira/browse/STANBOL>
I downloaded the latest version of STANBOL (0.9.0-incubating-SNAPSHOT)
and successfully tested the module integration within the framework.
What svn revision have you worked with? All pre-release modules are
versioned as 0.9.0-incubating-SNAPSHOT.
I'm asking you because recent revisions added support for enhancement
chains, and given their take on multilinguality, your engines look pretty
much like they could be articulated into pre-configured, or selectable
chains. I actually don't know if there is branching support, but it would
be super if a language-specific chain could be invoked upon detection by
your Language Identifier engine.
The services included so far in the module as Enhancement Engines are:
- a Named Entity Recognition service for French
- a Lemmatizer for Italian, German, Romanian, Russian, Danish (it
creates an annotation on the document whose content is the lemmatized form
of the document)
- a Language Identifier for Italian, French,German,Spanish, Portuguese,
Polish, Hungarian, Dutch, Swedish,Arabic, Russian,Turkish, Romanian, Greek,
Norwegian
- a Document Classification services for Italian, French, German,
English, Spanish, Portuguese that associates a document to DBPedia classes
The services are free to use for research purposes, while require a
license fee for commercial use.
The Enhancement Engines are by default provided with a demo license that
allows a certain number of daily requests in order to test them.
Proper license keys (both for research purposes and commercial use) can
be obtained on http://linguagrid.org/ web page (or sending me an email
request) and you are encouraged to request them in order to override the
limits of the demo license key.
So, you wrapped calls to your linguistic services into engines for the
Stanbol enhancer, am I right? I guess that for a Stanbol contribution it's
pretty fine to do so as the Zemanta engine does, so long as it is okay to
license the engine's client code under the Apache License 2.
Do you have a demo installation of Stanbol online with your engines set
up?
Best,
Alessandro