Hi there are also inconsistencies in the names of the OSGI parameters, default names of Engines ... Thats why I would like to make an other 0.* release and than change/fix all those things while working towards the 1.0 release
best Rupert On Tue, Nov 27, 2012 at 8:38 PM, Reto Bachmann-Gmür <r...@apache.org> wrote: > Which reminded me that we already discussed once that the artifact names > are unncessarily long, create STANBOL-820. Maybe some other renaming could > be done with that? > > Cheers, > Reto > > On Tue, Nov 27, 2012 at 12:17 PM, Rupert Westenthaler < > rupert.westentha...@gmail.com> wrote: > >> Hi Fabian >> >> Short version: >> >> I totally agree. Our vocabulary has changed over time, but the Engines >> still use the names as when they where introduced. Changing them >> (artifactIds and class names) is dangerous as this does break >> backwards compatibility. So I would suggest change names only if we >> can also come up with better implementation/design. >> >> Regarding Vocabulary I think we should prefer the terms >> "EntityLinking" and "NamedEntityLinking" and deprecate all others like >> "keyword" instead of "entity" or "extraction" or "tagging" instead of >> "linking". >> >> The 'engines/entitylinking' and 'engines/entityhublinking' introduced >> by STANBOL-733 do already use this new terminology. They also >> deprecate the 'engines/keywordextraction'. >> >> - - - >> >> Long version with more background information >> >> Regarding the linking of Entities there are currently two different >> principles: >> >> * "NamedEntityLinking": A "NamedEntity" has a 'selected text' AND a >> 'type'. So the selected text AND the type can be used for linking >> * "EntityLinking": An "Entity" does only have a 'selected text'. Here >> linking is only possible based on the selected text. >> >> The plan would be to also have two Engine implementations that support >> those linking models. >> >> * 'NamedEntityLinkingEngine' (currently /engines/entitytagging) >> * 'EntityLinkingEngine' (was /engines/keywordextraction (now >> deprecated) ; since yesterday /engines/entitylinking) >> >> Those should not have external dependencies (meaning to Stanbol >> components other than Stanbol Commons, Enhancer module; also not other >> major frameworks such as Solr or OpenNLP; no calls to external >> services). That would allow to keep those Engines within the enhancer >> module but also means that those implementation can not be directly >> used by the user (as the Service used for linking will be just defined >> by an Interface without an actual implementation. >> >> Because of that there will be "Engines" that are based on the above, >> but come with adapters to Services that do support the EntityLookup. >> The default will be implementations based on the StanbolEntityhub, but >> Stanbol users could also implement versions for their own >> infrastructure needs. >> >> The "EntityhubLinking" module [1] is the first example. When you look >> at the module you will recognize that it does not contain an single >> EnhancementEngine implementation. It only provides Entityhub specific >> implementations of the EntitySearcher interface defined by the >> "EntityLinkingEngine" and a OSGI component that allows users to >> configure an EntityLinkingEngine instance that uses the Entityhub to >> lookup Entities. >> >> Current state: >> >> Currently we are not yet there. The '/engines/entitytagging' still >> implements both NamedEntityLinking AND Lookup via the Entityhub. This >> engine could be replaced by a 'engines/namedentitylinking' that >> follows the design as described above. The new >> '/engines/entitylinking' already implements the above design. However >> it still depends on the Entityhub, because the EntitySearcher >> interface [3] that is still using the Entityhub Model classes. >> >> 'engines/entityhublinking' currently provides the ability to do >> 'entitylinking' with the Entityhub. As soon as the >> 'engines/namedentitylinking' is available I would add named entity >> linking functionality to that module. In a last step this module will >> also move out of the /enhancer component (as already suggested by >> STANBOL-805 [4]). >> >> >> BTW this design was the result of this [2] discussion on the Stanbol >> dev mailing list. >> >> best >> Rupert >> >> >> >> [1] >> http://svn.apache.org/repos/asf/stanbol/trunk/enhancer/engines/entityhublinking/ >> [2] http://markmail.org/message/nptkntyuthv7wwqh >> [3] >> http://stanbol.staging.apache.org/docs/trunk/components/enhancer/engines/entitylinking#entitysearcher >> [4] https://issues.apache.org/jira/browse/STANBOL-805 >> >> >> On Tue, Nov 27, 2012 at 11:14 AM, Fabian Christ >> <christ.fab...@googlemail.com> wrote: >> > Hi, >> > >> > enhancement engines in Stanbol can have several names and this is >> confusing >> > myself and very likely our users. Here are some examples that I came >> across >> > when trying to identify the running engines. I started to look at the >> > Web-UI and clicked through the OSGi console. >> > >> > dbpediaLinking (NamedEntityTaggingEngine) -> >> > Named Entity Tagging -> Entity Tagging -> >> > /engines/entitytagging >> > >> > entityhubExtraction (EntityLinkingEngine) -> >> > Entityhub Linking -> Entityhub Linking -> >> > /engines/entityhublinking >> > >> > Could we simplify this a bit to make it more obvious especially for new >> > users what is going on? >> > >> > Best, >> > - Fabian >> > >> > -- >> > Fabian >> > http://twitter.com/fctwitt >> >> >> >> -- >> | Rupert Westenthaler rupert.westentha...@gmail.com >> | Bodenlehenstraße 11 ++43-699-11108907 >> | A-5500 Bischofshofen >> -- | Rupert Westenthaler rupert.westentha...@gmail.com | Bodenlehenstraße 11 ++43-699-11108907 | A-5500 Bischofshofen