Hi Rupert, Thanks again for detailed answer.
I could setup the demo and look at the results. It's extremely powerful. One quick question, what's the purpose of these bin files under resources directory. bionlp2004-DNA-en.bin bionlp2004-RNA-en.bin bionlp2004-cell_line-en.bin bionlp2004-cell_type-en.bin bionlp2004-protein-en.bin With best regards, Rajan On Thu, May 28, 2015 at 3:52 AM, Rupert Westenthaler < rupert.westentha...@gmail.com> wrote: > Hi Rajan, > > The demo never included any Java code. > > The module just provides configurations [1] and datafiles [2]. Those > will be installed with the bundle using the Sling Installer and > Stanbol DataFileProvider infrastructure when the bundle is installed. > Note the <Install-Path> and <Data-Files> instructions configured for > the maven-bundle-plugin in the pom.xml file. > > The demo also provides a shell script [3] the indexes eHealth related > datasets and of corse the README explaining the demo > > best > Rupert > > [1] > http://svn.apache.org/repos/asf/stanbol/branches/release-0.12/demos/ehealth/src/main/resources/config/ > [2] > http://svn.apache.org/repos/asf/stanbol/branches/release-0.12/demos/ehealth/src/main/resources/datafiles/ > [3] > http://svn.apache.org/repos/asf/stanbol/branches/release-0.12/demos/ehealth/index.sh > > On Wed, May 27, 2015 at 1:27 PM, <raja...@gmail.com> wrote: > > Hi Rupert, > > > > Thanks a lot for the detailed answers. Let me play a little bit further > before I ask additional follow-up questions. > > > > As far as demo is concerned, I am interested in eHealth demo as it > covers lots of items from my questions. At present, the Java code for it is > missing. Is it possible to restore Java code for eHealth demo in 0.12 > branch? > > > > With best regards, > > Rajan > > > > Sent from my iPhone > > > >> On May 27, 2015, at 6:42 AM, Rupert Westenthaler < > rupert.westentha...@gmail.com> wrote: > >> > >> Hi > >> > >>> On Wed, May 27, 2015 at 5:31 AM, Rajan Shah <raja...@gmail.com> wrote: > >>> Hi, > >>> > >>> As I am trying to get my hands around stanbol, I have couple general > design > >>> questions. > >>> > >>> *1. Enhancement Chain firing and results* > >>> > >>> How to find out which enhancement chain detected which entities? One > way, I > >>> could see that by adding/removing particular chain. Is it possible to > just > >>> enable it via logging within current code? > >>> > >>> For ex. > >>> I have a chain categorized-linking and would like to find out whether > this > >>> chain fired and labeled entities properly > >> > >> A enhancement chain has 1..* enhancement engines. The engines create > >> the annotations not the chain. So your question should be what engine > >> is creating an annotation. This information is provided by the > >> dc:creator and dc:contributor metadata of the enhancement. See also > >> the documentation at [1] > >> > >>> > >>> *2. Categorize entities differently* > >>> > >>> Is it possible to categorize your detected entities as something else? > >>> i.e. other than People, Organizations or Places > >>> > >>> What steps one need to take in current framework to achieve the same? > >> > >> You can use the Custom NER Model Extraction Engine [2]. > >> The models used in the documentation of this engine can be found at [3] > >> > >>> > >>> *3. Domain specific modeling* > >>> > >>> Suppose, I have a small domain and various types of entities. I am > >>> interested in > >>> > >>> a. analyzing various entities > >>> b. linking them with other entities and find relations from > dbpedia/freebase > >>> c. infer interesting aspects using reasoning > >>> > >>> Is Stanbol the way to go or Marmotta? or Is it preferred to develop a > >>> custom engine using Stanbol which uses internal components to perform > all > >>> of the above tasks? > >> > >> * Entity linking to your custom vocabulary in Stanbol > >> * If you want to have your custom entities linked with > >> dbpedia/freebase it is better to do that in the vocabulary. I think > >> Google refine provided reconciliation to freebase. that could be > >> definitely an option. > >> * If you want to find additional entities contained in > >> freebase/dbpedia configuring an other entity linking in Stanbol makes > >> complete sense. > >> > >> Not sure what you mean with "infer interesting aspects using reasoning". > >> > >>> > >>> *4. Enhance detected entities by annotation* > >>> > >>> Suppose, opennlp-ner detected an entity xyz. If I want to annotate this > >>> entity with additional attributes/fields using different custom > >>> vocabularies, what are the dev. steps I need to take? > >>> > >> > >> If you just want to link Named Entities with a controlled vocabulary > >> you can use the FST linking engine [4] with the Linking Mode set to > >> NER (read the Linking Mode of the engines documentation). In short you > >> will want to configure a "Apache Stanbol Enhancer Engine: FST Linking: > >> Named Entities" for the vocabulary you want to link against. > >> > >> > >>> *5. Previous demo project(s)* > >>> > >>> At the same time, any luck with restoring demo project(s) within 0.12 > >>> branch ? I believe, it demonstrates various aspects and it would be > great > >>> to have it restored. > >>> > >> > >> I hope those are still functional in the 0.12 branch. No immediate > >> plans to move them to 1.0.0 (mainly because of lack of time). > >> Contributions are very welcome. > >> > >> Hope this helps > >> best > >> Rupert > >> > >>> Thanks in advance, > >>> Rajan > >> > >> > >> [1] > http://stanbol.apache.org/docs/trunk/components/enhancer/enhancementstructure#fiseenhancement > >> [2] > https://stanbol.apache.org/docs/trunk/components/enhancer/engines/opennlpcustomner > >> [3] > http://svn.apache.org/repos/asf/stanbol/branches/release-0.12/demos/ehealth/src/main/resources/datafiles/ > >> [4] > http://stanbol.apache.org/docs/trunk/components/enhancer/engines/lucenefstlinking > >> > >> > >> -- > >> | Rupert Westenthaler rupert.westentha...@gmail.com > >> | Bodenlehenstraße 11 ++43-699-11108907 > >> | A-5500 Bischofshofen > >> | REDLINK.CO > .......................................................................... > >> | http://redlink.co/ > > > > -- > | Rupert Westenthaler rupert.westentha...@gmail.com > | Bodenlehenstraße 11 ++43-699-11108907 > | A-5500 Bischofshofen > | REDLINK.CO > .......................................................................... > | http://redlink.co/ >