Hi Alessandro, Thanks a lot for your ideas. I'm a newbie to Solr, will look into the above options working with Solr.
Thanks, Dileepa On Fri, Oct 25, 2013 at 4:33 PM, Alessandro Benedetti < [email protected]> wrote: > Hi Dileepa, > I agree with my colleagues, i don't think that trying to modify contehub > concept is the right way. > An easy solution can be to move the Data Import handler logic for your use > case to a specific Solrj index application. > Using java will be really easy to write a custom indexing application that > extract the info from your database, enrich the content and finally it > indexes the content in your Solr instance. > > Customizing the Data import handler can be another good solution, in that > case you have to write a customised transformer[1] to enrich your rows and > add new Solr fields. > The you can simply include that in your db-config for the DIH. > Another strategy is eventually customise the Sql Processor. But i prefer > the first strategy. it's more polite. > > Cheers > > [1] http://wiki.apache.org/solr/DIHCustomTransformer > > > 2013/10/25 Rafa Haro <[email protected]> > > > Hi Dileepa, > > > > Why not just enhance the content before indexing it in your Solr. If you > > are using a Solr DataImportHandler, you could customize it for making the > > proper REST requests to the Enhancer. > > > > WDYT? > > > > Cheers, > > Rafa > > > > El 25/10/13 11:37, Dileepa Jayakody escribió: > > > > Hi Antonio, > >> > >> Thanks for your valuable input. > >> > >> My idea of the best data flow will be; > >> > >> mysqlDB >> Solr contenthub >> enhancement-chain (to extract > >> Person/Organization) >> Solr contenthub (so facet search on the content > >> can > >> be done now) > >> > >> It will be even better to run the entity linking enhancement flow (to > tag > >> Person and Organization) when importing data from mysql db straight to > >> Stanbol. > >> > >> What will be the best way to achieve my requirement? Appreciate all the > >> ideas I can get.. :) > >> > >> Thanks, > >> Dileepa > >> > >> > >> On Fri, Oct 25, 2013 at 3:02 PM, Antonio David Perez Morales < > >> [email protected]> wrote: > >> > >> Hi > >>> > >>> I don't know if what you want is possible, but importing directly data > >>> into > >>> the Solr ContentHub will not extract entities neither store them in > Solr > >>> because it's done using the REST interface of ContentHub component. > >>> > >>> Regards > >>> > >>> > >>> On Fri, Oct 25, 2013 at 11:26 AM, Dileepa Jayakody < > >>> [email protected]> wrote: > >>> > >>> Hi Antonio and all, > >>>> > >>>> My requirement is not extracting entities from the external Solr > >>>> > >>> instance. > >>> > >>>> It contains indexed content imported from a mysql database > periodically > >>>> using a Solr DataImportHandler. > >>>> I need to enhance those content using Stanbol to identify Person and > >>>> Organization entities, so that they can be used as > >>>> facets<http://searchhub.org/**2009/09/02/faceted-search-**with-solr/< > http://searchhub.org/2009/09/02/faceted-search-with-solr/>> > >>>> for > >>>> Solr search. > >>>> > >>>> So I was thinking, this will be easier if I can configure Stanbol > >>>> contenthub to use the external Solr instance as a contenthub-store. > >>>> Another option is to import data straight from the mysql database to > the > >>>> Solr index in Stanbol contenthub store..Since Solr has the data import > >>>> handler functionality does Stanbol support it by default? > >>>> > >>>> Open for ideas. > >>>> > >>>> Thanks, > >>>> Dileepa > >>>> > >>>> > >>>> > >>>> > >>>> On Fri, Oct 25, 2013 at 2:31 PM, Antonio David Perez Morales < > >>>> [email protected]> wrote: > >>>> > >>>> Hi Dileepa. > >>>>> > >>>>> I don't know if you can change the Solr instance of the ContentHub (I > >>>>> > >>>> think > >>>> > >>>>> it is not possible) but thinking in your problem, you could do the > >>>>> following: > >>>>> > >>>>> - Create a new SolrYard pointing to your Solr instance (instead of > >>>>> > >>>> creating > >>>> > >>>>> a new core in the the Embedded Solr instance by default) > >>>>> - Create a new Site for your entities using this SolrYard > >>>>> - Create an Enhancement Chain configuring the linking engine pointing > >>>>> > >>>> to > >>> > >>>> your site and the specific field of your site to perform the linking > >>>>> - Create a new ContentHub using this EnhancementChain and defining a > >>>>> > >>>> LDPath > >>>> > >>>>> program to extract the properties of the extracted entities you want > to > >>>>> store in the Solr Content Hub. > >>>>> - Use the ContentHub to perform queries over the content and the > >>>>> > >>>> extracted > >>>> > >>>>> entities of it (only the fields extracted by the LDPath program) > >>>>> > >>>>> I would do so but I hope Rupert or another Stanbol guy can give more > >>>>> information about this problem. > >>>>> > >>>>> Regards > >>>>> > >>>>> > >>>>> On Thu, Oct 24, 2013 at 8:03 PM, Dileepa Jayakody < > >>>>> [email protected] > >>>>> > >>>>>> wrote: > >>>>>> Hi All, > >>>>>> > >>>>>> Is $subject possible? > >>>>>> I have a Solr server which imports data from a mysql database and > >>>>>> > >>>>> index > >>> > >>>> the > >>>>> > >>>>>> content periodically. I need to identify Person and Organization > >>>>>> > >>>>> mentions > >>>> > >>>>> in those content and make them searchable in the Solr index. > >>>>>> > >>>>>> > >>>>>> Can I configure Stanbol contenthub to use my existing Solr core and > >>>>>> > >>>>> enhance > >>>>> > >>>>>> content by adding Person and Organization metadata to it? > >>>>>> > >>>>>> Thanks, > >>>>>> Dileepa > >>>>>> > >>>>>> -- > >>>>> > >>>>> ------------------------------ > >>>>> This message should be regarded as confidential. If you have received > >>>>> > >>>> this > >>>> > >>>>> email in error please notify the sender and destroy it immediately. > >>>>> Statements of intent shall only become binding when confirmed in hard > >>>>> > >>>> copy > >>>> > >>>>> by an authorised signatory. > >>>>> > >>>>> Zaizi Ltd is registered in England and Wales with the registration > >>>>> > >>>> number > >>> > >>>> 6440931. The Registered Office is Brook House, 229 Shepherds Bush > Road, > >>>>> London W6 7AN. > >>>>> > >>>>> -- > >>> > >>> ------------------------------ > >>> This message should be regarded as confidential. If you have received > >>> this > >>> email in error please notify the sender and destroy it immediately. > >>> Statements of intent shall only become binding when confirmed in hard > >>> copy > >>> by an authorised signatory. > >>> > >>> Zaizi Ltd is registered in England and Wales with the registration > number > >>> 6440931. The Registered Office is Brook House, 229 Shepherds Bush Road, > >>> London W6 7AN. > >>> > >>> > > > > -- > > > > ------------------------------ > > This message should be regarded as confidential. If you have received > this > > email in error please notify the sender and destroy it immediately. > > Statements of intent shall only become binding when confirmed in hard > copy > > by an authorised signatory. > > > > Zaizi Ltd is registered in England and Wales with the registration number > > 6440931. The Registered Office is Brook House, 229 Shepherds Bush Road, > > London W6 7AN. > > > > > > -- > -------------------------- > > Benedetti Alessandro > Visiting card : http://about.me/alessandro_benedetti > > "Tyger, tyger burning bright > In the forests of the night, > What immortal hand or eye > Could frame thy fearful symmetry?" > > William Blake - Songs of Experience -1794 England >
