Re: large dataset into EntityHub

2013-07-23 Thread Rupert Westenthaler
On Tue, Jul 23, 2013 at 9:24 PM, aj...@virginia.edu wrote: > -BEGIN PGP SIGNED MESSAGE- > Hash: SHA1 > > Hi, Stanbol folks! > > I'm trying to index a largeish (about 260M triples) dataset into a > Solr-backed EntityHub [1], but not having much success. I'm getting "out of > heap" errors

Re: Issue while adding field mappings

2013-07-23 Thread Fabian Christ
Hi, just as an additional info. The Stanbol dev mailing list (as any other public list at the ASF) is archived. You can search in this archive, e.g. at http://stanbol.markmail.org/ Best, - Fabian 2013/7/23 Tarandeep Singh, Sawhney : > thanks so much Rupert for your reaponse. > > i recently joi

large dataset into EntityHub

2013-07-23 Thread aj...@virginia.edu
-BEGIN PGP SIGNED MESSAGE- Hash: SHA1 Hi, Stanbol folks! I'm trying to index a largeish (about 260M triples) dataset into a Solr-backed EntityHub [1], but not having much success. I'm getting "out of heap" errors in the load-to-Jena stage, even with a 4GB heap. The process doesn't make

Entity Tagging by RE pattern

2013-07-23 Thread Erik Antelman
I know several pipelines (GATE and several commercial NLP pipelines) annotate dates, measurements, address etc http://gate.ac.uk/sale/tao/splitap6.html#x36-736000F.7. I would like to have the same type of rule/re pattern annotation capability in a Stanbol chain. I would rather not just throw in GA

Re: Trouble with Stanbol setup

2013-07-23 Thread Fabian Christ
Hi Arohi, as Antonio already pointed out, your system is out of resources, i.e. memory. You need to provide more RAM to the Maven process. You can configure this by setting the MAVEN_OPTS system variable. A setting like the following should definitely give Maven enough memory. MAVEN_OPTS="-Xmx10

Re: Issue while adding field mappings

2013-07-23 Thread Tarandeep Singh, Sawhney
thanks so much Rupert for your reaponse. i recently joined dev community so wasnt aware of this issue reported earlier also. thanks again and i will try out work around options you have suggested amd will revert if some more help is needed best regards tarandeep On Jul 23, 2013 7:57 PM, "Rupert

Re: Issue while adding field mappings

2013-07-23 Thread Rupert Westenthaler
Hi Tarandeep, this Issue does not appear the first time here on the list. So you might also want to search the archives for more information. When using the Felix Webconsole to configure an OSGI component the size of the configuration MUST NOT exceed the headerBufferSize [1] of the embedded Jetty

Re: Open NLP Vs. GATE

2013-07-23 Thread Tarandeep Singh, Sawhney
A polite reminder. Can anyone share some inputs on this. best regards tarandeep On Mon, Jul 22, 2013 at 12:10 PM, Sawhney, Tarandeep Singh < tsawh...@innodata.com> wrote: > Hi All > > Can you please provide some inputs to understand how does Open NLP > compares with GATE (Stanford Core NLP). >

Issue while adding field mappings

2013-07-23 Thread Tarandeep Singh, Sawhney
Hi All, We are facing an issue while adding field mappings beyond 12-14 fields as defined in EntityHubLinking Engine property *"Fields used for dereferencing".* * * We have created our own reference site for additional DBpedia data. But when we try to map properties for dereferenced entities beyon

EntityHub string search + path query combined - is it possible?

2013-07-23 Thread Alessandro Adamou
Hi, I need to configure a search service that looks up entities on a ReferencedSite, but it should also retrieve property paths of length 2 for each entity found. But I would like to achieve it with a single EntityHub call. A use case example would be to have an autocomplete widget, where if

Re: Re:Re: How to persist the configurations across restarts of Stanbol.

2013-07-23 Thread Rupert Westenthaler
Hi Arthi On Tue, Jul 23, 2013 at 12:22 PM, wrote: > Hi Rupert, > Thanks a lot for information. > Is there any copy which happens of these files when Stanbol is shut down or > at any other time. I had tried manually copying these files and there was an > error which stated that the filesystem

RE: Re:Re: How to persist the configurations across restarts of Stanbol.

2013-07-23 Thread arthi.venkat
Hi Rupert, Thanks a lot for information. Is there any copy which happens of these files when Stanbol is shut down or at any other time. I had tried manually copying these files and there was an error which stated that the filesystem does not handles files of such large length. Is there is a po

Re: Simple example for querying across multiple RDF graphs

2013-07-23 Thread Reto Bachmann-Gmür
Hi Fabian > interesting, can you give some more information about the differences of > the Contenthub and fusepool-ecs? ECS uses an index on top of the RDF graph. Lucene is used via Clerezza CRIS. ECS provides the following advantages: - Facet values like everything else are RDF resources, so pr

Re: Using an Entityhub site as a (live) LD cache

2013-07-23 Thread Szabolcs Grünwald
Sorry, I was gone for some days. Thank you, I'll try that. Best, Szaby On 18 July 2013 16:54, Alessandro Adamou wrote: > On 18/07/2013 14:32, Rafa Haro wrote: > >> Hi Szaby! >> >> El 18/07/13 14:30, Szaby Grünwald escribió: >> >>> Hi! >>> >>> I would like to use Entityhub with some predefined

Re: Using an Entityhub site as a (live) LD cache

2013-07-23 Thread Szaby Grünwald
Sorry, I was gone for some days. Thank you, I'll try that. Best, Szaby On 18 July 2013 16:54, Alessandro Adamou wrote: > On 18/07/2013 14:32, Rafa Haro wrote: > >> Hi Szaby! >> >> El 18/07/13 14:30, Szaby Grünwald escribió: >> >>> Hi! >>> >>> I would like to use Entityhub with some predefined