Hi

Based on your Request I have worked the last two days on several
improvements of the Indexing Tool.
Most important the Indexing Util now directly creates a Bundle that
when installed in the Entityhub will create all the necessary
Entityhub components to use the Indexed RDF data as an Referenced Site

I have also created the generic RDF configuration with a lot of
additional documentation.

I am currently working on some final things. So expect to see the
stuff in the SVN tomorrow.

best
Rupert Westenthaler

On Wed, Jun 1, 2011 at 10:32 AM, Olivier Grisel
<[email protected]> wrote:
> 2011/6/1 Florent André <[email protected]>:
>> Hi Rupert,
>>
>> Thanks for your valuables answers !
>>
>> In fact, if get it now, the meaning of indexing in entity hub is not just
>> about index, but about create a new (offline) entity hub.
>>
>> You said :
>>> The Solr Yard provides better performance especially for big Datasets.
>> ...
>>> The Clerezza  is fine for smaller data sets.
>>
>> Do you have a "magic number" (a vague will be fine :) ) that define the
>> limit for a big dataset ?
>
> The SolrYard implementation should be pretty scalable (tens or
> hundreds millions of entities). The ClerezzaYard will suffer from a
> limitation though. It won't be scalable to more than a couple of
> thousands of entities as long as the following is not fixed:
>
>  https://issues.apache.org/jira/browse/CLEREZZA-466
>
> --
> Olivier
> http://twitter.com/ogrisel - http://github.com/ogrisel
>



-- 
| Rupert Westenthaler             [email protected]
| Bodenlehenstraße 11                             ++43-699-11108907
| A-5500 Bischofshofen

Reply via email to