Hi there,

Sorry to bother again, but can I get any feedback on the last email
regarding the configuration for building the dbpedia index?

Thanks,
Antero

On Fri, 25 Mar 2016 at 18:38 Antero Duarte <a.fduar...@gmail.com> wrote:

> Hi,
>
> Thank you for your reply.
> I don't think you need to host the big indexes, as long as the
> documentation is clear on how to build them.
>
> Can you give me some input on how the configuration for the 50G index
> would be like? I've been trying to build one but the max I got was 7G and
> for some reason it only worked once following the same exact steps.
>
> I am willing to try to create a docker container to build the big index
> and obviously make it available to everyone, but I need to be able to do it
> myself first and in a way that I can reproduce it.
>
> As for the version of dbpedia, I think that can be solved by being able to
> work with DBPedia current. If it works once, it should always work with the
> most recent version available.
>
> Regards,
> Antero
>
> On Fri, 25 Mar 2016 12:05 pm Rupert Westenthaler, <
> rupert.westentha...@gmail.com> wrote:
>
>> Hi Pooja,
>>
>> I agree with Antero.
>>
>> Regarding DBPedia indexing: We had DBPedia indexes (up to version 3.8)
>> but the server used to download those had a hardware breakdown and was
>> not replaced afterwards. I think I still have full indexes of 3.8, but
>> I never built one for a newer version of DBPedia
>>
>> ATM I am not eager to build a new DBPedia index as the current dataset
>> is ~1 year old (April 2015) and I expect a new version to be released
>> soon. But even If I would build a new index I would still not have a
>> server to host the files that could be (depending on the configuration
>> form 5 to 50GByte in size).
>>
>> best
>> Rupert
>>
>>
>> On Tue, Mar 22, 2016 at 11:37 AM, Antero Duarte <a.fduar...@gmail.com>
>> wrote:
>> > Hi there,
>> >
>> > When you first start stanbol you get a small dbpedia index, that's why
>> you
>> > don't get the 2nd or 3rd tier cities/towns. When you use dbpedia
>> spotlight,
>> > you connect to an external service that probably uses a bigger index
>> than
>> > the 43k entities, that's why you get more data but not everything. You
>> can
>> > follow the documentation to create a bigger index, start with these
>> links:
>> >
>> >
>> https://stanbol.apache.org/docs/trunk/customvocabulary.html#building-full-local-indexes-with-the-entityhub-indexing-tool
>> >
>> >  http://svn.apache.org/repos/as
>> >
>> f/stanbol/trunk/entityhub/indexing/genericrdf/src/main/resources/indexing/config/indexing.properties
>> >
>> > Also:
>> >
>> https://stanbol.apache.org/docs/0.9.0-incubating/customvocabulary.html#examples
>> >
>> > But a lot of people seem to be struggling with this, can I suggest that
>> > someone that doesn't struggle with this and is successful to build
>> bigger
>> > indexes goes through the documentation and updates it to both index
>> dbpedia
>> > current (should be applicable to future versions or at least 3.9, maybe
>> use
>> > this script <https://github.com/apache/stanbol/pull/3>) and remove
>> older
>> > parts of the documentation that no longer apply to the current version
>> of
>> > Stanbol? A lot of the users are struggling with this, myself included, I
>> > was able to build a bigger index, but it doesn't work every time and I
>> > can't understand why.
>> >
>> > Best Regards,
>> > Antero Duarte
>> >
>> > On Tue, 22 Mar 2016 at 10:17 Pooja H Bavishi <
>> > pooja.bavi...@iet.ahduni.edu.in> wrote:
>> >
>> >> Hello,
>> >>
>> >> I noticed that apache stanbol is able to recognize famous indian cities
>> >> like Bangalore, Delhi etc . but when I give a tier 2 or tier3 city or
>> town
>> >> name it is not recognizing. though the depedia has entries for such
>> tier 3
>> >> cities (such as Bijapur, Gulbarga etc).I tried with Dbpedia spotlight
>> >> engines (dbpspotlightcandidates and dbpspotlightspot) though the
>> Enitity
>> >> recognition has improved but I am still not getting the locations(lat
>> and
>> >> long) which is present in the dbpedia link generated by the stanbol (
>> >> http://dbpedia.org/page/Bijapur) is there any way I could get the geo
>> >> mapping from stanbol for any location that is there in dbpedia?
>> >>
>> >>
>> >> Regards,
>> >> Pooja
>> >>
>>
>>
>>
>> --
>> | Rupert Westenthaler             rupert.westentha...@gmail.com
>> | Bodenlehenstraße 11                              ++43-699-11108907
>> | A-5500 Bischofshofen
>> | REDLINK.CO
>> ..........................................................................
>> | http://redlink.co/
>>
>

Reply via email to