Hi Rupert,

I was following the readme file, but the problem still exists.

I enabled debug and saw the following lines

15:48:43,318 [Indexing: Entity Source Reader Deamon] DEBUG
source.LineBasedEntityIterator - > line =     141 m.0432b
15:48:43,318 [Indexing: Entity Source Reader Deamon] DEBUG
source.LineBasedEntityIterator -  - id = m.0432b
15:48:43,318 [Indexing: Entity Source Reader Deamon] DEBUG
source.LineBasedEntityIterator -  - entity =
http://rdf.freebase.com/ns/.0432b
15:48:43,318 [Indexing: Entity Source Reader Deamon] DEBUG
source.LineBasedEntityIterator -  - score =
15:48:43,318 [Indexing: Entity Source Reader Deamon] DEBUG
jenatdb.RdfIndexingSource - No Statements found for id
http://rdf.freebase.com/key/.0432b (Node: http://rdf.freebase.com/ns/.0432b
)!


It seems like the 'm' in the entity id is being dropped. I created a patch
for that, so that preceding elements of the entity id are not dropped when
the default name space is used.


However even after fixing this problem, I still get the same Debug log with
the correct entity URL

15:48:43,319 [Indexing: Entity Source Reader Deamon] DEBUG
impl.EntityIdBasedIndexingDaemon - unable to get Data for Entity
http://rdf.freebase.com/key/m.041yjm (score=norm:0.314499|orig:141.0)
15:48:43,319 [Indexing: Entity Source Reader Deamon] DEBUG
source.LineBasedEntityIterator - > line =     141 m.041s8n
15:48:43,319 [Indexing: Entity Source Reader Deamon] DEBUG
source.LineBasedEntityIterator -  - id = m.041s8n
15:48:43,319 [Indexing: Entity Source Reader Deamon] DEBUG
source.LineBasedEntityIterator -  - entity =
http://rdf.freebase.com/ns/m.041s8n
15:48:43,319 [Indexing: Entity Source Reader Deamon] DEBUG
source.LineBasedEntityIterator -  - score =
15:48:43,319 [Indexing: Entity Source Reader Deamon] DEBUG
jenatdb.RdfIndexingSource - No Statements found for id
http://rdf.freebase.com/ns/m.041s8n (Node:
http://rdf.freebase.com/key/m.041s8n)!
15:48:43,319 [Indexing: Entity Source Reader Deamon] DEBUG
impl.EntityIdBasedIndexingDaemon - unable to get Data for Entity
http://rdf.freebase.com/ns/m.041s8n (score=norm:0.314499|orig:141.0)

When I checked the code, this happens because indexingDataset
.getDefaultGraph()
(RdfIndexingSource.getEntityData(String id) - 406) returns en empty graph
so it cannot find the parsed entity in it.. The indexing/resources/tdb
folder, which is used to create the  indexingDataset exists with 26 data
files.


Do you have any idea why this happens?

Thanks
Amindri



On 11 February 2015 at 23:03, Rupert Westenthaler <
rupert.westentha...@gmail.com> wrote:

> Hi Amindri,
>
> The file to look is the README.md file of the freebase indexer [1]. If
> something is missing in this file please create an issue [2] and if
> possible provide a patch.
>
> thx
> Rupert
>
>
> [1]
> http://svn.apache.org/repos/asf/stanbol/trunk/entityhub/indexing/freebase/README.md
> [2] https://issues.apache.org/jira/browse/STANBOL
>
>
> On Wed, Feb 11, 2015 at 1:00 AM, Amindri Udugala
> <amindriudug...@gmail.com> wrote:
> > Hi Rupert,
> >
> > Thanks for the informative reply.
> > I was able to specify an empty String as the namespace prefix
> > namespaceprefix.mapping
> > file. Exactly as you mentioned, indexing started with no loggings for
> quite
> > some time. Then the process finish without indexing a single entity.
> >
> > I used all the default configuration files created by the init process.
> I'm
> > trying to build a freebase index for multilingual FST linking. I would
> much
> > appreciate if you can point me to resource where I can get the
> information
> > to correctly configure the properties files.
> >
> > Thanks,
> > Amindri
>
>
>
> --
> | Rupert Westenthaler             rupert.westentha...@gmail.com
> | Bodenlehenstraße 11                              ++43-699-11108907
> | A-5500 Bischofshofen
> | REDLINK.CO
> ..........................................................................
> | http://redlink.co/
>



-- 
Regards
Amindri Udugala

Reply via email to