[Dbp-spotlight-users] spotlight concepts

Radim Rehurek Sat, 07 Jun 2014 03:00:07 -0700

Hello all,


I am a new spotlight user -- thanks for the awesome project!





I'd appreciate help with some basic concepts:




1. I read the "statistical" paper [0], and ran the "statistical" backend, 
but I'm not clear on which parameters it uses. What is the difference 
between en2+2 or en4+8, does it use the language-dependent or independent 
version? How do I switch to the language-dependent one?




2. I tried Lucene spotlight 0.5, and it runs out of heap space even with the
smallest spotter dict (threshold 75), with Xmx 22GB, debian with openjdk. Is
that normal or am I doing it wrong? I followed the installation instructions
at [1]. Is there any advantage to the Lucene implementation anyway? When 
should I use Lucene vs. statistical?




3. Is there a list of tools like idio's "spotlight editor" [2]? In 
particular I'm interested in ways to manually add new "things" to the DB, or
manually tweak detections that are wrong. Is there any support out of the 
box? Are there other ways, apart from that idio's editor?




Thank you again for the great tool,

Radim




[0] http://jodaiber.de/doc/entity.pdf

[1] https://github.com/dbpedia-spotlight/dbpedia-spotlight/wiki/Installation

[2] https://github.com/idio/spotlight-model-editor

------------------------------------------------------------------------------
Learn Graph Databases - Download FREE O'Reilly Book
"Graph Databases" is the definitive new guide to graph databases and their 
applications. Written by three acclaimed leaders in the field, 
this first edition is now available. Download your free book today!
http://p.sf.net/sfu/NeoTech

_______________________________________________
Dbp-spotlight-users mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/dbp-spotlight-users

[Dbp-spotlight-users] spotlight concepts

Reply via email to