+1 to both suggestions
LARQ has the benefit that it gives you proper full text search so you
can do more than simple auto-completion (rankings, result limits, full
lucene queries) and that it is a relatively easy to drop in module
Trie indexes are the simplest way to do prefix searches for
auto-completion but they require you to do all the implementation
yourselves because AFAIK we don't have a drop in Trie index module for ARQ
Damian - Is your Trie index code you could share?
Rob
On 4/19/12 8:12 AM, Damian Steer wrote:
On 19 Apr 2012, at 15:42, Kunal Jain wrote:
Dear All,
Hi there,
I am developing an application using Jena along with TDB store. I have
loaded around 4 million triples in my store. A small subset of my triples is
as follows:
...
Now I want to do a free text matching for autosuggest kind functionality. I
have got this query to run against my store
?subject vs:name ?subjectName
FILTER regex(?subjectName, \"^Light\", \"i\")
I this query I am trying to find triples which start with a particular word,
i.e starting with 'Light'. Execution of this query is taking around 20
seconds.
While it's possible this could be improved, it's never going to be great since
the search is unindexed.
LARQ adds a proper free text index [1] which should be much better. This is now
a separate module, I believe (Paolo?).
Personally I've used a separate trie index for autocompletion, since it tends
to get hammered.
Damian
[1]<http://incubator.apache.org/jena/documentation/larq/index.html>