Lucene's analysis module has ArabicAnalyzer.
And it also has support for multi-word synonyms, and
WordnetSynonymParser to specifically parse wordnet's file format.
Seems like you have the building blocks for what you need?
Mike McCandless
http://blog.mikemccandless.com
On Sat, Mar 28, 2015
Hi
I have tried to upgrade lucene from 4.x to 5.0 recently.I found out from
documentation that FieldCache is removed from lucene 5, and I should use
DocValues fields for sorting.
I upgraded my sources, and everything works fine except spatial sorting by
distance.
When I looked into
Hello,
I'm new to Lucene and am looking for advice. I'm wanting to search the entire
DB (or almost the entire DB) for a keyword. The users also want to know which
field the string occurred in.
I can think of two ways to do this, but neither are ideal and I'm looking for
suggestions:
1)
You could do this with a custom Collector which, for every hit visits
all child scorers asking each one whether it matched the current hit.
Your collector would have to somehow store this information away so
that once the search is done and you pull the top N hits, you know
which fields those hits
Yes , thank you so much.
Is WordnetSynonymParser parse only wordnet files of the prolog format (.pl)
?
Regards,
Amal
On 30 March 2015 at 11:40, Michael McCandless luc...@mikemccandless.com
wrote:
Lucene's analysis module has ArabicAnalyzer.
And it also has support for multi-word synonyms,
I am attempting to loop through the ChildScorer of the scorer.getChildren()
method inside my collect() call, and it is empty.
Is there something else I should do or some setup that I am missing?
Thanks
@Override
public void collect(int docID) throws IOException {
for(ChildScorer