Hi all,
First of all, sorry for my poor English. It's not my native language.
I'm trying to use Lucene to index hierarchical kind of information: I have
structured html and pdf/word documents and I want to index them in ways to
perform search in titles, text, paragraphs or tables only, or any
AUTOMATIC REPLY
Tom Roberts is out of the office till 2nd September 2008.
LUX reopens on 1st September 2008
-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]
AUTOMATIC REPLY
Tom Roberts is out of the office till 2nd September 2008.
LUX reopens on 1st September 2008
-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]
Hi John,
I am not sure about the way Solr implements range query.
But it looks like, that Solr is using
org.apache.lucene.search.ConstantScoreRangeQuery
which itself is using
org.apache.lucene.search.RangeFilter
So Solr do not rewrite the query to a large Boolean SHOULD, but it is
reading all
Hi,
We need a feature for time based cleanup of IndexCommits. Would it be
possible to add a method to IndexCommit class to get the timestamp of an
index commit?
Thanks.
--
Regards,
Akshay Ukey.
Hi Otis,
The original message text is:
Hi,
I'd appreciate if someone could explain the results I'm getting.
I've written a simple custom analyzer that applies the NGramTokenFilter to
the token stream during indexing. It's never applied during searching. The
purpose of this is to
Hi,
I enabled the TermVector for required fields using following piece of code,
Field titleField = new Field(title, title, Field.Store.NO,
Field.Index.TOKENIZED, TermVector.YES);
and then re-indexed it. But again it shows no result.
I checked the stored documents and they TermVector exists and
Any comments, suggestions? Maybe I should rephrase my original message or
describe it in detail?
I really would like to get any response if possible.
Thanks a lot in advance!
On Mon, Sep 1, 2008 at 10:25 AM, Leonid Maslov [EMAIL PROTECTED] wrote:
Hi all,
First of all, sorry for my poor
Hello,
Recently I developed an interest in making a lucene based structure for
tagging. As we all know lucene's update is not real-time and one has to
delete a document prior to updating it.
I have been googling for different approaches to a lucene based tagging
structure, and I stumbled upon
Hi Dave:
MoreLikeThis object has two parameters which controls his functionality:
mlt.setMinTermFreq(minTermFreq.intValue());
mlt.setMinDocFreq(minDocFreq.intValue());
By default MinTermFreq is 2, so if your document has no terms with
freq greater than 2 will return a query
MoreLikeThis essentially shortlists a large list of terms (found in example
text or an existing doc) and uses them in a query.
To see what terms have been shortlisted try calling query.rewrite(reader) and
then call toString() or extractTerms.
If this reveals no terms try using a debugger which
Thanks so much for hints, now it works correctly, the problem was with
mlt.setMinTermFreq.
Many thanks.
--
View this message in context:
http://www.nabble.com/Re%3A-MoreLikeThis-return-no-results-tp19230763p19256118.html
Sent from the Lucene - Java Users mailing list archive at Nabble.com.
Hi,
Assume I have a String z-4. That would be properly indexed by my Analyzer,
so I'd find the belonging document if I search for z-4. Now I also want to
find that document if I search for z4.
Now my approach would be to inject an additional token z4 at indexing
time. There may also be several
Is my subscription working... I got no reply on my previous question.
Sorry the disturbance.
On Mon, Sep 1, 2008 at 10:29 PM, Markus Lux [EMAIL PROTECTED] wrote:
Hi,
Assume I have a String z-4. That would be properly indexed by my
Analyzer,
so I'd find the belonging document if I search for
14 matches
Mail list logo