Newbie question: using Lucene to index hierarchical information.

2008-09-01 Thread Leonid Maslov
Hi all, First of all, sorry for my poor English. It's not my native language. I'm trying to use Lucene to index hierarchical kind of information: I have structured html and pdf/word documents and I want to index them in ways to perform search in titles, text, paragraphs or tables only, or any

Re: Newbie question: using Lucene to index hierarchical information.

2008-09-01 Thread tom
AUTOMATIC REPLY Tom Roberts is out of the office till 2nd September 2008. LUX reopens on 1st September 2008 - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]

Re: Re: Newbie question: using Lucene to index hierarchical information.

2008-09-01 Thread tom
AUTOMATIC REPLY Tom Roberts is out of the office till 2nd September 2008. LUX reopens on 1st September 2008 - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]

Re: Index types

2008-09-01 Thread Karsten F.
Hi John, I am not sure about the way Solr implements range query. But it looks like, that Solr is using org.apache.lucene.search.ConstantScoreRangeQuery which itself is using org.apache.lucene.search.RangeFilter So Solr do not rewrite the query to a large Boolean SHOULD, but it is reading all

getTimestamp method in IndexCommit

2008-09-01 Thread Akshay
Hi, We need a feature for time based cleanup of IndexCommits. Would it be possible to add a method to IndexCommit class to get the timestamp of an index commit? Thanks. -- Regards, Akshay Ukey.

Re: Confused with NGRAM results

2008-09-01 Thread gaz77
Hi Otis, The original message text is: Hi, I'd appreciate if someone could explain the results I'm getting. I've written a simple custom analyzer that applies the NGramTokenFilter to the token stream during indexing. It's never applied during searching. The purpose of this is to

Re: MoreLikeThis return no results

2008-09-01 Thread davood
Hi, I enabled the TermVector for required fields using following piece of code, Field titleField = new Field(title, title, Field.Store.NO, Field.Index.TOKENIZED, TermVector.YES); and then re-indexed it. But again it shows no result. I checked the stored documents and they TermVector exists and

Re: Newbie question: using Lucene to index hierarchical information.

2008-09-01 Thread Leonid Maslov
Any comments, suggestions? Maybe I should rephrase my original message or describe it in detail? I really would like to get any response if possible. Thanks a lot in advance! On Mon, Sep 1, 2008 at 10:25 AM, Leonid Maslov [EMAIL PROTECTED] wrote: Hi all, First of all, sorry for my poor

lucene based tagging structure

2008-09-01 Thread Cam Bazz
Hello, Recently I developed an interest in making a lucene based structure for tagging. As we all know lucene's update is not real-time and one has to delete a document prior to updating it. I have been googling for different approaches to a lucene based tagging structure, and I stumbled upon

Re: MoreLikeThis return no results

2008-09-01 Thread Marcelo Ochoa
Hi Dave: MoreLikeThis object has two parameters which controls his functionality: mlt.setMinTermFreq(minTermFreq.intValue()); mlt.setMinDocFreq(minDocFreq.intValue()); By default MinTermFreq is 2, so if your document has no terms with freq greater than 2 will return a query

Re: MoreLikeThis return no results

2008-09-01 Thread mark harwood
MoreLikeThis essentially shortlists a large list of terms (found in example text or an existing doc) and uses them in a query. To see what terms have been shortlisted try calling query.rewrite(reader) and then call toString() or extractTerms. If this reveals no terms try using a debugger which

Re: MoreLikeThis return no results

2008-09-01 Thread davood
Thanks so much for hints, now it works correctly, the problem was with mlt.setMinTermFreq. Many thanks. -- View this message in context: http://www.nabble.com/Re%3A-MoreLikeThis-return-no-results-tp19230763p19256118.html Sent from the Lucene - Java Users mailing list archive at Nabble.com.

Injecting additional tokens

2008-09-01 Thread Markus Lux
Hi, Assume I have a String z-4. That would be properly indexed by my Analyzer, so I'd find the belonging document if I search for z-4. Now I also want to find that document if I search for z4. Now my approach would be to inject an additional token z4 at indexing time. There may also be several

Re: Injecting additional tokens

2008-09-01 Thread Raymond Balmès
Is my subscription working... I got no reply on my previous question. Sorry the disturbance. On Mon, Sep 1, 2008 at 10:29 PM, Markus Lux [EMAIL PROTECTED] wrote: Hi, Assume I have a String z-4. That would be properly indexed by my Analyzer, so I'd find the belonging document if I search for