RE: MemoryIndex "field must not be added more than once"

2012-03-07 Thread Uwe Schindler
Hi, Can you open a bug report in JIRA about this? The IndexWriter/IndexReader contract allows to add the same field several times (internally concatenating, but with a positionIncrement gap). If you append the fields before, phrase queries may behave differently. Uwe - Uwe Schindler H.-H.-Me

Re: More About NOT Optimizing

2012-03-07 Thread Michael McCandless
Maybe try TieredMergePolicy to see if it'd do any merges here...? More responses below: On Tue, Mar 6, 2012 at 8:00 PM, Paul Hill wrote: > I have an index with 421163 documents (including body text) > after running a test index for a couple of months with 3.4 code with the > default LogByteSiz

Re: MemoryIndex "field must not be added more than once"

2012-03-07 Thread Stephen Howe
Easiest way to do that would be to append all your values together and then add them in one pass. Unfortunately, you can only add a field + values once. -Stephen On Wed, Mar 7, 2012 at 9:39 AM, Dave Seltzer wrote: > I'm using a MemoryIndex to search in-memory content. > > I was wondering if the

MemoryIndex "field must not be added more than once"

2012-03-07 Thread Dave Seltzer
I'm using a MemoryIndex to search in-memory content. I was wondering if there's a way to specify multiple values for the same field in a MemoryIndex. I've tried addField but it throws an exception: index.addField("foobar", "value1", LuceneAnalyzer); index.addField("foobar", "value2", LuceneAnaly

Re: How to construct this query ?

2012-03-07 Thread heikki
iorixxx: that works ! Thank you so much. Ian: don't know why, but my original query returned no results, and iorixxx's suggestion works as expected. I'm using Lucene 3.5. thank you very much, kind regards Heikki Doeleman -- View this message in context: http://lucene.472066.n3.nabble.com/How-t

Re: How to construct this query ?

2012-03-07 Thread Ian Lea
Looks fine to me. Care to supply any evidence? http://wiki.apache.org/lucene-java/LuceneFAQ#Why_am_I_getting_no_hits_.2BAC8_incorrect_hits.3F has good tips. -- Ian. On Wed, Mar 7, 2012 at 1:53 PM, heikki wrote: > hello, > > I'm trying to programmatically create a query but don't get it worki

Re: How to construct this query ?

2012-03-07 Thread Ahmet Arslan
> I'm trying to programmatically create a query but don't get > it working. > > The query should return all results that match some prefix, > but not any > results that /exactly/ match the prefix (in the same field). > So only the > results where the field contents are longer than the > prefix. >

How to construct this query ?

2012-03-07 Thread heikki
hello, I'm trying to programmatically create a query but don't get it working. The query should return all results that match some prefix, but not any results that /exactly/ match the prefix (in the same field). So only the results where the field contents are longer than the prefix. I tried thi

Re: Help on DOCX and XLSX

2012-03-07 Thread Ian Lea
So you want to index different fields and search on those fields and are asking whether you can do that in lucene? The answer is yes. I still think you should look at Solr but if you are determined to use Lucene, get hold of a copy of the second edition of Lucene In Action http://www.manning.com/

RE: Help on DOCX and XLSX

2012-03-07 Thread Prasad KVSH
Hi Ian, Thanks for your quick reply. Our documents will have the following common key information like 1. Document Type ID, 2. Document Date, 3. Document Author ID, 4. Document Status 5. Document Group ID. While creating the indexing, we would like to add the above key values along the content

Re: Help on DOCX and XLSX

2012-03-07 Thread Ian Lea
You'll have to find something that parses the formats you are interested in and extracts the text you want. Apache Tika comes to mind. Why are you using such an old version of Lucene? Why aren't you using Solr? That might just work for you out of the box. See also http://www.lucidimagination.c