Nested Queries

2006-12-26 Thread Kapil Chhabra
Hi, Please see the following data-structure ++--+ | FIELD1 | FIELD2 | ++--+ | 1 | 2,3,4,6, | | 2 | 3,1,5,7, | | 3 | 1,2, | | 4 | 1,8,10, | | 5 | 2,9, | | 6 | 1, | | 7 | 2,9, | | 8 | 4,9, | | 9 |

Re: Nested Queries

2006-12-26 Thread Kapil Chhabra
Just to mention, I have tokenized FIELD2 on "," and indexed it. FIELD2:3 should return 1,2 FIELD2:(FIELD2:3) should return something like the output of: *FIELD2: 1 OR FIELD2: 2 * Regards, kapilChhabra* * Kapil Chhabra wrote: Hi, Please see the following data-structure ++--+ |

Re: Merge Index Filling up Disk Space

2006-12-26 Thread Harini Raghavan
Yes I think I got hit IOException. I assumed that the.tmp files are not required and deleted them manually from the indes directory as they were more than 10G. Is that ok? Michael McCandless wrote: Harini Raghavan wrote: Thank you for the response. I don't have readers open on the index, bu

Re: Merge Index Filling up Disk Space

2006-12-26 Thread Michael McCandless
Harini Raghavan wrote: Yes I think I got hit IOException. I assumed that the.tmp files are not required and deleted them manually from the indes directory as they were more than 10G. Is that ok? Yes, they are indeed not necessary so deleting them is fine. This (deleting partially created file

Re: Merge Index Filling up Disk Space

2006-12-26 Thread Harini Raghavan
Hi, I have another related problem. I am adding news articles for a company to the lucene index. As of now if the articles are mapped to more than one company, they are added so many times in the index. As the no. of companies mapped to each article increases, this will not be a scalable impl

Re: Merge Index Filling up Disk Space

2006-12-26 Thread Erick Erickson
First, it probably would have been a good thing to start a new thread on this topic, since it's only vaguely related to disk space ... That said, sure. Note that there's no requirement in lucene that all documents in an index have the same fields. Also, there's no reason you can't use two separat

Re: Better way for use lucene search in ejb

2006-12-26 Thread Grant Ingersoll
Hi Alex, This seems like a pretty small setup, so I think we may need some more info. I don't know a lot about EJBs, but what I wonder is if your bean is opening an Index Searcher for every connection. Perhaps you could describe your architecture a little bit more and that will help us

Re: boosting instead of sorting WAS: to boost or not to boost

2006-12-26 Thread Andrzej Bialecki
Suman Ghosh wrote: Andrzej, I have been trying to solve a similar problem where I need to boost score based on the document type. Your approach is very interesting and I want to give it a try. I have a implementation specific question. When you mention to put as many "1" as the boost need to be

help finding docs, creating analyzer objects

2006-12-26 Thread Eric Haszlakiewicz
I'm been having a hard time finding any kind of reasonable documentation on lucene. It seems that the javadocs are mostly empty, and the stuff on the wiki really doesn't explain anything. Is there a better place to be looking? So far, I've managed to get some basic stuff working. Now I'm try

Re: help finding docs, creating analyzer objects

2006-12-26 Thread Grant Ingersoll
Hi Eric, I'm sorry you are not finding what you need. The snowball analyzers come in a separate jar, in the release zip, under the contrib/ snowball directory. You may also want/need the analyzers in contrib/ analyzers for other languages. The README delivered w/ the release probably sh

Re: help finding docs, creating analyzer objects

2006-12-26 Thread Haszlakiewicz, Eric
>>> Grant Ingersoll 12/26/2006 2:43 PM >>> Hi Eric, >I'm sorry you are not finding what you need. The snowball analyzers >come in a separate jar, in the release zip, under the contrib/ >snowball directory. You may also want/need the analyzers in contrib/ >analyzers for other languages. The

Re: help finding docs, creating analyzer objects

2006-12-26 Thread Haszlakiewicz, Eric
>>> Grant Ingersoll 12/26/2006 2:43 PM >>> >I'm sorry you are not finding what you need. The snowball analyzers >come in a separate jar, in the release zip, under the contrib/ >snowball directory. You may also want/need the analyzers in contrib/ >analyzers for other languages. The README de

Re: help finding docs, creating analyzer objects

2006-12-26 Thread Mark Miller
You sound like a man that wants a manual. Lucene In Action is the way to go. It targets 1.4 instead of 2.0, so I'm sure you will still have complaints, but if you want to get up to speed on Lucene fast, LIA is the way to go. LIA will give you the overview and important classes, and then the doc

Re: help finding docs, creating analyzer objects

2006-12-26 Thread Erick Erickson
Did you look in contrib/snowball? It should have been unpacked along with everything else when you unpacked lucene 2.0, it's named something like lucene-showball-2.0.0.jar. That said, I've also had to poke around to find things, and I still don't understand large portions of what Lucene does for

Re: help finding docs, creating analyzer objects

2006-12-26 Thread Grant Ingersoll
Hi Eric, Take a look at http://lucene.apache.org/java/docs/releases.html Clicking through to the appropriate mirror, download/unzip lucene-2.0.0.zip. In it you'll find the jars, contrib, README, demo, etc. It really should contain everything you need outside of the actual Lucene source

Re: Nested Queries

2006-12-26 Thread Kapil Chhabra
Hi All, Any pointers in this direction? Thanks in advance. Kapil Kapil Chhabra wrote: Just to mention, I have tokenized FIELD2 on "," and indexed it. FIELD2:3 should return 1,2 FIELD2:(FIELD2:3) should return something like the output of: *FIELD2: 1 OR FIELD2: 2 * Regards, kapilChhabra* *

Re: help finding docs, creating analyzer objects

2006-12-26 Thread Erik Hatcher
Mark's message was very nice to see about LIA, but I want to reply and second Eric's comments about the Lucene distributable. I just downloaded and unpacked the 2.0 .zip to test out a newbie to Lucene, but Java savvy (our target audience), experience. I opened docs/index.html and the first

Re: help finding docs, creating analyzer objects

2006-12-26 Thread Erik Hatcher
I want to again thank Grant for recently taking a dedicated lead effort on the documentation revamping. I haven't yet built a recent archive from trunk to see what it's out of the box experience is like, but I'm guessing that at least the issue mentioned below has been alleviated (?).

Re: help finding docs, creating analyzer objects

2006-12-26 Thread Peter W.
Hello, I just got this working in three or four steps: 1. goto http://www.apache.org/dyn/closer.cgi/lucene/java/ 2. click on any of the mirrors and download "lucene-2.0.0.zip" 3. unzip into preferred directory (step not shown), then use jar to look at snowball items: jar tvf /opt/lucene-2.0.