Re: Customized search with Lucene?

2007-10-24 Thread Doron Cohen
Lukas, Thanks for the link to the book, seems very interesting. Your original question made me think you are intending to maintain personalized score modifications - i.e. for each user you are going to custom the scores differently and I was curious to know how you are going to maintain custom sco

Re: Same query different results?

2007-10-24 Thread Karl Wettin
25 okt 2007 kl. 06.33 skrev Warren: This query works correctly in my app and Luke +(upc:almonds description:almonds content:almonds) +dept:bulk +zone:1 Query 2: This query works correctly in Luke but returns no results in my app +(upc:almonds description:almonds content:almonds) +dept:"Bulk"

Same query different results?

2007-10-24 Thread Warren
I have two two similar query's that give me the same results when I run them in Luke, but when I run them in my app, one of the querys gives me the correct results and the other query gives me no results. I am using StandardAnalyzer in both Luke and my app. Query 1: This query works correctly in m

Re: Highlighter and href fields

2007-10-24 Thread Cool Coder
Ok I understand now that I have a big work ahead of me. >2. Use an Analyzer that recognizes URL's. That way you wont get partial BTW, Do you know any analyzer that can recognize URLs. - BR Mark Miller <[EMAIL PROTECTED]> wrote: Nothing in the Highlighter per seh that will help you ther

Re: Boost Sub-Query

2007-10-24 Thread Erick Erickson
What I'd do is get a copy of Luke, put the query in the appropriate box and click on the explain tab. Trying that several ways should answer your question as well as give you a better feel for how Lucene handles queries. Best Erick On 10/24/07, java_user_ <[EMAIL PROTECTED]> wrote: > > > Is it po

Re: Lucene Queries Over User-Editable Dynamic Categories of Documents

2007-10-24 Thread markharw00d
lucene user wrote: Thanks for all your help! We are using Lucene 2.1.0 and TermsFilter seems to be new in Lucene 2.2.0. I have not been able to find SortedVIntList in the javadocs at all. No, SortedVIntList is in the patch I provided a link to earlier. Because both SortedVIntList and a r

Re: Index Reader Writer - 2 JVMs

2007-10-24 Thread Mark Miller
Lucene does provide locking so that two JVM's can access the same index. Check out the LockFactory to control how this is done. There are a couple advantages of using IndexAccessor: For one you could use a NoLockFactory and avoid Lucene's locking mechanism. Also, you would only need to run one

Re: Customized search with Lucene?

2007-10-24 Thread Lukas Vlcek
Doron, Sorry for the late reply. I got the inspiration for this question from book Programming Collective Intelligence by Toby Segaran (you can check herefor some O'Reilly's comments on the book). You can find a chapter Searching and

Re: Highlighter and href fields

2007-10-24 Thread Mark Miller
Nothing in the Highlighter per seh that will help you there. I see two options off the top of my head: 1. break the text before feeding it to the highlighter and feed all but the URL parts, and then stitch back together -- much as you might do if highlighting an XML doc. Ugly though. 2. Use

Index Reader Writer - 2 JVMs

2007-10-24 Thread Dino Korah
Hi All, I have a scenario where there are two processes (2 JVMs) accessing the same index. One of them is doing the indexing as documents arrive into the system and the second one servers search queries. Both the processes are running on the same machine. Is there a need to do some kind of lockin

Highlighter and href fields

2007-10-24 Thread Cool Coder
Is there anyway I stop highlighting text if it is a href/url etc...? The problem occurs when the field content is a URL which contains the query e.g. my search is for .net and fields has value http://jkjsd.net. After applying highlighter, it becomes http://jkjsd.net, which is a wrong URL. Can I

Re: Lucene Queries Over User-Editable Dynamic Categories of Documents

2007-10-24 Thread lucene user
Thanks for all your help! We are using Lucene 2.1.0 and TermsFilter seems to be new in Lucene 2.2.0. I have not been able to find SortedVIntList in the javadocs at all. Because both SortedVIntList and a regular BitSet are based on Lucene Document Numbers, which are not permanent, It seems we will

Stemmer and Synonym analyzer

2007-10-24 Thread java_user_
I am planning on building an analyzer that has stemming, stopwords and synonyms. I am planning on using the Snowball Porter stemmer and the WordNet synonym engine. Does it make sense to stem the synonym index? I do not want to stem the term “history” and then try to find the synonym. The st

Boost Sub-Query

2007-10-24 Thread java_user_
Is it possible to boost subqueries with QueryParser? For example: ((apple AND banana)^10 OR orange) Thanks -- View this message in context: http://www.nabble.com/Boost-Sub-Query-tf4685212.html#a13388793 Sent from the Lucene - Java Users mailing list archive at Nabble.com. ---

Re: Corpus interpretation

2007-10-24 Thread Steven Rowe
Hi Liaqat, Liaqat Ali wrote: > I want to index the Urdu language corpus (200 documents in CES XML DTD > format). Is net necessary to break the XML file into 200 different files > or it can be indexed in the original form using Lucene. Kindly guide in > this regard. A Lucene document is composed o

Re: Lucene Queries Over User-Editable Dynamic Categories of Documents

2007-10-24 Thread mark harwood
If you were talking about a reasonably stable dataset e.g. products in a catalogue that would be manageable in Lucene because the volume of updates is comparatively low (one set of categorisations maintained by the site owner e.g. something like Cnet where Solr/faceted search came from). If you

Corpus interpretation

2007-10-24 Thread Liaqat Ali
I want to index the Urdu language corpus (200 documents in CES XML DTD format). Is net necessary to break the XML file into 200 different files or it can be indexed in the original form using Lucene. Kindly guide in this regard. ---

Re: Integration of Lucene

2007-10-24 Thread Grant Ingersoll
On Oct 24, 2007, at 3:07 AM, Liaqat Ali wrote: Hi All, I m developing a search engine for Urdu language. I want to use lucene for that purpose. Now the situation is that ---I have a corpus of 2000 Urdu(Variant of Persian and Arabic) documents in XML form, how i will make index of them us

Re: Lucene Queries Over User-Editable Dynamic Categories of Documents

2007-10-24 Thread lucene user
Thanks very much. How large can my end-user's catigories grow before this implementation you have outlined will start to bog down? If my users had thousands of items categorized, would you still recommend using a term filter in this way? Tens of thousands? What is a realistic max? Is there another

Re: Is there bug in CJKAnalyzer?

2007-10-24 Thread Ivan Vasilev
Hi Steven, Thank you very match for your answer. I tested with the StandardAnalyzer and it really tokenizes the text ideograph by ideograph. May be as Samir says in his mail this is not convenient for people who use CJK language because too lot of documents will match. But the think is in this cas

Re: NullPointerException while sorting on INT

2007-10-24 Thread Bhavin Pandya
Hi, I am getting NullPointerException while sorting on a field using SortField.INT same is working for SortField.STRING Sorry guys... I found my mistake... I was using wrong Types my data has FLOAT data I used SortField.AUTO and its working fine now. Thanks. Bhavin pandya - O

Re: Lucene Queries Over User-Editable Dynamic Categories of Documents

2007-10-24 Thread mark harwood
Given the volatility in the set membership I'd be tempted to keep that grouping info in a database rather than doing the reader/writer-open/close dance in Lucene before you can see any updates. (I suspect this is the reason you've opted not to keep the info in Lucene). You can pull a user's list

Integration of Lucene

2007-10-24 Thread Liaqat Ali
Hi All, I m developing a search engine for Urdu language. I want to use lucene for that purpose. Now the situation is that ---I have a corpus of 2000 Urdu(Variant of Persian and Arabic) documents in XML form, how i will make index of them using Lucene. ---Well there will be need some stemming

NullPointerException while sorting on INT

2007-10-24 Thread Bhavin Pandya
Hi friends, I am getting NullPointerException while sorting on a field using SortField.INT same is working for SortField.STRING Code Snippet: SortField sortfield = new SortField("CURRENT_BID",SortField.INT,false); // asceding sort on current bid value Sort sort =