Re: Using Lucene for searching tokens, not storing them.

2006-04-17 Thread karl wettin
16 apr 2006 kl. 19.18 skrev karl wettin: For any interested party, I do this because I have a fairly small corpus with very heavy load. I think there is a lot to win by not creating new instances of what not, seeking in the file-centric Directory, parsing pseudo-UTF8, et.c. at query time.

Re: search pdf

2006-04-17 Thread Shajahan
Hi, thankyou for your replay. i am very sorry for asking again, but i am new to this Lucene. please tell me how to run this code. i downloaded this LuceneInAction zip file. and i didnot find any readme file for instructions. and i am also downloaded the lucene-1.4.3 also. so please tell me how

RE: search pdf

2006-04-17 Thread Aditya Liviandi
Please take a moment to learn java and how to use java APIs. After that, re-read the emails you just sent us, and answer your own question. -Original Message- From: Shajahan [mailto:[EMAIL PROTECTED] Sent: Monday, April 17, 2006 2:22 PM To: java-user@lucene.apache.org Subject: Re:

How lucene search

2006-04-17 Thread wu fox
Hi all: Somebody told me recently that when lucene search a query which contain a few of clauses ,for example a query: title:lucene ina action AND publisher:manning ,lucene first search: +title:lucene in action ,then search +publisher:manning and combine the results of two search process.

Re: search pdf

2006-04-17 Thread Erik Hatcher
There _is_ a README file at the root of the unzipped Lucene In Action code. It does require some basic Java and Ant know-how. Erik On Apr 17, 2006, at 2:21 AM, Shajahan wrote: Hi, thankyou for your replay. i am very sorry for asking again, but i am new to this Lucene. please

Re: Not able to retrieve hits for a phrase

2006-04-17 Thread Erik Hatcher
This could be related to the analyzer you used during indexing. Be aware that matches are *exact* including case. Erik On Apr 17, 2006, at 1:34 AM, Vishal Bathija wrote: Hi, I am not able to retrieve the number of hits for a particular phrase . The code below retrieves the hits

RE: Catching BooleanQuery.TooManyClauses

2006-04-17 Thread bb
Thanks Erick Paul, I also found a great example of a custom filter in LIA (6.4 Using a custom filter) Here's my updated testcase if anybody is interested... = QueryParserTest.java ... public class QueryParserTest extends LuceneTestCase {

Re: Not able to retrieve hits for a phrase

2006-04-17 Thread Vishal Bathija
I currently use writer = new IndexWriter(index, new StandardAnalyzer(),true); Should I use any other analyzer. Yes I am aware that the matches are case sensitive. Regards Vishal On 4/17/06, Erik Hatcher [EMAIL PROTECTED] wrote: This could be related to the analyzer you

Re: Not able to retrieve hits for a phrase

2006-04-17 Thread Erik Hatcher
Are the terms you're adding to PhraseQuery lowercased? If not, then that is most likely the issue. Erik On Apr 17, 2006, at 9:39 AM, Vishal Bathija wrote: I currently use writer = new IndexWriter(index, new StandardAnalyzer(),true); Should I use any other

Re: Not able to retrieve hits for a phrase

2006-04-17 Thread Vishal Bathija
Hi Erik, Thanks, that seemed to have solved the problem. Can you please elaborate on the kind of input PhraseQuery takes in. Am I supposed to add only lowercased terms to PhraseQuery. Is it possible to search for a phrase that is not case sensitive? Regards Vishal On 4/17/06, Erik Hatcher [EMAIL

Re: Not able to retrieve hits for a phrase

2006-04-17 Thread Erik Hatcher
PhraseQuery needs terms that match what got indexed, simple as that. QueryParser does this for you by using the specified analyzer on the phrase text within double quotes and creating a PhraseQuery out of the tokens. When you're creating a PhraseQuery directly with the API, you need to

hypens

2006-04-17 Thread John Powers
Hello, If I have a user search for b-trunk I would like them to be able to find b-trunk (with hypen). I would also like someone searching for b trunk to also find b-trunk. On the other side, if someone searches for 12412 I would like them to be able to find 12412-235, 12412-121,

how to match Documents from Hits with Documents from Query Spans?

2006-04-17 Thread Boris Galitsky
Hello I am using span queries to get hits (Documents) and occurrences (positions) of search terms within these documents. For some reason, there is a disagreement between the order the Documents are returned in hits, and the Documents are referenced (via order number, starting from 0) in the

Indexing - scheduled batch process or server?

2006-04-17 Thread Marc Dauncey
Hi everyone, I'm currently designing a Lucene search system and i'm considering the indexing side of things. Just wondered what kind of architecture people have adopted for indexing - are CHRON jobs sufficient for high volume drip feed indexing or has anyone implemented a more sophisticated

Re: Indexing - scheduled batch process or server?

2006-04-17 Thread Jeremy Hanna
I'm pretty new with this, but with my index for a database, I'm using a Quartz scheduler. Also at the end of the index update, I set my singleton of IndexSearcher to null. That way the index searcher will be using the latest information. That bit as well as setting it to null and not

Technical Lead - Search

2006-04-17 Thread Joe Taylor
All- We are looking for someone with search experience (we leverage Lucene) to lead a small team of developers as described below. If you are interested, send your resume to [EMAIL PROTECTED] Thanks. Joe Job Title: Technical Lead/Engineering Manager - Ariba Content Summary: Ariba

Re: hypens

2006-04-17 Thread karl wettin
17 apr 2006 kl. 18.59 skrev John Powers: Hello, If I have a user search for b-trunk I would like them to be able to find b-trunk (with hypen). I would also like someone searching for b trunk to also find b-trunk. If you don't care about spans, make a filter that rebuilds the token at

Maximum Length of String for a Field

2006-04-17 Thread Ananth T. Sarathy
Is there maximum length to a string that is analazyed and put into a field? IE if the String is 1 billion characters and analyzed, tokenized, and the last word in the string only appears once at the end, would searching for that last word against that field end with a hit for that document? --

Maximum Length of String for a Field

2006-04-17 Thread Tom Hill
Is there maximum length to a string that is analazyed and put into a field? IE if the String is 1 billion characters and analyzed, tokenized, and the last word in the string only appears once at the end, would searching for that last word against that field end with a hit for that document?

Subscribe me

2006-04-17 Thread Sunil Kumar PK