Re: How to modify a document Field before the document is indexed?

2010-07-19 Thread Erick Erickson
One subtlety you might be able to use to advantage... This is where getPositionIncrementGap in your analyzer can be used to separate the two bits of data in the same field. If I have my own analyzer (which could be a trivial override of an existing one) that returns, say 10,000 from getPositionIncr

Re: How to modify a document Field before the document is indexed?

2010-07-19 Thread Joe Hansen
Thanks for your reply Koji! Your suggestion worked fine. I thought adding a field named "contents" to a document, even though it contains a field already named "contents" would NOT do anything. But looks like I am wrong! Thank you for your kind help! :) Regards, Joe On Mon, Jul 19, 2010 at 5:12

Re: How to modify a document Field before the document is indexed?

2010-07-19 Thread Koji Sekiguchi
(10/07/20 7:31), Joe Hansen wrote: Hey All, I am using Apache Lucene (2.9.1) and its fast and it works great! I have a question in connection with Apache PDFBox. The following command creates a Lucent Document from a PDF file: Document document = org.apache.pdfbox.searchengine.lucene.LucenePDFD

How to modify a document Field before the document is indexed?

2010-07-19 Thread Joe Hansen
Hey All, I am using Apache Lucene (2.9.1) and its fast and it works great! I have a question in connection with Apache PDFBox. The following command creates a Lucent Document from a PDF file: Document document = org.apache.pdfbox.searchengine.lucene.LucenePDFDocument.getDocument(docFile); The Lu

Re: Scoring exact matches higher in a stemmed field

2010-07-19 Thread Itamar Syn-Hershko
On 19/7/2010 5:50 PM, Shai Erera wrote: If your analyzer outputs b and b$ in the same position, then the below query will already be what the QP output today If you want to incorporate boosting, I can suggest that you extend QP, override newTermQuery for example, and if the term is a stemmed term

RE: Building maven artifacts

2010-07-19 Thread Zhang, Lisheng
Hi Pavel, I have not done this build, I sent last message based on my experiences using ant on other projects, maybe people who worked on maven artifacts could help? Best regards, Lisheng -Original Message- From: Pavel Minchenkov [mailto:char...@gmail.com] Sent: Monday, July 19, 2010 3:

Re: Scoring exact matches higher in a stemmed field

2010-07-19 Thread Shai Erera
If your analyzer outputs b and b$ in the same position, then the below query will already be what the QP output today If you want to incorporate boosting, I can suggest that you extend QP, override newTermQuery for example, and if the term is a stemmed term, then set the query's boost (Query.setBoo

Re: Get lengthNorm of a field

2010-07-19 Thread Philippe
Hi Yonik, Am 19.07.2010 16:21, schrieb Yonik Seeley: On Mon, Jul 19, 2010 at 9:53 AM, Philippe wrote: is there a possibility to retrieve the lengthNorm for all (or a specific) fields in a specific document? See IndexReader: public abstract byte[] norms(String field) throws IOExcepti

Re: Get lengthNorm of a field

2010-07-19 Thread Yonik Seeley
On Mon, Jul 19, 2010 at 9:53 AM, Philippe wrote: > is there a possibility to retrieve the lengthNorm for all (or a specific) > fields in a specific document? See IndexReader: public abstract byte[] norms(String field) throws IOException; And Similarity: public float decodeNormValue(byte b) { The

Get lengthNorm of a field

2010-07-19 Thread Philippe
Hi, is there a possibility to retrieve the lengthNorm for all (or a specific) fields in a specific document? Regards, Philippe - To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org For additional commands, e-m

Re: API to retrieve search results without scoring or sorting

2010-07-19 Thread Yonik Seeley
On Mon, Jul 19, 2010 at 6:14 AM, Naveen Kumar wrote: > Is there any API using which I can retrieve search results, such that they > are neither scored nor sorted (for performance reasons). I just need the > results, don't need any extra computation on that. Use your own custom Collector class. -

API to retrieve search results without scoring or sorting

2010-07-19 Thread Naveen Kumar
HI Is there any API using which I can retrieve search results, such that they are neither scored nor sorted (for performance reasons). I just need the results, don't need any extra computation on that. Any suggestion will be very helpful. -- Thanks Naveen Kumar

Re: Building maven artifacts

2010-07-19 Thread Pavel Minchenkov
Hi, I don't know. I tried to setup somethind like this: But error is the same. Maybe there are any other parameters? 2010/7/16 Zhang, Lisheng > Hi, > > I never this kind of build before, but just from the error message > I guess it could mean two variables: > > ${project.artifactId} > ${proj