Re: Indexing performance with Lucene 1.9

2006-03-01 Thread Eric Jain
Eric Jain wrote: I'll rerun the indexing procedure with the old version overnight, just to be sure. Just to confirm: There no longer seems to be any difference in indexing performance between the nightly build and 1.4.3. - T

Solr, the Lucene based Search Server

2006-03-01 Thread Yonik Seeley
Solr is a new open-source search server that's based on Lucene, and has XML/HTTP interfaces for updating and querying, declarative specification of analyzers and field types via a schema, extensive caching, replication, and a web admin interface. It's already in production use at CNET, and has rec

Re: Solr, the Lucene based Search Server

2006-03-01 Thread Eric Jain
Yonik Seeley wrote: Solr is a new open-source search server that's based on Lucene, and has XML/HTTP interfaces for updating and querying, declarative specification of analyzers and field types via a schema, extensive caching, replication, and a web admin interface. Just had a look, quite impre

Re: Solr, the Lucene based Search Server

2006-03-01 Thread Yonik Seeley
On 3/1/06, Eric Jain <[EMAIL PROTECTED]> wrote: > I noticed that you have a WordDelimiterFilter; any chance that this will be > contributed back to Lucene? This class is really useful! (In fact I was > just trying to write something similar myself...) Definitely. Solr uses Java5, so it would need

Re: Efficiently updating indexed documents

2006-03-01 Thread Yonik Seeley
Hi Nadav, This is exactly the approach Solr uses by default, and it works fine. see doDeletions() on DirectUpdateHandler2 http://svn.apache.org/viewcvs.cgi/incubator/solr/trunk/src/java/org/apache/solr/update/DirectUpdateHandler2.java?rev=372455&view=markup We keep a Map of id->num_to_save that

Re: Hacking proximity search: looking for feedback

2006-03-01 Thread Jeff Rodenburg
Thanks to everyone on the replies. I'm going to try several of these approaches and with equivalent data sets and run some side-by-side tests. No timeframes guarantees here, but I'll report back with the different approaches and the test results. cheers, -- j On 2/28/06, Chris Hostetter <[EMAI

Term Vector

2006-03-01 Thread Srikanth Kallurkar
Hi all, Can a vector of terms be contructed in Lucene. A term vector for a document, for example, would contain the indexed terms of that document. Also, is it possible to get a handle to the actual term-document matrix. Thanks in advance, Srikanth -

Re: Term Vector

2006-03-01 Thread Rajesh Munavalli
This has been discussed previously. Here are the links http://www.gossamer-threads.com/lists/lucene/java-user/9189#9189 http://www.gossamer-threads.com/lists/lucene/java-user/32362#32362 Hope that helps, Rajesh Munavalli On 3/1/06, Srikanth Kallurkar <[EMAIL PROTECTED]> wrote: > > Hi all, > >

Offsets

2006-03-01 Thread Seeta Somagani
Is there a way to get the startOffset and endOffset if I know the termPosition in the field?

Re: Hacking proximity search: looking for feedback

2006-03-01 Thread Doug Cutting
Jeff Rodenburg wrote: Following on the Range Query approach, how is performance? I found the range approach (albeit with the exact values) to be slower than the parsed-string approach I posited. Note that Hoss suggested RangeFilter, not RangeQuery. Or perhaps ConstantScoreRangeQuery, which i

field search problems

2006-03-01 Thread Standa Opichal
Hi! I have just downloaded the 1.9 release. Put the provided .war file to my Tomcat install and successfully created the index which I am able to query. All that in few minutes. I must say this is great achievement from user point of view. However as I have read the tutorial and attempted to use

Lucene 1.9-final release available

2006-03-01 Thread Doug Cutting
Release 1.9-final of Lucene is now available from: http://www.apache.org/dyn/closer.cgi/lucene/java/ This release has many improvements since release 1.4.3, including new features, performance improvements, bug fixes, etc. For details, see: http://svn.apache.org/viewcvs.cgi/*checkout*/lucene/j

Re: Lucene 1.9-final release available

2006-03-01 Thread info
Great work, team, Will the jars be available as well via maven (http://www.ibiblio.org/maven/lucene/jars/)? It would be great if the core as well as all the contrib jars would. Michael Franken Doug Cutting wrote: Release 1.9-final of Lucene is now available from: http://www.apache.org/dyn/

Re: Get list with found words for a hit?

2006-03-01 Thread Samuru Jackson
Hi! > For the purpose of highlighting, you don't necessarily need to know in > advance > which word matched: you can just highlight any occurance of either Paris or > London - wherever you find them - in the original text. As I just discovered that Highlighter is not what I need in my project. It

Re: Get list with found words for a hit?

2006-03-01 Thread Chris Hostetter
: Is there a way to retrieve only the search "words" the user entered in BNF form? : Query ::= ( Clause )* : Clause ::= ["+", "-"] [ ":"] ( | "(" Query ")" ) : : For example the user entered: +Paris -London : : I want to get a List that contains only {Paris, London} your use of "TERM" is a lit

RE: Adding line count to a document

2006-03-01 Thread Eyal
I think my questions wasn't clear.. Let's say I'm doing something like that (c# code, but that's not the issue..) TextReader reader=new StreamReader("C:\FileToIndex.txt"); Int lineCount=CountLines("C:\FileToIndex.txt"); //This ones reads the entire file and count the number of lines Document do

javadoc lookup

2006-03-01 Thread Larry Ogrodnek
Hey, I put together a little ajax / lucene javadoc lookup site that I just wanted to share I've found it pretty useful to be able to just type a few letters instead of navigating through the standard javadoc frames... http://jdk.representqueens.com:9090/s/jdk/ http://jdk.representqueens.c

Re: javadoc lookup

2006-03-01 Thread Paul Smith
That is neat... nice work. On 02/03/2006, at 10:23 AM, Larry Ogrodnek wrote: Hey, I put together a little ajax / lucene javadoc lookup site that I just wanted to share I've found it pretty useful to be able to just type a few letters instead of navigating through the standard javadoc fr

Re: Hacking proximity search: looking for feedback

2006-03-01 Thread Jeff Rodenburg
Very good note, I missed that. I need the development environment in front of me to remember all the different class names correctly. ;-) -- j On 3/1/06, Doug Cutting <[EMAIL PROTECTED]> wrote: > > Jeff Rodenburg wrote: > > Following on the Range Query approach, how is performance? I found th

Re: javadoc lookup

2006-03-01 Thread Erik Hatcher
Nice! I'm curious, what do you have cooking on the server-side? Having Lucene showing terms from an index of javadocs would allow for partial matches such that QueryParser pops ups if someone types "parser" (provided the text was tokenized at case shifts). Erik On Mar 1, 2006,

sentences and paragraph search requirement

2006-03-01 Thread sog
I am implement a search engine, I do some special search requirement: Grammatical Connectors /s Search terms in the same sentence: design /s defect 同一个句子 /p Search terms in the same paragraph: hearsay /p utterance 同一个段落 +s The first term preceding the second wit

Re: javadoc lookup

2006-03-01 Thread Yonik Seeley
Nice indeed. I use IntelliJ for such things when I have it open, but when I don't I have to click through the SVN repository (I'm normally looking for source, not javadoc). Could you add a link to the source as well? -Yonik - To

Re: javadoc lookup

2006-03-01 Thread Daniel Noll
Larry Ogrodnek wrote: Hey, I put together a little ajax / lucene javadoc lookup site that I just wanted to share I've found it pretty useful to be able to just type a few letters instead of navigating through the standard javadoc frames... http://jdk.representqueens.com:9090/s/jdk/ htt

How to intergrate lucene with my web application

2006-03-01 Thread Haritha_Parvatham
Hi, Thanks to all, I have sucessfully installed lucene 1.4.3 & able to run it. Now ,I would like to intergrate lucene with my web application.Please guide me the steps for doing intergration. Iam using snowball stemmer for my application Since i want lucene to support multilingualism. But i don

Re: How to intergrate lucene with my web application

2006-03-01 Thread gekkokid
hi, i would download the 1.9 version as your starting fresh (unless you need the 1.4.3 version for some reason), what is your web application? and what should lucene be doing when intergrated with your web app? there is a simple example in the binary 1.9 download, /src/jsp, look at "results.js

RE: How to intergrate lucene with my web application

2006-03-01 Thread Haritha_Parvatham
Hi, Actually,I have created front-end design page in html.From the front-end I want to search the text. So how I will intergrate lucene with my frontend page.How I will store my own text in lucene.I believe in lucene it contains default java docs text in index file.How I will replace it with my ne

Re: sentences and paragraph search requirement

2006-03-01 Thread Chris Hostetter
: Grammatical Connectors /s Search terms in the same sentence: : /p Search terms in the same paragraph: : +s The first term preceding the second within the same sentence: : +p The first term preceding the second within the same paragraph: : I am not sure lucene support thi