Re: Search Across All Fields

2009-01-16 Thread Jamie
Hi Erick Thanks for the pointer. I dont know how I missed that. Our index sizes are absolutely huge so its not really practical in putting an all_text field. It would great if you could introduce a macro or something that one could use to specify all fields. Thanks anyway! Jamie Erick Eri

Re: term offsets info seems to be wrong...

2009-01-16 Thread Koji Sekiguchi
Mark, This is exactly what I want and It worked perfectly. Thanks! I'll post my highlighter to JIRA in a few days (hopegully). It uses term offsets with positions (WITH_POSITIONS_OFFSETS) to support PhraseQuery. Thanks again, Koji Mark Miller wrote: Okay, Koji, hopefully I'll be more luckily

Re: term offsets info seems to be wrong...

2009-01-16 Thread Mark Miller
Okay, Koji, hopefully I'll be more luckily suggesting this this time. Have you tried http://issues.apache.org/jira/browse/LUCENE-1448 yet? I am not sure if its in an applyable state, but I hope that covers your issue. On Fri, Jan 16, 2009 at 7:15 PM, Koji Sekiguchi wrote: > Hello, > > I'm writi

term offsets info seems to be wrong...

2009-01-16 Thread Koji Sekiguchi
Hello, I'm writing a highlighter by using term offsets info (yes, I borrowed the idea of LUCENE-644). In my highlighter, I'm seeing unexpected term offsets info when getting multi-valued field. For example, if I indexed [" "," bbb "] (multi-valued), I got term info bbb(7,10). This is expected

Re: Search Across All Fields

2009-01-16 Thread Erick Erickson
I think you forgot a set of parentheses, a close paren right before the AND and an open paren right after AND Depending upon how big your index is, a MUCH easier way to do this is to index another field, call it all_text say, and add all your terms to that field as well as to the individual one, t

Re: Words that need protection from stemming, i.e., protwords.txt

2009-01-16 Thread patrick o'leary
Porter is a little outdated I've found KStem much better http://wiki.apache.org/solr/AnalyzersTokenizersTokenFilters/Kstem You'll still need a good protected word list, but KStem is just a little nicer On Fri, Jan 16, 2009 at 6:20 PM, David Woodward wrote: > Hi. > > Any good protwords.txt out t

Words that need protection from stemming, i.e., protwords.txt

2009-01-16 Thread David Woodward
Hi. Any good protwords.txt out there? In a fairly standard solr analyzer chain, we use the English Porter analyzer like so: For most purposes the porter does just fine, but occasionally words come along that really don't work out to well, e.g., "maine" is stemmed to "main" - clearly goofing

RE: Search Across All Fields

2009-01-16 Thread Zhang, Lisheng
Hi, Inside (priority:beauty ..) there is an AND, is that operator what you want? Best regards, Lisheng -Original Message- From: Jamie [mailto:ja...@stimulussoft.com] Sent: Friday, January 16, 2009 3:02 PM To: java-user@lucene.apache.org Subject: Search Across All Fields Hi Everyone I

Search Across All Fields

2009-01-16 Thread Jamie
Hi Everyone I have two queries: Query 1 == (attachments:"beauty supply") AND sentdate:[d2008111701 TO d20090117235900] Query 2 == (priority:beauty attach:beauty score:beauty size:beauty sentdate:beauty archivedate:beauty receiveddate:beauty from:beauty to:beauty subject:beauty

Re: Nightly source builds of Lucene ..

2009-01-16 Thread Ryan McKinley
maybe try: http://hudson.zones.apache.org/hudson/view/Solr/job/Solr-trunk/ On Jan 16, 2009, at 4:47 PM, Kay Kay wrote: I am trying to access the nightly lucene builds here at - http://people.apache.org/builds/lucene/java/nightly/ . It does not seem to be available for sometime. Just cur

Nightly source builds of Lucene ..

2009-01-16 Thread Kay Kay
I am trying to access the nightly lucene builds here at - http://people.apache.org/builds/lucene/java/nightly/ . It does not seem to be available for sometime. Just curious if that is the right source to access the same. ---

Re: ANNOUNCE: Welcome Patrick O'Leary as Contrib Committer

2009-01-16 Thread Mark Miller
Welcome Patrick! +1 for LocalLucene. patrick o'leary wrote: Thanks Folks I'm in the business well over a decade now; Started my career in my country of origin in Ireland, and have since lived & worked in UK and the US. I've also traveled extensively establishing development groups in remote of

Re: ANNOUNCE: Welcome Patrick O'Leary as Contrib Committer

2009-01-16 Thread Shalin Shekhar Mangar
Welcome Patrick! On Sat, Jan 17, 2009 at 1:22 AM, patrick o'leary wrote: > Thanks Folks > > I'm in the business well over a decade now; Started my career in my country > of origin in Ireland, and have since lived & worked in UK and the US. I've > also traveled extensively establishing developmen

Re: ANNOUNCE: Welcome Patrick O'Leary as Contrib Committer

2009-01-16 Thread Michael McCandless
Welcome aboard Patrick! Mike patrick o'leary wrote: Thanks Folks I'm in the business well over a decade now; Started my career in my country of origin in Ireland, and have since lived & worked in UK and the US. I've also traveled extensively establishing development groups in remote of

Re: ANNOUNCE: Welcome Patrick O'Leary as Contrib Committer

2009-01-16 Thread patrick o'leary
Thanks Folks I'm in the business well over a decade now; Started my career in my country of origin in Ireland, and have since lived & worked in UK and the US. I've also traveled extensively establishing development groups in remote offices for my company in a few countries. I've worked in several

Re: Term Frequency and IndexSearcher

2009-01-16 Thread Chris Hostetter
: References: : : <1998.130.159.185.12.1232021837.squir...@webmail.cis.strath.ac.uk> : Date: Thu, 15 Jan 2009 04:49:49 -0800 (PST) : Subject: Term Frequency and IndexSearcher http://people.apache.org/~hossman/#threadhijack Thread Hijacking on Mailing Lists When starting a new discussion

Re: ANNOUNCE: Welcome Patrick O'Leary as Contrib Committer

2009-01-16 Thread Ryan McKinley
dooh, never hit paste in the subject line On Jan 16, 2009, at 1:54 PM, Ryan McKinley wrote: The PMC is pleased to announce that Patrick O'Leary has been voted to be a a Lucene-Java Contrib committer. Patrick has contributed a great foundation for integrating spatial search with lucene. I

ANNOUNCE: Welcome as Contrib Committer

2009-01-16 Thread Ryan McKinley
The PMC is pleased to announce that Patrick O'Leary has been voted to be a a Lucene-Java Contrib committer. Patrick has contributed a great foundation for integrating spatial search with lucene. I look forward to future development in this area. Patrick - traditionally we ask you to send o

RE: clustering with compass & terracotta

2009-01-16 Thread Angel, Eric
Glen, Thanks for the links. I'll try these out and see. -Original Message- From: Glen Newton [mailto:glen.new...@gmail.com] Sent: Thursday, January 15, 2009 12:06 PM To: java-user@lucene.apache.org Subject: Re: clustering with compass & terracotta There is a discussion here: http://ww

Re: Lucene index updation and performance

2009-01-16 Thread Erick Erickson
You should look over the FAQ, lots of information there. See: http://wiki.apache.org/lucene-java/LuceneFAQ You can index and search in parallel, but a searcher doesn't see additions to an indexer until the underlying IndexReader is closed/reopened (see the FAQ section: Does Lucene allow searchin

RE: Lucene index updation and performance

2009-01-16 Thread Angel, Eric
You can simply call IndexWriter.addDocument() for new jobs and IndexWriter.updateDocument http://lucene.apache.org/java/2_4_0/api/org/apache/lucene/index/IndexWri ter.html Also, don't forget to optimize your index. Depending on your volume, you might want to optimize during slow traffic. Eric A

Maximum boost factor

2009-01-16 Thread mitu2009
Does anyone know the maximum boost factor value for a field in Lucene? Thanks! -- View this message in context: http://www.nabble.com/Maximum-boost-factor-tp21504717p21504717.html Sent from the Lucene - Java Users mailing list archive at Nabble.com. --

Lucene index updation and performance

2009-01-16 Thread mitu2009
I am working on a job portal site and have been using Lucene for job search functionality. Users will be posting a number jobs on our site on a daily basis.We need to make sure that new job posted is searchable on the site as soon as possible. In this context, how do I update Lucene index when a

Re: Google finance-like suggestible search field

2009-01-16 Thread Shalin Shekhar Mangar
Also look at ConstantScorePrefixQuery in Solr source. In the past I've used Solr with shingles and prefix queries to solve similar problems. On Thu, Jan 15, 2009 at 7:29 AM, Hayes, Peter wrote: > Hi all, > > We are trying to implement a Google finance-like suggest as you type > search field. T

Re: Google finance-like suggestible search field

2009-01-16 Thread Asbjørn A . Fellinghaug
Hi again. You can find additional info regarding this Bigram index here: http://asbjorn.fellinghaug.com/blog/master-thesis/ The source code was available, from the same site but it has disappeared. However, it can be downloaded from the computer science department at NTNU in Norway: http://dai