Re: Query time boosting with dismax

2009-12-05 Thread Uri Boness
You can actually define boost queries to do that (bq parameter). Boost queries accept the standard Lucene query syntax and eventually appended to the user query. Just make sure that the default operator is set to OR other wise these boost queries will not only influence the boosts but also

Embedded for write, HTTP for read - cache aging

2009-12-05 Thread Peter 4U
Hello, Does anyone know of a way to tell an http SolrServer to reload its back-end index (mark cache as dirty) periodically? I have a scenario where an EmbeddedSolrServer is used for writing (for fast indexing), and an CommonsHttpSolrServer for reading (for remote access). If the

Re: Sanity check on numeric types and which of them to use

2009-12-05 Thread Marc Sturlese
And what about: fieldtype name=sint class=solr.SortableIntField sortMissingLast=true/ vs. fieldtype name=bcdint class=solr.BCDIntField sortMissingLast=true/ Wich is the differenece between both? It's just bcdint always better? Thanks in advance Yonik Seeley-2 wrote: On Fri, Dec 4, 2009 at

Re: Sanity check on numeric types and which of them to use

2009-12-05 Thread Yonik Seeley
On Sat, Dec 5, 2009 at 7:02 AM, Marc Sturlese marc.sturl...@gmail.com wrote: And what about: fieldtype name=sint class=solr.SortableIntField sortMissingLast=true/ vs. fieldtype name=bcdint class=solr.BCDIntField sortMissingLast=true/ Wich is the differenece between both? It's just bcdint

Re: Query time boosting with dismax

2009-12-05 Thread Erik Hatcher
Are you sure about the default operator and bq? I assume we're talking about the setting in schema.xml. I think boosting queries are OR'd in automatically to the main query: From DismaxQParser#addBoostQuery() ... query.add(f, BooleanClause.Occur.SHOULD);... There is one case where

Re: Query time boosting with dismax

2009-12-05 Thread Uri Boness
Well.. this is mainly based on some experiments I did (not based on the code base). It appeared as if the boost queries were appended to the generated dismax query and if the default operator (in the schema) was set to AND it actually filtered out the request. For example, here's a dismax

Re: Query time boosting with dismax

2009-12-05 Thread Uri Boness
Checking it further by looking at the code, it seems that in most cases it indeed adds the boost queries as SHOULD. But if you define *one* bq parameter which contains a boolean query, then each clause in this boolean query will be added to the query as is. Therefore: This set up will filter

Re: Solr 1.4: StringIndexOutOfBoundsException in SpellCheckComponent with HTMLStripCharFilterFactory

2009-12-05 Thread Koji Sekiguchi
Robin Wojciki wrote: I am running a search in Solr 1.4 and I am getting the StringIndexOutOfBoundsException pasted below. The spell check field uses HTMLStripCharFilterFactory. However, the search works fine if I do not use the HTMLStripCharFilterFactory. If I set a breakpoint at

Re: solr 1.4: multi-select for statscomponent

2009-12-05 Thread gunjan_versata
Is there any update on this requirement?? Britske wrote: Is there way to exclude filters from a stats field, like it is possible to exclude filters from a facet.field? It didn't work for me. i.e: I have a field price, and although I filter on price, I would like to be able to get the

Re: HTML Stripping slower in Solr 1.4?

2009-12-05 Thread Koji Sekiguchi
Yonik Seeley wrote: Is BaseCharFilter required for the html strip filter? -Yonik http://www.lucidimagination.com It could be if HTMLStripCharFilter is reverted to first version. The first version of HTMLStripCharFilter, for example, if we have paaa, it producesaaa (3 space chars prior

Re: Retrieving large num of docs

2009-12-05 Thread Raghuveer Kancherla
Hi Otis, I think my experiments are not conclusive about reduction in search time. I was playing around with various configurations to reduce the time to retrieve documents from Solr. I am sure that making the two multi valued text fields from stored to un-stored, retrieval time (query time + time

Re: WELCOME to solr-user@lucene.apache.org

2009-12-05 Thread Raghuveer Kancherla
2 ways I can think of ... - ExtractingRequestHandler (this is what I am guessing you are using now) Set extractOnly=true while making a request to the extractingRequestHandler and get the parsed content back. Now make a post request on update request handler with what ever fields and field

Re: Document Decay

2009-12-05 Thread Grant Ingersoll
On Dec 4, 2009, at 1:56 AM, brad anderson wrote: Hi, I'm looking for a way to have the score of documents decay over time. I want older documents to have a lower score than newer documents. I noted the ReciprocalFloatFunction class. In an example it seemed to be doing just this when you

Re: WELCOME to solr-user@lucene.apache.org

2009-12-05 Thread khalid y
Thanks a lot for you response !! For the first solution : I need to index all the content of my websites and I want just tika ignore meta name=id because I have already an id I'll try monday and tell you if it works The second solution : Are your sure Tika use the HTML Tokenizer ? I'll check

parsing the raw query string?

2009-12-05 Thread regany
I've just found solr and am looking at what's involved to work with it. All the examples I've seen only ever use 1 word search terms being implemented as examples, which doesn't help me trying to see how multiple word queries work. It also looks like a hell of a lot of processing needs to be done