Re: Solr like for autocomplete field?

2010-11-03 Thread truebner
Have a look at ajax-solr http://evolvingweb.github.com/ajax-solr/ in the tutorial is an example of an autocompletion widget. tob From: Amit Nithian To: solr-user@lucene.apache.org Date: 03.11.2010 07:36 Subject: Re: Solr like for autocomplete field? I implemented the edge ngrams solution a

RE: Searching Across Multiple Cores

2010-11-03 Thread Lohrenz, Steven
Sorry about the late response to this, but was on holidays. No, as of right now there is not the same schema in each shard. I need to be able to search a set of data resources with manually defined data fields. All of those fields are searchable. Any one of these resources can be added to an

Re: Updating last_modified field when using DIH

2010-11-03 Thread Stefan Matheis
Juan, that's correct .. solr will not touch your database, that's part of your application-code. solr uses an updated timestamp (which is available through dataimporter.last_index_time). so, image the following situation, solr import runs every 10 minutes .. last run at 11:00, your entity gets up

RE: Updating last_modified field when using DIH

2010-11-03 Thread Ephraim Ofir
Also, your deltaImportQuery should be: deltaImportQuery='SELECT * FROM "Entities" WHERE "ent_id"=${dataimporter.delta.id}"' Otherwise you're just importing the ids and not the rest of the data. If performance is important to you, you might also want to check out http://mail-archives.apache.org/mo

Re: Query question

2010-11-03 Thread Ahmet Arslan
> My impression was that > > city:Chicago^10 +Romantic +View > > would do what you want (with the standard lucene query > parser and default operator OR), and I'm not sure about > this, but I have a feeling that the version with "Boolean" > operators AND/OR and parens might actually net out to th

Re: Query question

2010-11-03 Thread kenf_nc
Unfortunately the default operator is set to AND and I can't change that at this time. If I do (city:Chicago^10 OR Romantic OR View) it returns way too many unwanted results. If I do (city:Chicago^10 OR (Romantic AND View)) it returns less unwanted results, but still a lot. iorixxx's solution o

Core status uptime and startTime

2010-11-03 Thread Marc Sturlese
As far as I know, in the core admin page you can find when was the last time an index had a modification and was comitted checking the lastModified. But? what startTime and uptime mean? Thanks in advance -- View this message in context: http://lucene.472066.n3.nabble.com/Core-status-uptime-and-s

Re: Query question

2010-11-03 Thread Mike Sokolov
Another alternative (prettier to my eye), would be: (city:Chicago AND Romantic AND View)^10 OR (Romantic AND View) -Mike On 11/03/2010 09:28 AM, kenf_nc wrote: Unfortunately the default operator is set to AND and I can't change that at this time. If I do (city:Chicago^10 OR Romantic OR Vi

Corename after Swap in MultiCore

2010-11-03 Thread sivaram
Hi everyone, Long question but please hold on. I'm using a multicore Solr instance to index different documents from different sources( around 4) and I'm using a common config for all the cores. So, for each source I have core and temp core like 'doc' and 'doc-temp'. So, everytime I want to get n

RE: Query question

2010-11-03 Thread cbennett
Another option is to override the default operator in the query. {!lucene q.op=OR}city:Chicago^10 +Romantic +View Colin. > -Original Message- > From: Mike Sokolov [mailto:soko...@ifactory.com] > Sent: Wednesday, November 03, 2010 9:42 AM > To: solr-user@lucene.apache.org > Cc: kenf_nc >

Re: Influencing scores on values in multiValue fields

2010-11-03 Thread Jonathan Rochkind
Be careful of multi-term queries and String types. By multi-term here, I mean multi-term according to the 'pre-tokenization' that dismax and standard parsers do -- basically on whitespace. If you have a string with whitespace as a single (non-tokenized field) in a Solr String type, and you h

Re: Searching Across Multiple Cores

2010-11-03 Thread Jonathan Rochkind
Basically, Solr doesn't do that. It seems to be a frequent topic on the listserv, people wanting Solr to be able to do something like that. But, as far as I know, it doesn't -- and I don't have a good idea of alternate ways to solve that kind of problem either. Try put everything in the same c

Re: Possible memory leaks with frequent replication

2010-11-03 Thread Jonathan Rochkind
I hadn't looked at the code, am not familiar with Solr code, and can't say what that code does. But I have experienced issues that I _believe_ were caused by too frequent commits causing over-lapping searcher preperation. And I've definitely seen Solr documentation that suggests this is an iss

Re: Possible memory leaks with frequent replication

2010-11-03 Thread Jonathan Rochkind
Ah, but reading Peter's email message I reference more carefully, it seems that Solr already DOES provide an info-level log warning you about over-lapping warming, awesome. (But again, I'm pretty sure it does NOT throw or HTTP error in that condition, based on my and others experience). > To

Re: A bug in ComplexPhraseQuery ?

2010-11-03 Thread jmr
iorixxx wrote: > > > I added this change to SOLR-1604, can you test it give us feedback? > > Hi, Sorry for the delay. We have tested the change and it is OK for this. However, we have found that this query is crashing when using CoomplexPhraseQuery: "sulfur-reducing bacteria" It is due to

Override SynonymFilterFactory to load synonyms from alternate data source

2010-11-03 Thread Will Milspec
Hi all, Can anyone comment on the ease/merit of overriding the shipped SynonymFilterFactory with a version that could load the synonyms from an alternate data source? Our application currently maintains synonyms in its database ; we could export this data to 'synonyms.txt', but would prefer a db

Re: Override SynonymFilterFactory to load synonyms from alternate data source

2010-11-03 Thread Ahmet Arslan
> Our application currently maintains synonyms in its > database ; we could > export this data to 'synonyms.txt', but would prefer a db > aware > implementationv of SynonymFilterFactory, i.e. avoiding that > middle step. > > From the looks of the class (private instances, static > methods), it doe

Negative or zero value for fieldNorm

2010-11-03 Thread Markus Jelsma
Hi all, I've got some puzzling issue here. During tests i noticed a document at the bottom of the results where it should not be. I query using DisMax on title and content field and have a boost on title using qf. Out of 30 results, only two documents also have the term in the title. Using deb

blacklist docs by uniqueKey

2010-11-03 Thread Ravi Kiran
Hello, I have a single core servicing 3 different applications, one of the application doesnt want some specific docs to show up (driven by Editorial decision). Over a period of time the amount of blacklisted docs could grow, hence I do not want to restrict them in a query as it the query c

Question about morelikethis and multiple fields

2010-11-03 Thread ahammad
Hello, I'm trying to implement a "Related Articles" feature within my search application using the mlt handler. To give you a little background information, my Solr index contains a single core that is created by merging 10+ other cores. Within this core is my main data item known as an "article

Re: Negative or zero value for fieldNorm

2010-11-03 Thread Yonik Seeley
Regarding "Negative or zero value for fieldNorm", I don't see any negative fieldNorms here... just very small positive ones? Anyway the fieldNorm is the product of the lengthNorm and the index-time boost of the field (which is itself the product of the index time boost on the document and the inde

Re: blacklist docs by uniqueKey

2010-11-03 Thread Erick Erickson
How dynamic is this list? Is it feasable to add a field to your docs like blacklisteddocs, and at editorial's discretion add values to that field like "app1", "app2"? At that point you can just filter them out via a filter query... Best Erick On Wed, Nov 3, 2010 at 2:40 PM, Ravi Kiran wrote: >

Re: blacklist docs by uniqueKey

2010-11-03 Thread Yonik Seeley
On Wed, Nov 3, 2010 at 3:05 PM, Erick Erickson wrote: > How dynamic is this list? Is it feasable to add a field to your docs like > blacklisteddocs, and at editorial's discretion add values to that field > like "app1", "app2"? > > At that point you can just filter them out via a filter query... R

Re: Question about morelikethis and multiple fields

2010-11-03 Thread darren
Try adding TFV's (term frequency vectors) to the title field as well as the body. On Wed, 3 Nov 2010 11:41:35 -0700 (PDT), ahammad wrote: > Hello, > > I'm trying to implement a "Related Articles" feature within my search > application using the mlt handler. > > To give you a little background

How to display the synonyms

2010-11-03 Thread jayant
Hi, If the synonym.txt file define the following castle,fort I am able to match "fort" when the user wants to search for "castle". However, I would like to tell the user that "castle" is a synonym for "fort". It is for those users that may wonder why they got a different search result when they w

Re: Negative or zero value for fieldNorm

2010-11-03 Thread Markus Jelsma
> Regarding "Negative or zero value for fieldNorm", I don't see any > negative fieldNorms here... just very small positive ones? Of course, you're right. The E-# got twisted in my mind and became negative. Silly me. > Anyway the fieldNorm is the product of the lengthNorm and the > index-time bo

Re: Question about morelikethis and multiple fields

2010-11-03 Thread ahammad
I don't quite understand what you mean by that. Did you mean TermVector Components? Also, I did some more digging and I found some messages on this mailing list about filtering. From what I understand, using the standard query handler (solr/select/?q=...) with a qt parameter allows you to filter

Filter by relevance

2010-11-03 Thread Jason Brown
Is it possible to filter my search results by relevance? For example, anything below a certain value shouldn't be returned? I also retrieve facet counts in my search queries, so it would be useful if the facet counts also respected the filter on the relevance. Thank You. Jason. If you wish to

Re: blacklist docs by uniqueKey

2010-11-03 Thread Jonathan Rochkind
I don't believe there is, but it occurs to me that the additional feature that Tom Burton-West contemplates in the thread "filter query from external list of Solr unique IDs" could potentially address your problem too, if it existed. I think that feature could also address a variety of problems

phrase boost on dismax query

2010-11-03 Thread Jason Brown
I have 3 fields in my index that I use in a dismax query with boosts and phrase boosts. I've realised that 1 field I'm not really interested in at all, unless the search term is in that field as a phrase. Is it realistic to set the normal boost to zero for this field, but the phrase boost to

Re: blacklist docs by uniqueKey

2010-11-03 Thread Jan Høydahl / Cominvent
How does the exclude="true" option in elevate.xml perform with large number of excludes? Then you could have a separate elevate config for that client. -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com On 3. nov. 2010, at 20.11, Yonik Seeley wrote: > On Wed, Nov 3, 2010

Re: Filter by relevance

2010-11-03 Thread Ahmet Arslan
> Is it possible to filter my search > results by relevance? For example, anything below a certain > value shouldn't be returned? > http://search-lucene.com/m/4AHNF17wIJW1/

RE: blacklist docs by uniqueKey

2010-11-03 Thread Andrew Cogan
A filter that could accept a list of SOLR document IDs as articulated by Tom Burton-West would enable some important features for our application. So if anyone is wondering if this would be a useful feature, consider this a yes vote. -Original Message- From: Jonathan Rochkind [mailto:roch

Re: Possible memory leaks with frequent replication

2010-11-03 Thread Lance Norskog
Do you use EmbeddedSolr in the query server? There is a memory leak that shows up when taking a lot of replications. On Wed, Nov 3, 2010 at 8:28 AM, Jonathan Rochkind wrote: > Ah, but reading Peter's email message I reference more carefully, it seems > that Solr already DOES provide an info-level

Re: Filter by relevance

2010-11-03 Thread Erick Erickson
Be aware, though, that relevance isn't absolute, it's only interesting #within# a query. And it's then normed between 0 and 1. So picking "a certain value" is rarely doing what you think it will. Limiting to the top N docs is usually more reasonable But this may be an XY problem. What is it yo

ZendCon 2010 - Slides on Building Intelligent Search Applications with Apache Solr and PHP 5

2010-11-03 Thread Israel Ekpo
Due to popular demand, the link to my slides @ ZendCon are now available here in case anyone else is looking for it. http://slidesha.re/bAXNF3 The sample code will be uploaded shortly. Feedback is also appreciated http://joind.in/2261 -- °O° "Good Enough" is not good enough. To give anything

Re: blacklist docs by uniqueKey

2010-11-03 Thread Ravi Kiran
Mr.Rochkind pointed out the exact requirement I had in mind i.e. "filter query from external list of Solr unique IDs". On the flip side, even filter queries can be dicey for me as I could very easily blow past the 1024 bytes URL GET limit as my original queries itself are very long..just adding 100

Re: blacklist docs by uniqueKey

2010-11-03 Thread Ravi Kiran
Yes I also did see the exclude="true" in an example elevate.xml...was wondering what it does precisely and if "text" MUST have a value ? I couldnt find any documentation explaining it Ravi Kiran Bhaskar Principal Software Engineer Washington Post 1150 15th Street NW, Washington, DC

Re: A bug in ComplexPhraseQuery ?

2010-11-03 Thread Ahmet Arslan
> However, we have found that this query is crashing when > using > CoomplexPhraseQuery: > "sulfur-reducing bacteria" > > It is due to the dash inside the phrase. > Here is the trace: > java.lang.IllegalArgumentException: Unknown query type > "org.apache.lucene.search.PhraseQuery" found in phrase

Does Solr support Natural Language Search

2010-11-03 Thread jayant
Does Solr support Natural Language Search? I did not find any thing about this in the reference manual. Please let me know. Thanks. -- View this message in context: http://lucene.472066.n3.nabble.com/Does-Solr-support-Natural-Language-Search-tp1839262p1839262.html Sent from the Solr - User maili

Problem escaping question marks

2010-11-03 Thread Stephen Powis
I'm having difficulty properly escaping ? in my search queries. It seems as tho it matches any character. Some info, a simplified schema and query to explain the issue I'm having. I'm currently running solr1.4.1 Schema: I want to return any first name with a Question Mark in it Query: first_

Re: replication not working between 1.4.1 and 3.1-dev

2010-11-03 Thread Shawn Heisey
On 10/29/2010 4:33 PM, Shawn Heisey wrote: The recommended method of safely upgrading Solr that I've read about is to upgrade slave servers, keeping your production application pointed either at another set of slave servers or your master servers. Then you test it with a dev copy of your appli