Re: Search based on images

2010-12-09 Thread Jak Akdemir
The LIRE (Lucene Image REtrieval) library focused on searching images according to its texture and color property. http://www.semanticmetadata.net/lire/ http://portal.acm.org/citation.cfm?id=1459577 Sincerely, Jak Akdemir http://portal.acm.org/citation.cfm?id=1459577 On Thu, Dec 9, 2010 at 7:57

dismax: limiting term match to one field

2010-12-09 Thread davidbrai
Hello, I'm trying to use dismax to search for several terms in several fields with different weights. It works good, but I want to limit the matching of terms to a single field. example: given the documents: doc1= name: pulp fiction category: thriller doc2= name: pulp category: fiction query:

RE: dismax: limiting term match to one field

2010-12-09 Thread jan.kurella
try to set the tiebreaker above 1.0, this will increase score for dismax findings in other than the best field. But this may lead to strange side effects? -Original Message- From: ext davidbrai [mailto:davidb...@gmail.com] Sent: Donnerstag, 9. Dezember 2010 09:55 To:

Solr replication, HAproxy and data management

2010-12-09 Thread Paolo Castagna
Hi, we are using Solr v1.4.x with multi-cores and a master/slaves configuration. We also use HAProxy [1] to load balance search requests amongst slaves. Finally, we use MapReduce to create new Solr indexes. I'd like to share with you what I am doing when I need to: 1. add a new index 2.

Triggering a reload of replicated configuration files

2010-12-09 Thread Ophir Adiv
Hi, I added a configuration file which is updated on one of the master cores' conf directory, and also added the file name to the list of confFiles. As as expected, after index change and commit, this file gets replicated to the slave core. However, the problem that remains is how to reload this

Solr on Google App Engine

2010-12-09 Thread Praveen Agrawal
Hi, I was wondering if Solr can be deployed/run on Google App Engine. GAE has some restrictions, notably no local file write access is allowed, instead applications must use JDO/JPA etc. I believe Solr can be deployed/run on Amazon EC2. Has anyone tried Solr on these two hosts? Thanks. Praveen

Re: Map size must not be negative with spatial results + php serialized

2010-12-09 Thread Markus Jelsma
Well, in that case i'd open a ticket for this one. The problem is, for now, that i can only replicate the behaviour using the spatial plugin. On Wednesday 08 December 2010 21:58:06 Chris Hostetter wrote: : That's fine - it could be a Solr bug too. it definitely looks like a generic solr

Highlighting Issue

2010-12-09 Thread Shaun Campbell
I'm trying to highlight a field and I'm getting an exception thrown, only on certain search terms though. I am fairly certain that the cause of the problem is through having synonyms on the highlighted field as I have had highlighting working in the past on other fields. The added complication

Re: Triggering a reload of replicated configuration files

2010-12-09 Thread Upayavira
On Thu, 09 Dec 2010 13:34 +0200, Ophir Adiv firt...@gmail.com wrote: Hi, I added a configuration file which is updated on one of the master cores' conf directory, and also added the file name to the list of confFiles. As as expected, after index change and commit, this file gets

Re: dismax: limiting term match to one field

2010-12-09 Thread davidbrai
Thanks for the relpy. I've tried to add a tie breaker with different values, but in most cases it didn't change anything, and sometimes caused weird things to have higher scores, like you suggested might happen. Is there another way? On Thu, Dec 9, 2010 at 1:58 PM, jan.kurella [via Lucene]

Re: Search based on images

2010-12-09 Thread Michael Zach
Hello Scott, can you please provide more details on this? Any links to projects combining OpenCV and Solr? Regards Michael - Ursprüngliche Mail - Von: scott chu (朱炎詹) scott@udngroup.com An: solr-user@lucene.apache.org Gesendet: Donnerstag, 9. Dezember 2010 08:01:52 Betreff: Re:

Re: Triggering a reload of replicated configuration files

2010-12-09 Thread Ophir Adiv
On Thu, Dec 9, 2010 at 2:25 PM, Upayavira u...@odoko.co.uk wrote: On Thu, 09 Dec 2010 13:34 +0200, Ophir Adiv firt...@gmail.com wrote: Hi, I added a configuration file which is updated on one of the master cores' conf directory, and also added the file name to the list of confFiles. As as

Re: Solr on Google App Engine

2010-12-09 Thread Mauricio Scheffer
Solr on GAE has been discussed a couple of times, see these threads: http://www.mail-archive.com/java-u...@lucene.apache.org/msg26010.html http://www.mail-archive.com/java-u...@lucene.apache.org/msg26010.html http://www.mail-archive.com/solr-user@lucene.apache.org/msg24473.html

Re: Highlighting Issue

2010-12-09 Thread Koji Sekiguchi
(10/12/09 21:22), Shaun Campbell wrote: I'm trying to highlight a field and I'm getting an exception thrown, only on certain search terms though. I am fairly certain that the cause of the problem is through having synonyms on the highlighted field as I have had highlighting working in the past

Re: Highlighting Issue

2010-12-09 Thread Shaun Campbell
Koji Thanks a lot it's stopped crashing now. Can I ask one other question about synonym highlighting which looks a bit puzzling? I enter asset as my criteria and it returns through synonym matching other terms highlighted as well. My debug output is: DEBUG:

Re: Highlighting Issue

2010-12-09 Thread Koji Sekiguchi
(10/12/09 22:13), Shaun Campbell wrote: Koji Thanks a lot it's stopped crashing now. Can I ask one other question about synonym highlighting which looks a bit puzzling? I enter asset as my criteria and it returns through synonym matching other terms highlighted as well. My debug output is:

Re: dismax: limiting term match to one field

2010-12-09 Thread Jan Kurella
Short: No. Long: the concept is different from what you are looking for. The DisMaxQHandler splits up the tokens and searches each token across the given field with a DIsMaxQuery. All DismaxQueries are combined in a BooleanQuery as SHould clauses (with a minShouldMatch according your mm

Re: Highlighting Issue

2010-12-09 Thread Shaun Campbell
OK. I'd switch to FastVectorHighlighter which cured the exceptions and gives me highlighting so I assumed that you could use this instead of the standard highlighter on n-grammed fields. I guess my query was how does the highlighter now highlight synonym terms? Thanks Shaun As I said in my

Re: Problem with loading a class

2010-12-09 Thread Maciej Lisiewski
Just checked logs: Dec 9, 2010 3:12:42 PM org.apache.solr.core.SolrResourceLoader replaceClassLoader INFO: Adding 'file:/var/www/solr/searchPr/lib/lucene-stempel-3.1-2010-12-06_10-23-49.jar' to classloader [..] Dec 9, 2010 3:12:42 PM org.apache.solr.common.SolrException log SEVERE:

Re: Highlighting Issue

2010-12-09 Thread Koji Sekiguchi
(10/12/09 22:50), Shaun Campbell wrote: OK. I'd switch to FastVectorHighlighter which cured the exceptions and gives me highlighting so I assumed that you could use this instead of the standard highlighter on n-grammed fields. I guess my query was how does the highlighter now highlight synonym

Re: A bug in ComplexPhraseQuery ?

2010-12-09 Thread jmr
iorixxx wrote: I added Terje Eggestad's fix[1], can you test it give us feedback? Hi, Sorry for the delay. The fix was working well but we discovered another query crashing the parser: a63b27/00:IC org.apache.lucene.search.PhraseQuery found in phrase query string a63b27/00 at

RE: Triggering a reload of replicated configuration files

2010-12-09 Thread Jonathan Rochkind
To the slave, a replication is more or less the same as a commit for these purposes. Can you use the newSearcher action on slave? newSearcher will happen after a replication is received. From: Ophir Adiv [firt...@gmail.com] Sent: Thursday, December 09,

Re: Highlighting Issue

2010-12-09 Thread Shaun Campbell
Sorry, see what you mean about fixed-length (minGramSize==maxGramSize). I see mine aren't.:( On 9 December 2010 14:26, Koji Sekiguchi k...@r.email.ne.jp wrote: (10/12/09 22:50), Shaun Campbell wrote: OK. I'd switch to FastVectorHighlighter which cured the exceptions and gives me

Re: Delete by query or Id very slow

2010-12-09 Thread Ravi Kiran
Thank you Tom for responding. On an average the docs are around 25-35 KB. The code is as follows, Kindly let me know if you see anything weird, a second pair of eyes always helps :-) public ListString deleteDocs(ListString ids) throws SolrCustomException { CommonsHttpSolrServer server

Re: Warming searchers/Caching

2010-12-09 Thread Mark
Our machines have around 8gb of ram and our index is 25gb. What are some good values for those cache settings. Looks like we have the defaults in place... size=16384 initialSize=4096 autowarmCount=1024 You are correct, I am just removing the health-check file and our loadbalancer prevents

SolrHome and Solr Data Dir in solrconfig.xml

2010-12-09 Thread Bing Li
Dear all, I am a new user of Solr. When using Solr, SolrHome is set to /home/libing/Solr. When Tomcat is started, it must read solrconfig.xml to get Solr data dir, which is used to contain indexes. However, I have no idea how to associate SolrHome with Solr data dir. So a mistake occurs. All the

Re: SolrHome and Solr Data Dir in solrconfig.xml

2010-12-09 Thread Markus Jelsma
What's the context file for Solr under Catalina? Should read something like this: Context path=/solr docBase=/usr/share/solr debug=0 privileged=true allowLinking=true crossContext=true !-- make symlinks work in Tomcat -- Resources className=org.apache.naming.resources.FileDirContext

Re: Delete by query or Id very slow

2010-12-09 Thread Tom Hill
I'd bet it's the optimize that's taking the time, and not the delete. You don't really need to optimize these days, and you certainly don't need to do it on every delete. And you can give solr a list of ids to delete, which would be more efficient. I don't believe you can tell which ones have

(Nessun oggetto)

2010-12-09 Thread alessandro.ri...@virgilio.it
Dear all,I'm trying to call our solr server with the json parameter in order to parse it on the client side which is javascript. My problem is that when I try the call (see the code below) using the wiki instructions (http://wiki.apache.org/solr/SolJSON) the XMLHttpRequest object gets blank when

SolJSON

2010-12-09 Thread alessandro.ri...@virgilio.it
Dear all, First of all sorry for the previous email with missing object. I'm trying to call our solr server with the json parameter in order to parse it on the client side which is javascript. My problem is that when I try the call (see the code below) using the wiki instructions

Re: Triggering a reload of replicated configuration files

2010-12-09 Thread Tom Hill
On Thu, Dec 9, 2010 at 4:49 AM, Ophir Adiv firt...@gmail.com wrote: On Thu, Dec 9, 2010 at 2:25 PM, Upayavira u...@odoko.co.uk wrote: On Thu, 09 Dec 2010 13:34 +0200, Ophir Adiv firt...@gmail.com wrote: Hi, I added a configuration file which is updated on one of the master cores' conf

Re: SolJSON

2010-12-09 Thread lee carroll
Hi Alessandro, Can you use a javascript library which handles ajax and json / jsonp You will end up with much cleaner client code for example a jquery implementation looks quite nice using solrs neat jsonp support: queryString = *:* $.getJSON(

Re: dismax: limiting term match to one field

2010-12-09 Thread davidbrai
Thanks for the answer Jan, I noticed that my example doesn't correctly describe the problem I'm facing. here's a better example: doc1 is name=A B category=B doc2 is name=A category=B when searching for the terms A and B I want doc2 to get a higher score. to be more specific, I don't want the

Concurrent DIH calls

2010-12-09 Thread Juan Manuel Alvarez
Hello!!! I am working with Solr on my first project and I am really happy so far, both with the product and with the community. I am having some doubts about how DIH works. How does DIH handles concurrent requests from different users? Does it queue them? Or if an import is in progress it

Re: Warming searchers/Caching

2010-12-09 Thread Chris Hostetter
: Our machines have around 8gb of ram and our index is 25gb. What are some good : values for those cache settings. Looks like we have the defaults in place... : : size=16384 : initialSize=4096 : autowarmCount=1024 my personal opinion: set all three options to the same value (16384) it will make

Re: dismax: limiting term match to one field

2010-12-09 Thread Chris Hostetter
: doc1 is name=A B category=B : doc2 is name=A category=B : : when searching for the terms A and B I want doc2 to get a higher score. : to be more specific, I don't want the term B to influence doc1's score in : both name and category, only in one of them. if you set the boost value of category

[Multiple] RSS Feeds and Source Field

2010-12-09 Thread Adam Estrada
All, I am indexing RSS feeds from several sources so I have a couple questions. 1. There is only 1 source for each RSS feed which is typically the name of the feed, I get an error in my app stating *Value cannot be null. Parameter name: source* I look at the index in Luke and there are data

Re: [Multiple] RSS Feeds and Source Field

2010-12-09 Thread Jonathan Rochkind
You look at what index in Luke? I bet you $10 there is no index called source* in your index. With an asterisk in it. On 12/9/2010 3:23 PM, Adam Estrada wrote: All, I am indexing RSS feeds from several sources so I have a couple questions. 1. There is only 1 source for each RSS feed which

Re: [Multiple] RSS Feeds and Source Field

2010-12-09 Thread Adam Estrada
In Luke I looked at the available fields and term counts per field and there is a source field without an asterisk beside it. The source value is CNN.com which is what I would expect it to be. I still get a null value in my app which is probably a bug somewhere in my application. Any more of your

Re: Problem with dismax mm

2010-12-09 Thread Chris Hostetter
: Personally, the mm parameter makes my head hurt. : As I read it, there are actually 4 buckets that rules apply to, not three : in your mm definition, see below. If you have Smiley and Pugh's Solr book, check out pages 138-140... http://wiki.apache.org/solr/PacktBook2009 I created the mm

Open source Solr UI with multiple select faceting?

2010-12-09 Thread Andy
Hi, Any open source Solr UI's that support selecting multiple facet values (OR faceting)? For example allowing a user to select red or blue for the facet field Color. I'd prefer libraries in javascript or Python. I know about ajax-solr but it doesn't seem to support multiple selects.

Re: how to recover when indexing with proxy shards

2010-12-09 Thread Shawn Heisey
On 12/9/2010 12:56 AM, patrick wrote: i'm considering of using more than 3 solr shards and assign a (separate) proxy to do the loadbalancing when indexing. using SolrJ is my way to do the indexing. the question is if i get any information about the whereabouts of the shard in which the

Re: Open source Solr UI with multiple select faceting?

2010-12-09 Thread Adam Estrada
SolrNet has a great example application that you can use...There is a great Javascript project called SolrAjax but I don't know what the state of it is. Adam On Thu, Dec 9, 2010 at 4:53 PM, Andy angelf...@yahoo.com wrote: Hi, Any open source Solr UI's that support selecting multiple facet

Re: SOLR Thesaurus

2010-12-09 Thread Chris Hostetter
: a term can have a Prefered Term (PT), many Broader Terms (BT), Many Narrower : Terms (NT) Related Terms (RT) etc ... : User supplied Term is say : Ski : : Prefered term: Skiing : Broader terms could be : Ski and Snow Boarding, Mountain Sports, Sports : Narrower terms: down hill

Re: problems with custom SolrCache.init() - fails on startup

2010-12-09 Thread Chris Hostetter
: I believe the problem is that I attempt to access the core in the init process. : I currently use the deprecated SolrCore.getSolrCore(), but had the same problem : when attempting to use CoreContainer. During some initialization process, I need : access to the IndexSchema object. I assume

Re: Solr 3x segments file and deleting index

2010-12-09 Thread Chris Hostetter
: If I want to delete an entire index and start over, in previous versions of : Solr, you could stop Solr, delete all files in the index directory and : restart Solr. Solr would then create empty segments files and you could ... : You have to delete the index directory entirely. This

search problem after using EdgeNGramFilter

2010-12-09 Thread Jason, Kim
I am using EdgeNGramFilter for wildcard search. But the search result is same whether or not followed by asterisk. When I search without asterisk, I just want to retrive in original terms(except ngram terms). [example] - doc1 : enterprise search server - doc2 : enter key When I query 'enter*',

RE: entire farm fails at the same time with OOM issues

2010-12-09 Thread Chris Hostetter
I'm not sure if you resolved this issue, but... : It has typically been when query traffic was lowest! We are at 12 GB ...that doesn't mean it couldn't have been query load related. it's possible that some unusual query (ie: trying to sort on many fields at the same time?) could have

Re: search problem after using EdgeNGramFilter

2010-12-09 Thread Ahmet Arslan
I am using EdgeNGramFilter for wildcard search. But the search result is same whether or not followed by asterisk. When I search without asterisk, I just want to retrive in original terms(except ngram terms). [example] - doc1 : enterprise search server - doc2 : enter key When I query

solr immediate response on data import

2010-12-09 Thread Tri Nguyen
Hi, I do a data import with commit=false.  I get the response back saying it's idle and Total number of rows skipped = -1 Total number of rows processed = -1 This is the very first time after i start solr.  Subsequent times it doesn't return -1 but the rows it read from the datasource.

Re: [Multiple] RSS Feeds and Source Field

2010-12-09 Thread Erick Erickson
Hmmm, you say you get an error i your app. I'm a bit confused. Is this before you try to send it to Solr or as a result of sending it to Solr? If the latter, I'd wager source is required in your schema and you're not sending it in your document. Try instrumenting your app to check that every

Re: [Multiple] RSS Feeds and Source Field

2010-12-09 Thread Adam Estrada
I ended up copying the source field to another which seems to have fixed the problem...I still have so much to learn about when it comes to using Solr... Thanks for all the great feedback, Adam On Thu, Dec 9, 2010 at 11:03 PM, Erick Erickson erickerick...@gmail.comwrote: Hmmm, you say you get

Re: search problem after using EdgeNGramFilter

2010-12-09 Thread Jason, Kim
Hi, iorixxx I thought that I have to use NGramFilter for wildcard search. But It was the wrong idea. Thanks, iorixxx -- View this message in context: http://lucene.472066.n3.nabble.com/search-problem-after-using-EdgeNGramFilter-tp2060966p2061961.html Sent from the Solr - User mailing list

Re: my index has 500 million docs ,how to imp rove solr search performance?

2010-12-09 Thread Dennis Gearon
Late reply on this, but how is that big installation working out? Dennis Gearon Signature Warning It is always a good idea to learn from your own mistakes. It is usually a better idea to learn from others’ mistakes, so you do not have to make them yourself. from