Re: How to update solr index.

2010-12-07 Thread Anurag
Can you clarify ur question? - Kumar Anurag -- View this message in context: http://lucene.472066.n3.nabble.com/How-to-update-solr-index-tp2038480p2038580.html Sent from the Solr - User mailing list archive at Nabble.com.

Re: complex boolean filtering in fq queries

2010-12-07 Thread Andy
--- On Wed, 12/8/10, Tom Hill wrote: > > fq=location:national OR (location:CA AND city:"San > Francisco") > Do you mean URL encoding it? You can just type your query > into the > search box in the admin UI, and copy from the resulting > URL. Thanks Tom. I wasn't referring to URL encoding. I w

Re: Index version on slave nodes

2010-12-07 Thread Tom Hill
Just off the top of my head, aren't you able to use a slave as a repeater, so it's configured as both a master and a slave? http://wiki.apache.org/solr/SolrReplication#Setting_up_a_Repeater This would seem to require that the slave return the same values as its master for indexversion. What happe

Re: complex boolean filtering in fq queries

2010-12-07 Thread Tom Hill
For one thing, you wouldn't have fq= in there, except at the beginning. fq=location:national OR (location:CA AND city:"San Francisco") more below... On Tue, Dec 7, 2010 at 10:25 PM, Andy wrote: > Forgot to add, my defaultOperator is "AND". > > --- On Wed, 12/8/10, Andy wrote: > >> From: Andy

Re: complex boolean filtering in fq queries

2010-12-07 Thread Andy
Forgot to add, my defaultOperator is "AND". --- On Wed, 12/8/10, Andy wrote: > From: Andy > Subject: complex boolean filtering in fq queries > To: solr-user@lucene.apache.org > Date: Wednesday, December 8, 2010, 1:21 AM > I have a facet query that requires > some complex boolean filtering. Some

complex boolean filtering in fq queries

2010-12-07 Thread Andy
I have a facet query that requires some complex boolean filtering. Something like: fq=location:national OR (fq=location:CA AND fq=city:"San Francisco") 1) How do I turn the above filters into a REST query string? 2) Do I need the double quotes around "San Francisco"? 3) Will complex boolean filt

Re: Terms component with shards?

2010-12-07 Thread bbarani
Hey Shawn, Thanks for your reply. I tried using shards and shards qt parameter, its working like charm.. I included both these component in Terms request handler and it seems to work fine even in SOLR 1.4.. Thanks, Barani -- View this message in context: http://lucene.472066.n3.nabble.com/Te

Re: Warming searchers/Caching

2010-12-07 Thread Mark
Maybe I should explain my problem a little more in detail. The problem we are experiencing is after a delta-import we notice a extremely high load time on the slave machines that just replicated. It goes away after a min or so production traffic once everything is cached. I already have a bef

Re: customer ping response

2010-12-07 Thread Tom Hill
Hi Tri, Well, I wouldn't really recommend this, but I just tried making a custom XMLReponseWriter that wrote the response you wanted. So you can use it with any request handler you want. Works fine, but it's pretty hack-y. The downside is, you are writing code, and you have to modify SolrCore. Bu

Re: customer ping response

2010-12-07 Thread Erick Erickson
That's the query term being sent to the server. On Tue, Dec 7, 2010 at 8:50 PM, Tri Nguyen wrote: > Hi, > > I'm reading the wiki. > > What does q=apache mean in the url? > > > http://localhost:8983/solr/select/?stylesheet=&q=apache&wt=xslt&tr=example.xsl > > thanks, > > tri > > > > > > _

Re: customer ping response

2010-12-07 Thread Tri Nguyen
Hi, I'm reading the wiki. What does q=apache mean in the url? http://localhost:8983/solr/select/?stylesheet=&q=apache&wt=xslt&tr=example.xsl thanks, tri   From: Markus Jelsma To: Tri Nguyen Cc: solr-user@lucene.apache.org Sent: Tue, December 7, 2010 4:35:

Re: customer ping response

2010-12-07 Thread Markus Jelsma
Well, you can go a long way with xslt but i wouldn't know how to embed the server name in the response as Solr simply doesn't return that information. You'd have to patch the response Solr's giving or put a small script in front that can embed the server name. > I need to return this: > > >

Re: customer ping response

2010-12-07 Thread Tri Nguyen
I need to return this: Server ok From: Markus Jelsma To: solr-user@lucene.apache.org Cc: Tri Nguyen Sent: Tue, December 7, 2010 4:27:32 PM Subject: Re: customer ping response Of course! The ping request handler behaves like any other request handler and

Re: customer ping response

2010-12-07 Thread Markus Jelsma
Of course! The ping request handler behaves like any other request handler and accepts at last the wt parameter [1]. Use xslt [2] to transform the output to any desirable form or use other response writers [1]. Why anyway, is it a load balancer that only wants an OK output or something? [1]: ht

Re: Warming searchers/Caching

2010-12-07 Thread Markus Jelsma
XInclude works fine but that's not what your looking for i guess. Having the 100 top queries is overkill anyway and it can take too long for a new searcher to warmup. Depending on the type of requests, i usually tend to limit warming to popular filter queries only as they generate a very high h

customer ping response

2010-12-07 Thread Tri Nguyen
Can I have a custom xml response for the ping request? thanks, Tri

Re: Warming searchers/Caching

2010-12-07 Thread Erick Erickson
Warning: I haven't used this personally, but Xinclude looks like what you're after, see: http://wiki.apache.org/solr/SolrConfigXml#XInclude Best Erick On Tue, Dec 7, 2010 at 6:33 PM, Mark wrote: > Is there any plugin or easy way to auto-warm/cache a new searcher with a > bunch of searches rea

Re: Spatial search - Solr 4.0

2010-12-07 Thread Erick Erickson
What version of solr are you using? What is your configuration? What query are you using? Best Erick On Tue, Dec 7, 2010 at 5:40 PM, Jae Joo wrote: > Hi, > > I am implementing spatial search and found some odd things. As I know that > the returning distance is still being implemented, so I have

Warming searchers/Caching

2010-12-07 Thread Mark
Is there any plugin or easy way to auto-warm/cache a new searcher with a bunch of searches read from a file? I know this can be accomplished using the EventListeners (newSearcher, firstSearcher) but I rather not add 100+ queries to my solrconfig.xml. If there is no hook/listener available, is

Re: only index synonyms

2010-12-07 Thread lee carroll
That's ace tom Will give it a go but sounds spot on On 7 Dec 2010 20:49, "Tom Hill" wrote: > Hi Lee, > > Sorry, I think Erick and I both thought the issue was converting the > synonyms, not removing the other words. > > To keep only a set of words that match a list, use the > KeepWordFilterFactory

Spatial search - Solr 4.0

2010-12-07 Thread Jae Joo
Hi, I am implementing spatial search and found some odd things. As I know that the returning distance is still being implemented, so I have implement algorithm to calculate the actual distance based on lat and long returned. when I do it, I have found the sort is not working properly. Any thing I

Re: highlighting encoding issue

2010-12-07 Thread Koji Sekiguchi
(10/12/08 1:12), getagrip wrote: Hi, when I query solr (trunk) I get "numeric character references" instead of regular UTF-8 strings in case of special characters in the highlighting section, in the result section the characters are presented fine. e.g instead of the German Umlaut Ä I get ä E

Re: Terms component with shards?

2010-12-07 Thread Shawn Heisey
On 12/7/2010 12:53 PM, bbarani wrote: Hi, Will terms component work along with Shards? I have 3 cores and I am using shards to to distributed search. Yes - but not in Solr 1.4.x. You'll need branch_3x or trunk. https://issues.apache.org/jira/browse/SOLR-1177 Shawn

Re: only index synonyms

2010-12-07 Thread Tom Hill
Hi Lee, Sorry, I think Erick and I both thought the issue was converting the synonyms, not removing the other words. To keep only a set of words that match a list, use the KeepWordFilterFactory, with your list of synonyms. http://wiki.apache.org/solr/AnalyzersTokenizersTokenFilters#solr.KeepWord

Re: only index synonyms

2010-12-07 Thread lee carroll
ok thanks for your response To summarise the solution then: To only index synonyms you must only send words that will match the synonym list. If words with out synonym ,atches are in the field to be indexed these words will be indexed. No way to avoid this by using schema.xml config. thanks lee

Terms component with shards?

2010-12-07 Thread bbarani
Hi, Will terms component work along with Shards? I have 3 cores and I am using shards to to distributed search. I have a autosuggest feature implemented using terms component (when I had just one core before) and its working fine as long as I have just one core. It doesnt seems to work fine wh

Re: Out of memory error

2010-12-07 Thread Fuad Efendi
Related: SOLR-846 Sent on the TELUS Mobility network with BlackBerry -Original Message- From: Erick Erickson Date: Tue, 7 Dec 2010 08:11:41 To: Reply-To: solr-user@lucene.apache.org Subject: Re: Out of memory error Have you seen this page? http://wiki.apache.org/solr/DataImportHandler

Re: solrj & http client 4

2010-12-07 Thread Yonik Seeley
On Tue, Dec 7, 2010 at 12:32 PM, Stevo Slavić wrote: > Hello solr users and developers, > > Are there any plans to upgraded http client dependency in solrj from 3.x to > 4.x? I'd certainly be for moving to 4.x (and I think everyone else would too). The issue is that it's not a drop-in replacement

solrj & http client 4

2010-12-07 Thread Stevo Slavić
Hello solr users and developers, Are there any plans to upgraded http client dependency in solrj from 3.x to 4.x? Found this ticket - judging by comments in it upgrade might help fix the issue. I have a project in jar hell, getting different version

Re: Solr & JVM performance issue after 2 days

2010-12-07 Thread Peter Karich
Am 07.12.2010 13:01, schrieb Hamid Vahedi: Hi Peter Thanks a lot for reply. Actually I need real time indexing and query at the same time. Here told: "You can run multiple Solr instances in separate JVMs, with both having their solr.xml configured to use the same index folder." Now Q1: I'm

Re: Index version on slave nodes

2010-12-07 Thread Markus Jelsma
Yes, i read that too in the replication request handler's source comments. But i would find it convenient if it would just use the same values as we see using the details command. Any devs agree? Then i'd open a ticket for this one. On Tuesday 07 December 2010 17:14:09 Xin Li wrote: > I read it

Re: Index version on slave nodes

2010-12-07 Thread Xin Li
I read it somewhere (sorry for not remembering the source).. the indexversion command gets the "replicable" index version #. Since it is a slave machine, so the result is 0. Thanks, On Tue, Dec 7, 2010 at 11:06 AM, Markus Jelsma wrote: > But why? I'd expect valid version numbers although the rep

highlighting encoding issue

2010-12-07 Thread getagrip
Hi, when I query solr (trunk) I get "numeric character references" instead of regular UTF-8 strings in case of special characters in the highlighting section, in the result section the characters are presented fine. e.g instead of the German Umlaut Ä I get ä Example: Vielfachmessgerät

Re: Index version on slave nodes

2010-12-07 Thread Markus Jelsma
But why? I'd expect valid version numbers although the replication handler's source code seems to agree with you judging from the comments. On Monday 06 December 2010 17:49:16 Xin Li wrote: > I think this is expected behavior. You have to issue the "details" > command to get the real indexversion

DataDevRoom at the 2011 edition of the FOSDEM

2010-12-07 Thread Isabel Drost
Hello, We (Olivier, Nicolas and I) are organizing a Data Analytics DevRoom that will take place during the next edition of the FOSDEM in Brussels on Feb. 5. Here is the CFP: http://datadevroom.couch.it/CFP You might be interested in attending the event and take the opportunity to speak about y

Severe NoClassDefFoundError Spell StringDistance Nightly 20101207

2010-12-07 Thread Dan Hertz (Insight 49, LLC)
Whilst running java -jar start.jar from the latest nightly build example directory, I get the following...any ideas how to fix this? Thanks! Dan. Dec 7, 2010 8:46:56 AM org.apache.solr.common.SolrException log SEVERE: java.lang.NoClassDefFoundError: org/apache/lucene/search/spell/StringDistance

Re: Solr Newbie - need a point in the right direction

2010-12-07 Thread Mark
Thanks to everyone who responded, no wonder I was getting confused, I was completely focusing on the wrong half of the equation. I had a cursory look through some of the Nutch documentation available and it is looking promising. Thanks everyone. Mark On Tue, Dec 7, 2010 at 10:19 PM, webdev1977

Re: Field Collapsing - sort by group count, get total groups

2010-12-07 Thread Yonik Seeley
On Tue, Dec 7, 2010 at 9:07 AM, ssetem wrote: > Thanks for the reply, > > How would i get the total amount of possible facets(non zero), I've searched > around but have no luck. Only current way would be to request them all. Just like field collapsing, this is a number we don't (generally) have.

Re: Field Collapsing - sort by group count, get total groups

2010-12-07 Thread ssetem
Thanks for the reply, How would i get the total amount of possible facets(non zero), I've searched around but have no luck. Cheers -- View this message in context: http://lucene.472066.n3.nabble.com/Field-Collapsing-sort-by-group-count-get-total-groups-tp2033086p2033645.html Sent from the Solr

Re: Solr Newbie - need a point in the right direction

2010-12-07 Thread webdev1977
I my experience, the hardest (but most flexible part) is exactly what was mentioned.. processing the data. Nutch does have a really easy plugin interface that you can use, and the example plugin is a great place to start. Once you have the raw parsed text, you can do what ever you want with it.

Re: Taxonomy and Faceting

2010-12-07 Thread webdev1977
That would be AMAZING!! And much appreciated ;-) -- View this message in context: http://lucene.472066.n3.nabble.com/Taxonomy-and-Faceting-tp2028442p2033657.html Sent from the Solr - User mailing list archive at Nabble.com.

Re: how to config DataImport Scheduling

2010-12-07 Thread Ahmet Arslan
> I want to config DataImport Scheduling, but not know, how > to do it. I do it with a cronjob. curl "http://localhost:8080/solr/dataimport?command=delta-import&optimize=false";

Re: Taxonomy and Faceting

2010-12-07 Thread Tommaso Teofili
Hi, as I made the patch I can guide you through the Solr-UIMA integration configuration, just give me some more time as I am really busy at the moment and can't deepen it. There was a mini tutorial but it's outdated, I'll update it and let you know here in a few hours. Cheers, Tommaso 2010/12/7 we

Re: Solr & JVM performance issue after 2 days

2010-12-07 Thread Upayavira
On Tue, 07 Dec 2010 04:01 -0800, "Hamid Vahedi" wrote: > Hi Peter > > Thanks a lot for reply. Actually I need real time indexing and query at > the same > time. What do you mean by real time? The answer to that is going to heavily influence your architecture and the amount of effort you are

Re: Taxonomy and Faceting

2010-12-07 Thread webdev1977
Can someone enlighten me on how to get started with this patch? I am running solr 1.4.1 and I need to download the latest trunk and apply the patch obviously.. But after that, I am sort of clueless.. I am assuming there are some things that have to happen in solr config and schema files. Readin

Re: MultiCore config less stable than SingleCore?

2010-12-07 Thread Erick Erickson
Could you tell us what version of Solr you're running? And what OS you're concerned about? And what file system you're operating on? And anything else you can think of that'd help us help you? Best Erick On Tue, Dec 7, 2010 at 4:56 AM, Jan Simon Winkelmann < jansimon.winkelm...@newsfactory.de> wr

Re: Solr & JVM performance issue after 2 days

2010-12-07 Thread Erick Erickson
Your autocommit is unrealistic. You're telling the server to commit every 5 seconds and you're overloading the system. At least that's my guess. Every time you do this, you're causing all the caches to be thrown away, any autowarming to be triggered, etc, etc, etc. There have been significant impr

Re: only index synonyms

2010-12-07 Thread Erick Erickson
OK, the light finally dawns *If* you have a defined list of words to remove, you can put them in with your stopwords and add a stopword filter to the field in schema.xml. Otherwise, you'll have to do some pre-processing and only send to solr words you want. I'm assuming you have a list of val

Re: Solr Newbie - need a point in the right direction

2010-12-07 Thread Erick Erickson
Solr is downstream of what I think you want. There's nothing in Solr that allows you to take an arbitrary page and extract specific info from it. I suspect the Nutch folks have dealt with this kind of question, looking over the user's list there might give some insight. Basically, once you have th

Re: Field Collapsing - sort by group count, get total groups

2010-12-07 Thread Yonik Seeley
On Tue, Dec 7, 2010 at 7:03 AM, ssetem wrote: > I wondered if it is possible to sort groups by the total within the group, > and to bring back total amount groups? That is planned, but not currently implemented. You can use faceting to get both totals and sort by highest total though. Total numb

Re: Out of memory error

2010-12-07 Thread Erick Erickson
Have you seen this page? http://wiki.apache.org/solr/DataImportHandlerFaq See especially batchsize, but it looks like you're already on to that. Do you have any idea how big the records are in the database? You might try adjusting the rambuffersize

MultiCore config less stable than SingleCore?

2010-12-07 Thread Jan Simon Winkelmann
Hi, i have recently moved Solr at one of our customers to a MultiCore environment running 2 indexes. Since then, we seem to be having problems with locks not being removed properly, .lock files keep sticking around in the index directory. Hence, any updates to the index keep returning 500 erro

Field Collapsing - sort by group count, get total groups

2010-12-07 Thread ssetem
Hi, I wondered if it is possible to sort groups by the total within the group, and to bring back total amount groups? Trying to build reporting system, which shows highest aggregates first, then allows pagination through this list. Cheers -- View this message in context: http://lucene.472066.

Re: Solr & JVM performance issue after 2 days

2010-12-07 Thread Hamid Vahedi
Hi Peter Thanks a lot for reply. Actually I need real time indexing and query at the same time. Here told: "You can run multiple Solr instances in separate JVMs, with both having their solr.xml configured to use the same index folder." Now Q1: I'm using Tomcat now, Could you please tell

Re: Solr & JVM performance issue after 2 days

2010-12-07 Thread Upayavira
Also, reduce your commit frequency, if you are doing an initial import. You only need to commit (manually) once all of your content has been imported. I gave a talk about this sort of thing last week at the Online Information Show in London, and am attempting to get the slides put online, when I c

Re: only index synonyms

2010-12-07 Thread lee carroll
Hi tom This seems to place in the index This is a scenic line of words I just want scenic and words in the index I'm not at a terminal at the moment but will try again to make sure. I'm sure I'm missing the obvious Cheers lee On 7 Dec 2010 07:40, "Tom Hill" wrote: > Hi Lee, > > > On Mon, Dec 6,

Re: Solr & JVM performance issue after 2 days

2010-12-07 Thread Hamid Vahedi
hi Sven no, only auto commit 1000 1000 From: Sven Almgren To: solr-user@lucene.apache.org Sent: Tue, December 7, 2010 1:54:40 PM Subject: Re: Solr & JVM performance issue after 2 days Have you run any optimize requests yet? /Sven O

Re: Solr & JVM performance issue after 2 days

2010-12-07 Thread Peter Karich
Hi Hamid, try to avoid autowarming when indexing (see solrconfig.xml: caches->autowarm + newSearcher + maxSearcher). If you need to query and indexing at the same time, then probably you'll need one read-only core and one for writing with no autowarming configured. See: http://wiki.apache.or

Re: Solr & JVM performance issue after 2 days

2010-12-07 Thread Sven Almgren
Have you run any optimize requests yet? /Sven On Tue, Dec 7, 2010 at 08:40, Hamid Vahedi wrote: > Hi, > > I am using multi-core tomcat on 2 servers. 3 language per server. > > I am adding documents to solr up to 200 doc/sec. when updating process is > started, every thing is fine (update perform

Re: Solr Newbie - need a point in the right direction

2010-12-07 Thread Gora Mohanty
On Tue, Dec 7, 2010 at 9:12 AM, Mark wrote: [...] > What I'm trying to do is extract some (presumably) structured information > from non-uniform data (eg, prices from a nutch crawl) that needs to show in > search queries, and I've come up against a wall. > > I've been unable to figure out where is

Re: DIH - rdbms to index confusion

2010-12-07 Thread Stefan Matheis
Hi, have a look at the mysql-query-log - it will tell you what queries are executed from the solr dih. so you'll see which variables are empty/not set as expected and therefore maybe missing in the result. otherwise (for the rest of the list) it would be easier to help you, when you're using real