Couple of question about Virtualization and Load Balancer

2016-01-21 Thread Gian Maria Ricci - aka Alkampfer
Hi, I've a couple of quick question about production setup. The first one is about virtualization, I'd like to know if there are any official test on loss of performance in virtualization environment. I think that the loss of performance is negligible, and quick question on test

How get around solr's spellcheck maxEdit limit of 2?

2016-01-21 Thread Nitin Solanki
I am using Solr for spell Correction. Solr is limited to maxEdit of 2. Does there is anyway to increase that maxEdit without using phonetic mapping ? Please any suggestions

Re: How to achieve exact string match query which includes spaces and quotes

2016-01-21 Thread Binoy Dalal
You should use the client utils method. That way you're set even if you encounter special characters like + or () etc. On Thu, 21 Jan 2016, 13:40 Alok Bhandari wrote: > Thanks Erick for your reply. Because of some medical reason I was out of > office for a week.

Re: Restricting/Filtering facet results

2016-01-21 Thread Binoy Dalal
The facet.query parameter is what you're looking for. Use it like so: =field:(UserB OR UserC) Check out the wiki for more details On Thu, 21 Jan 2016, 14:41 Achim Domma wrote: > Hi, > > is there some way to restrict/filter the facetting results? Our use case > is the

Re: Scaling SolrCloud

2016-01-21 Thread Yago Riveiro
Is not a typo. I was wrong, for zookeeper 2 nodes still count as majority. It's not the desirable configuration but is tolerable. Thanks Erick. \-- /Yago Riveiro > On Jan 21 2016, at 4:15 am, Erick Erickson erickerick...@gmail.com wrote: > > bq: 3 are to risky, you lost one you

Re: How to achieve exact string match query which includes spaces and quotes

2016-01-21 Thread Alok Bhandari
Thanks Erick for your reply. Because of some medical reason I was out of office for a week. ClientUtils.escapeQueryChars method from solrj client should be used? or you think its better to escape only quote " character. -- View this message in context:

Restricting/Filtering facet results

2016-01-21 Thread Achim Domma
Hi, is there some way to restrict/filter the facetting results? Our use case is the following: Our documents have a multi value field, which holds user ids, so the values might be like this: doc1 = ['UserA', 'UserB', 'UserC'] doc2 = ['UserA', 'UserB'] doc3 = ['UserA', 'UserC'] Now I execute a

Recall: schemaless vs schema based core

2016-01-21 Thread Prateek Jain J
Prateek Jain J would like to recall the message, "schemaless vs schema based core".

RE: schemaless vs schema based core

2016-01-21 Thread Prateek Jain J
Thanks Erick, Yes, I took same approach as suggested by you. The issue is some developers started with schemaless configuration and now they have started liking it and avoiding restrictions (including increased time to deploy application, in managed enterprise environment). I was more

Re: Couple of question about Virtualization and Load Balancer

2016-01-21 Thread Jack Krupansky
Official numbers? There are none. If for no other reason than that performance is completely dependent on your specific hardware and your specific data and your specific data model. The standard recommendation is that you should do a proof of concept implementation with a reasonable subset of your

Re: Restricting/Filtering facet results

2016-01-21 Thread Achim Domma
If I specify the query like this, I will get only one facet back. Or am I wrong? I would have to specify one query for UserB and one for UserC. My filter list can contain thousands of users, so specifying individual queries is not an option. On 21.01.2016 11:43, Binoy Dalal wrote: > The

RE: Couple of question about Virtualization and Load Balancer

2016-01-21 Thread Davis, Daniel (NIH/NLM) [C]
> The first one is about virtualization, I'd like to know if there are any > official test > on loss of performance in virtualization environment. I think that the loss of > performance is negligible, and quick question on test infrastructure is > confirming > this, but I'd like to know if there

Read time out exception - exactly 10 minutes after starting committing

2016-01-21 Thread adfel70
I am running soft commit on 100 solr docs (the index itself has 3 Billion docs). After EXACTLY 10 minutes (for example, start committing on 15:52:55.932, exception on 16:02:55.976) I am getting several exception of the sort: org.apache.solr.client.solrj.SolrServerException: Timeout occured while

Re: schemaless vs schema based core

2016-01-21 Thread Shawn Heisey
On 1/21/2016 2:22 AM, Prateek Jain J wrote: > Thanks Erick, > > Yes, I took same approach as suggested by you. The issue is some developers > started with schemaless configuration and now they have started liking it and > avoiding restrictions (including increased time to deploy application, in

Best practices for Solr highlighter for CJK

2016-01-21 Thread Vu Hieu
Sent from Windows Mail

Re: Scaling SolrCloud

2016-01-21 Thread Erick Erickson
NP. My usual question though is "how often do you expect to lose a second ZK node before you can replace the first one that died?" My tongue-in-cheek statement is often "If you're losing two nodes regularly, you have problems with your hardware that you're not really going to address by adding

[ANNOUNCE] Apache Nutch 2.3.1 Release

2016-01-21 Thread lewis john mcgibbney
Hi Folks, !!Apologies for cross posting!! The Apache Nutch PMC are pleased to announce the immediate release of Apache Nutch v2.3.1, we advise all current users and developers of the 2.X series to upgrade to this release. Nutch is a well matured, production ready Web crawler. Nutch 2.X branch

Re: Restricting/Filtering facet results

2016-01-21 Thread Erick Erickson
Yes, you'll get one facet back, but you can specify multiple facet.query clauses, as =field:UserB =field:UserC and get two back. Best, Erick On Thu, Jan 21, 2016 at 7:43 AM, Achim Domma wrote: > If I specify the query like this, I will get only one facet back. Or am > I

Re: How get around solr's spellcheck maxEdit limit of 2?

2016-01-21 Thread Erick Erickson
bq: ...is anyway to increase that maxEdit IIUC, increasing maxEdit beyond 2 increases the space/time required unacceptably, that limit is there on purpose, put there by people who know their stuff. Best, Erick On Thu, Jan 21, 2016 at 12:39 AM, Nitin Solanki wrote: > I

Re: collection aliasing,solrctl

2016-01-21 Thread Erick Erickson
I suggest you ask on the Cloudera support forums. But you have access to a browser connected to the system, right? Just issue the command in a browser window or CURL is one option. Best, Erick On Wed, Jan 20, 2016 at 10:59 PM, vidya wrote: > Hi > > I am using solr with

Re: schemaless vs schema based core

2016-01-21 Thread Erick Erickson
I guess it's all about whether schemaless really supports 1> all the docs you index. 2> all the use-cases for search. 3> the assumptions it makes scale to you needs. If you've established rigorous tests and schemaless does all of the above, I'm all for shortening the cycle by using schemaless.

Re: Scaling SolrCloud

2016-01-21 Thread Walter Underwood
Alternatively, do you still want to be protected against a single failure during scheduled maintenance? With a three node ensemble, when one Zookeeper node is being updated or moved to a new instance, one more failure means it does not have a quorum. With a five node ensemble, three nodes

Re: Rolling upgrade to 5.4 from 5.0 - "bug" caused by leader changes - is there a workaround?

2016-01-21 Thread Ishan Chattopadhyaya
Perhaps you could stay on 5.4.1 RC2, since that is what 5.4.1 will be (unless there are last moment issues). On Wed, Jan 20, 2016 at 7:50 PM, Michael Joyner wrote: > Unfortunately, it really couldn't wait. > > I did a rolling upgrade to the 5.4.1RC2 then downgraded

sharedLib node in solr.xml

2016-01-21 Thread Bob Lawson
I am working with the gettingstarted example from the Solr Quick Start guide. I developed a couple of Solr plugins and I want to tell Solr where to find them, and I'm following this guide to do it. For a single core it was easy, I just put them in a lib

Re: sharedLib node in solr.xml

2016-01-21 Thread Shawn Heisey
On 1/21/2016 11:13 AM, Bob Lawson wrote: > I am working with the gettingstarted example from the Solr Quick Start > guide. I developed a couple of Solr plugins and I want to tell Solr where > to find them, and I'm following this > guide to do it. For a

Re: sharedLib node in solr.xml

2016-01-21 Thread Bob Lawson
Thanks. I tried what you said and sill have problems. I removed sharedLib from solr.xml, so solr.xml is back to its original state. I then placed a lib directory containing my jar into the solr home directory, which is /solr-5.4.0/server/solr. I then ran bin/solr start -e cloud -noprompt

RE: How get around solr's spellcheck maxEdit limit of 2?

2016-01-21 Thread Dyer, James
But if you really need more than 2 edits, I think IndexBasedSpellChecker supports it. James Dyer Ingram Content Group -Original Message- From: Erick Erickson [mailto:erickerick...@gmail.com] Sent: Thursday, January 21, 2016 11:29 AM To: solr-user Subject: Re: How get around solr's

Re: Read time out exception - exactly 10 minutes after starting committing

2016-01-21 Thread Shawn Heisey
On 1/21/2016 8:16 AM, adfel70 wrote: > I am running soft commit on 100 solr docs (the index itself has 3 Billion > docs). > After EXACTLY 10 minutes (for example, start committing on 15:52:55.932, > exception on 16:02:55.976) I am getting several exception of the sort: >

Re: sharedLib node in solr.xml

2016-01-21 Thread Bob Lawson
I just wanted to give an update on this problem. I decided to reinstall Solr 5.4.0 and try again, following the simplest procedure I could. First I created a single core collection called 'test'. I modified solrconfig.xml to use my plugin. I created a lib directory under the test folder. The

Re: Rolling upgrade to 5.4 from 5.0 - "bug" caused by leader changes - is there a workaround?

2016-01-21 Thread Michael Joyner
On 01/21/2016 01:22 PM, Ishan Chattopadhyaya wrote: Perhaps you could stay on 5.4.1 RC2, since that is what 5.4.1 will be (unless there are last moment issues). On Wed, Jan 20, 2016 at 7:50 PM, Michael Joyner wrote: Unfortunately, it really couldn't wait. I did a rolling

Re: sharedLib node in solr.xml

2016-01-21 Thread Shawn Heisey
On 1/21/2016 12:21 PM, Bob Lawson wrote: Thanks. I tried what you said and sill have problems. I removed sharedLib from solr.xml, so solr.xml is back to its original state. I then placed a lib directory containing my jar into the solr home directory, which is /solr-5.4.0/server/solr. I then

Re: sharedLib node in solr.xml

2016-01-21 Thread Shawn Heisey
On 1/21/2016 1:56 PM, Bob Lawson wrote: I then copied the exact same solrconfig.xml file to /solr-5.4.0/server/solr/configsets/data_driven_schema_configs/conf. I copied the exact same lib directory to /solr-5.4.0/server/solr. I tried to launch the cloud gettingstarted example by executing:

Re: sharedLib node in solr.xml

2016-01-21 Thread Bob Lawson
Shawn, that was it! I copied my lib directory to both /solr-5.4.0/example/cloud/node1/solr and /solr-5.4.0/example/cloud/node2/solr. Everything ran perfectly. Maybe the documentation should be updated with this clarification, because it currently makes it sound like all you have to do is put

One complex wildcard query lead solr OOM

2016-01-21 Thread Jian Mou
We are using Solr as our search engine, and recently notice some user input wildcard query can lead to Solr dead loop in org.apache.lucene.util.automaton.Operations.determinize() , and it also eats memory and finally OOM. the wildcard query seems like **?-???o·???è??**。 Although we

Re: One complex wildcard query lead solr OOM

2016-01-21 Thread Jack Krupansky
The Lucene WildcardQuery class does have an additional constructor that has a maxDeterminizedStates parameter to limit the size of the FSM generated by a wildcard queery, and the QueryParserBase class does have a method to set that parameter, setMaxDeterminizedStates, but there is no Solr support

Re: solr error

2016-01-21 Thread Midas A
Hi, Please find attached detail logs, Please help me to figure it out. On Fri, Jan 15, 2016 at 2:50 AM, Shawn Heisey wrote: > On 1/14/2016 12:08 AM, Midas A wrote: > > we are continuously getting the error > > "null:org.eclipse.jetty.io.EofException" > > on slave . > > >

Close and open vendor sorting

2016-01-21 Thread irshad siddiqui
Hi, I am using solr 5.1 in my search . my search are working fine today i got one new requirement that show first all open vendor and then close vendor. All vendor have different time oping and closing for a day Example:*"Royal China"* vendor on Friday have 3 time slot for opening and

Mix Solr 4 and 5?

2016-01-21 Thread jimi.hullegard
Hi, Long story short, we use a CMS that is integrated with Solr 4.6, with the solrj jar file in the global/common Tomcat classpath. We currently use a Google Search Appliance machine for our own freetext search needs, but plan to replace that with some other solution in the near future. Since

Re: sharedLib node in solr.xml

2016-01-21 Thread Bob Lawson
Correction: Thanks. I tried what you said and sill have problems. I removed sharedLib from solr.xml, so solr.xml is back to its original state. I then placed a lib directory containing my jar into the solr home directory, which is /solr-5.4.0/server/solr. I then ran bin/solr start -e cloud