Re: newbie question re solr.PatternReplaceFilterFactory

2017-05-10 Thread Erick Erickson
First use PatternReplaceCharFilterFactory. The difference is that PatternReplaceCharFilterFactoryworks on the entire input whereas PatternReplaceFilterFactory works only on the tokens emitted by the tokenizer. Concrete example using WhitespeceTokenizerFactory would be this [is some ] text PatternRe

newbie question re solr.PatternReplaceFilterFactory

2017-05-10 Thread Michael Tobias
I am sure this is very simple but I cannot get the pattern right. How can I use solr.PatternReplaceFilterFactory to remove all words in brackets from being indexed? eg [ignore this] thanks Michael

SolrSpellChecker not showing suggestions when the first character of a word is wrong

2017-05-10 Thread aruninfo100
Hi All, I am trying to do spell check with Solr.I am able to get suggestions when the word is incorrectly spelled. Eg:-word entered(incorrectly) :*maintaan* I am getting *"maintain" *as suggestion,but if I provide *naintain*,it doesnt provide suggestions. *solrConfig:* text_general default sp

Re: Automatic conversion to Range Query

2017-05-10 Thread Chris Hostetter
: I'm facing a issue when i'm querying the Solr : my query is "xiomi Mi 5 -white [64GB/ 3GB]" ... : +(((Synonym(nameSearch:xiaomi nameSearch:xiomi)) (nameSearch:mi) : (nameSearch:5) -(Synonym(nameSearch:putih : nameSearch:white))*(nameSearch:[64gb/ TO 3gb])*)~4) ... : Now due to aut

Re: Underlying file changed by an external force

2017-05-10 Thread Erick Erickson
bq: All the core.properties files are each in their own directory with no overlap Not quite what I was asking. By definition, all core.properties are in their own directory. In fact Solr stops looking down the tree when it finds the first directory with core.properties in it and immediately moves

Re: Recommended index-size per core

2017-05-10 Thread Toke Eskildsen
S G wrote: > *Rough estimates for an initial size:* > > 50gb index is best served if all of it is in memory. Assuming you need low latency and/or high throughput, yes. I mention this because in many cases the requirements for number of simultaneous users and response times are known (at least

Re: solrcloud collections restore documentation is confusing about the restored configset

2017-05-10 Thread Shawn Heisey
On 5/10/2017 12:26 PM, Webster Homer wrote: > Looking at the solrcloud restore API I am confused about the solr > configuration > It seems to me that the restore should create a new configset in Zookeeper > from the backed up configset and use the new collection name as the name of > the config se

solrcloud collections restore documentation is confusing about the restored configset

2017-05-10 Thread Webster Homer
Looking at the solrcloud restore API I am confused about the solr configuration What configuration gets loaded into the restored collection? The one in Zookeeper or the one from the backup? Say I have a collection, BAZ which has a configuration BAZ.config Now I create a backup of BAZ I make chang

RE: Underlying file changed by an external force

2017-05-10 Thread Oakley, Craig (NIH/NLM/NCBI) [C]
> You need to look at all of your core.properties files and see if any of them > point to the same data directory. All the core.properties files are each in their own directory with no overlap. > Second: if you issue a "kill -9" you can leave write locks lingering. We manage our Solr instances

Recommended index-size per core

2017-05-10 Thread S G
Hi, Is there a recommendation on the size of index that one should host per core? Idea is to come up with an *initial* shard/replica setting for a load test. And then arrive at a good cluster size based on that testing. *Example: * Num documents: 100 million Average document size: 1kb So total

Re: SOLR as nosql database store

2017-05-10 Thread Bharath Kumar
Thanks Walter and Mike. In our use case we have same schema on both source and target sites. The idea is if we can avoid mysql replication on the target site for a particular table in our mysql schema. Currently, we index some of the fields in that table in solr, we want to move all the fields to s

Re: SOLR as nosql database store

2017-05-10 Thread Bharath Kumar
Yes Mike we have CDCR replication as well. On Wed, May 10, 2017 at 1:15 AM, Mike Drob wrote: > > The searching install will be able to rebuild itself from the data > storage install when that > is required. > > Is this a use case for CDCR? > > Mike > > On Tue, May 9, 2017 at 6:39 AM, Shawn Heise

Re: Do developers and power users support stackoverflow.com solr tag?

2017-05-10 Thread Alexandre Rafalovitch
Just FYI. The average number of Solr answers on SO is probably less than one. Whether they get accepted even when answered is a different issue. The questions do seem to have a larger back-and-forth (in comments) than other topics though. That's something Mailing List is much better for. Finally

Create core with bin/solr where BasicAuth is setup

2017-05-10 Thread bay chae
Hi, I have basic auth implemented in solr and can create a core with 'curl —user…’ and through the web interface with username and password entered. I can create a core: bin/solr create -c bore with this in solr.in.sh: SOLR_AUTH_TYPE="basic" SOLR_AUTHENTICATION_OPTS="-Dbasicauth=solr:SolrRoc

Re: Do developers and power users support stackoverflow.com solr tag?

2017-05-10 Thread Walter Underwood
Sure, here is an example. The accepted answer doesn’t really answer the question. Mine finally got an equal number of votes, but is not accepted. Essentially, this is voting on physics, which is not a good way to find engineering solutions. https://ham.stackexchange.com/questions/337/why-do-con

Re: Do developers and power users support stackoverflow.com solr tag?

2017-05-10 Thread Alexandre Rafalovitch
On 10 May 2017 at 11:50, Karl-Philipp Richter wrote: > Would you (all) consider it useful to cross-post a SO question on this > list? I think most of the times, it would be more efficient to post Solr-only questions on this mailing list in the first place. But people who find SO somehow do not fi

Solr 6.5.0 sql select * issue because of invalid "score" field?

2017-05-10 Thread kringe
Hello, Running solr 6.5.0 I have a collection called TestIndex with a schema that has a few fields that are all, among other things, set up as docValues fields so I can perform unlimited sql queries against the collection. After ingesting 10 documents I try to use the Solr admin UI to perf

Re: Do developers and power users support stackoverflow.com solr tag?

2017-05-10 Thread Alexandre Rafalovitch
I think I am the only person answering both SO Solr tag and Mailing List questions comparatively frequently. Less now than before, but I still track SO it by subscribing to the solr tag newsletter. There are some other strong users answering SO tag, but I don't think they are on the Mailing List. O

Re: Do developers and power users support stackoverflow.com solr tag?

2017-05-10 Thread Karl-Philipp Richter
Hi, Am 10.05.2017 um 15:01 schrieb Shawn Heisey: > I don't seek out questions on SO, but if one happens to come my way that > I can answer, there's a good chance I will post. Most of the time I see > SO posts via some other medium, though -- like this list or the #solr > IRC channel. That kind o

Re: Do developers and power users support stackoverflow.com solr tag?

2017-05-10 Thread Karl-Philipp Richter
Hi, Am 10.05.2017 um 17:03 schrieb Walter Underwood: > I have contributed some answers in the amateur radio group. Stack Overflow > has a bad > tendency to get stuck on the earliest “might be right” answer, even if it is > wrong. Very > frustrating. This happens a lot with questions about antenn

Re: Replicating the master node for a fail over scenario

2017-05-10 Thread Erick Erickson
This is really what SolrCloud was built for, particularly CDCR (Cross Data Center Replication) for remote DCs. For the master/slave situation there's nothing automatic, it's a roll-your-own type thing. People have done things like: 1> any replica can be "promoted" to master with configuration cha

Replicating the master node for a fail over scenario

2017-05-10 Thread Dominik Niziński
Hello, we're successfully using solr in our application for a few months now. Recently we've got asked if there is a possibility of having multiple master nodes (in case of some disaster happening in one of server locations). Basically what we want to do is having a few master nodes running at the

Re: Solrcloud collection restore puts 2 replicas on the same node

2017-05-10 Thread Erick Erickson
bq; When will 6.6 be released Real Soon Now. The release process has started, the first RC will probably be cut sometime next week. After that, the process will take 3-4 days. Any issues found will reset that "3-4 days" as another RC is spun. Best, Erick On Wed, May 10, 2017 at 8:04 AM, Webster

Re: distribution of leader and replica in SolrCloud

2017-05-10 Thread Erick Erickson
Bernd: Short form: Worrying about which node is the leader is wasting your time. Details below: Why do you care what nodes the leaders are on? There has to be some concern you have about co-locating the leaders on the same node or you wouldn't be spending the time on it. Please articulate that co

Re: Solrcloud collection restore puts 2 replicas on the same node

2017-05-10 Thread Webster Homer
Yes that looks like the issue I'm seeing. When will 6.6 be released? On Wed, May 10, 2017 at 9:42 AM, Erick Erickson wrote: > Possibly https://issues.apache.org/jira/browse/SOLR-9527? > > On Wed, May 10, 2017 at 7:34 AM, Webster Homer > wrote: > > I am running Solr 6.2 on a 4 node cluster > > >

Re: Do developers and power users support stackoverflow.com solr tag?

2017-05-10 Thread Walter Underwood
I just checked, and it has been 3.5 years since I’ve answered anything about solr on Stack Overflow. It’s been 30 minutes since I answered something here. I have contributed some answers in the amateur radio group. Stack Overflow has a bad tendency to get stuck on the earliest “might be right”

Re: Search substring in field

2017-05-10 Thread Emir Arnautovic
Hi, Solr works on top of data structure called inverted index . You can misuse it and do not invert your documents and use regex or wildcards to find matches, but that is not the way to use it - it'll be significantly slower. Solr does support su

Re: Do developers and power users support stackoverflow.com solr tag?

2017-05-10 Thread Erick Erickson
Personally I have all I can do to keep up with this list and the dev list and, you know, do my day job ;) I've seen quite a few references to SO for Solr questions, and the times I've perused them the answers I've been impressed. Just don't have time. On Wed, May 10, 2017 at 6:01 AM, Shawn Heisey

Re: How to Speed Up Solr ResposeWriter

2017-05-10 Thread Erick Erickson
You need to describe your problem more fully. The response writer is rarely a bottleneck, so I'm guessing there are things you aren't telling us. Are you returning thousands of rows? Are the documents huge? Details matter. Best, Erick On Wed, May 10, 2017 at 5:34 AM, Prashobh Chandran wrote: > H

Re: Solrcloud collection restore puts 2 replicas on the same node

2017-05-10 Thread Erick Erickson
Possibly https://issues.apache.org/jira/browse/SOLR-9527? On Wed, May 10, 2017 at 7:34 AM, Webster Homer wrote: > I am running Solr 6.2 on a 4 node cluster > > Each collection has 2 shards and a replication factor of 2 > > Normally when I create a collection I see a replica on each node, which is

Re: SOLR as nosql database store

2017-05-10 Thread Walter Underwood
CDCR doesn’t rebuild it so much as copy it. To change the schema, you’ll need to reindex. I’ve worked on two NoSQL databases (Objectivity and MarkLogic) and I’ve worked on Solr. They are utterly different designs, intended to do different things. wunder Walter Underwood wun...@wunderwood.org ht

Solrcloud collection restore puts 2 replicas on the same node

2017-05-10 Thread Webster Homer
I am running Solr 6.2 on a 4 node cluster Each collection has 2 shards and a replication factor of 2 Normally when I create a collection I see a replica on each node, which is what I would expect. However when I restore a backup to a new collection I see that one node has two replicas on it. The

How to Speed Up Solr ResposeWriter

2017-05-10 Thread Prashobh Chandran
Hi, Currently we are using solr 5.3.1 engine, Im getting json format results from engine. But it's taking time to getting results, So i need to speed up solr response writer. Is there anyway? Please reply asap... Regards, Prasobh

Re: Do developers and power users support stackoverflow.com solr tag?

2017-05-10 Thread Shawn Heisey
On 5/10/2017 6:31 AM, Karl-Philipp Richter wrote: > Do developers and power users (which are famous on the mailing > list(s)) support the `solr` tag on stackoverflow.com? There's no > definite answer, I know, but someone might do an educated guess. I don't seek out questions on SO, but if one hap

Re: Solr Query Limits

2017-05-10 Thread Shawn Heisey
On 5/10/2017 12:33 AM, Adnan Shaikh wrote: > Thanks Alexandre for the update. > > Please help me to understand the other part of the query as well , if there > is any limit to how many values we can pass for a key. The limit is not the number of values, but the size of the request in bytes. A ty

Do developers and power users support stackoverflow.com solr tag?

2017-05-10 Thread Karl-Philipp Richter
Hi, Do developers and power users (which are famous on the mailing list(s)) support the `solr` tag on stackoverflow.com? There's no definite answer, I know, but someone might do an educated guess. -Kalle signature.asc Description: OpenPGP digital signature

Re: SOLR as nosql database store

2017-05-10 Thread Shawn Heisey
On 5/10/2017 2:15 AM, Mike Drob wrote: >> The searching install will be able to rebuild itself from the data > storage install when that > is required. > > Is this a use case for CDCR? Does CDCR require an identical schema between locations? If not, then I think CDCR can keep a searching install

Re: distribution of leader and replica in SolrCloud

2017-05-10 Thread Rick Leir
Myself, I am still in the old camp. For critical machines, I want to know that it is my machine, with my disks, and what software is installed exactly. But maybe the cloud provider's fast network is more important? Cheers--Rick On May 10, 2017 6:13:27 AM EDT, Bernd Fehling wrote: >Hi Rick, > >

RE: 6.5.1. cloud went partially down

2017-05-10 Thread Markus Jelsma
I am not this is directly related but we also sometimes see clients losing connections on 6.5.1, this with the problem described below are unique to 6.5.1, i have not seen this many issues with cloud in a short time for a very long time. 2017-05-09 21:30:36.661 ERROR (Document compiler) [c:log

Re: distribution of leader and replica in SolrCloud

2017-05-10 Thread Bernd Fehling
Hi Rick, yes I have distributed 5 virtual server accross 5 physical machines. So each virtual server is on a separate physical machine. Splitting each virtual server (64GB RAM) into two (32GB RAM), which then will be 10 virtual server accross 5 physical machines, is no option because there is no

Need help in understanding solr clustering component

2017-05-10 Thread yauza
I was looking(in process of making my own) into solr's default clustering component for carrot2. In the clustering component class there are 2 methods where the clustering algorithms are called: in the overridden process method SolrDocumentList solrDocList = SolrPluginUtils.docListToSolrDocumentL

Re: Solr Query Limits

2017-05-10 Thread Alexandre Rafalovitch
How many values are you trying to pass in? And in which format? And what issues are you facing? There are too many variables here to give a generic advice. Regards, Alex. http://www.solr-start.com/ - Resources for Solr users, new and experienced On 10 May 2017 at 02:33, Adnan Shaikh wro

Re: distribution of leader and replica in SolrCloud

2017-05-10 Thread Rick Leir
Bernd, Yes, cloud, ahhh. As you say, the world changed. Do you have any hint from the cloud provider as to which physical machine your virtual server is on? If so, you can hopefully distribute your replicas across physical machines. This is not just for reliability: in a sharded system, each

Re: SOLR as nosql database store

2017-05-10 Thread Mike Drob
> The searching install will be able to rebuild itself from the data storage install when that is required. Is this a use case for CDCR? Mike On Tue, May 9, 2017 at 6:39 AM, Shawn Heisey wrote: > On 5/9/2017 12:58 AM, Bharath Kumar wrote: > > Thanks Hrishikesh and Dave. We use SOLR cloud with

Re: SOLR as nosql database store

2017-05-10 Thread Bharath Kumar
Thanks Shawn and Rick for your suggestions. We will surely look at these options. On Tue, May 9, 2017 at 4:39 AM, Shawn Heisey wrote: > On 5/9/2017 12:58 AM, Bharath Kumar wrote: > > Thanks Hrishikesh and Dave. We use SOLR cloud with 2 extra replicas, > will that not serve as backup when somethi