Faceting Word Count

2017-11-05 Thread Wael Kader
Hello, I am having an index with around 100 Million documents. I have a multivalued column that I am saving big chunks of text data in. It has around 20 GB of RAM and 4 CPU's. I was doing faceting on it to get word cloud but it was taking around 1 second to retrieve when the data was 5-10 Million

Long blocking during indexing + deleteByQuery

2017-11-05 Thread Chris Troullis
Hi, I am experiencing an issue where threads are blocking for an extremely long time when I am indexing while deleteByQuery is also running. Setup info: -Solr Cloud 6.6.0 -Simple 2 Node, 1 Shard, 2 replica setup -~12 million docs in the collection in question -Nodes have 64 GB RAM, 8 CPUs, spinni

Re: SolrClould 6.6 stability challenges

2017-11-05 Thread Rick Dig
hi Emir - the document size would be an average of less than 1.5kb. it is actually 2000 queries / min - queries are primarily autocomplete + highlighting (on a multivalued field with different payloads), search and faceting . what should we watch for that would indicate that we are overloading th

Re: SolrClould 6.6 stability challenges

2017-11-05 Thread Rick Dig
hi Shawn, all, answers inline. also, another discovery, not sure if completely useful. even when we increase the autocommit values to say an hour, the nodes go "down" in 10-15 minutes. so either we are doing something wrong with autocommit settings and commits are continuing to happen frequently (h

solr cloud updatehandler stats mismatch

2017-11-05 Thread Wei
Hi, I use the following api to track the number of update requests: /solr/collection1/admin/mbeans?cat=UPDATE&stats=true&wt=json Result: - class: "org.apache.solr.handler.UpdateRequestHandler", - version: "6.4.2.1", - description: "Add documents using XML (with XSLT), CSV, JSON, or

Re: SolrClould 6.6 stability challenges

2017-11-05 Thread Erick Erickson
Check the leader and follower logs for anything like "leader initiated recovery" (LIR). One thing I have seen where followers go into recovery is if, for some reason, the time it takes to respond to an update exceeds the timeout. The scenario is this: > leader sends an update > follower fails to re

Re: SolrClould 6.6 stability challenges

2017-11-05 Thread Shawn Heisey
On 11/3/2017 10:15 PM, Rick Dig wrote: we are trying to run solrcloud 6.6 in a production setting. here's our config and issue 1) 3 nodes, 1 shard, replication factor 3 2) all nodes are 16GB RAM, 4 core 3) Our production load is about 2000 requests per minute 4) index is fairly small, index size

Re: match in order

2017-11-05 Thread Emir Arnautović
Hi Vincenco, Since it is about boosting, you might also take a look at edismax and pf2 and pf3 fields. It also supports slop (ps2 and ps3). HTH, Emir -- Monitoring - Log Management - Alerting - Anomaly Detection Solr & Elasticsearch Consulting Support Training - http://sematext.com/ > On 5 Nov

Re: SolrClould 6.6 stability challenges

2017-11-05 Thread Emir Arnautović
Hi Rick, I quickly looked at GC logs and didn’t see obvious issues. You mentioned that batch processing takes ~20s and it is 500 documents. With 5-7 indexing thread it is ~150 documents/s. Are those big documents? With 200 queries/min (~3-4 queries/s - what sort of queries?) and 5-7 indexing thr