Re: Migration: SOLR8-Java8 -> SOLR8-JAVA11 indexing issue.

2019-10-24 Thread Shawn Heisey
On 10/24/2019 11:50 AM, Junagade, Anup wrote: * Attachment 1: OpenJDK 8 vs OpenJDK 8 key metrics * Attachment 2: OpenJDK 8 vs OpenJDK 8 waiting QTP Threads * Attachment 3: OpenJDK 11 Thread dump There are no attachments. Apache mailing lists swallow almost all attachments. Yo

Re: solr configuration issue

2019-10-24 Thread Shawn Heisey
On 10/24/2019 1:52 AM, Danilo Tomasoni wrote: For every document processed, a soft commit is performed to make the update visible to other concurrent update processes. This is not the way to do things. Doing a commit after every document means that Solr will spend more time doing commits than

Re: WordDelimiter in extended way.

2019-10-23 Thread Shawn Heisey
On 10/23/2019 9:41 AM, servus01 wrote: Hey, thank you for helping me: Thanks in advanced for any help, really appriciate. It is not the WordDelimiter filter th

Re: Solr Prod stopped yesterday - saya "insufficient memory for the Java Runtime Environment"

2019-10-23 Thread Shawn Heisey
On 10/23/2019 9:08 AM, Vignan Malyala wrote: Ok. I have around 500 cores in my solr. So, how much heap I should allocate in solr and jvm? (Currently as I see, in solr.in.sh shows heap as - Xms 20g -Xmx 20g. And my system jvm heap shows -Xms 528m -Xmx 8g. I've re-checked it.) We have no way of

Re: WordDelimiter in extended way.

2019-10-23 Thread Shawn Heisey
On 10/23/2019 7:43 AM, servus01 wrote: Now Solr behaves in such a way that on the one hand the hyphens which have a blank before and after are not indexed and also the search as soon as blank - blank is searched does not return any results. With the WordDelimiter I have already covered the cases

Re: Solr Prod stopped yesterday - saya "insufficient memory for the Java Runtime Environment"

2019-10-23 Thread Shawn Heisey
On 10/23/2019 4:09 AM, Vignan Malyala wrote: *Solr prod stopped yesterday. How to prevent this.* Solr heap info is : -Xms20g -Xmx20g JVM Heap info. : -Xms528m -Xmx8g There is no such thing as a Solr heap separate from the JVM heap. There are multiple environment variables that can specify t

Re: Document Update performances Improvement

2019-10-23 Thread Shawn Heisey
On 10/22/2019 1:12 PM, Nicolas Paris wrote: We, at Auto-Suggest, also do atomic updates daily and specifically changing merge factor gave us a boost of ~4x Interesting. What kind of change exactly on the merge factor side ? The mergeFactor setting is deprecated. Instead, use maxMergeAtOnce,

Re: using the df parameter to set a default to search all fields

2019-10-22 Thread Shawn Heisey
On 10/22/2019 11:42 AM, Audrey Lorberfeld - audrey.lorberf...@ibm.com wrote: I think you actually can search over all fields, but not in the df parameter. We have a big list of fields we want to search over. So, we just put a dummy one in the df param field, and then we use the fl parameter. Wi

Re: using the df parameter to set a default to search all fields

2019-10-22 Thread Shawn Heisey
On 10/22/2019 11:17 AM, rhys J wrote: How would I enter qf parameters in the solrconfig.xml? In the handler definition you would simply add the parameter like you did when you added df. edismax field1 field2 field3 I included the parameter in the above example that sw

Re: using the df parameter to set a default to search all fields

2019-10-22 Thread Shawn Heisey
On 10/22/2019 10:26 AM, rhys J wrote: How do I make Solr search on all fields in a document? Solr does not have a way to ask for all fields on a search. If you use the edismax query parser, you can specify multiple fields with the qf parameter, but there is nothing you can put in that parame

Re: Importing a csv file encapsulated by " creates a large copyField field of all fields combined.

2019-10-21 Thread Shawn Heisey
On 10/21/2019 11:24 AM, rhys J wrote: I am using this command: curl ' http://localhost:8983/solr/users/update/csv?commit=true&separator=%09&encapsulator=%20&escape=\&stream.file=/tmp/users.csv ' The sequence %20 is a URL encoding of a space. If you intend the encapsulator character to be a do

Re: Solr Paryload example

2019-10-21 Thread Shawn Heisey
On 10/21/2019 7:59 AM, Vincenzo D'Amore wrote: But how many dynamic fields like this can I have? more than thousands? There are no actual limits on the number of fields, dynamic or not. You can have as many fields as you want per document, and as many fields as you want across the whole inde

Re: AEM 6.4 Compatibility

2019-10-18 Thread Shawn Heisey
On 10/18/2019 11:10 AM, Natalie Hannigan wrote: I am new to this group. I am working with a vendor to get Solr up and running with AEM 6.4. Has anyone had any experience with this? I am wanting to use Solr 8.1, but I cannot find documentation that says they are compatible. Does anyone know for

Re: Solr 8.2 docker image in cloud mode not connecting to Zookeeper on startup

2019-10-18 Thread Shawn Heisey
On 10/18/2019 9:28 AM, Drew Kidder wrote: I'm beginning to think that ZK is not setup correctly. I haven't uploaded any configuration files to ZK yet; my understanding was that I could start up a solr cloud node with no collections and upload the configuration from there. I was under the impressi

Re: Query on autoGeneratePhraseQueries

2019-10-16 Thread Shawn Heisey
On 10/16/2019 7:14 AM, Shubham Goswami wrote: I have implemented the sow=false property with eDismax Query parser but still it does not has any effect on the query as it is still parsing as separate terms instead of phrased one. We have seen reports that when sow=false, which is the default set

Re: Solr 7.6 frequent OOM with Java 9, G1 and large heap sizes - any tests with Java 13 and the new ZGC?

2019-10-15 Thread Shawn Heisey
On 10/15/2019 2:49 AM, Vassil Velichkov (Sensika) wrote: I've reduced the JVM heap on one of the shards to 20GB and then simulated some heavy load to reproduce the issue in a faster way. The solr.log ROOT was set to TRACE level, but I can't really see anything meaningful, the solr.log ends @ 07

Re: Minimum Tomcat version that supports latest Solr version

2019-10-15 Thread Shawn Heisey
On 10/15/2019 12:42 AM, vikas shinde wrote: Dear Solr team, Which is the latest Tomcat version that supports the latest Solr version 8.2.0? Also provide details about previous Solr versions & their compatible Tomcat versions. Dominique is correct. We do not officially support running under t

Re: Facet Advice

2019-10-15 Thread Shawn Heisey
On 10/14/2019 3:25 PM, Moyer, Brett wrote: Hello, looking for some advice, I have the suspicion we are doing Facets all wrong. We host financial information and recently "tagged" our pages with appropriate Facets. We have built a Flat design. Are we going at it the wrong way? In Solr we have a

Re: solr 8.1.1 many time slower returning query results than solr 4.10.4 or solr 6.5.1

2019-10-15 Thread Shawn Heisey
On 10/14/2019 1:36 PM, Russell Bahr wrote: Backend replacement of solr4 and hopefully Frontend replacement as well. solr-spec 8.1.1 lucene-spec 8.1.1 Runtime Oracle Corporation OpenJDK 64-Bit Server VM 12 12+33 1 collection 6 shards 5 replicas per shard 17,919,889 current documents (35 days wort

Re: Solr 7.6 frequent OOM with Java 9, G1 and large heap sizes - any tests with Java 13 and the new ZGC?

2019-10-14 Thread Shawn Heisey
On 10/14/2019 7:18 AM, Vassil Velichkov (Sensika) wrote: After the migration from 6.x to 7.6 we kept the default GC for a couple of weeks, than we've started experimenting with G1 and we've managed to achieve less frequent OOM crashes, but not by much. Changing your GC settings will never pre

Re: Solr 7.6 frequent OOM with Java 9, G1 and large heap sizes - any tests with Java 13 and the new ZGC?

2019-10-14 Thread Shawn Heisey
On 10/14/2019 6:18 AM, Vassil Velichkov (Sensika) wrote: We have 1 x Replica with 1 x Solr Core per JVM and each JVM runs in a separate VMware VM. We have 32 x JVMs/VMs in total, containing between 50M to 180M documents per replica/core/JVM. With 180 million documents, each filterCache entry

Re: Solr-8.2.0 Cannot create collection on CentOS 7.7

2019-10-11 Thread Shawn Heisey
On 10/10/2019 11:01 PM, Peter Davie wrote: I have just installed Solr 8.2.0 on CentOS 7.7.1908.   Java version is as follows: openjdk version "11.0.4" 2019-07-16 LTS Caused by: java.time.format.DateTimeParseException: Text '2019-10-11T04:46:03.971Z' could not be parsed: null Note that

Re: Zk Status Error

2019-10-10 Thread Shawn Heisey
On 10/10/2019 9:00 AM, mdsholund wrote: I am also getting this error using ZK 3.5.5 and Solr 7.7.2. I have whitelisted mntr but still get a similar exception 2019-10-10 14:59:01.799 ERROR (qtp591391158-152) [ ] o.a.s.s.HttpSolrCall null:java.lang.ArrayIndexOutOfBoundsException: 1 at

Re: backup strategy

2019-10-04 Thread Shawn Heisey
On 10/4/2019 1:41 PM, Koen De Groote wrote: From what I've researched, backup and restore is pretty straightforward. BACKUP and RESTORE are collection commands and the backup is to be put on a shared filesystem. So far so good. I'm a bit concerned about the RESTORE action. A RESTORE command wi

Re: Updating Solr schema doesn't work

2019-10-04 Thread Shawn Heisey
On 10/4/2019 2:45 PM, Shawn Heisey wrote: It's probably not the way I would do it.  I would update a local copy of the config and then re-upload the entire config rather than dealing with a single file. You will also need to reload the collection or restart Solr, and then as al

Re: Updating Solr schema doesn't work

2019-10-04 Thread Shawn Heisey
On 10/4/2019 10:22 AM, amruth wrote: *- /opt/zookeeper/bin/zkCli.sh delete /configs/collection1/managed-schema - /opt/zookeeper/bin/zkCli.sh create /configs/collection1/managed-schema "`cat /var/solr/data/collection1/conf/managed-schema`"* I could see fields on managed-schema on Solr UI and when

Re: Turn off weighted search

2019-09-30 Thread Shawn Heisey
On 9/30/2019 9:06 AM, yuri.glad...@swisscom.com wrote: Is it possible to turn off the weighted search for Solr? I mean the results have to be presented in a pure alphabetical order, not by the default weighted order. So if a certain letter appears in a word 2 times, this word shouldn' t be rank

Re: Throughput does not increase in spite of low CPU usage

2019-09-30 Thread Shawn Heisey
On 9/29/2019 11:44 PM, Yasufumi Mizoguchi wrote: I am trying some tests to confirm if single Solr instance can perform over 1000 queries per second(!). In general, I would never expect a single instance to handle a large number of queries per second unless the index is REALLY small -- dozens

Re: How to resolve a single domain name to multiple zookeeper IP in Solr

2019-09-27 Thread Shawn Heisey
On 9/27/2019 10:39 AM, LEE Ween Jiann wrote: FYI, solr-zookeeper-headless resolves to 3 different IPs. And getZkRawResponse() method only connects to 1 of the 3. My suggestion is the following: - List zookeepers = Arrays.asList(zkHost.split("/")[0].split(",")); + List zookeeperHosts = Arrays.as

Re: How to split a shard?

2019-09-26 Thread Shawn Heisey
On 9/26/2019 8:50 AM, Gael Jourdan-Weil wrote: We are trying to split a single shard into two but we are encountering some issues we don't understand. A) Create a new core "col_core2", then run the SPLIT (https://lucene.apache.org/solr/guide/7_6/coreadmin-api.html#coreadmin-split) If you

Re: URGENT Documents automatically getting deleted in SOLR 6.6.0

2019-09-26 Thread Shawn Heisey
On 9/26/2019 6:42 AM, Neha wrote: Today i have noticed that in my SOLR instance 6.6.0 documents are getting automatically deleted. In SOLR traces i found below lines and seems it is because of this. 2019-09-26 09:01:21.599 INFO  (qtp225493257-14) [   x:Ecotron] Also the "dataimport.propert

Re: How to resolve a single domain name to multiple zookeeper IP in Solr

2019-09-26 Thread Shawn Heisey
On 9/26/2019 4:12 AM, LEE Ween Jiann wrote: I'm trying to modify the helm chart for solr such that it works for kubernetes (k8s) deployment correctly. There needs to be a particular change in the way solr resolves zookeepers hostname in order for this to happen. This is the solr-user mailing

Re: Reloading after creating a collection

2019-09-19 Thread Shawn Heisey
On 9/19/2019 12:09 PM, Arnold Bronley wrote: I am not changing the the config to enable CDCR. I am just using the CDCR API to start it. Does that count as changing configuration? I would guess that there are no changes to the config from using the API, but I also admit that I have never tried

Re: Solr behaves wonky when zookeeper quorom is messed up.

2019-09-19 Thread Shawn Heisey
On 9/19/2019 9:22 AM, harjagsbby wrote: In our PROD SOLR cluster(7.6 and ZK:3.4.9) when Zookeeper leader fails Zookeeper enter an infinite leader election loop which makes SOLR instable. Solr Fails to index(as Expected with error "Remote error message: Cannot talk to ZooKeeper - Updates are disab

Re: Reloading after creating a collection

2019-09-19 Thread Shawn Heisey
On 9/19/2019 9:36 AM, Arnold Bronley wrote: Why is it that I need to reload collection after I created it? CDCR runs into issues if I do not do this. If the config doesn't change after creation, I would not expect that to be required. If you do change the config to enable CDCR after the coll

Re: Custom auth plugin for SolrCloud

2019-09-19 Thread Shawn Heisey
On 9/19/2019 6:18 AM, Zubovich Yauheni wrote: This class is wrapped into jar. Jar added to server lib directory and defined at solrconfig.xml: Where exactly is this "server lib" directory that you describe? If it's one of the locations that gets loaded automatically, you should NOT be load

Re: fq * vs [* TO *]

2019-09-19 Thread Shawn Heisey
On 9/19/2019 1:23 AM, Vincenzo D'Amore wrote: talking about how to write solr queries I was investigating if there is a difference of performance in these two filter queries: field:[* TO *] or field:* In other words: q=*:*&fq=field:[* TO *]&rows=0 q=*:*&fq=field:*&rows=0 The first one is a

Re: Moving a collection to a specific machine?

2019-09-18 Thread Shawn Heisey
On 9/18/2019 8:46 AM, Koen De Groote wrote: I'm looking into how I can move a collection to a specific machine. I have a set of machines, some with more/less CPU/RAM/Disk Space. Basically, I want to either move the specific collection to the appropriate machine(s). Or somehow say in the configu

Re: Question about "No registered leader" error

2019-09-18 Thread Shawn Heisey
On 9/18/2019 6:11 AM, Shawn Heisey wrote: On 9/17/2019 9:35 PM, Hongxu Ma wrote: My questions:    *   Is this error possible caused by "long gc pause"? my solr zkClientTimeout=6 It's possible.  I can't say for sure that this is the issue, but it might be. A follo

Re: Question about "No registered leader" error

2019-09-18 Thread Shawn Heisey
On 9/17/2019 9:35 PM, Hongxu Ma wrote: My questions: * Is this error possible caused by "long gc pause"? my solr zkClientTimeout=6 It's possible. I can't say for sure that this is the issue, but it might be. * If so, how can I prevent this error happen? My thoughts: using G

Re: Unable to Create Collection on Solr Cloud 7.7.1

2019-09-17 Thread Shawn Heisey
On 9/17/2019 6:16 AM, iporritt wrote: I am trying to create a collection on Solr Cloud. I have firstly uploaded SolrConfig.xml and Schema.xml to Zookeeper. HTTP ERROR 500 \nProblem accessing /solr/admin/cores. Reason:\nServer Error Caused by: java.lang.NoClassDefFoundError: org/apache/lu

Re: Why I receive permission denied when running as root

2019-09-17 Thread Shawn Heisey
On 9/16/2019 9:06 PM, Raymond Xie wrote: [root@pocnnr1n1 solr]# bin/solr start -bash: bin/solr: Permission denied The first thing I'd check for this is whether the bin/solr script has execute permission. A lack of execute permissions can cause this problem. The first line of that script is

Re: Cannot index into solr 8.1.1

2019-09-16 Thread Shawn Heisey
On 9/16/2019 10:18 AM, Bhuvanesh wrote: Recently I created a cloud Solr 8.1.1 with zookeeper similar to cloud Solr 6.6.2 which is in use. All configurations and schema files are exactly alike, but when I try to index the same documents Solr throws *cannot change field "FIELD_NAME" from* *index op

Re: In Place Updates: Can we filter on fields with only docValues="true"

2019-09-14 Thread Shawn Heisey
On 9/14/2019 4:29 PM, Mikhail Khludnev wrote: Shawn, would you mind to provide some numbers? I'm experimenting with lucene 8.0.0. I have 100 shard index of 100M docs with 2000 docVals only updateable fields. Searching for such field turns to be blazingly fast $ curl 'localhost:39200/books/_search

Re: JWT Auth plugin (https://github.com/cominvent/solr-auth-jwt) for Solr 7.2 Errors

2019-09-13 Thread Shawn Heisey
On 9/13/2019 10:30 AM, Tyrone Tse wrote: When I check the solr.log file I am seeing the following error 2019-09-13 15:09:27.816 ERROR (main) [ ] o.a.s.s.SolrDispatchFilter Could not start Solr. Check solr/home property and the logs 2019-09-13 15:09:27.838 ERROR (main) [ ] o.a.s.c.SolrCore nu

Re: OR and AND queries case sensitive in q param?

2019-09-12 Thread Shawn Heisey
On 9/12/2019 5:50 PM, Arnold Bronley wrote: in Solr 6.3, I was able to use OR and AND operators in case insensitive manner. The edismax query parser supports lowercase operators, if the lowercaseOperators parameter is set to true. I believe it defaults to false. Then if I pass 'rick OR mo

Re: Is it possible to skip scoring completely?

2019-09-12 Thread Shawn Heisey
On 9/12/2019 12:43 AM, Ashwin Ramesh wrote: I just tried a * query with filters with fl=id,score. I noticed that all scores were 1.0. Which I assume means no scoring was done. When I added a sort after that test, scores were still 1.0. I guess all I have to do is set q=* & set a sort. Don't us

Re: Is it possible to skip scoring completely?

2019-09-11 Thread Shawn Heisey
On 9/11/2019 5:27 PM, Ashwin Ramesh wrote: I was wondering if there is a way we can tell solr (7.3+) to run none of it's scoring logic. We would like to simply add a set of filter queries and order on a specific docValue field. e.g. "Give me all fq=color:red documents ORDER on popularityScore DE

Re: In Place Updates: Can we filter on fields with only docValues="true"

2019-09-10 Thread Shawn Heisey
On 9/10/2019 7:15 AM, Doss wrote: 4 to 5 million documents. For an NTR index, we need a field to be updated very frequently and filter results based on it. Will In-Place updates help us? Although you CAN search on docValues-only fields, the performance is terrible. So the answer I have for

Re: upgrading from solr4 to solr8 searches taking 4 to 10 times as long to return

2019-09-05 Thread Shawn Heisey
On 9/4/2019 12:48 PM, Russell Bahr wrote: Thank you for the feedback and advise. I have loaded the 2 screenshots up to drop box. Here is the link. https://www.dropbox.com/s/c5b41a61za0ojw7/solr4_Screen%20Shot%202019-09-03%20at%203.37.08%20PM.png?dl=0 Just one screenshot there. Looking at th

Re: upgrading from solr4 to solr8 searches taking 4 to 10 times as long to return

2019-09-03 Thread Shawn Heisey
On 9/3/2019 4:46 PM, Russell Bahr wrote: Hi Shawn, Here is a screenshot of one of the master nodes solr4 Screen Shot 2019-09-03 at 3.37.08 PM.png solr8 Screen Shot 2019-09-03 at 3.45.46 PM.png Email attachments do not make it to the list. I cannot see those pictures. You will need to use a

Re: upgrading from solr4 to solr8 searches taking 4 to 10 times as long to return

2019-09-03 Thread Shawn Heisey
On 9/3/2019 1:22 PM, Russell Bahr wrote: Yes, some of our queries are quite complex due to a lot of very specific positive as well as negative boosts, however, the query that I ran as the base test after we found our queries were taking so long is just " http://solr.obscured.com:8990/solr/content

Re: subscription request

2019-09-03 Thread Shawn Heisey
On 9/3/2019 11:47 AM, dev beautiful wrote: I want to subscribe solr mailing list. When I sent a request, I got the following message. Can you add this email address to the mailing list please? Thank you. Louis Choi --- This is the mail system at host n3.nabble.com. Nabble is a website th

Re: Query regarding Solr Cloud Setup

2019-09-03 Thread Shawn Heisey
On 9/3/2019 7:22 AM, Porritt, Ian wrote: We have a schema which I have managed to upload to Zookeeper along with the Solrconfig, how do I get the system to recognise both a lib/.jar extension and a custom core.properties file? I bypassed the issue of the core.properties by amending the update.a

Re: ZooKeeper error in solr.log

2019-09-02 Thread Shawn Heisey
On 9/2/2019 6:04 AM, Gell-Holleron, Daniel wrote: The error I get in the log is below: 2019-09-02 09:17:22.812 ERROR (qtp1192171522-16) [ ] o.a.s.h.RequestHandlerBase java.lang.ArrayIndexOutOfBoundsException: 1 at org.apache.solr.handler.admin.ZookeeperStatusHandler.monitorZookeeper(

Re: SolrNet...multiValued.

2019-09-02 Thread Shawn Heisey
On 9/2/2019 8:58 AM, Britto Raj wrote: I am working MNC and doing Prototype using SolrNet and Solr. I have few questions and got stuck not able to move forward.. 4. When i try to access using SolrQueryResults results = solr.Query(new SolrQuery("title:\"changeme4\"")); It throws e

Re: ZooKeeper error in solr.log

2019-09-02 Thread Shawn Heisey
On 9/2/2019 4:03 AM, Gell-Holleron, Daniel wrote: I’m having trouble retrieving ZK status in SolrCloud. As far as I know, I’ve followed all configuration instructions. The ZK_HOST, ZK_CLIENT_TIMEOUT and SOLR_HOST settings are configured along with a myid file in the data directory. The zoo.cfg

Re: Undefined field - solr 7.2.1 cloud

2019-08-29 Thread Shawn Heisey
On 8/29/2019 1:22 PM, Antony A wrote: I do restart Solr after changing schema using "solr zk upconfig". I am yet to confirm but I do have a daily cron that does "delta" import. Does that process have any bearing on some cores losing the field? Did you restart all the Solr servers? If the colle

Re: Undefined field - solr 7.2.1 cloud

2019-08-29 Thread Shawn Heisey
On 8/29/2019 11:26 AM, Antony A wrote: Hi, I am running on Solr cloud 7.2.1. I have 4 core collection. The fields are available in the schema.xml in solr admin UI. This tells me zookeeper has the correct schema. But unfortunately only the leader core has the correct response to the query with th

Re: Question: Solr perform well with thousands of replicas?

2019-08-29 Thread Shawn Heisey
On 8/28/2019 9:27 PM, Hongxu Ma wrote: I have a solr-cloud cluster, but it's unstable when collection number is big: 1000 replica/core per solr node. To solve this issue, I have read the performance guide: https://cwiki.apache.org/confluence/display/SOLR/SolrPerformanceProblems I noted there i

Re: What are the risk of running into "Unmap hack not supported on this platform"

2019-08-28 Thread Shawn Heisey
On 8/28/2019 1:58 PM, Pushkar Raste wrote: What does this exception really affect. I believe it is related in some way to how Lucene uses Java's MMAP capability to access data on disk. The MMAP functionality that Lucene uses required changes to properly support later Java versions. There w

Re: Turn off CDCR for only selected target clusters

2019-08-28 Thread Shawn Heisey
On 8/28/2019 1:42 PM, Arnold Bronley wrote: I have configured the SolrCloud collection-wise only and there is no other way. The way you have defined 3 zkHosts (comma separated values for zkHost property), I tried that one before as it was more intuitive. But it did not work for me. I had to use 3

Re: What are the risk of running into "Unmap hack not supported on this platform"

2019-08-28 Thread Shawn Heisey
On 8/27/2019 8:22 AM, Pushkar Raste wrote: I am trying to run Solr 4 on JDK11, although this version is not supported on JDK11 it seems to be working fine except for the error/exception "Unmap hack not supported on this platform". What the risks/downsides of running into this. The first version

Re: Problem of Shutdown Process for Windows Server

2019-08-28 Thread Shawn Heisey
On 8/28/2019 4:01 AM, Kayak28 wrote: I use Solr with Windows servers, and cannot shutdown Solr successfully. When I try to stop Solr using solr.cmd, which is kicked from Windows Task Manager, it "looks" like Solr stops without any problem. Here "looks" means that at least log file that Solr wrote

Re: Require searching only for file content and not metadata

2019-08-28 Thread Shawn Heisey
On 8/27/2019 7:18 AM, Khare, Kushal (MIND) wrote: Basically, what problem I am facing is - I am getting the textual content + other metadata in my _text_ field. But, I want only the textual content written inside the document. I tried various Request Handler Update Extract configurations, but n

Re: Max number of cores in solr

2019-08-28 Thread Shawn Heisey
On 8/28/2019 12:55 AM, Vignan Malyala wrote: Im planning to create separate core for each of my client in solr. Can I create around 500 cores in solr. Is it a good idea? For each client i have around 10 records on average currently. There is no limit that I know of to the number of cores.

Re: Query number of Lucene documents using Solr?

2019-08-26 Thread Shawn Heisey
On 8/26/2019 2:19 PM, Bram Van Dam wrote: Possibly somewhat unusual question: I'm looking for a way to query the number of *lucene documents* from within Solr. This can be different from the number of Solr documents (because of unmerged deletes/updates/ etc). As a bit of background; we recently

Re: How to enter a search bar in the results screen

2019-08-23 Thread Shawn Heisey
On 8/23/2019 1:08 PM, alee2 wrote: maybe this thank you later better Whatever that is, it's not part of Solr. You're going to need to talk to whoever wrote the software that drives that website or program. Thanks, Shawn

Re: How to enter a search bar in the results screen

2019-08-23 Thread Shawn Heisey
On 8/23/2019 12:59 PM, alee2 wrote: I need to insert a search bar on the screen that generates the list of results, could anyone help? Can you be more specific? If you're talking about Solr's admin UI, that is not designed to be exposed to end users. End users should never have direct access

Re: Solr process getting killed suddenly

2019-08-21 Thread Shawn Heisey
On 8/21/2019 4:44 PM, Adriano Rogério de O. Carolino de Melo wrote: Hi, does anybody know why Solr Java process is terminated with no reason? What OS is it running on? Maybe it's the OS killing it. How did you install/start Solr, and what version is it? The OOM script does not run. In the

Re: Solr crash | GC issue

2019-08-16 Thread Shawn Heisey
On 8/16/2019 11:59 AM, Rohan Kasat wrote: I see the OOM file getting created with "not much heap space" as the error Can you get the precise error cause? I haven't ever seen that particular text before. If you can paste the entire error (which will be many lines), that can be helpful. Sh

Re: Solr cloud questions

2019-08-16 Thread Shawn Heisey
On 8/15/2019 8:14 AM, Kojo wrote: I am starting to think that my setup has more than one problem. As I said before, I am not balancing my load to Solr nodes, and I have eight nodes. All of my web application requests go to one Solr node, the only one that dies. If I distribute the load across the

Re: Solr crash | GC issue

2019-08-16 Thread Shawn Heisey
On 8/16/2019 8:23 AM, Rohan Kasat wrote: I have a Solr Cloud setup of 3 solr servers 7.5 version. 24GB heap memory is allocated to each solr server and i have around 655 GB of data in indexes to be searched for. Few last 2-3 days, the solr servers are crashing and am able to see the heap memory

Re: HttpShardHandlerFactory

2019-08-16 Thread Shawn Heisey
On 8/16/2019 3:51 AM, Mark Robinson wrote: I am trying to understand the socket time out and connection time out in the HttpShardHandlerFactory:- 10 20 The shard handler is used when that Solr instance needs to make connections to another Solr i

Re: SOLR / Zookeeper Compatibility

2019-08-14 Thread Shawn Heisey
On 8/14/2019 11:46 AM, Paul Russell wrote: We have staged SOLR 6.x and 7.x environments with ZK V3.5.5 as requested and it does not appear to work. If we back off to ZK 3.4.x it works fine. The ZK project has documentation about backward compatibility: https://cwiki.apache.org/confluence/displ

Re: Solr cloud questions

2019-08-13 Thread Shawn Heisey
On 8/13/2019 9:28 AM, Kojo wrote: Here are the last two gc logs: https://send.firefox.com/download/6cc902670aa6f7dd/#Ee568G9vUtyK5zr-nAJoMQ Thank you for that. Analyzing the 20MB gc log actually looks like a pretty healthy system. That log covers 58 hours of runtime, and everything looks ver

Re: Solr cloud questions

2019-08-13 Thread Shawn Heisey
On 8/13/2019 6:19 AM, Kojo wrote: -- tail -f node1/logs/solr_oom_killer-8983-2019-08-11_22_57_56.log Running OOM killer script for process 38788 for Solr on port 8983 Killed process 38788 -- Based on what I can see, a 6GB heap is not big enough for the setup you've got

Re: Moving to solrcloud from single instance

2019-08-12 Thread Shawn Heisey
On 8/12/2019 1:42 PM, Erie Data Systems wrote: I am starting the planning stages of moving from a single instance of solr 8 to a solrcloud implementation. Currently I have a 148GB index on a single dedicated server w 96gb ram @ 16 cores /2.4ghz ea. + SSD disk. The search is fast but obviously th

Re: Solr cloud questions

2019-08-12 Thread Shawn Heisey
On 8/12/2019 5:47 AM, Kojo wrote: I am using Solr cloud on this configuration: 2 boxes (one Solr in each box) 4 instances per box Why are you running multiple instances on one server? For most setups, this has too much overhead. A single instance can handle many indexes. The only good reas

Re: solr: java.nio.file.accessdeniedexception

2019-08-12 Thread Shawn Heisey
On 8/12/2019 6:44 AM, Rathor, Piyush wrote: We are facing following issue in data update on solr: java.nio.file.accessdeniedexception in solr cloud https://cwiki.apache.org/confluence/display/solr/UsingMailingLists We will need considerably more detail. Exceptions from Java are MANY lines l

Re: java.nio.file.accessdeniedexception

2019-08-11 Thread Shawn Heisey
On 8/11/2019 8:52 AM, Rathor, Piyush wrote: We are facing following issue in data update on solr:  java.nio.file.accessdeniedexception Attachments almost never make it to the list. We cannot see the picture you sent. https://www.dropbox.com/s/6axm2g1if4cwffs/solr-user-message-view.png?dl=0

Re: Enumerating cores via SolrJ

2019-08-09 Thread Shawn Heisey
On 8/9/2019 3:07 PM, Mark H. Wood wrote: Did I miss something, or is there no way, using SolrJ, to enumerate loaded cores, as: curl 'http://solr.example.com:8983/solr/admin/cores?action=STATUS' does? This code will do so. I tested it. public static void main(String[] args) throws SolrS

Re: Indexed Data Size

2019-08-09 Thread Shawn Heisey
On 8/9/2019 12:17 PM, Moyer, Brett wrote: The biggest is /data/solr/system_logs_shard1_replica_n1/data/index, files with the extensions I stated previously. Each is 5gb and there are a few hundred. Dated by to last 3 months. I don’t understand why there are so many files with such small indexe

Re: [jira] [Commented] (SOLR-9952) S3BackupRepository can install Solr keeping S3 as storage(neither OS filesystem not hdfs).

2019-08-09 Thread Shawn Heisey
On 8/9/2019 10:16 AM, Suryakant Jadhav wrote: I am trying to configure Solr with S3. Could you please guide me step by step configuration for setting this up. Can you see if we can install Solr keeping S3 as storage(neither OS file system not hdfs). Changes for that issue have not yet been com

Re: Indexed Data Size

2019-08-09 Thread Shawn Heisey
On 8/9/2019 6:12 AM, Moyer, Brett wrote: Thanks! We update each index nightly, we don’t clear, but bring in New and Deltas, delete expired/404. All our data are basically webpages, so none are very large. Some PDFs but again not too large. We are running Solr 7.5, hopefully you can access the

Re: Indexed Data Size

2019-08-08 Thread Shawn Heisey
On 8/8/2019 3:17 PM, Moyer, Brett wrote: In our data/solr//data/index on the filesystem, we have files that go back 1 year. I don’t understand why and I doubt they are in use. Files with extensions like fdx,cfe,doc,pos,tip,dvm etc. Some of these are very large and running us out of server spac

Re: Query regarding Solr Search

2019-08-07 Thread Shawn Heisey
On 8/7/2019 6:39 AM, Khare, Kushal (MIND) wrote: Hello People ! Hope you all are doing good ! Well, I am new to the solr server and want to use it for content search in one of my applications. I have already been working upon it since quite a few days, and have the basics done. The issue that

Re: Issue : Solr Server becoming Un-responsive

2019-08-07 Thread Shawn Heisey
On 8/7/2019 7:22 AM, Abhimeet, Kumar wrote: We are facing an issue of Solr becoming unresponsive after every 2-3 Hrs. We recently increased the allocated JVM size on Solr up to 11GB, since then the issue is very frequent. Find the attached thread dumps. I'm amazed that the attachment made

Re: NRT for new items in index

2019-08-06 Thread Shawn Heisey
On 7/31/2019 6:47 AM, profiuser wrote: we have something about 400 000 000 items in a solr collection. We have set up auto commit property for this collection to 15 minutes. Is a big collection and we using some caches etc. Therefore we have big autocommit value. I would set autoCommit to 60 se

Re: SOLR 8.1.1 index on pdate field included in search results

2019-08-05 Thread Shawn Heisey
On 8/5/2019 10:37 AM, Hodder, Rick wrote: ExpirationDate is supposed to be there, but IDX_ExpirationDate should not. I know that I can probably keep using date, but it is deprecated, and part of the reason for upgrading to 8.1.1 is to use the latest non-deprecated stuff ;-) The DatePointField

Re: Difference between search results from Solr 5 and 8

2019-08-05 Thread Shawn Heisey
On 8/5/2019 7:34 AM, Alexander Sherbakov wrote: Which stopped working in Solr 8, so we mowed this to solrconfig.xml as: AND Now, this search gives 0 results while previously it worked fine and returned 2 records: [ path=select parameters={fq: ["type:Member"], sort: "score desc", q: "u...@gmail.

Re: Solr 7.6.0: PingRequestHandler - Changing the default query (*:*)

2019-08-05 Thread Shawn Heisey
On 8/4/2019 10:15 PM, dinesh naik wrote: My question is regarding the custom query being used. Here i am querying for field _root_ which is available in all of my cluster and defined as a string field. The result for _root_:abc might not get me any match as well(i am ok with not finding any match

Re: SolrCloud recommended I/O RAID level

2019-08-04 Thread Shawn Heisey
On 8/4/2019 8:53 AM, Kaminski, Adi wrote: Erick - thanks a lot for answering and sharing the below article, it's very helpful ! I have another follow-up question - assuming we have 400 vCPUs across our SolrCloud cluster nodes, will it be better to have 400 shards with replication factor 2 or

Re: SolrCloud recommended I/O RAID level

2019-07-30 Thread Shawn Heisey
On 7/30/2019 12:12 PM, Kaminski, Adi wrote: Indeed RAID10 with both mirroring and striping should satisfy the need, but per some benchmarks in the network there is still an impact on write performance on it compared to RAID0 which is considered as much better (attaching a table that summarizes

Re: Solr Backup

2019-07-30 Thread Shawn Heisey
On 7/30/2019 7:11 AM, Jayadevan Maymala wrote: We will need the *FULL* error message. It is probably dozens of lines long and MIGHT contain multiple "Caused by" sections. { "responseHeader":{ "status":500, "QTime":22}, "Operation backup caused exception:":"java.nio.file.NoSuchF

Re: Solr Backup

2019-07-30 Thread Shawn Heisey
On 7/30/2019 5:41 AM, Jayadevan Maymala wrote: We have a 3-node Solr cluster running on google cloud platform. I would like to schedule a backup and have been trying the backup API and getting java.nio.file.NoSuchFileException:java.nio.file.NoSuchFileException error. I suspect it is because a sha

Re: [SOLR] - Best Practices/node down

2019-07-26 Thread Shawn Heisey
On 7/26/2019 7:16 AM, Rodrigo Oliveira wrote: I have a cluster from Solr with Zookeeper (5 nodes - 48 Gb each node - Xms:28 Gb - Xmx:32 Gb). The bigger problem is my environment, because I am in a process of migration from MYSQL to SOLR. Solr is not intended as a primary data store. There are

Re: CDCR Queues API invocation with CloudSolrclient

2019-07-24 Thread Shawn Heisey
On 7/24/2019 3:50 PM, Natarajan, Rajeswari wrote: Hi, With the below API , the QueryResponse , sometimes have the "lastTimestamp" , sometimes not. protected static QueryResponse getCdcrQueue(CloudSolrClient client) throws SolrServerException, IOException { ModifiableSolrParams params = ne

Re: SOLR 6.6 with MS-SQL.

2019-07-24 Thread Shawn Heisey
On 7/24/2019 11:31 AM, Fiz Ahmed wrote: We are using Apache Solr 6.6 stand-alone currently in a number of locations.Most indexes are holding 250,000 to 400,000 documents.Our data comes from MS-SQL.We’re using a front-end JavaScript solution to communicate with Solr to perform querie

Re: Apache Solr 8 behaviour when ZK data is deleted

2019-07-24 Thread Shawn Heisey
On 7/24/2019 12:08 PM, Prince Manohar wrote: If the zookeeper is wiped out of it's data, it looks like the Solr also deleted all the indexes. I wanted to know if is normal that if the configsets related to a collection are not inside zookeeper, then its Solr indexes are deleted from the file sy

<    1   2   3   4   5   6   7   8   9   10   >