Re: Regarding indexing data in different cores or same core with different entities.

2022-04-10 Thread Shawn Heisey
On 4/10/2022 2:31 PM, Neha Gupta wrote: I want to query these tables differently in Solr as they don't have any relation between them. Could you please tell whether i should create different cores for each table or should i indexed them in one core with different entities. If latter is the case

Re: Timeouts with Replicas in Solrcloud

2022-04-11 Thread Shawn Heisey
On 4/11/2022 3:15 AM, Nik Osvalds wrote: When we were designing the system, we played around autoCommit and softCommit settings following this guide . This was mainly to optimize the initial bulk l

Re: Upgrading log4j from v2.16.0 to 2.17.0

2022-04-14 Thread Shawn Heisey
On 4/14/22 03:47, Kunal Tidake wrote: We have upgraded solr to v8.11.1 which includes log4j v2.16.0. We want to upgrade log4j to v2.17.1. So is it possible to upgrade log4j from v2.16.0 to v2.17.0 , Please let us know. Yes.  Just replace all the log4j jars with newer ones. We are not releasi

Re: Apache Solr 8.11.1 and Log4J Vulnerability

2022-04-14 Thread Shawn Heisey
On 4/14/2022 11:59 AM, Tate, Justina (DTMB) wrote: Can you please explain how we can go about upgrading Log4J to greater than 2.16.0. Just replace the jars in the Solr install directory with newer versions obtained directly from the log4j project. But there's no need.  Solr is not vulnerabl

Re: Apache Solr 8.11.1 and Log4J Vulnerability

2022-04-14 Thread Shawn Heisey
On 4/14/2022 6:14 PM, Shawn Heisey wrote: If you need to check a compliance box saying you dealt with a nonexistent vulnerability, just replace the jars as I already said. If you want to get really adventurous, you could clone the git repo, check out branch_8_11, and build it yourself.  That

Re: Dismax query parser and wildcards

2022-04-14 Thread Shawn Heisey
On 4/14/2022 7:12 PM, Jeremy Buckley - IQ-C wrote: qt=/select {!lucene df=mpno v='fo4a*'} returns lots of results {!dismax df=mpno v='fo4a*'} returns *zero *results {!edismax df=mpno v='fo4a*'} returns lots of results What is wrong with dismax? The dismax parser doesn't do everything.  Wh

Re: Apache Solr 8.11.1 and Log4J Vulnerability

2022-04-14 Thread Shawn Heisey
On 4/14/2022 7:18 PM, Shawn Heisey wrote: https://paste.elyograg.org/view/ed0f1b1e The required steps are found in the first 33 lines.  The remaining 43000 lines is the whole build. To be very specific, the commands I did are on lines 1, 9, 10, 14, 32, and 33.

Re: Solr - Spring cloud – CVE-2022-22963

2022-04-15 Thread Shawn Heisey
On 4/15/22 00:41, Rajath Banagi Ravindra wrote: We are using Solr 7.5 version and Solr 6.4 version in our applications. Wanted to check if there is any impact due to vulnerability - CVE-2022-22963. I did googling and didn’t find any clear answers regarding same. Are Solr instances vulnerable a

Re: Is solr what I want, or something else?

2022-04-16 Thread Shawn Heisey
On 4/16/2022 8:25 PM, Jay Scott wrote: I hope that the search software lets me search for combinations of words; I've been assuming that's built in. Yes, most likely Solr will handle this need. I want to do all of this locally -- not use the cloud or anything like that. mnogoSearch worked ok

Re: Is solr what I want, or something else?

2022-04-17 Thread Shawn Heisey
On 4/17/2022 10:55 AM, Jay Scott wrote: i'm going to watch some tutorials and see if they'll show me what i need. i still have a feeling solr is much more than what i need, but, oh, well. it won't hurt me to learn something new. let me do some homework; if i need more help i'll ask. thanks to

Re: property substitution in the solrconfig.xml

2022-04-21 Thread Shawn Heisey
On 4/21/22 03:24, Stefan wrote: After setting all up like this and querying documents on the name field, the scores of the documents are wrong because the weight from the core.properties is used instead of the weight from the configoverlay.json. As far as I am aware, the configoverlay.json file

Re: SOLR Service down issue - NSSM.

2022-04-25 Thread Shawn Heisey
On 4/25/22 09:36, Fiz N wrote: Hello Experts, Currently we have installed SOLR 8.11.1 services through NSSM Service manager (version 2.24). NSSM Service manager is mainly used for managing windows services which includes automatically starting and stopping the SOLR Services. As a part of SOL

Re: SOLR Service down issue - NSSM.

2022-04-25 Thread Shawn Heisey
On 4/25/22 11:49, dmitri maziuk wrote: On 2022-04-25 12:35 PM, Shawn Heisey wrote: On 4/25/22 09:36, Fiz N wrote: *SOLR-8.11.1: Unexpected status SERVICE_STOP_PENDING in response to STOP control.* Have you talked to the NSSM project to see what they can tell you about what their software

Re: I can index documents when running Solr 8.11.1 in stand-alone mode, but not when using solr in cloud mode

2022-04-25 Thread Shawn Heisey
On 4/25/2022 9:51 AM, Heller, George A III CTR (USA) wrote: Error 404 Not Found HTTP ERROR 404 Not Found URI:/solr/xlm_core_pAp/update/extract This 404 error indicates that either the collection does not exist or the /update/extract handler is not defined in the solrconfig.xml file tha

Re: I can index documents when running Solr 8.11.1 in stand-alone mode, but not when using solr in cloud mode

2022-04-25 Thread Shawn Heisey
On 4/25/2022 8:33 PM, Shawn Heisey wrote: and it looks like your message actually never made it to the users list Or it DID make it and I am just blind. :)

Re: Solr GC Tuning causes issues and doesn't start Solr url

2022-04-27 Thread Shawn Heisey
On 4/27/22 05:08, Deeksha Shrivastava wrote: The "etc" itself is included in the setting format. Please refer screenshot below: The screenshot says "set GC_TUNE="-XX:NewRatio=3 -XX:SurvivorRatio=4 etc." ... it does not say what the "etc" is, and we are going to need that information.

Re: Problem with indexing a String field in SOLR.

2022-04-29 Thread Shawn Heisey
On 4/28/22 14:16, Neha Gupta wrote: I am not using dynamic fields and schema is as below. That isn't the schema.  It's a screenshot of the Schema Browser in the admin UI, showing information from one field. A better way of asking for what is needed:  We need the XML file containing the sche

Re: Solr - frequent OOM

2022-05-03 Thread Shawn Heisey
On 5/3/2022 5:01 PM, Vincenzo D'Amore wrote: I'm tuning a solrcloud 5.4.1 deployment (3 nodes, 12 cores each, 18GB ram) that is experiencing frequent OutOfMemoryError (20 a day in total) exceptions during the execution of a group query. Looking at query group.limit=1 but the rows range between 1

Re: Solr GC Tuning causes issues and doesn't start Solr url

2022-05-04 Thread Shawn Heisey
On 5/4/22 02:50, Deeksha Shrivastava wrote: We have java installed on our dev environment/local environment but its giving the same issue on this particular environment as well. We need more information to be able to help.  This will be the starting point: Detailed information about how you

Re: Solr GC Tuning causes issues and doesn't start Solr url

2022-05-16 Thread Shawn Heisey
On 5/16/22 05:45, Deeksha Shrivastava wrote: Even after installing java/jre on the machine, solr GC Tuning setting is not working. Can you please provide some resolution on the same and please let us know what information is needed to investigate this issue further. You could start with the i

Re: Why OR in query does not work sometimes?

2022-05-16 Thread Shawn Heisey
On 5/16/2022 2:32 PM, WU, Zhiqing wrote: (Note: prodAllGeneric_txt_sort is a field, X is the details of a query) If X is "(prodAllGeneric_txt_sort:\"Phi; \")", Solr finds 3 documents If X is "(-prodAllGeneric_txt_sort:*)", Solr finds 8 documents If X is "(prodAllGeneric_txt_sort:\"Phi; \") OR (-

Re: Question about Zookeeper architecture

2022-05-17 Thread Shawn Heisey
On 4/29/22 02:17, DAVID MARTIN NIETO wrote: I make a similar question time ago and get this answer: Expanding a little bit on what I wrote before: If you have three ZK servers, you can lose one and remain fully operational. If you have four ZK servers, you can still only lose one -- if you l

Re: Shard Split and composite id

2022-05-18 Thread Shawn Heisey
On 5/18/22 08:42, Hasmik Sarkezians wrote: Have a question about shard splitting and compositeId usage. We are starting a solr collection with X number of shards for our multi-tenant application. We are assuming that the number of shards will increase over time as the number of customers grows as

Re: [External Email] Re: Shard Split and composite id

2022-05-18 Thread Shawn Heisey
On 5/18/22 10:56, Hasmik Sarkezians wrote: Thanks for the reply. It doesn't matter to me which shard the document ends up in, just matters how many shards the document ends up with: And seems like I wouldn't have control over that as the number of shards grows. I've been thinking about some d

Re: Solr GC Tuning causes issues and doesn't start Solr url

2022-05-20 Thread Shawn Heisey
On 5/20/22 01:46, Ruchi Gupta wrote: Even after installing java/jre on the machine, solr GC Tuning setting is not working. We tried to enable it but then our solr service is not getting start. Can you please provide some resolution on the same and please let us know what information is needed

Re: Schema field type property - uninvertible

2022-05-20 Thread Shawn Heisey
On 5/19/22 01:13, Vincenzo D'Amore wrote: As far as I understand, we should always set the property uninvertible=false to avoid that Solr builds "up large in memory data structure to serve in place of DocValues" and this is good for "stability", not explaining exactly what it means. Could anyone

Re: Solr latest 9.0.0 issue

2022-05-21 Thread Shawn Heisey
On 5/21/2022 1:05 AM, dev disoza wrote: We have added a host ip with SOLR_HOST in the solr configuration file as before. $ bin/init.d/solr status -p 8988 Solr at http://192.168.100.222:8988/solr not online. Running bin/init.d/solr could be problematic depending on exactly how things were inst

Re: Solr latest 9.0.0 issue

2022-05-21 Thread Shawn Heisey
On 5/21/2022 8:02 AM, Shawn Heisey wrote: If you have installed the solr service, then you would want to do "service solr status" instead, replacing solr with whatever you actually named the service.  Did you install the service with the service installer script?  What options d

Re: Solr latest 9.0.0 issue

2022-05-21 Thread Shawn Heisey
On 5/21/2022 9:04 AM, Shawn Heisey wrote: SOLR_JETTY_HOST=192.168.100.2 I didn't type the address right.  Noticed the error half a second after I clicked Send.

Re: Facing issue in enabling SSL on solr v8.11.1

2022-05-23 Thread Shawn Heisey
On 5/23/2022 9:47 AM, Anchal Sharma2 wrote: ERROR ON BROWSER An error occurred during connection to hostname:solrport. Cannot communicate securely with peer . No common encryption algorithm(s). Error Code: SSL_ERROR_NO_CYPHER_OVERLAP It means that the browser and the server (Jetty in this case

Re: Schema field type property - uninvertible

2022-05-24 Thread Shawn Heisey
On 5/24/22 07:40, Vincenzo D'Amore wrote: Just another question, so having a new schema.xml, for the "id" field and the other fields that are pint/plong/string/etc.. (i.e. have "docValues=true") should I apply uninvertible=false ? Yes.  If the field has docValues then there is no reason for it

Re: Solr GC Tuning causes issues and doesn't start Solr url

2022-05-24 Thread Shawn Heisey
On 5/24/22 06:17, Deeksha Shrivastava wrote: Solr set up was done long time back. Currently Solr is deployed on our QA and PROD servers, and our website's search functionality is working fine as of now. The Solr version installed is 7.2.1, previously java/jre was not installed but we did the i

Re: How could I get the bucket (or group) which value of the grouping field is null by using Facet API?

2022-05-24 Thread Shawn Heisey
On 5/24/22 13:48, WU, Zhiqing wrote: Is there any setting in Facet API I could change to get the 'field = null' group? I found a "facet.missing" parameter which might do what you're after. https://solr.apache.org/guide/8_8/faceting.html#field-value-faceting-parameters Thanks, Shawn

Re: SOLR 7-8 Upgrade. Now getting OOM when updating content

2022-05-24 Thread Shawn Heisey
On 5/24/2022 5:42 PM, Paul Russell wrote: --A: Upgraded SOLR on the exiting servers utilizing the current indexes. We have plans to reindex and flatten the structure but we're not ready for that yet. With the release of SOLR 9 we need to get to a supported version now. We had planned on reindexin

Re: SOLR 7-8 Upgrade. Now getting OOM when updating content

2022-05-25 Thread Shawn Heisey
On 5/24/2022 10:33 PM, Paul Russell wrote: Thanks for the link. We’ll see what the patch provides for additional info. We are aware of the need to reindex for solr 9. We will reindex to a separate instance. Are there performance benefits to using the core upgrade utility when going from 7.7.x to

Re: Create a core via SolrClient, single server

2022-05-25 Thread Shawn Heisey
On 5/25/2022 3:25 PM, Christopher Schultz wrote: I have a non-clustered/ZK Solr instance and I'd like to create a core using the Java SolrClient library. Is that currently possible? I only see methods for working with documents in the current core (selected when the client object is initially c

Re: Regarding solr field cache

2022-05-26 Thread Shawn Heisey
On 5/26/22 08:54, Poorna Murali wrote: I would like to know when will a solr field cache be invalidated or cleared? Lucene will throw the existing fieldCache away whenever a new searcher is created.  A new searcher is created by autoSoftCommit, by explicit hard commits unless you disable it

Re: Regarding solr field cache

2022-05-26 Thread Shawn Heisey
On 5/26/22 10:14, Poorna Murali wrote: In our configuration, we are not using soft commit at all and autocommit is done by hard commit with opensearcher = false which will not create a new searcher. If you never do a commit that opens a new searcher, then you will never see the result of adds,

Re: Regarding solr field cache

2022-05-26 Thread Shawn Heisey
On 5/26/22 11:34, Poorna Murali wrote: Thanks Shawn for the information. Is it possible that the segment merging which happens during hardcommit can open a new searcher thereby making index updates visible during search? No, segment merging does not create a new searcher.  In order to utilize

Re: Solr Release Roadmap - any advice on when to upgrade from 8.11.1 to 9.x.x?

2022-05-27 Thread Shawn Heisey
On 5/27/22 05:41, Sebastian Riemer wrote: I'm working out upgrade strategies for all the libraries and components we use and wonder what would be a good time to upgrade from 8.11.1 to 9.x.x? We use it in conjunction with SolrJ. Currently running Solr 8.11.1 on Ubuntu 20.04 and Java 11. Is th

Re: Apache Solr 8.11. 1 on CentOS Linux release 7.9.2009 (Core)

2022-05-29 Thread Shawn Heisey
On 5/28/2022 10:19 PM, Kaushal Shriyan wrote: I am running Apache Solr 8.11.1 on CentOS Linux release 7.9.2009 (Core) using zookeeper to make it highly available. I have a couple of questions. 1. Is there a performance tuning to be carried out as I am using the default settings? Withou

Re: SolrJ connection handling, best practice(s)?

2022-05-30 Thread Shawn Heisey
On 5/30/2022 12:30 AM, Clemens WYSS (Helbling Technik) wrote: As you can see * we keep one SolrClient-connection per core in a map and make use of these connections * making use of HttpSolrClient ( Http2SolrClient is denoted "experimental" and has cause some problem ) * no special tweaking of u

Re: SolrJ connection handling, best practice(s)?

2022-05-30 Thread Shawn Heisey
On 5/30/2022 7:06 AM, Shawn Heisey wrote: What version of SolrJ are you using in your code?  For a long time, SolrJ defaulted to an HttpClient that only allowed two threads per destination.  Then that was greatly increased.  Now from what I can decipher in current 8.x code using the Builder

Re: SolrJ compatibility

2022-05-31 Thread Shawn Heisey
On 5/31/22 03:37, Gohlke, Alexander wrote: Is SolrJ client in version 4.10.4 is compatible with a Solr server 8.11.1 or 9.0.0 ? Any information would be very appreciated. If you're not running in SolrCloud mode, that will probably work. When running SolrCloud, the client is MUCH more integrat

Re: SolrJ connection handling, best practice(s)?

2022-05-31 Thread Shawn Heisey
On 5/31/2022 9:36 AM, Christopher Schultz wrote: Thread pool or connection pool? Usually, you want your application threads to be waiting on that data, not handing execution off to another thread, only to wait for it to complete. Any asynchronous environment should be providing its own threads.

Re: concatinating fields via schema

2022-06-01 Thread Shawn Heisey
On 6/1/2022 3:27 AM, Yirmiyahu Fischer wrote: I tried BrandName_s ManufacturerNo_s BrandMfgConcat_t BrandMfgConcat_t However, after indexing, the field BrandMfgConcat_t do

Re: Create a core via SolrClient, single server

2022-06-01 Thread Shawn Heisey
On 6/1/2022 11:41 AM, Christopher Schultz wrote: How can I provide the schema for the core once it's been created? Can I use the API for that, or do I have to resort to pushing the config file directly similar to these kindx of curl commands: curl -d "{ ... config }" \    ${SCHEME}://localhost

Re: Create a core via SolrClient, single server

2022-06-01 Thread Shawn Heisey
On 6/1/2022 3:34 PM, Christopher Schultz wrote: So I tried this with configSet=_default and I /did/ get a core created. I didn't get the same thing I got from the CLI: This is what I get from "solr create -c test_core": Using bin/solr to create a core does it in multiple steps.  It creates t

Re: Create a core via SolrClient, single server

2022-06-01 Thread Shawn Heisey
On 6/1/2022 6:31 PM, Shawn Heisey wrote: The end result is the same ... except in the second case, it references the configset by name, which will be in the created core.properties file.  If you were to change the config in the configset directory and then reload each core, test_core would

Re: JDK version to build source

2022-06-03 Thread Shawn Heisey
On 6/3/2022 8:29 AM, Rahul Goswami wrote: I am trying to build 8.11.1 from source. Which JDK version should I use? I have successfully built it with OpenJDK 8 and OpenJDK 11.  I have not tried anything newer and would not be surprised if it doesn't work.  JDK version 8 is the absolute minimum

Re: JDK version to build source

2022-06-03 Thread Shawn Heisey
On 6/3/2022 1:18 PM, Rahul Goswami wrote: Thanks for the detailed response Shawn. I intend to run the code on JRE 11. However the build machine I have access to has JDK 8. Are there any known concerns (performance or otherwise) with code built on Java 8 and running on Java 11? Building on 8 and

Re: Failing Solr tests during build

2022-06-06 Thread Shawn Heisey
On 6/6/2022 8:29 PM, Rahul Goswami wrote: I am trying to build Solr 8.11.1 on Windows (using GitBash). Most tests execute fine, until I hit the S3 suite of testcases under contrib/s3-repository with the below exception: You do not need to run tests to build Solr.  Running tests usually results

Re: Solr indexing performance tips

2022-06-08 Thread Shawn Heisey
On 6/8/2022 3:35 AM, Marius Grigaitis wrote: * 9 different cores. Each weighs around ~100 MB on disk and has approximately 90k documents inside each. * Updating is performed using update method in batches of 1000, around 9 processes in parallel (split by core) This means that indexing within ea

Re: deleteByQuery is killing Solr servers with OOM

2022-06-08 Thread Shawn Heisey
On 6/8/2022 6:45 AM, Parag Ninawe wrote: We are using Solr 7.7.3 We have observed that deleteByQuery parameter causing sudden spikes in JVM causing OOM. What do you know about the OOM?  Is it an OS-level OOM or a Java level OOM?  If it's Java, have you seen the actual exception?  A whole bunch

Re: Solr deletes all collections after recreating (docker)

2022-06-08 Thread Shawn Heisey
On 6/8/2022 4:01 AM, Yurii Aleshchenko wrote: How can I save my data in docker volume and why solr deleted all collections, cores on startup ? It sounds like when you are recreating the setup, that you are starting with a brand new and empty ZK database. All the collection configuration is i

Re: Core Admin API - rename core issue on 8.11

2022-06-08 Thread Shawn Heisey
On 6/8/2022 5:06 PM, Shamik Bandopadhyay wrote: "msg": "Not supported in SolrCloud", Using the CoreAdmin API when running in cloud mode is a REALLY bad idea.  The CoreAdmin API cannot touch information in zookeeper. With part of the information for a SolrCloud collection being in zookeep

Re: Re-index after upgrade

2022-06-13 Thread Shawn Heisey
On 6/13/2022 10:14 AM, Christopher Schultz wrote: 1. Re: regular re-indexes. I've just built this into my web application so it's literally a one-click administrative background-process kick-off. I've been trying to get automatic schema-provisioning as well (see my recent posts to users@) just

Re: Re-index after upgrade

2022-06-13 Thread Shawn Heisey
On 6/13/2022 1:19 PM, Christopher Schultz wrote: Okay. So if I do what I initially proposed: 1. delete *:* 2. re-index everything If you do this but do not optimize the index (which will happen practically instantaneously because the index will consist of only deleted docs), then I can make

Re: Re-index after upgrade

2022-06-14 Thread Shawn Heisey
On 6/14/2022 10:22 AM, Christopher Schultz wrote: Does that mean I need to: 1. delete *:* 2. optimize 3. re-index everything Is #2 something available via the SolrJ client, or do I have to issue a REST call for that? This code should delete everything, commit, and optimize, all with a singl

Re: Solr indexing performance tips

2022-06-16 Thread Shawn Heisey
On 6/16/22 02:59, Marius Grigaitis wrote: In the end what caught our eye is a few deleteByQuery lines in stacks of running threads while Solr is overloaded. We temporarily removed deleteByQuery and it had around 10x performance improvement on indexing speed. I do not understand all the low-leve

Re: Commit behavior when softcommit is set at the server-level

2022-06-16 Thread Shawn Heisey
On 6/16/22 09:16, Christopher Schultz wrote: We have our server set to soft-commit every 30 seconds. 30 seconds is very aggressive.  It's far better than 1 second, but still quite frequent. If we know we are going to be sending a lot of documents at once, if we set "commitWithin" to someth

Re: Auto recovery of solr

2022-06-21 Thread Shawn Heisey
On 6/21/2022 4:54 AM, Poorna Murali wrote: If we have an alerting mechanism which says if a solr node is down, is it possible to configure the alert with some action which will restart that solr node and recover automatically? Please advise. I agree with Dave. When the environment is valid (OS

Re: Update/Reindex

2022-06-21 Thread Shawn Heisey
On 6/21/2022 8:29 AM, Mike wrote: Did I understand that correctly, with version 8 you cannot update a dataset, only reindex it. In the past, however, this was possible, e.g. with version 4 That really depends on exactly what you mean by "update a dataset" ... 8.x should have all the indexing c

Re: Regarding Solr auto recovery

2022-06-22 Thread Shawn Heisey
On 6/22/22 04:40, Poorna Murali wrote: Thanks everyone for the inputs on my post yesterday! It did help me to clarify the doubts. @David - I believe that the clearcache.sh you mentioned in the link ( https://www.tecmint.com/clear-ram-memory-cache-buffer-and-swap-space-on-linux/) will clear the s

Re: Facing an issue with solr 5.3.0 basic auth enablement

2022-06-23 Thread Shawn Heisey
On 6/23/2022 1:42 AM, Tamal Choudhury wrote: This worked fine, but the zookeeper is throwing the following error when we are trying to query from admin UI or from code - { "responseHeader": { "status": 401, "QTime": 5, "params": { "indent": "true", "q": "*:*", "_": "1655969026930", "wt": "json"

Re: Solr 8.1.1 - OpenJDK 11

2022-06-30 Thread Shawn Heisey
On 6/30/22 00:34, Paul Ryder wrote: Various docs seem ambiguous on Java version required for Solr 8.1.1 Solr docs says Java 8 or higher and Solr 8.1.1 was continually tested with JDK11 Sitecore docs say use only Java8 for Solr 8.4 or earlier I'm trying OpenJDK 11 with Solr 8.1.1 - mostly works

Re: Solr 8.1.1 - OpenJDK 11

2022-07-04 Thread Shawn Heisey
On 7/4/22 02:05, Paul Ryder wrote: Can anyone give me pointers to updating the jetty version on Solr 8.1.1? Is it just a case of overwriting the jetty jars in dist? If you upgrade all the jetty jars in all locations under the program server directory, that will update Jetty. But I need to c

Re: Solr eats up all the memory

2022-07-04 Thread Shawn Heisey
On 7/4/22 03:01, Mike wrote: My Solr index size is around 500GB and I have 64GB of RAM. Solr eats up all the memory and because of that PHP works very, very slowly. What can I do? Solr is a Java program.  A Java program will never directly use more memory than you specify for the max heap size

Re: Solr eats up all the memory

2022-07-05 Thread Shawn Heisey
On 7/5/2022 3:11 PM, Christopher Schultz wrote: Well, if you need more than 32GiB, I think the recommendation is to go MUCH HIGHER than 32GiB. If you have a 48GiB machine, maybe restrict to 31GiB of heap, but if you have a TiB, go for it :) I remember reading somewhere, likely for a different

Re: Data Import Handler problem in Solr 8

2022-07-06 Thread Shawn Heisey
On 7/6/22 04:32, Michał Świątkowski wrote: I checked that and collection data will be erased only when I will use clean=true and optimize=true (first query). 1. clean=true ; optimize=true webapp=/solr path=/dataimport params={core=example_collection&optimize=true&indent=on&commit=true&name=dat

Re: Solr eats up all the memory

2022-07-06 Thread Shawn Heisey
On 7/6/22 10:59, dmitri maziuk wrote: mmap() doesn't side-step disk access though, dep. on the number of of mmap'ed chunks and chunk size, it can be slow. Especially if your "disk" is an iSCSI volume on a gigabit link to a slow underprovisioned NAS. If the mounted filesystem is one that the O

Re: Solr eats up all the memory

2022-07-07 Thread Shawn Heisey
On 7/6/22 18:33, dmitri maziuk wrote: This way lieth dark magick and madness, of course, but I'm curious what the optimal config would be for a container infra. For bare metal a large PCIe SSD should be the best bang for the buck, but on kube the "disk" is probably iSCSI volumes and you may not

Re: Solr eats up all the memory

2022-07-07 Thread Shawn Heisey
On 7/6/22 16:38, Christopher Schultz wrote: Anecdotal data point: elyograg@bilbo:/usr/local/src$ ps aux | grep '\(java\|PID\)' USER PID %CPU %MEM    VSZ   RSS TTY  STAT START   TIME COMMAND solr  852288  1.0  9.5 3808952 771204 ?  Sl   Jul03  59:32 java -server -Xms512m -Xm

Re: Transfer to a new server

2022-07-11 Thread Shawn Heisey
On 7/11/22 05:28, Mike wrote: How can I transfer a 500 GB Solr index to a new server? If it's the same Solr version, just copy the index directory to the new server.  Ideally you will either shut Solr down during the copy, or make VERY sure that there are no index changes happening during th

Re: Solr Core Initialization Failure

2022-07-13 Thread Shawn Heisey
On 7/13/22 14:39, Bejoy Thankachan wrote: SolrCore Initialization Failures I have a 3 node solr 8.11.1 setup (port 8983) with 3 node external zookeeper 3.7.1 (p 2181). After I did a clean setup, I see Solr Core Initialization error in the admin console. The

Re: Autoscaling

2022-07-17 Thread Shawn Heisey
On 7/17/22 07:40, Ronen Nussbaum wrote: We are planning to migrate our Solr Cloud clusters to the cloud. Currently it is installed on-prem for each customer. It is already deployed as Docker containers. Instead of estimating in advance what is the number of shards needed, or the number of pods, w

Re: Autoscaling

2022-07-17 Thread Shawn Heisey
On 7/17/22 11:25, Kaminski, Adi wrote: For example, if we have 10 shards each 100k (1M total) documents size for best and optimized ingestion/query performance...adding more documents will make sense to have 11th shard, and reaching 1.1M total will make sense to add 12th one eventually. One

Re: Help with new install

2022-07-21 Thread Shawn Heisey
On 7/21/22 13:12, jorge hernandez wrote: SimplePostTool: WARNING: IOException while reading response: java.io.FileNotFoundException: http://localhost:8983/solr/mynescore/update/extract?resource.name=%3cpath_of_the_files> The problem here is that the _default configset does NOT create the /upda

Re: BufferUnderFlowException

2022-07-22 Thread Shawn Heisey
On 7/22/22 08:16, Hasmik Sarkezians wrote: Does anyone know what would be the reason for BufferUnderFlowException while solr is reading? We have a profiler setup and at times we are seeing a lot of exceptions related to buffer underflow exception: The image did not make it through.  The mailin

Re: solr backup location 8.11.1

2022-08-05 Thread Shawn Heisey
On 8/5/22 07:00, Thomas Woodard wrote: optimize optimize 2 00:00:20 /var/i8s/backup/solr/${i8s.environment}/${ solr.core.name} The backups after optimize are happening, but they are going to the default locations, not the configured location. For

Re: solr backup location 8.11.1

2022-08-05 Thread Shawn Heisey
On 8/5/22 07:42, Shawn Heisey wrote: I've confirmed that it isn't a path security issue, by verifying that all paths are allowed: 2022-08-05 12:29:03.873 INFO  (main) [   ] o.a.s.c.CoreContainer Allowing use of paths: [_ALL_] I missed this part of your email until after I had alrea

Re: solr backup location 8.11.1

2022-08-05 Thread Shawn Heisey
On 8/5/22 10:06, Thomas Woodard wrote: That is exactly what I was afraid of. Not being able to configure where automated backups go seems like a pretty major oversight, though. Is anyone aware of a solution other than creating a bunch of soft links? The symlink idea I had (but haven't mentioned

Re: solr backup location 8.11.1

2022-08-05 Thread Shawn Heisey
On 8/5/22 11:56, Thomas Woodard wrote: Yup, I absolutely did typo when I tried to do it as a default. I'll update my issue to correct that. It will be interesting to see whether fixing the typo makes it work.  Sometimes the code is hard to decipher, and it is always possible that it does appl

Re: Solr 8.11 ( Queries taking longer to respond on startup)

2022-08-05 Thread Shawn Heisey
On 8/5/22 14:21, Surya R wrote: When the solr dameon is restarted, the cores do appear on the admin console, but when a query is hit against the core immediately, We dont get a response, it spins for like 20 seconds and then only after i see the below message in the log, i get the results. Why is

Optimize not working, Solr version 9.1.0-SNAPSHOT

2022-08-05 Thread Shawn Heisey
While looking into a problem described on the #solr slack channel, I tried to have Solr optimize my core. It seems to have completely ignored the command.  I am running 9.1.0-SNAPSHOT, compiled from branch_9x. The user on slack also tried to optimize their index, running version 8.11.2, and th

Test with DMARC reject enabled

2022-08-05 Thread Shawn Heisey
I have just enabled DMARC rejection for my domain.  Hoping that messages to the list can still get through.

Re: Solr 6 vs Solr 8 considerable performance gap

2022-08-26 Thread Shawn Heisey
On 8/26/22 02:55, Sidharth Negi wrote: We set up Solr 6 and Solr 8 on two identical AWS instances (16 cores, 128 GB of which Solr was given Xmx=50GB) and indexed the same data on them and tested under the same load of traffic. The schema and solrconfig.xml are exactly identical - the schema fil

Re: Solr 6 vs Solr 8 considerable performance gap

2022-08-26 Thread Shawn Heisey
On 8/26/22 14:18, Sidharth Negi wrote: The disk space taken by the index of both Solr versions was about ~35 GB and the number of docs ~30 million in both. Unless that system is handling insanely complex queries that chew up lots of memory, I would not expect it to need more than about 8GB of

Re: How to change solr.maxBooleanClauses limit in Solr 8.11 in Solr cloud mode

2022-08-26 Thread Shawn Heisey
On 8/26/22 09:44, Su, Weiling wrote: I checked the documentation, it seems that Solr Cloud is using zookeeper to store solr.xml. (https://solr.apache.org/guide/8_11/format-of-solr-xml.html#defining-solr-xml). Any ideas on how to make the above configuration changes to reach Solr query? SolrC

Re: Forcing solr to run query on replica Nodes

2022-08-30 Thread Shawn Heisey
On 8/30/22 08:08, Satya Nand wrote: For querying, we have used *shard.preference as PULL *so that all queries are returned from pull replicas. How can I force solr to use only pull replicas? in case one of the pull replicas is not available then I want partial results to be returned from 7 repli

Re: SOLR API - modifying the schema for one collection modifies it for ALL collections ?

2022-08-31 Thread Shawn Heisey
On 8/30/22 16:22, Christopher Schultz wrote: I've been playing-around with trying to programmatically alter the schema for a Solr core as well, and this explains a LOT of weirdness I was experiencing. Jan, thanks for explaining the root problem, but your solution of just "cloning your Config

Re: Allow anonymous search on otherwise Basic Auth-protected Solr instance?

2022-09-02 Thread Shawn Heisey
On 9/1/22 19:06, Victoria Stuart (VictoriasJourney.com) wrote: I am moving from client-side (my personal local host environment; Linux) to Solr running as a standalone backend server on a cloud VPS. The web domain (mine) is SSL-only, and Solr is SSL-enabled with a signed (Let's Encrypt) certif

Re: how to use solr streaming expressions without using zkhost

2022-09-02 Thread Shawn Heisey
On 9/2/22 10:30, Eric Pugh wrote: "SolrJ streaming expressions: don’t depend on ZK anymore" For my own education... does streaming expressions require SolrCloud, or can it work in standalone mode? Thanks, Shawn

Re: Compatibility with windows 2019

2022-09-03 Thread Shawn Heisey
On 8/31/22 07:25, Ashish gade wrote: I like to check the compatibility of the Solr version on Windows 2019 Server. We are planning to upgrade windows from 2012 to 2019 server and currently we are using Solr v4.4.0.0. Please let me know the impact asap Solr requires Java, not a particular opera

Re: Forcing solr to run query on replica Nodes

2022-09-07 Thread Shawn Heisey
On 8/31/22 23:52, Satya Nand wrote: Thank you Shawn. If I eliminate this indexing node and create 8 NRT shards on these 8 query nodes. Meaning indexing will be happening on all 8 nodes and queries too. Will it create any impact on response time? currency commit interval is 15 minus. Heavy ind

Re: Configure Jetty custom error page under Solr

2022-09-13 Thread Shawn Heisey
On 9/13/22 14:08, David Ahia wrote: As a result of a security scan of Solr, I am being asked to block the display of error messages. Is there a recommended guide for configuring Jetty with a custom error page to ensure error messages are not displayed through the web UI? If anything other th

Re: Can I force-delete a field?

2022-09-13 Thread Shawn Heisey
On 9/13/22 18:38, dmitri maziuk wrote: I've a field whose name I typo-ed in a data importer script, so Solr dutifully created the new TYPOed_field and a TYPOed_field_str as per my directive. Now it won't let me delete either because "it's referred to by at least one copyfield directive". I

Re: SolrCloud nodes do not attempt to rediscover Zookeeper nodes in a kubernetes environment?

2022-09-13 Thread Shawn Heisey
On 9/13/22 19:26, Jonathan Tan wrote: So what I'm trying to verify... It looks like SOLR doesn't attempt to reconnect to ZK if it has previously failed. Is that intentional? Is there a way to get it to do so? What version of Solr, and what version of ZK? And something that may be important ...

Re: SolrCloud nodes do not attempt to rediscover Zookeeper nodes in a kubernetes environment?

2022-09-13 Thread Shawn Heisey
On 9/13/22 20:07, Jonathan Tan wrote: The SolrZkClient takes a startUpTimeOut and a startUpZkTimeOut property, and only checks if ZK is available within that period. Once that timeout has exceeded, then it declares that the SOLR node was unable to load the cores, and then it does nothing else. Su

Re: Get and run another fieldType's analyzer?

2022-09-17 Thread Shawn Heisey
On 9/17/22 15:54, Bill Dueber wrote: I’m writing some custom filters in java and am trying to figure out if it’s possible to get the string output from running another fieldTypes analysis chain. I get an IndexSchema in the FieldType.init that I can hang onto, and thought that maybe in my toInter

  1   2   3   4   5   6   7   >