Re: Cannot start solr because oom

2021-01-24 Thread Luke
SinceI changed heap size to 10G, I found that solr always uses around 6G-6.5G. Just wondering where I can set to limit memory usage, for example, I just want to give solr 6G. On Sun, Jan 24, 2021 at 1:51 PM Luke wrote: > looks like the solr-8983-console.log was overridden after I restarted Solr

Re: Cannot start solr because oom

2021-01-24 Thread Luke
looks like the solr-8983-console.log was overridden after I restarted Solr with 10G memory, I cannot find it anymore. as for how I install and start solr, I did as below 1. download binary file(8.7.0) 2. change configuration in solr.in.sh(setup external zk) 3. start it by ./bin/solr start &

Re: Cannot start solr because oom

2021-01-23 Thread Shawn Heisey
On 1/23/2021 6:41 PM, Luke wrote: I don't see any log in solr.log, but there is OutOfMemory error in solr-8983-console.log file. Do you have the entire text of that exception? Can you share it? That is the real information that I am after here. I only asked how Solr was installed and

Re: Cannot start solr because oom

2021-01-23 Thread Luke
Shawn, What version of Solr? 8.7.0 How is it installed and started? I download binary file and change configuration in solr.in.sh, then start it by ./bin/solr start & What OS? centos 7 Java version? openJDK 8 I don't see any log in solr.log, but there is OutOfMemory error in

Re: Cannot start solr because oom

2021-01-23 Thread Shawn Heisey
On 1/23/2021 6:29 AM, Luke Oak wrote: I use default settings to start solr , I set heap to 6G, I created 10 collections with 1node and 1 replica, however, there is not much data at all, just 100 documents. My server is 32 G memory and 4 core cpu, ssd drive 300g It was ok when i created 5

Cannot start solr because oom

2021-01-23 Thread Luke Oak
Hi there, I use default settings to start solr , I set heap to 6G, I created 10 collections with 1node and 1 replica, however, there is not much data at all, just 100 documents. My server is 32 G memory and 4 core cpu, ssd drive 300g It was ok when i created 5 collections. It got oom killed

Possible bug report—ICU Tokenizer: letter-space-number-letter tokenized inconsistently

2021-01-22 Thread Trey Jones
Hi all.. Following the advice at https://issues.apache.org/jira I'm explaining my situation here before creating an issue. The short version is that the ICU tokenizer can split tokens differently after a space depending on what comes *before* the space. For example, *x 14th* is tokenized as x |

Re: Exact and non exact highlighting

2021-01-22 Thread David Smiley
I'm very familiar with using the Unifier Highligher on a project with this requirement. The main "trick" we used was using only one field but analyzing both ways with a term differentiator (e.g. a leading symbol), and then coupled with a custom query parser that knows a phrase query is to be

Exact and non exact highlighting

2021-01-22 Thread df2832368_...@amberoad.de df2832368_...@amberoad.de
Hello folks, I am currently working on an issue where we need to enable exact highlighting on a text field. Only problem is that it should also be possible to have also parts of the query which don't need to be exact.(e.g. "Hello World" Test, so "Hello World" needs to be an exact match, but

RE: Getting Solr's statistic using SolrJ

2021-01-22 Thread Gael Jourdan-Weil
Hello Steven, I believe what you are looking for cannot be accessed using SolrJ (I didn't really check though). But you can easily access it either via the Collections APIs and/or the Metrics API depending on what you need exactly. See

Re: SOLR 8.6 Synonyms search and out of context results

2021-01-22 Thread Colvin Cowie
Hello, Do you mean that you want searches for "gain" to match documents with "revenue" on them, but do *not* want searches for "revenue" to match documents with "gain" on them? If that's what you mean, how have you defined your synonyms? If you're using the SynonymGraphFilterFactory

Getting Solr's statistic using SolrJ

2021-01-22 Thread Steven White
Hi everyone, Is there a SolrJ API that I can use to collect statistics data about Solr (everything that I see on the dashboard if possible)? I am in need to collect data about Solr instances, those same data that I see on the dashboard such as swap-memory, jvm-memory, list of cores, info about

Change uniqueKey using SolrJ

2021-01-22 Thread Timo Grün
Hi All, I’m currently trying to change the uniqueKey of my Solr Cloud schema using Solrj. While creating new Fields and FieldDefinitions is pretty straight forward, I struggle to find any solution to change the Unique Key field with Solrj. Any advice here? Best Regards, Timo Gruen

SOLR 8.6 Synonyms search and out of context results

2021-01-22 Thread Iram Tariq
Hi All, Using SOLR default Synonyms search I am able to search Synonyms but for some cases it is giving ambiguous results. For example one of Synonyms of "Revenue" is "Gain" Input Keyword for search: Revenue and Company Irrelevant Output: Our company doesn't want to gain success through

Re: "timeAllowed" param with "numFound" having a count value but doc list is empty

2021-01-22 Thread Aleksandr Yaroslavskiy
Hi, I am using timeAllowed and a collection with shards. Docs list contains results only if QTime(query time) < timeAllowed. In other cases (partialResults=true in responseHeader) docs are empty. /solr/collection_with_shards/select?etimeAllowed=1000=some_slow_query docs is empty If access a

Re: leader election stuck after hosts restarts

2021-01-22 Thread Pierre Salagnac
Thanks Alessandro. We found this Jira ticket that may be the root cause of this issue: https://issues.apache.org/jira/browse/SOLR-14356 I'm not sure whether it is the reason of the leader election initially failing, but it prevents Solr from exiting this error loop. Le mer. 13 janv. 2021 à

Re: Queries Regarding Cold searcher

2021-01-22 Thread Shawn Heisey
On 1/21/2021 3:42 AM, Parshant Kumar wrote: Do value(true or false) of cold searcher play any role during the completion of replication on slave server.If not please tell in which process in solr its applied? The setting to use a cold searcher applies whenever a new searcher is opened. It

Re: NullPointerException in Graph Traversal nodes streaming expression

2021-01-21 Thread Mike Drob
Can you provide a sample expression that would be able to reproduce this? Are you able to try a newer version by chance - I know we've fixed a few NPEs recently, maybe https://issues.apache.org/jira/browse/SOLR-14700 On Thu, Jan 21, 2021 at 4:13 PM ufuk yılmaz wrote: > Solr version 8.4. I’m

NullPointerException in Graph Traversal nodes streaming expression

2021-01-21 Thread ufuk yılmaz
Solr version 8.4. I’m getting an unexplanetory NullPointerException when executing a simple 2 level nodes stream, do you have any idea what may cause this? I tried setting /stream?partialResults=true=true and shards.tolerant=true in nodes expressions, with no luck. I also tried reading source

Re: Events on updating documents

2021-01-21 Thread Walter Underwood
Solr is not a database. I strongly recommend that you NOT use it as a data store. You will lose data. Solr does not have transactions. Don’t think of a Solr “commit” as a database commit. It is a command to start indexing the queued updates. It does not even attempt to meet ACID properties.

Re: DIH

2021-01-21 Thread dmitri maziuk
On 2021-01-20 6:26 PM, Joshua Wilder wrote: Please reconsider the removal of the DIH from future versions. The repo it's been moved to is a ghost town with zero engagement from Rohit (or anyone). Not sure how 'moving' it caused it to now only support MariaDB but that appears to be the case. The

Re: Exact matching without using new fields

2021-01-21 Thread Alexandre Rafalovitch
If, during index time, your "information" and "informed" are tokenized into the same root (inform?), then you will not be able to distinguish them without storing original forms somewhere, usually with copyField. Same with information vs INFORMATION. The search happens based on indexed tokens.

Re: Exact matching without using new fields

2021-01-21 Thread Doss
Hi, You can try search query -> "+information +retrieval" Meaning the document should have both the keywords. Doc 5 will also be in the results. https://lucene.apache.org/solr/guide/8_7/the-standard-query-parser.html#the-boolean-operator - Mohandoss. On Wed, Jan 20, 2021 at 1:38 AM gnandre

Re: read/write on different node?

2021-01-21 Thread Doss
Hi, Master/Slave concept is not there in SolrCloud, but similar thing we can achive by choosing TLOG + PULL replicas, where writes will happen in TLOG nodes and the PULL replicas gets the updated segments. Ensure minimum 2 TLOG nodes as PULL replica notes can't became leader of a shard. You can

How to change the JVM Threads of SolrCloud

2021-01-21 Thread Issei Nishigata
Hello All, I'm running SolrCloud(1 shard,9 replicas) on Amazon EKS. The other day, when I accidentally stopped CoreDNS of EKS, the entire Solr cluster went down due to the inability to resolve names of each node. I restarted CoreDNS shortly afterwards, but the Solr node just repeated down and

Re: read/write on different node?

2021-01-21 Thread Luke Oak
Thanks Mohandoss, It is realtime inserting and updating if a query is executed. My environment is three nodes, 3 shards and 2 replicas. I noticed there was master slave mode in the old version, but for solr cloud, I don’t know whether it is doable. Derrick Sent from my iPhone > On Jan

RE: Parallel streaming expression java.lang.IndexOutOfBoundsException

2021-01-21 Thread ufuk yılmaz
Looked at the source code of the parallel stream and it seems I need equal number of SHARDS and workers count parameter. I thought I needed as many replicas, it was shards. Maybe helps someone. Sent from Mail for Windows 10 From: ufuk yılmaz Sent: 21 January 2021 11:16 To:

Re: Queries Regarding Cold searcher

2021-01-21 Thread Parshant Kumar
Adding more queries :- Do value(true or false) of cold searcher play any role during the completion of replication on slave server.If not please tell in which process in solr its applied? On Thu, Jan 21, 2021 at 3:11 PM Parshant Kumar wrote: > Hi all, > > Please help me in below queries: > >

RE: Parallel streaming expression java.lang.IndexOutOfBoundsException

2021-01-21 Thread ufuk yılmaz
It only works when I set workers to 1, which defeats the point of parallel. Sent from Mail for Windows 10 From: ufuk yılmaz Sent: 21 January 2021 11:16 To: solr-user@lucene.apache.org Subject: Parallel streaming expression java.lang.IndexOutOfBoundsException Hello all,

Queries Regarding Cold searcher

2021-01-21 Thread Parshant Kumar
Hi all, Please help me in below queries: 1) what is the impact of making cold searcher false,true? 2)After full replication completion of data on slave server, new searcher is opened or not? 3)If opensearcher is false in autocommit and cold searcher is true , what does this conclude , Is their

Parallel streaming expression java.lang.IndexOutOfBoundsException

2021-01-21 Thread ufuk yılmaz
Hello all, https://lucene.apache.org/solr/guide/8_4/stream-decorator-reference.html#parallel I’m sending the same query in the docs, (just collection names changed) to my Solr but always getting the exception: { "result-set":{ "docs":[{

Re: read/write on different node?

2021-01-21 Thread Doss
Hi, You haven't shared information about your environment and how frequently you are commiting the changes, whether your user searching collection gets real time inserts / updates etc., but if you are not doing any real time analysis with the user query information, you can store the information

Events on updating documents

2021-01-21 Thread haris . khan
Hello, We at VNC are using Solr for search and as a data store. We have a use-case in which we want to hit a REST endpoint whenever documents are inserted, updated, or deleted in Solr with the documents under consideration as well. When exploring the Solr documentation, we found Event

DIH

2021-01-20 Thread Joshua Wilder
Please reconsider the removal of the DIH from future versions. The repo it's been moved to is a ghost town with zero engagement from Rohit (or anyone). Not sure how 'moving' it caused it to now only support MariaDB but that appears to be the case. The current implementation is fast, easy to work

Streaming expressions, what is the effect of collection name in the request url

2021-01-20 Thread ufuk yılmaz
Do collection names in request url affect how the query works in any way? A streaming expression is sent to http://mySolrHost/solr/col1,col2/stream (notice multiple collections in url) Col1 has 2 shards, each have 3 replicas. * Shard1 has replicas on nodes A, B, C * Shard2 has replicas on D,E,F

read/write on different node?

2021-01-20 Thread Luke
Hi, I have one data collection on 3 shards and 2 replicas, user searches on it. Also I log all user queries and save to another collection on the same solr cloud, but user queries are very slow when there are a lot of logs to be written to the log collection. any solution for me, please advise.

Solr Cloud freezes during scheduled backup

2021-01-20 Thread Paweł Róg
Hello everyone, I have a nasty problem with the scheduled Solr collections backup. From time to time when a scheduled backup is triggered (backup operation takes around 10 minutes) Solr freezes for 20-30 seconds. The freeze happens on one Solr instance at time but this affects all queries latency

Incorrect distance returned for indexed polygone shape

2021-01-20 Thread Famas
I am using `geodist()` in solr query. Following this `select?==*,_dist_:geodist()={!geofilt d=30444}=on=50.53,-9.5722616=*:*=geo=true=json` However, it seems like distance calculations aren’t working. Here’s an example query where the pt is several hundred kilometers away from the POLYGON. The

Different Edismax Behavior with user params vs Solr config params on Solr 8.

2021-01-19 Thread Kerwin
Hi, I am upgrading from Solr 6.5.1 to solr 8.6.1 and have noticed a change in the Edismax parser behavior which is affecting our search results. If user operators are present in the search query, the Solr 6 behavior was to take mm parameters from the user query string which was 0% by default if

Re: Effects of shards and replicas on performance

2021-01-19 Thread Shawn Heisey
On 1/19/2021 4:19 PM, ufuk yılmaz wrote: Lets say I had only 1 replica for each collection but I split it to 6 shards, 1 for every node. Or I had 2 shards (1 shard is too big for a single node I think) but I had 3 replicas, 3x2=6, 1 on every node. How would it affect the performance? It all

Effects of shards and replicas on performance

2021-01-19 Thread ufuk yılmaz
I’m trying to learn all I can on Solr for a year now and I still scratch my head when it comes to effects of shards and replicas on performance. - info about my setup We have a SolrCloud setup with 6 nodes. Each collection has 2 shards and 2 replicas. 1 shard’s size is about 100GB.

Re: Exact matching without using new fields

2021-01-19 Thread gnandre
Thanks for replying, Dave. I am afraid that I am looking for non-index time i.e. query time solution. Actually in my case I am expecting both documents to be returned from your example. I am just trying to avoid returning of documents which contain a tokenized versions of the provided search

Re: Leading wildcard searches very slow

2021-01-19 Thread matthew sporleder
https://lucene.apache.org/solr/4_6_0/solr-core/org/apache/solr/analysis/ReversedWildcardFilterFactory.html ? On Tue, Jan 19, 2021 at 4:01 AM mosheB wrote: > > Hi, is there any sophisticated way [using the schema] to block brutal regex > queries? > > > Thanks > > > > -- > Sent from:

Re: Exact matching without using new fields

2021-01-19 Thread David R
We had the same requirement. Just to echo back your requirements, I understand your case to be this. Given these 2 doc titles: doc 1: "information retrieval" doc 2: "Advanced information retrieval with Solr" You want a phrase search for "information retrieval" to find both documents, but an

Exact matching without using new fields

2021-01-19 Thread gnandre
Hi, I am aware that to do exact matching (only whatever is provided inside double quotes should be matched) in Solr, we can copy existing fields with the help of copyFields into new fields that have very minimal tokenization or no tokenization (e.g. using KeywordTokenizer or using string field

Re: Solr Slack Workspace

2021-01-19 Thread Charlie Hull
Relevance Slack is open to anyone working on search & relevance - #solr is only one of the channels, there's lots more! Hope to see you there. Cheers Charlie https://opensourceconnections.com/slack On 16/01/2021 02:18, matthew sporleder wrote: IRC has kind of died off,

Re: Leading wildcard searches very slow

2021-01-19 Thread mosheB
Hi, is there any sophisticated way [using the schema] to block brutal regex queries? Thanks -- Sent from: https://lucene.472066.n3.nabble.com/Solr-User-f472068.html

Re: Issues upgrading from Solr 6.5.1 to Solr 8.6.1

2021-01-18 Thread Kerwin
I further checked that BM25Similarity class until solr 7.7 has a null check for norms in the explainTFNorm method but this is removed in Solr 8 onwards. Does omitNorms work in Solr8? Can someone send me what the debug output looks like with omitNorms="true"? Here is my config: On Mon, Jan 18,

[Solr8.5.2] Sudden increase in cpu usage.

2021-01-18 Thread raj.yadav
Hi Everyone, We are using solr8.5.2 (Solr cloud mode), external zookeeper ensemble (hosted on the separate node) All of a sudden we are seeing sudden spike in CPU but at the same same time neither any heavy indexing is performed nor any sudden increase in request rate. Collection info:

Re: Solrcloud - Reads on specific nodes

2021-01-18 Thread Shawn Heisey
On 1/17/2021 11:12 PM, Doss wrote: Thanks Michael Gibney , Shawn Heisey for pointing in the right direction. 1. Will there be any performance degrade if we use shards.preference? 2. How about leader election if we decided to use NRT + PULL ? TLOG has the advantage of participating in leader

Issues upgrading from Solr 6.5.1 to Solr 8.6.1

2021-01-18 Thread Kerwin
Hi eveybody, I am migrating from solr 6.5.1 to solr 8.6.1 and am having a couple of issues for which I need your help. There is a significant change in ranking between Solr 6 and 8 search results which I need to fix before using Solr8 in our live environment. I noticed a couple of changes upfront

Re: Solrcloud - Reads on specific nodes

2021-01-17 Thread Doss
Thanks Michael Gibney , Shawn Heisey for pointing in the right direction. 1. Will there be any performance degrade if we use shards.preference? 2. How about leader election if we decided to use NRT + PULL ? TLOG has the advantage of participating in leader election correct? 3. NRT + TLOG is there

Re: Solr Slack Workspace

2021-01-15 Thread matthew sporleder
IRC has kind of died off, https://lucene.apache.org/solr/community.html has a slack mentioned, I'm on https://opensourceconnections.com/slack after taking their solr training class and assume it's mostly open to solr community. On Fri, Jan 15, 2021 at 8:10 PM Justin Sweeney wrote: > > Hi all, >

Solr Slack Workspace

2021-01-15 Thread Justin Sweeney
Hi all, I did some googling and didn't find anything, but is there a Slack workspace for Solr? I think this could be useful to expand interaction within the community of Solr users and connect people solving similar problems. I'd be happy to get this setup if it does not exist already. Justin

Re: [Solr8.7] Performance of group.ngroups ?

2021-01-15 Thread Joel Bernstein
You can try collapse as well. Joel Bernstein http://joelsolr.blogspot.com/ On Fri, Jan 15, 2021 at 4:51 AM Bruno Mannina wrote: > Hello, > > > > I found a temporary solution to my problem. > > > > I do a request without ngroups=true => result is quickly > > And just after, I do a simple

Re: Solrcloud - Reads on specific nodes

2021-01-15 Thread Shawn Heisey
On 1/15/2021 7:56 AM, Doss wrote: 1. Suppose we have 10 node SOLR Cloud setup, is it possible to dedicate 4 nodes for writes and 6 nodes for selects? 2. We have a SOLR cloud setup for our customer facing applications, and we would like to have two more SOLR nodes for some backend jobs. Is it

Re: Handling acronyms

2021-01-15 Thread Michael Gibney
EDIT: "the equivalent terms are separated by commas (as they should be)" => "the equivalent terms are _not_ separated by commas (as they should be)" On Fri, Jan 15, 2021 at 10:09 AM Michael Gibney wrote: > Shaun, > > I'm not 100% sure, but don't give up on this just yet: > > > For example if I

Re: Solrcloud - Reads on specific nodes

2021-01-15 Thread Michael Gibney
I know you're asking about nodes, not replicas; but depending on what you're trying to achieve you might be as well off routing requests based on replica. Have you considered the various options available via the `shards.preference` param [1]? For instance, you could set up your "write" replicas

Re: Replicaton SolrCloud

2021-01-15 Thread Shawn Heisey
On 1/15/2021 7:20 AM, Jae Joo wrote: Is non CDCR replication in SolrCloud still working in Solr 9.0? Solr 9 doesn't exist yet. Probably won't for at least a few months. The latest version is 8.7.0. Solr's replication feature is used by SolrCloud internally for recovery operations, but the

Re: Handling acronyms

2021-01-15 Thread Michael Gibney
Shaun, I'm not 100% sure, but don't give up on this just yet: > For example if I enter diabetes it finds the acronym DM for diabetes mellitus I think the behavior you're observing may simply be a side-effect of a misconfiguration of synonyms.txt. In the example you posted, the equivalent terms

Solrcloud - Reads on specific nodes

2021-01-15 Thread Doss
Dear All, 1. Suppose we have 10 node SOLR Cloud setup, is it possible to dedicate 4 nodes for writes and 6 nodes for selects? 2. We have a SOLR cloud setup for our customer facing applications, and we would like to have two more SOLR nodes for some backend jobs. Is it good idea to form these

Re: Handling acronyms

2021-01-15 Thread Shaun Campbell
Hi Michael Thanks for that I'll have a study later. It's just reminded me of the expand option which I meant to have a look at. Thanks Shaun On Fri, 15 Jan 2021 at 14:33, Michael Gibney wrote: > The equivalent terms on the right-hand side of the `=>` operator in the > example you sent should

Re: Handling acronyms

2021-01-15 Thread Shaun Campbell
Hi Charlie I was indexing at index time only. The synonyms/acronyms were coming from the published journals xml files so I wasn't expecting to maintain them myself. If it worked, I was expecting, hopefully, to update the synonyms file automatically. As I just explained to Bernd I'm finding that

Re: Handling acronyms

2021-01-15 Thread Shaun Campbell
Hi Bernd Thanks for that. I think it is working, but I think unfortunately what I'm trying to do is impossible/not logical. When I enter a term it goes off and searches using all the matching acronyms, because I'm finding a term used in more than one synonym eg diabetes. I think at the end of

Re: Handling acronyms

2021-01-15 Thread Michael Gibney
The equivalent terms on the right-hand side of the `=>` operator in the example you sent should be separated by a comma. You mention you already tried only-comma-separated (e.g. one line: `SRN,Stroke Research Network`) and that that yielded unexpected results as well. I would recommend

Replicaton SolrCloud

2021-01-15 Thread Jae Joo
Is non CDCR replication in SolrCloud still working in Solr 9.0? Jae

Unicode Normalization and ICUNormalizer2Filter

2021-01-15 Thread Bernd Fehling
Hello list, cloud it be that Apache Solr Reference Guide of all versions is wrong? Example: https://lucene.apache.org/solr/guide/8_7/filter-descriptions.html#icu-normalizer-2-filter NFC: (name="nfc" mode="compose") Normalization Form C, canonical decomposition NFD: (name="nfc"

SolrCloud 8.7.0 with Zookeeper 3.4.5

2021-01-15 Thread Subhajit Das
Hi There, I am planning to implement Solr cloud 8.7.0 with existing Zookeeper 3.4.5. This is cloudera provided zookeeper. Is there any red flags, for such configuration, as I couldn’t find any compatibility matrix? Many thanks in advance. Regards, Subhajit

RE: Query over migrating a solr database from 7.7.1 to 8.7.0

2021-01-15 Thread Flowerday, Matthew J
Hi Jim Thanks for looking into it for me. I did some more testing and if I created a base solr 7.7.1 database using the 'out of the box' schema.xml and solrconfig and add this item manually using the Solr Admin tool documents/XML ABCD-N1 A test And then update it using

Re: Handling acronyms

2021-01-15 Thread Charlie Hull
I'm wondering if you should be using these acronyms at index time, not search time. It will make your index bigger and you'll have to re-index to add new synonyms (as they may apply to old documents) but this could be an occasional task, and in the meantime you could use query-time synonyms

Fieldname alias for Highlighter results

2021-01-15 Thread Michael Aleythe, Sternwald
Hi everybody, I'm looking for a way to replace solr index field names in the highlighting response. For the query part there is the param fl=substitute:REAL_FIELD_NAME which substitutes the field name REAL_FIELD_NAME by "substitute". Sadly the substitution is not applied to the highlighter

Re: Handling acronyms

2021-01-15 Thread Bernd Fehling
If you are using multiword synonyms, acronyms, ... Your should escape the space within the multiwords. As synonyms.txt: SRN, Stroke\ Research\ Network IGBP, isolated\ gastric\ bypass ... Redards Bernd Am 15.01.21 um 10:48 schrieb Shaun Campbell: I have a medical journals search application

RE: [Solr8.7] Performance of group.ngroups ?

2021-01-15 Thread Bruno Mannina
Hello, I found a temporary solution to my problem. I do a request without ngroups=true => result is quickly And just after, I do a simple request with my query and this param: ….={x:"unique(fid)"} Where the field « fid » is my group field name. 88 sec => 3~4 sec for both requests.

Handling acronyms

2021-01-15 Thread Shaun Campbell
I have a medical journals search application and I've a list of some 9,000 acronyms like this: MSNQ=>MSNQ Multiple Sclerosis Neuropsychological Screening Questionnaire SRN=>SRN Stroke Research Network IGBP=>IGBP isolated gastric bypass TOMADO=>TOMADO Trial of Oral Mandibular Advancement Devices

Re: Getting error "Bad Message 414 reason: URI Too Long"

2021-01-15 Thread Shawn Heisey
On 1/14/2021 2:31 AM, Abhay Kumar wrote: I am trying to post below query to Solr but getting error as “Bad Message 414reason: URI Too Long”. I am sending query using SolrNet library. Please suggest how to resolve this issue. *Query :*

RE: QueryResponse ordering

2021-01-14 Thread Srinivas Kashyap
Hi Alessandro, I'm trying to retrieve party id 'abc' 'def' 'ghi' in the same order I pass to filter query. Is this possible? The sorting field which I want to get results is not in solr schema for party core. The sorting field Is outside solr. I want to able to fetch the QueryResponse(SolrJ)

Re: Cursor Performance Issue

2021-01-14 Thread Ajay Sharma
Hi Mike, Thanks for your reply. I remember DocValues is enabled by default since solr 6. If it is not and I reindex the data with DocValues= true for id field. How much my index size will increase due to this. Currently I have 90 GB as index size On Wed, 13 Jan, 2021, 9:14 pm Mike Drob,

[Solr8.7] Performance of group.ngroups ?

2021-01-14 Thread Matheo Software
Hi All, I have more than 130 million documents, with an index size of more than 400GB on Solr8.7. I do a simple query and it takes around 1400ms, it’s ok but when I use ngroups=true, I get an answer in 88sec. I know it’s because Solr calculates the number of groups on a specific field but

RE: [Solr8.7] UI request reply empty after 8s

2021-01-14 Thread Bruno Mannina
Hi, Perfect ! it works when I increase the config.timeout (row 597). The file app.js can be found here: /opt/solr/server/solr-webapp/webapp/js/angular -Message d'origine- De : ufuk yılmaz [mailto:uyil...@vivaldi.net.INVALID] Envoyé : mercredi 13 janvier 2021 14:57 À :

Re: Getting error "Bad Message 414 reason: URI Too Long"

2021-01-14 Thread Bernd Fehling
AFAIK, that could be a limit in Jetty and be raised in jetty.xml. You might check the Jetty docs and look for something like BufferSize. At least for Solr 6.6.x Regards Bernd Am 14.01.21 um 13:19 schrieb Abhay Kumar: Thank you Nicolas. Yes, we are making Post request to Solr using SolrNet

RE: Getting error "Bad Message 414 reason: URI Too Long"

2021-01-14 Thread Abhay Kumar
Thank you Nicolas. Yes, we are making Post request to Solr using SolrNet library. The current request length is approx. 32K characters, I have tested with 10K characters length request and it works fine. Any suggestion to increase request length size in Solr configuration. Thanks. Abhay

Re: solrcloud with EKS kubernetes

2021-01-14 Thread Abhishek Mishra
Hi Jonathan, it was really helpful. Some of the metrics were crossing threshold like network bandwidth etc. Regards, Abhishek On Sat, Dec 26, 2020 at 7:54 PM Jonathan Tan wrote: > Hi Abhishek, > > Merry Christmas to you too! > I think it's really a question regarding your indexing speed NFRs.

Re: Getting error "Bad Message 414 reason: URI Too Long"

2021-01-14 Thread Nicolas Franck
Euh, sorry: I did not read your message well enough. You did actually use a post request, with the parameters in the body (your example suggests otherwise) > On 14 Jan 2021, at 10:37, Nicolas Franck wrote: > > I believe you can also access this path in a HTTP POST request. > That way you do no

Re: Getting error "Bad Message 414 reason: URI Too Long"

2021-01-14 Thread Nicolas Franck
I believe you can also access this path in a HTTP POST request. That way you do no hit the URI size limit cf. https://stackoverflow.com/questions/2997014/can-you-use-post-to-run-a-query-in-solr-select I think some solr libraries already use this approach (e.g. WebService::Solr in perl) On 14

Getting error "Bad Message 414 reason: URI Too Long"

2021-01-14 Thread Abhay Kumar
Hello, I am trying to post below query to Solr but getting error as "Bad Message 414reason: URI Too Long". I am sending query using SolrNet library. Please suggest how to resolve this issue. Query :

dedupe doen't work on solr cloud with router field

2021-01-13 Thread Luke
I have one collection, 3 shards, 2 replicas, I defined route field: title, and ID is the unique key. I index two document with same ID and different title, I configured dedupe chain and I can see signature is generated, but the old document was removed by solr, please help, thanks

Re: Apache Solr in High Availability Primary and Secondary node.

2021-01-13 Thread Kaushal Shriyan
Hi, Checking in again if someone can pitch in for my earlier post to this mailing list? Thanks in Advance. Best Regards, On Tue, Jan 12, 2021 at 8:30 AM Kaushal Shriyan wrote: > > > On Tue, Jan 12, 2021 at 12:10 AM Dmitri Maziuk > wrote: > >> On 1/11/2021 12:30 PM, Walter Underwood wrote: >>

Re: Re:Interpreting Solr indexing times

2021-01-13 Thread Alessandro Benedetti
I agree, documents may be gigantic or very small, with heavy text analysis or simple strings ... so it's not possible to give an evaluation here. But you could make use of the nightly benchmark to give you an idea of Lucene indexing speed (the engine inside Apache Solr) :

Re: leader election stuck after hosts restarts

2021-01-13 Thread Alessandro Benedetti
I faced these problems a while ago, but at the time I created a blog post which I hope could help: https://sease.io/2018/05/solrcloud-leader-election-failing.html - --- Alessandro Benedetti Search Consultant, R Software Engineer, Director Sease Ltd. - www.sease.io -- Sent from:

Re: QueryResponse ordering

2021-01-13 Thread Alessandro Benedetti
Hi Srinivas, Filter queries don't impact scoring but only matching. So, what is the ordering you are expecting? A bq (boost query) parameter will add a clause to the query, impacting the score in an additive way. The query you posted is a bit confused, what was your intent there? To boost search

RE: Query over migrating a solr database from 7.7.1 to 8.7.0

2021-01-13 Thread Dyer, Jim
I think if you have _root_ in schema.xml you should look elsewhere. My memory is merely adding this one line to schema.xml took care of our problem. From: Flowerday, Matthew J Sent: Tuesday, January 12, 2021 3:23 AM To: solr-user@lucene.apache.org Subject: RE: Query over migrating a solr

Re: different score from different replica of same shard

2021-01-13 Thread Walter Underwood
Yes, check performance before turning on the stats cache in prod. When we tested the LRUStatsCache in 6.6.2, searches were 11X slower. It should be possible to do distributed IDF with little extra overhead. Infoseek was doing that in 1995 and the patent on the technique has expired. wunder

Re: Cursor Performance Issue

2021-01-13 Thread Mike Drob
You should be using docvalues on your id, but note that switching this would require a reindex. On Wed, Jan 13, 2021 at 6:04 AM Ajay Sharma wrote: > Hi All, > > I have used cursors to search and export documents in solr according to > >

Re: different score from different replica of same shard

2021-01-13 Thread Vincent Brehin
Hallo Bernd und Markus, A very instructive article, by the creator of TLOG mode (introduced in 7.0, btw): https://medium.com/@caomanhdat317/indexing-flow-of-solrcloud-sharding-distributed-systems-1-bba411bf8994 It helped me when architecting our replication policy. Not an easy matter, it's a

Re: different score from different replica of same shard

2021-01-13 Thread Markus Jelsma
Hallo Bernd, I see the different replica types in the 7.1 [1] manual but not in the 6.6. ExactStatsCache should work in 6.6, just add it to solrconfig.xml, not the request handler [1]. It will slow down searches due to added overhead. Regards, Markus [1]

SockerTimeoutException in long running streaming queries

2021-01-13 Thread ufuk yılmaz
When I performa a long running streaming expression, sometimes I get: { "error": { "metadata": [ "error-class", "org.apache.solr.common.SolrException", "root-error-class", "java.net.SocketTimeoutException" ], "msg":

Re: different score from different replica of same shard

2021-01-13 Thread Bernd Fehling
Hello Markus, thanks a lot. Is TLOG also for SOLR 6.6.6 or only 8.x and up? I will first try ExactStatsCache. Should be added as invariant to request handler, right? Comparing the replica index directories they have different size and the index version and generation is different. Also Max

RE: [Solr8.7] UI request reply empty after 8s

2021-01-13 Thread ufuk yılmaz
Hi, A while ago I asked the same thing here. Looking at the source javascript code of the frontend app, I saw a 10k millisecond timeout config in httpInterceptor inside app.js. I changed it to something much larger and results of long queries began to show. Hope it helps Sent from Mail for

Re: different score from different replica of same shard

2021-01-13 Thread Markus Jelsma
Hello Bernd, This is normal for NRT replicas, because the way segments are merged and deletes are removed is not synchronized between replicas. In that case counts for TF and IDF and norms become slightly different. You can either use ExactStatsCache that fetches counts for terms before scoring,

QueryResponse ordering

2021-01-13 Thread Srinivas Kashyap
Hello, I have a scenario where I'm using filter query to fetch the results. Example: Filter query(fq) - PARTY_ID:(abc OR def OR ghi) Now I'm getting query response through solrJ in different order. Is there a way I can get the results in same order as specified in filter query? Tried dismax

<    1   2   3   4   5   6   7   8   9   10   >