Re: Monitoring Solr for currently running queries

2020-12-29 Thread Markus Jelsma
Hello Ufuk,

You can log slow queries [1].

If you would want to see currently running queries you would have to extend
SearchHandler and build the custom logic yourself. Watch out for SolrCloud
because the main query as well as the per-shard queries can pass through
that same SearchHandler. You can distinguish between then reading the
shard=true parameter.

Regards,
Markus

[1] https://lucene.apache.org/solr/guide/6_6/configuring-logging.html

Op di 29 dec. 2020 om 16:49 schreef ufuk yılmaz :

> Hello All,
>
> Is there a way to see currently executing queries in a SolrCloud? Or a
> general strategy to detect a query using absurd amount or resources?
>
> We are using Solr for not only simple querying, but running complex
> streaming expressions, facets with large data etc. Sometimes, randomly, CPU
> usage gets so high that it starts to respond very slowly to even simple
> queries, or don’t respond at all. I’m trying to determine if it’s a result
> of simple overloading of the system by many “normal” queries, or someone
> sends Solr an unreasonably compute-heavy request.
>
> A few days ago when this occured, I stopped every service that can send
> Solr a query. After that, for about an hour, nodes were reading from the
> disk at 1GB/s which is the maximum of our disks. Then everything went back
> to the normal as I started the other services.
>
> One (bad) idea I had is to build a proxy service which proxies every
> request to our SolrCloud and monitors current running requests, but scaling
> this to the size of SolrCloud may be reinventing the wheel.
>
> For now all I can detect is that Solr is struggling, but I have no idea
> what causes that and when.
>
> -Chees and happy new year
>


Upgrade httpclient version due to CVE-2020-13956?

2020-12-29 Thread ??????
CVE-2020-13956: Apache HttpClient incorrect handling of malformed
authority component in request URIs


Severity: Medium


Vendor:
The Apache Software Foundation


Versions Affected:
Apache HttpClient 4.5.12 and prior
Apache HttpClient 5.0.2 and prior


Description:


Apache HttpClient versions prior to version 4.5.13 and 5.0.3 can
misinterpret malformed authority component in request URIs passed to
the library as java.net.URI object and pick the wrong target host for
request execution.


Mitigation:


As of release 4.5.13 and 5.0.3 HttpClient will reject URIs with
ambiguous malformed authority component as invalid. Users of HttpClient
are advised to upgrade to version 4.5.13 or 5.0.3 and sanitize request
URIs when using java.net.URI as input.


Credit:
This issue was discovered and reported by Priyank Nigam

Monitoring Solr for currently running queries

2020-12-29 Thread ufuk yılmaz
Hello All,

Is there a way to see currently executing queries in a SolrCloud? Or a general 
strategy to detect a query using absurd amount or resources?

We are using Solr for not only simple querying, but running complex streaming 
expressions, facets with large data etc. Sometimes, randomly, CPU usage gets so 
high that it starts to respond very slowly to even simple queries, or don’t 
respond at all. I’m trying to determine if it’s a result of simple overloading 
of the system by many “normal” queries, or someone sends Solr an unreasonably 
compute-heavy request.

A few days ago when this occured, I stopped every service that can send Solr a 
query. After that, for about an hour, nodes were reading from the disk at 1GB/s 
which is the maximum of our disks. Then everything went back to the normal as I 
started the other services.

One (bad) idea I had is to build a proxy service which proxies every request to 
our SolrCloud and monitors current running requests, but scaling this to the 
size of SolrCloud may be reinventing the wheel.

For now all I can detect is that Solr is struggling, but I have no idea what 
causes that and when.

-Chees and happy new year


Re: Why do I get different results for the same query with two Solr versions?

2020-12-29 Thread Tulsi Das
Can you post the managed schema and solrconfig content here ?

Do try the solr admin analysis screen
once as well to see the behaviour for this field.

https://lucene.apache.org/solr/guide/7_6/index.html

On Sun, 27 Dec, 2020, 6:54 pm nettadalet,  wrote:

> Thank you, that was helpful!
>
> For Solr 4.6 I get
> "parsedquery": "PhraseQuery(TITLE_ItemCode_t:\"ki 7\")"
>
> For Solr 7.5 I get
> "parsedquery":"+(+(TITLE_ItemCode_t:ki7 (+TITLE_ItemCode_t:ki
> +TITLE_ItemCode_t:7)))"
>
> So this is the cause of the difference in the search result, but I still
> don't know why the parsedquery is different between the two versions.
> Any idea/guess?
> Is it some internal implementation that changed sometime between 4.6 and
> 7.5?
>
>
>
> --
> Sent from: https://lucene.472066.n3.nabble.com/Solr-User-f472068.html
>


Re: Re:Re: Why do I get different results for the same query with two Solr versions?

2020-12-29 Thread nettadalet
Hi,
thank for the comment, but I tried to use both "sow=false" and "saw=true"
and I still get the same result. For query (TITLE_ItemCode_t:KI_7) I still
see:
Solr 4.6: "parsedquery": "PhraseQuery(TITLE_ItemCode_t:\"ki 7\")"
Solr 7.5: "parsedquery":"+(+(TITLE_ItemCode_t:ki7 (+TITLE_ItemCode_t:ki
+TITLE_ItemCode_t:7)))"



Tulsi wrote
> Hi ,
> Yes this look like related to sow (split on whitespace) param default
> behaviour change in solr 7.
> 
> The sow parameter (short for "Split on Whitespace") now defaults to
> false, which allows support for multi-word synonyms out of the box.
> This parameter is used with the eDismax and standard/"lucene" query
> parsers. If this parameter is not explicitly specified as true, query
> text will not be split on whitespace before analysis.
> 
> https://lucene.apache.org/solr/guide/7_0/major-changes-in-solr-7.html
> 
> 
> On Sun, 27 Dec, 2020, 8:25 pm nettadalet, 

> nsteinberg@

>  wrote:
> 
>> I added "defType=lucene" to both searches to make sure I use the same
>> query
>> parser, but it didn't change the results.
>>
>>
>>
>> --
>> Sent from: https://lucene.472066.n3.nabble.com/Solr-User-f472068.html
>>





--
Sent from: https://lucene.472066.n3.nabble.com/Solr-User-f472068.html


Atomic Update Failures with Nested Schema and Lazy Field Loading

2020-12-29 Thread Nussbaum, Ronen
Hi,

I've encountered another issue that might be related to nested schema.
Not always, but many times atomic updates fails for some shards with the 
message "TransactionLog doesn't know how to serialize class 
org.apache.lucene.document.LazyDocument$LazyField".
I checked both options:

  1.  Set false.
  2.  Set true but removed 
child documents.
In both cases atomic update worked without any errors.
This might suggest that there is an issue with this combination.

Thanks in advance,
Ronen.



This electronic message may contain proprietary and confidential information of 
Verint Systems Inc., its affiliates and/or subsidiaries. The information is 
intended to be for the use of the individual(s) or entity(ies) named above. If 
you are not the intended recipient (or authorized to receive this e-mail for 
the intended recipient), you may not use, copy, disclose or distribute to 
anyone this message or any information contained in this message. If you have 
received this electronic message in error, please notify us by replying to this 
e-mail.