Re: how to get rid of double quotes in solr

2020-04-13 Thread sefty nindyastuti
the data that I use is log from hadoop, my problem is hadoop log from cluster, the schema I use is filebeat --> logstash --> solr, I use logstash config to parse the hadoop log, the hadoop log is inputted to the logstash via filebeat then output from the logstash indexed to the solr Pada tanggal S

Re: Fuzzy search not working

2020-04-13 Thread Deepu
Corrected Typo mistake. Hi Team, We have 8 text fields (*_txt_en) in schema and one multi valued text field which is copy field of other text fields, like below. tittle_txt_en, configuration_summary_txt_en, all_text_txt_ens (multi value field) Observed one issue with Fuzzy match, same term with

Re: Fuzzy search not working

2020-04-13 Thread Deepu
Hi Walter, It's type, actual input term was "probl" sorry for the typo. Thanks, Pradeep On Mon, Apr 13, 2020 at 3:46 PM Walter Underwood wrote: > You need to add three letters to “prob” to get “problem”, so it is edit > distance 3. > Fuzzy only works to distance 2. > > If you want to match pre

Re: Fuzzy search not working

2020-04-13 Thread Walter Underwood
You need to add three letters to “prob” to get “problem”, so it is edit distance 3. Fuzzy only works to distance 2. If you want to match prefixes, edge n-grams are a better approach. wunder Walter Underwood wun...@wunderwood.org http://observer.wunderwood.org/ (my blog) > On Apr 13, 2020, at 2

Re: SolrJ connection leak with SolrCloud and Jetty Gzip compression enabled

2020-04-13 Thread Samuel Garcia Martinez
Reading again the last two paragraphs I realized that, those two specially, are very poorly worded (grammar 😓). I tried to rephrase them and correct some of the errors below. Here I can see three different problems: * HttpSolrCall should not use HttpServletResponse#setCharacterEncoding to set

Fuzzy search not working

2020-04-13 Thread Deepu
Hi Team, We have 8 text fields (*_txt_en) in schema and one multi valued text field which is copy field of other text fields, like below. tittle_txt_en, configuration_summary_txt_en, all_text_txt_ens (multi value field) Observed one issue with Fuzzy match, same term with distance of two(~2) is w

Fuzzy match issue

2020-04-13 Thread Pradeep Kumar Kolluri (V)
We have 8 text fields (*_txt_en) in schema and one multi valued text field which is copy field of other text fields, like below. tittle_txt_en, configuration_summary_txt_en, all_text_txt_ens (multi value field) Observed one issue with Fuzzy match, same term with distance of two(~2) is working on

SolrJ connection leak with SolrCloud and Jetty Gzip compression enabled

2020-04-13 Thread Samuel Garcia Martinez
Hi! Today, I've seen a weird issue in production workloads when the gzip compression was enabled. After some minutes, the client app ran out of connections and stopped responding. The cluster setup is pretty simple: Solr version: 7.7.2 Solr cloud enabled Cluster topology: 6 nodes, 1 single coll

Re: Proper way to manage managed-schema file

2020-04-13 Thread Alexandre Rafalovitch
If you are using API (which AdminUI does), the regenerated file will loose comments and sort everything in particular order. That's just the implementation at the moment. If you don't like that, you can always modify the schema file by hand and reload the core to notice the changes. You can even s

Re: how to use multiple update process chain?

2020-04-13 Thread Alexandre Rafalovitch
You can only have one chain at the time. You can, however, create your custom URP chain to contain configuration from all three. Or, if you do use multiple chains that are configured similarly, you can pull each URP into its own definition and then mix and match then either in the chain or even p

Re: Required operator (+) is being ignored when using default conjunction operator AND

2020-04-13 Thread Chris Hostetter
On Sat, 11 Apr 2020, Eran Buchnick wrote: : Date: Sat, 11 Apr 2020 23:34:37 +0300 : From: Eran Buchnick : Reply-To: solr-user@lucene.apache.org : To: solr-user@lucene.apache.org : Subject: Re: Required operator (+) is being ignored when using default : conjunction operator AND : : Hoss, than

Re: Nested Document with replicas slow

2020-04-13 Thread Michael Gibney
Depending on how you're measuring performance (and whether your use case benefits from caching), it might be worth looking into stable replica routing (configured with the "replica.base" sub-parameter of the shards.preference

Nested Document with replicas slow

2020-04-13 Thread Jae Joo
I have multiple 100 M documents using Nested Document for joining. It is the fastest way for joining in a single replica. By adding more replicas (2 or 3), the performance is slow down significantly. (about 100x times). Does anyone have same experience? Jae

Re: how to get rid of double quotes in solr

2020-04-13 Thread sefty nindyastuti
the picture is the output from logstash, so I use config logstash to accept input from the file and then output to the solr Pada tanggal Sen, 13 Apr 2020 19.07, Erick Erickson menulis: > I don’t quite know what you’re asking about. Is that input or intput to > Solr? Or is it output from logstas

Re: how to get rid of double quotes in solr

2020-04-13 Thread Erick Erickson
I don’t quite know what you’re asking about. Is that input or intput to Solr? Or is it output from logstash? What are you indexing? Because that doesn't look like data from a solr log. You might want to review: https://wiki.apache.org/solr/UsingMailingLists Best, Erick > On Apr 13, 2020, at 12

Queries on adding headers to solrj Request

2020-04-13 Thread dinesh naik
Hi all, We are planning to add security to Solr using . For this we are adding few information in the headers of each SolrJ Request. These request will be intercepted by some application (proxy) in the Solr VM and then route it to Solr ( Considering Solr port as 8983 ) . Could you please answer bel