Re: Insert documents to a particular shard

2020-06-02 Thread Jörn Franke
Hint: you can easily try out streaming expressions in the admin UI > Am 03.06.2020 um 07:32 schrieb Jörn Franke : > >  > You are trying to achieve data locality by having parents and children in the > same shard? > Does document routing address it? > >

Re: Insert documents to a particular shard

2020-06-02 Thread Jörn Franke
You are trying to achieve data locality by having parents and children in the same shard? Does document routing address it? https://lucene.apache.org/solr/guide/8_5/shards-and-indexing-data-in-solrcloud.html#document-routing On a side node, I don’t know your complete use case, but have you

Re: solr 8.4.1 with ssl tls1.2 creating an issue with non-leader node

2020-06-02 Thread Jörn Franke
Why in the jetty-ssl.xml? Should this not be configured in the solr.in.sh? > Am 03.06.2020 um 00:38 schrieb yaswanth kumar : > > Thanks Franke, but yes for all these questions I did configured it > properly, I made sure to include > > default="JKS"/> > default="JKS"/> > in the

Re: solr 8.4.1 with ssl tls1.2 creating an issue with non-leader node

2020-06-02 Thread yaswanth kumar
also forgot to update before that I have enabled basicauthentication and provided the details in security.json and uploaded it via zookeeper. Thanks, On Tue, Jun 2, 2020 at 6:42 PM yaswanth kumar wrote: > also I am seeing the below error as a parent one from solr.log > > at

Re: solr 8.4.1 with ssl tls1.2 creating an issue with non-leader node

2020-06-02 Thread yaswanth kumar
also I am seeing the below error as a parent one from solr.log at org.apache.solr.util.CryptoKeys$RSAKeyPair.encrypt(CryptoKeys.java:366) org.apache.solr.common.SolrException: javax.crypto.BadPaddingException: RSA private key operation failed at

Re: solr 8.4.1 with ssl tls1.2 creating an issue with non-leader node

2020-06-02 Thread yaswanth kumar
Thanks Franke, but yes for all these questions I did configured it properly, I made sure to include in the jetty-ssl.xml along with the path keystore and truststore. Also I have made sure that trusstore exists on all nodes and also I am using the same file for both keystore and truststore as

Insert documents to a particular shard

2020-06-02 Thread sambasivarao giddaluri
Hi All, I am running solr in cloud mode in local with 2 shards and 2 replica on port 8983 and 7574 and figuring out how to insert document in to a particular shard , I read about implicit and composite route but i don't think it will work for my usecase. shard1 :

Re: solr 8.4.1 with ssl tls1.2 creating an issue with non-leader node

2020-06-02 Thread Jörn Franke
Have you looked in the logfiles? Keystore Type correctly defined on all nodes? Have you configured the truststore on all nodes correctly? Have you set clusterprop urlScheme to htttps in ZK? https://lucene.apache.org/solr/guide/7_5/enabling-ssl.html#configure-zookeeper > Am 02.06.2020 um

Solr8 improvements to SolrCloud leader election

2020-06-02 Thread Danny Shih
Are there any significant (or not so significant) changes? I have browsed the release notes and searched JIRA, but the latest news seems to be in 7.3 (where the old Leader-In-Recovery logic was replaced). Context: We are currently running Solr 7.4 in production. In the past year, we’ve seen

Autoscaling using SolrCloud8.5 on AWS EKS - issue with Node Added trigger

2020-06-02 Thread Mangla,Kirti
Hi, I have been trying to enable autoscaling on SolrCloud 8.5, with Node Added trigger and Node Lost trigger. The SolrCloud is running on AWS EKS pods, with 2 nodes minimum. I have added NodeAddedTrigger. My autoscaling API response looks like as in the attached file. Whenever I scale up the

Re: Building a web based search engine

2020-06-02 Thread Jim Anderson
Markus, Thank for your replies. I will review them and experiment more and see if I can get everything working. Jim On Tue, Jun 2, 2020 at 2:36 PM Markus Jelsma wrote: > Hello, see inline. > > Markus > > -Original message- > > From:Jim Anderson > > Sent: Tuesday 2nd June 2020 19:59 >

RE: Building a web based search engine

2020-06-02 Thread Markus Jelsma
Hello, see inline. Markus -Original message- > From:Jim Anderson > Sent: Tuesday 2nd June 2020 19:59 > To: solr-user@lucene.apache.org > Subject: Re: Building a web based search engine > > Hi Markus, > > Thanks for your response. I appreciate you giving me the bullet list of >

Re: Building a web based search engine

2020-06-02 Thread Jim Anderson
Hi Markus, Thanks for your response. I appreciate you giving me the bullet list of things to do. I can take that list and work from it and hopefully make progress, but I don't think it will get me where I want to be - just a bit closer. You say, "We have been building precisely that for over ten

Re: solr 8.4.1 with ssl tls1.2 creating an issue with non-leader node

2020-06-02 Thread yaswanth kumar
team, can someone help me on the above topic? On Mon, Jun 1, 2020 at 10:00 PM yaswanth kumar wrote: > Trying to setup solr 8.4.1 + open jdk 11 on centos , enabled the ssl > configurations with all the certs in place, but the issue what I am seeing > is when trying to hit /update api on

RE: Building a web based search engine

2020-06-02 Thread Markus Jelsma
Hello, We have been building precisely that for over ten years now. The '10,000 foot level overview' is basically: * forget about Lucene for now, Solr uses it under the hood; * get Solr, and start it with the schema.xml file that comes with Nutch; * get Nutch, give it a set of domains or hosts

Re: Not all EML files are indexing during indexing

2020-06-02 Thread Walter Underwood
> On Jun 2, 2020, at 7:40 AM, Charlie Hull wrote: > > If it was me I'd probably build a standalone indexer script in Python that > did the file handling, called out to a separate Tika service for extraction, > posted to Solr. I would do the same thing, and I would base that script on Scrapy

Re: Not all EML files are indexing during indexing

2020-06-02 Thread Charlie Hull
Ah OK. I haven't used SimplePostTool myself and I note the docs say "View this not as a best-practice code example, but as a standalone example built with an explicit purpose of not having external jar dependencies." I'm wondering if it's some kind of synchronisation issue between new files

Building a web based search engine

2020-06-02 Thread Jim Anderson
Hi, I have been looking at solr, lucene and nutch websites and tutuorials for over a week now, experimenting and learning, but also frustrated be the fact the I am totally missing the 'how to' do what I want. I see a lot of examples of how to use each of the tools, but not how to put them all

Re: Not all EML files are indexing during indexing

2020-06-02 Thread Zheng Lin Edwin Yeo
Hi Charlie, The main code that is doing the indexing is from the Solr's SimplePostTools, but we have done some modification to it. The walking through a folder is done by PowerShell script, the extracting of the content from .eml file is from Tika that comes with Solr, and the images in the .eml

RE: Multiple Solr instances using same ZooKeepers

2020-06-02 Thread Gell-Holleron, Daniel
Many thanks for this information! -Original Message- From: Colvin Cowie Sent: 02 June 2020 09:46 To: solr-user@lucene.apache.org Subject: Re: Multiple Solr instances using same ZooKeepers You can specify a different "chroot" directory path in zookeeper for each cloud

Re: Highlighting values of non stored fields

2020-06-02 Thread Erick Erickson
Why do you think even variants need to be stored/highlighted? Usually when you store variants for ranking purposes those extra copies are invisible to the user. So most often people store exactly one copy of a particular field and highlight _that_ field in the return. So say my field is f1 and I

Re: Multiple Solr instances using same ZooKeepers

2020-06-02 Thread Colvin Cowie
You can specify a different "chroot" directory path in zookeeper for each cloud https://lucene.apache.org/solr/guide/8_5/setting-up-an-external-zookeeper-ensemble.html#using-a-chroot On Tue, 2 Jun 2020 at 09:33, Gell-Holleron, Daniel < daniel.gell-holle...@gb.unisys.com> wrote: > Hi there, > >

Multiple Solr instances using same ZooKeepers

2020-06-02 Thread Gell-Holleron, Daniel
Hi there, We are in the process of deploying Solr Cloud with CDCR. I would like to know if multiple instances of Solr (4 Solr servers for one instance, 4 for another instance) can use the same ZooKeeper servers? This would prevent us from needing multiple ZooKeepers servers to serve each

Highlighting values of non stored fields

2020-06-02 Thread mosheB
Our use case is as follow: We are indexing free text documents. Each document contains metadata fields (such as author, creation date...) which are kinda small, and one "big" field that holds the document's text itself. For ranking purpose each field is indexed in more then one "variation" and