solr 7.3 Combining Replica Types in a Cluster

2018-04-25 Thread kumar gaurav
Hi I am setting up new cluster in solr cloud 7.3 . i am just confused to make a combination of replica types . Real time indexing update/delete request will coming in cluster continuously but very less than select query . Cluster will need to handle high concurrent select queries than update. I a

Re: Solr 7.3 debug/explain with boost applied

2018-04-25 Thread Ryan Yacyshyn
Typically you would use a function query there to manipulate the score rather than a constant of 2. This doesn't do much but simply multiply all scores by that value. You can do something like boost=sqrt(popularity) if you wanted to boost on the popularity field for example. In both cases, however,

Re: Modify data-conf.xml on the runtime

2018-04-25 Thread Shawn Heisey
On 4/25/2018 4:12 AM, rameshkjes wrote: Actually I am trying to approach this problem from another way. I am taking user input from gui which is direcotory of dataset, and saving that path in properties file. Since I am using Maven, so I am able to access that path in my pom file using properties

Re: solr cell: write entire file content binary to index along with metadata

2018-04-25 Thread Rahul Singh
Lucene ( the major underlying Tech in SolR ) can handle any data, but it’s optimized to be an index , not a file store. Better to put that in another DB or file system like Cassandra, S3, etc. (better than SolR). In our experience , leveraging the tika binary / microservice as a pre-index proce

Re: CDCR broken for Mixed Replica Collections

2018-04-25 Thread Amrit Sarkar
Pardon, * I have added extensive tests for both the use-cases. Amrit Sarkar Search Engineer Lucidworks, Inc. 415-589-9269 www.lucidworks.com Twitter http://twitter.com/lucidworks LinkedIn: https://www.linkedin.com/in/sarkaramrit2 Medium: https://medium.com/@sarkaramrit2 On Thu, Apr 26, 2018 at 3:

Re: CDCR broken for Mixed Replica Collections

2018-04-25 Thread Amrit Sarkar
Webster, I have patch uploaded to both Cdcr supporting Tlog: https://issues.apache.org/jira/browse/SOLR-12057 and core not getting failed while initializing for Pull type replicas: https://issues.apache.org/jira/browse/SOLR-12071 and awaiting feedback from open source community. The solution for p

Re: CDCR broken for Mixed Replica Collections

2018-04-25 Thread Erick Erickson
CDCR won't really ever make sense for PULL replicas since the PULL replicas have no tlog and don't do any indexing and can't ever become a leader seamlessly. As for plans to address TLOG replicas, patches are welcome if you have a need. That's really how open source works, people add functionality

Re: Preventing solr cache flush when committing

2018-04-25 Thread Erick Erickson
Had this typed up yesterday and forgot to send. "Is there no way to ensure that the top level filter caches are not expunged when some documents are added to the index and have the changes available at the same time?" no. And it's not something that you can do without major architectural changes.

System collection - lazy loading mechanism not working for custom UpdateProcessors?

2018-04-25 Thread Johannes Brucher
Hi all, I'm facing an issue regarding custom code inside a .system-collection and starting up a Solr Cloud cluster. I thought, like its stated in the documentation, that in case using the .system collection custom code is lazy loaded, because it can happen that a collection that uses custom cod

Re: Modify data-conf.xml on the runtime

2018-04-25 Thread rameshkjes
Actually I am trying to approach this problem from another way. I am taking user input from gui which is direcotory of dataset, and saving that path in properties file. Since I am using Maven, so I am able to access that path in my pom file using properties tag. So, now is it possible to use that

Re: solr cell: write entire file content binary to index along with metadata

2018-04-25 Thread Shawn Heisey
On 4/25/2018 4:02 AM, Lee Carroll wrote: *We don't recommend using solr-cell for production indexing.* Ok. Are the reasons for: Performance. I think we have rather modest index requirement (1000 a day... on a busy day) Security. The index workflow is, upload files to public facing server w

How does the stopwords file work?

2018-04-25 Thread lina Zhang
Hello, I am trying to create a domain specific search engine. As most of the collected information contains ‘NDIS', I added 'NDIS' to stopwords_en.txt. The field type is text_en, so it should use StopFilterFactory based on the schema. If I search for ‘ipad’, I can get the results I need. When I

CDCR broken for Mixed Replica Collections

2018-04-25 Thread Webster Homer
I was looking at SOLR-12057 According to the comment on the ticket, CDCR can not work when a collection has PULL Replicas. That seems like a MAJOR limitation to CDCR and PULL Replicas. Is this likely to be addressed in the future? CDCR currently is broken for TLOG replicas too. https://issues.apa

Re: SolrCloud DIH (Data Import Handler) MySQL 404

2018-04-25 Thread Mikhail Khludnev
Can you share more log lines around this odd NPE? It might be necessary to restart jvm with -verbose:class and look through its' output to find why it can't load this class. On Wed, Apr 25, 2018 at 11:42 AM, msaunier wrote: > Hello Shawn, > > I have install SolrCloud 7.3 on an other server and t

System collection - lazy loading mechanism not working for custom UpdateProcessors

2018-04-25 Thread Johannes Brucher
Hi all, I'm facing an issue regarding custom code inside a .system-collection and starting up a Solr Cloud cluster. I thought, like its stated in the documentation, that in case using the .system collection custom code is lazy loaded, because it can happen that a collection that uses custom cod

Re: SolrCloud cluster does not accept new documents for indexing

2018-04-25 Thread Denis Demichev
Shawn, Mikhail, Chris, Thank you all for your feedback. Unfortunately I cannot try your recommendations right away - this week is busy. Will post my results here next week. Regards, Denis On Tue, Apr 24, 2018 at 11:33 AM Shawn Heisey wrote: > On 4/24/2018 6:30 AM, Chris Ulicny wrote: > > I ha

Re: solr cell: write entire file content binary to index along with metadata

2018-04-25 Thread Lee Carroll
> > > > > *That's not usually the kind of information you want to have in a > Solrindex. Most of the time, there will be an entry in the Solr index > thattells the system making queries how to locate the actual data -- > afilename, a URL, a database lookup key, etc.* Agreed. The app will have a

Re: SolrCloud cluster does not accept new documents for indexing

2018-04-25 Thread Emir Arnautović
Hi Denis, Merge works on segments and depending on merge strategy it is triggered separately so there is no some queue between update executor and merge threads. Re SPM - I am using it on a daily bases for most of my consulting work and if you have SPM app you can invite me to it and I’ll take a

RE: SolrCloud DIH (Data Import Handler) MySQL 404

2018-04-25 Thread msaunier
Hello Shawn, I have install SolrCloud 7.3 on an other server and the problem not apear. I create a Jira Ticket ? But I have an other problem: Full Import failed:org.apache.solr.handler.dataimport.DataImportHandlerException: Unable to PropertyWriter implementation:ZKPropertiesWriter at