Re: SolrCloud is sick.

2019-11-02 Thread Noble Paul
Solr has to do more than Lucene. A Lucene user is mostly a developer who reads javadocs. A Solr user's touch points are * Public API * Ref guide * publicly visible files (in ZK as well as file system) * What to see/look for in the log files to debug issues Then we have more nuanced touch points s

Re: The Gradle Build is HERE Oct 23 2019

2019-11-02 Thread David Smiley
What's the status of this Mark? Sorry to ask but someone had to. It's obvious your attention is on a big Solr effort right now but it's a shame this didn't get committed, even if imperfect. ~ David Smiley Apache Lucene/Solr Search Developer http://www.linkedin.com/in/davidwsmiley On Wed, Oct 2

Re: SolrCloud is sick.

2019-11-02 Thread David Smiley
Yeah we do a bad job of the things you listed Noble. :-( My colleagues want pointers to internal docs but the sad reality is there isn't any. You may notice I'm a stickler in my code reviews for requiring javadocs on all top level classes. I think more javadocs and code comments would be very

Re: SolrCloud is sick.

2019-11-02 Thread Mark Miller
Things are also counterintuitive. The more you fix and the faster things work the more things fail. It’s like rings of hell. Mark On Sat, Nov 2, 2019 at 10:29 PM Mark Miller wrote: > And it didnt get any easier. What I did about it is kill myself multiple > times over 2 years for weeks on end o

Re: SolrCloud is sick.

2019-11-02 Thread Mark Miller
And it didnt get any easier. What I did about it is kill myself multiple times over 2 years for weeks on end of torturing my wife. And I found a million problems, a million bugs, a million terrible inefficiencies. And I fixed and lost countless of them friggen twice. And didnt lose tons of the work

Re: SolrCloud is sick.

2019-11-02 Thread Mark Miller
I mean the reality is - why do we not have just a single watcher per node pulling in state. We are we not tracking and minimizing state transfers and changes? Why are we not measuring the time it takes to round trip a state.json and adjusting? Looking at load to adjust overseerish duties and leader

Re: SolrCloud is sick.

2019-11-02 Thread Mark Miller
Not much. Something you can understand. How about tests < 10 seconds fail or not. Good logging and as a backup good debug logging. Docs on how things are designed to work? Tracking of all important operations and how long they take with tight cutoffs? Proper response to interruption 100% of the tim

Re: SolrCloud is sick.

2019-11-02 Thread Mark Miller
I am not personally attacking anyone. Everyone here does good work in one way or another. However, if you cross a bar of prolific vs attention to detail, tests, doc, and things beyond just you, I do hold you to a higher standard. You are super prolific and your care for others following your foo

Re: SolrCloud is sick.

2019-11-02 Thread Noble Paul
Hi, I believe there is a consensus on what is wrong with the way we have built the cluster state and overseer. We need to focus a bit more on the design aspect. Design, according to me, has the following elements: * How does it work? * What are the performance characteristics? Can it be done mor

[ANNOUNCE] Apache Solr 8.3.0 released

2019-11-02 Thread Ishan Chattopadhyaya
## 2 November 2019, Apache Solr™ 8.3.0 available The Lucene PMC is pleased to announce the release of Apache Solr 8.3.0. Solr is the popular, blazing fast, open source NoSQL search platform from the Apache Lucene project. Its major features include powerful full-text search, hit highlighting, fac

[ANNOUNCE] Apache Lucene 8.3.0 released

2019-11-02 Thread Ishan Chattopadhyaya
## 2 November 2019, Apache Lucene™ 8.3.0 available The Lucene PMC is pleased to announce the release of Apache Lucene 8.3.0. Apache Lucene is a high-performance, full-featured text search engine library written entirely in Java. It is a technology suitable for nearly any application that requires

Re: SolrCloud is sick.

2019-11-02 Thread Scott Blum
Very much agreed. I've been trying to figure out for a long time what is the point in having a replica DOWN state that has to be toggled (DOWN and then UP!) every time a node restarts. Considering that we could just combine ACTIVE and `live_nodes` to understand whether a replica is available. It

Re: SolrCloud is sick.

2019-11-02 Thread Mark Miller
Give me a short bit to follow up and I will lay out my case and proposal. Everyone is then free to decide that we need to do something drastic or that I'm wrong and we should just continue down the same road. If that's the case, a lot of your work will get a lot easier and less impeded by me and w

(Solr) Can we include an example dataset under CC license?

2019-11-02 Thread Alexandre Rafalovitch
Hi, Somebody more familiar with license situation may be able to help me here. I think it would be nice if Solr shipped with a more comprehensive example. I have used and love the generated names from https://www.fakenamegenerator.com/ . They have good fields, but also support different languages