Re: DIH: Create Child Documents in ScriptTransformer

2019-09-18 Thread Mikhail Khludnev
Hello, Jörn. Have you tried to find a parent doc in the context which is passed as a second argument into ScriptTransformer? On Wed, Sep 18, 2019 at 9:56 PM Jörn Franke wrote: > > Hi, > > I load a set of documents. Based on these documents some logic needs to be > applied to split them into chapt

Re: Question about "No registered leader" error

2019-09-18 Thread Hongxu Ma
@Shawn @Erick Thanks for your kindle help! No OOM log and I confirm there was no OOM happened. My ZK ticktime is set to 5000, so 5000*20 = 100s > 60s, and I checked solr code: the leader waiting time: 4000ms is a const variable, is not configurable. (why it isn't a configurable param?) My solr

Re: DIH: Create Child Documents in ScriptTransformer

2019-09-18 Thread Jörn Franke
I fully agree. However, I am just curious to see the limits. > Am 18.09.2019 um 23:33 schrieb Erick Erickson : > > When it starts getting complex, I usually move to SolrJ. You say > you're loading documents, so I assume Tika is in the mix too. > > Here's a blog on the topic so you an see how to

Solr query fetching different results

2019-09-18 Thread Jayadevan Maymala
Hi all, There is something "strange' happening in our Solr cluster. If I execute a query from the server, via solarium client, I get one result. If I execute the same or similar query from admin Panel, I get another result. If I go to Admin Panel - Collections - Select Collection and click "Reloa

Re: Custom update processor not kicking in

2019-09-18 Thread Rahul Goswami
Eric, Markus, Thank you for your inputs. I made sure that the jar file is found correctly since the core reloads fine and also prints the log lines from my processor during update request (getInstane() method of the update factory). The reason why I want to insert the processor between distributed

Re: DIH: Create Child Documents in ScriptTransformer

2019-09-18 Thread Erick Erickson
When it starts getting complex, I usually move to SolrJ. You say you're loading documents, so I assume Tika is in the mix too. Here's a blog on the topic so you an see how to get started... https://lucidworks.com/post/indexing-with-solrj/ Best, Erick On Wed, Sep 18, 2019 at 2:56 PM Jörn Franke

Re: Custom update processor not kicking in

2019-09-18 Thread Erick Erickson
It Depends (tm). This is a little confused. Why do you have distributed processor in stand-alone Solr? Stand-alone doesn't, well, distribute updates so that seems odd. Do try switching it around and putting it on top, this should be OK since distributed is irrelevant. You can also just set a break

RE: Custom update processor not kicking in

2019-09-18 Thread Markus Jelsma
Hello Rahul, I don't know why you don't see your logs lines, but if i remember correctly, you must put all custom processors above Log, Distributed and Run, at least i remember i read it somewhere a long time ago. We put all our custom processors on top of the three default processors and they

Custom update processor not kicking in

2019-09-18 Thread Rahul Goswami
Hello, I am using solr 7.2.1 in a standalone mode. I created a custom update request processor and placed it between the distributed processor and run update processor in my chain. I made sure the chain is invoked since I see log lines from the getInstance() method of my processor factory. But I d

DIH: Create Child Documents in ScriptTransformer

2019-09-18 Thread Jörn Franke
Hi, I load a set of documents. Based on these documents some logic needs to be applied to split them into chapters (this is done). One whole document is loaded as a parent. Chapters of the whole document + metadata should be loaded as child documents of this parent. I want to now collect informati

Re: Looking for some guide on how to do the index on a folder of data in Solr 7.2

2019-09-18 Thread Angie Rabelero
post.jar was removed in solr 5 I think. There are ways to index your files you can use the post tool https://lucene.apache.org/solr/guide/8_1/post-tool.html , or you can try tika to extract text from docuements, or you can use curl https:

Looking for some guide on how to do the index on a folder of data in Solr 7.2

2019-09-18 Thread Raymond Xie
I remember there is a post.jar in Cloudera's Solr (very old version) that allows indexing doc like: java -Dtype=application/json -Drecursive -Durl=" http://localhost:8983/solr/indexer_odac/update/json/docs"; -jar post.jar /tmp/solr_data/data I don't see the post.jar in Solr 7.2 anymore, it is jus

Clarification on Solr BlendedInfixLookupFactory with context filtering (suggest.cfq)

2019-09-18 Thread Baskar K
Hi Team, In our project we are using the Apache Solr and we really thankful to the framework. We need a clarification on solr's Blended Infix Lookup Factory with context filtering. We checked the documentation seems its supporting the context filtering and we tried to integrate in our applicatio

Re: Moving a collection to a specific machine?

2019-09-18 Thread Shawn Heisey
On 9/18/2019 8:46 AM, Koen De Groote wrote: I'm looking into how I can move a collection to a specific machine. I have a set of machines, some with more/less CPU/RAM/Disk Space. Basically, I want to either move the specific collection to the appropriate machine(s). Or somehow say in the configu

Moving a collection to a specific machine?

2019-09-18 Thread Koen De Groote
Hello, I'm looking into how I can move a collection to a specific machine. I have a set of machines, some with more/less CPU/RAM/Disk Space. Basically, I want to either move the specific collection to the appropriate machine(s). Or somehow say in the configuration "collection X is to only exist

Re: Question about "No registered leader" error

2019-09-18 Thread Erick Erickson
Check whether the oom killer script was called. If so, there will be log files obviously relating to that. I've seen nodes mysteriously disappear as a result of this with no message in the regular solr logs. If that's the case, you need to increase your heap. Erick On Wed, Sep 18, 2019 at 8:21 AM

Re: Question about "No registered leader" error

2019-09-18 Thread Shawn Heisey
On 9/18/2019 6:11 AM, Shawn Heisey wrote: On 9/17/2019 9:35 PM, Hongxu Ma wrote: My questions:    *   Is this error possible caused by "long gc pause"? my solr zkClientTimeout=6 It's possible.  I can't say for sure that this is the issue, but it might be. A followup. I was thinking a

Re: Question about "No registered leader" error

2019-09-18 Thread Shawn Heisey
On 9/17/2019 9:35 PM, Hongxu Ma wrote: My questions: * Is this error possible caused by "long gc pause"? my solr zkClientTimeout=6 It's possible. I can't say for sure that this is the issue, but it might be. * If so, how can I prevent this error happen? My thoughts: using G