Re: Latency in updates.

2018-10-12 Thread Erick Erickson
Oh, and the other thing that occurs is if too many merges stack up, incoming updates are blocked which may be another source of delays. But still, updates to existing docs and adding new docs shouldn't be any different. Best, Erick On Fri, Oct 12, 2018 at 10:42 AM Shawn Heisey wrote: > > On

Re: Solr JVM Memory settings

2018-10-12 Thread Christopher Schultz
-BEGIN PGP SIGNED MESSAGE- Hash: SHA256 Hendrik, On 10/12/18 02:36, Hendrik Haddorp wrote: > Those constraints can be easily set if you are using Docker. The > problem is however that at least up to Oracle Java 8, and I believe > quite a bit further, the JVM is not at all aware about

Re: Latency in updates.

2018-10-12 Thread Shawn Heisey
On 10/12/2018 8:46 AM, root23 wrote: We are having an issue where we are seeing latency in updates. We are on solr 6. The new documents are reflected right away but updates to existing document take sometime from 30 seconds to couple of minutes. This is some relevant things from our solrconfig.

Re: Default merge policy

2018-10-12 Thread Shawn Heisey
On 10/12/2018 8:32 AM, root23 wrote: We are on solr 6. and as per the documentation i think solr 6 uses TieredMergePolicyFactory. However we have not specified it in the following way 10 10 We still use 25. which i understand is not used by TieredMergePolicyFactory. Supplementing

Re: Latency in updates.

2018-10-12 Thread Erick Erickson
bq. The new documents are reflected right away but updates to existing document take sometime from 30 seconds to couple of minutes. Certainly your soft commit interval is far too short for comfort. If you absolutely can't lengthen it then you should not use any caching/autowarming. But I

highlighting more-like-this

2018-10-12 Thread Matt Work Coarr
I want to get highlighted results for more like this queries. More like this doesn't support highlighting. So what I did was ran a more like this query (I have the source document A and say I get three similar documents back A1, A2, and A3). I then create a second query where I use the contents

Re: Default merge policy

2018-10-12 Thread Erick Erickson
bq. However we have not specified it in the following way Is that a typo and you mean "have now specified"? There's code in SolrIndexConfig: if (policy instanceof TieredMergePolicy) { if (mergeFactor != -1) { tieredMergePolicy.setMaxMergeAtOnce(mergeFactor);

Latency in updates.

2018-10-12 Thread root23
Hi all, We are having an issue where we are seeing latency in updates. We are on solr 6. The new documents are reflected right away but updates to existing document take sometime from 30 seconds to couple of minutes. This is some relevant things from our solrconfig. Our autosoft commit time is

Default merge policy

2018-10-12 Thread root23
Hi all, I am little bit confused. We are on solr 6. and as per the documentation i think solr 6 uses TieredMergePolicyFactory. However we have not specified it in the following way 10 10 We still use 25. which i understand is not used by TieredMergePolicyFactory. So my confusion is that

Re: SolrJ does not use HTTP proxy anymore in 7.5.0 after update from 6.6.5

2018-10-12 Thread Michael Joyner
Would you supply the snippet for the custom HttpClient to get it to honor/use proxy? Thanks! On 10/10/2018 10:50 AM, Andreas Hubold wrote: Thank you, Shawn. I'm now using a custom HttpClient that I create in a similar manner as SolrJ, and it works quite well. Of course, a fix in a future

Something odd with async request status for BACKUP operation on Collections API

2018-10-12 Thread Shawn Heisey
I'm working on reproducing a problem reported via the IRC channel. Started a test cloud with 7.5.0. Initially with two nodes, then again with 3 nodes.  Did this on Windows 10. Command to create a collection: bin\solr create -c test2 -shards 30 -replicationFactor 2 For these URLs, I dropped

Re: DIH for TikaEntityProcessor

2018-10-12 Thread Kamuela Lau
Glad to help :) 2018年10月12日(金) 21:10 Martin Frank Hansen (MHQ) : > You sir just made my day!!! > > It worked!!! Thanks a million! > > > Martin Frank Hansen, > > -Oprindelig meddelelse- > Fra: Kamuela Lau > Sendt: 12. oktober 2018 11:41 > Til: solr-user@lucene.apache.org > Emne: Re: DIH

Re: CMS GC - Old Generation collection never finishes (due to GC Allocation Failure?)

2018-10-12 Thread Dominique Bejean
Hi, 1/ As previously said by other persons, my first action would be to understand why you need so much heap ? The first step is to maximize your heap size to 31Gb (or obviously less if possible). https://blog.codecentric.de/en/2014/02/35gb-heap-less-32gb-java-jvm-memory-oddities/ Can you

SV: DIH for TikaEntityProcessor

2018-10-12 Thread Martin Frank Hansen (MHQ)
You sir just made my day!!! It worked!!! Thanks a million! Martin Frank Hansen, -Oprindelig meddelelse- Fra: Kamuela Lau Sendt: 12. oktober 2018 11:41 Til: solr-user@lucene.apache.org Emne: Re: DIH for TikaEntityProcessor Also, just wondering, have you have tried to specify

SV: DIH for TikaEntityProcessor

2018-10-12 Thread Martin Frank Hansen (MHQ)
Hi Kamuela, Thanks for your answer. I still get the same error, so I think I will try with the tech-products example to see if it works there as Alexendre suggest in the mail above. Martin Frank Hansen, -Oprindelig meddelelse- Fra: Kamuela Lau Sendt: 12. oktober 2018 11:38 Til:

Re: DIH for TikaEntityProcessor

2018-10-12 Thread Alexandre Rafalovitch
Solr ships with DIH Tika example that seems 90% identical to yours. Can you get that to run? If it works, then you can focus on the 10% difference. Perhaps it is explicit dataSource=null in the outer entity? Or maybe format=text on the inner one. Regards, Alex On Fri, Oct 12, 2018, 3:11

Re: ComplexPhraseQParser vs phrase slop

2018-10-12 Thread Yogendra Kumar Soni
{!complexphrase inOrder=false}f: ( "cat jump"~2 ) this query will give more result as it also consider "jump cat"~2 because inOrder=false, inOrder=true may give you same result as f: ( "cat jump"~2) or try f: ( "cat jump"~2 OR "jump cat"~2) to get same results as {!complexphrase

Re: DIH for TikaEntityProcessor

2018-10-12 Thread Kamuela Lau
Also, just wondering, have you have tried to specify dataSource="bin" for read_file? On Fri, Oct 12, 2018 at 6:38 PM Kamuela Lau wrote: > Hi, > > I was unable to reproduce the error that you got with the information > provided. > Below are the data-config.xml and managed-schema fields I used;

Re: DIH for TikaEntityProcessor

2018-10-12 Thread Kamuela Lau
Hi, I was unable to reproduce the error that you got with the information provided. Below are the data-config.xml and managed-schema fields I used; the data-config is mostly the same (I think that BinFileDataSource doesn't actually require a dataSource, so I think it's safe to put

Is that a simple way to start a Mini Solr cluster in other project unittest?

2018-10-12 Thread zhenyuan wei
Hi all, I found it is too troublesome to start a solr mini cluster in my project,the MiniSolrCloudCluster has too many properties related to the folders of Solr Source Project. Is there a simple way to start a mini solr cluster out of Solr Project,such as in my custom Project?

SV: DIH for TikaEntityProcessor

2018-10-12 Thread Martin Frank Hansen (MHQ)
Hi again, Can anybody help me? Any suggestions to why I am getting the error below? Martin Frank Hansen, Senior Data Analytiker Data, IM & Analytics [cid:image001.png@01D383C9.6C129A60] Lautrupparken 40-42, DK-2750 Ballerup E-mail m...@kmd.dk Web

Re: Solr JVM Memory settings

2018-10-12 Thread Hendrik Haddorp
Those constraints can be easily set if you are using Docker. The problem is however that at least up to Oracle Java 8, and I believe quite a bit further, the JVM is not at all aware about those limits. That's why when running Solr in Docker you really need to make sure that you set the memory