Re: Long Running Data Import Handler - Notifications

2015-12-08 Thread Stefan Matheis
https://wiki.apache.org/solr/DataImportHandler#EventListeners might be worth a look -Stefan On Wed, Dec 9, 2015 at 2:51 AM, Walter Underwood wrote: > Not that I know of. I wrote a script to check the status and sleep until > done. Like this: > > SOLRURL='http://solr-master.prod2.cloud.cheggnet.

Re: capacity of storage a single core

2015-12-08 Thread Mugeesh Husain
@Upayavira, could you provice the any link, that issue has been resolved. >>So long as your joined-to collection is replicated across every box wher i can find this related link or example. -- View this message in context: http://lucene.472066.n3.nabble.com/capacity-of-storage-a-single-core-t

Re: Increasing Solr5 time out from 30 seconds while starting solr

2015-12-08 Thread Debraj Manna
. After failed attempt to start solr if I try to start solr again on same port it says solr is already running. Try running solr on different port. Can you let me know if it is possible to increase the timeout? So that I can observe how does it behave. On Dec 9, 2015 10:10 AM, "Rahul Ramesh" wrot

Re: Increasing Solr5 time out from 30 seconds while starting solr

2015-12-08 Thread Rahul Ramesh
Hi Debraj, I dont think increasing the timeout will help. Are you sure solr/ any other program is not running on 8789? Please check the output of lsof -i :8789 . Regards, Rahul On Tue, Dec 8, 2015 at 11:58 PM, Debraj Manna wrote: > Can someone help me on this? > On Dec 7, 2015 7:55 PM, "D" wro

Re: Long Running Data Import Handler - Notifications

2015-12-08 Thread Walter Underwood
Not that I know of. I wrote a script to check the status and sleep until done. Like this: SOLRURL='http://solr-master.prod2.cloud.cheggnet.com:6090/solr/textbooks/dataimport' while : ; do echo `date` checking whether Solr indexing is finished curl -s "${SOLRURL}" | fgrep '"status":"idle"

Long Running Data Import Handler - Notifications

2015-12-08 Thread Brian Narsi
Is there a way to receive notifications when a Data Import Handler finishes up and whether it succeeded or failed. (typically runs about an hour) Thanks

Re: secure solr 5.3.1

2015-12-08 Thread Ishan Chattopadhyaya
Right, as Bosco said, this has been tested well and supported on SolrCloud. It should be possible to run it in standalone mode, but it is not something that has been well test yet. On Tue, Dec 8, 2015 at 11:02 PM, Don Bosco Durai wrote: > It was tested and meant to work only in SolrCloud mode. >

Re: Solr memory usage

2015-12-08 Thread Erick Erickson
You're doing nothing wrong, that particular bit of advice has always needed a bit of explanation. Solr (well, actually Lucene) uses MMapDirectory for much of the index structure which uses the OS memory rather than the JVM heap. See Uwe's excellent: http://blog.thetaphi.de/2012/07/use-lucenes-mma

Re: solrconfig.xml - configuration scope

2015-12-08 Thread Erick Erickson
What specifically are you seeing? Most things are per-core as you surmised. There are a few things which, through interaction with Lucene global variables affect multiple cores, the one that comes to mind is maxBooleanClauses, where the value in the last core loaded "wins". There might be some ot

Solr memory usage

2015-12-08 Thread Steven White
Hi folks, My index size on disk (optimized) is 20 GB (single core, single index). I have a system with 64 GB of RAM. I start Solr with 24 GB of RAM. I have run load tests (up to 100 concurrent users) for hours where each user issuing unique searches (the same search is never executed again for

Re: Increasing Solr5 time out from 30 seconds while starting solr

2015-12-08 Thread Debraj Manna
Can someone help me on this? On Dec 7, 2015 7:55 PM, "D" wrote: > Hi, > > Many time while starting solr I see the below message and then the solr is > not reachable. > > debraj@boutique3:~/solr5$ sudo bin/solr start -p 8789 > Waiting to see Solr listening on port 8789 [-] Still not seeing Solr

Re: fuzzy searches and EDISMAX

2015-12-08 Thread Walter Underwood
You probably want to apply the patch for SOLR-629. We have this in production at Chegg. I’ve been trying to get this feature added to Solr for seven years. Not sure why it never gets approved. https://issues.apache.org/jira/browse/SOLR-629 wunde

fuzzy searches and EDISMAX

2015-12-08 Thread Felley, James
I am trying to build an edismax search handler that will allow a fuzzy search, using the "query fields" property (qf). I have two instances of SOLR 4.8.1, one of which has edismax "qf" configured with no fuzzy search ... ns_name^3.0 i_topic^3.0 i_object_type^3.0 ... And the other with a fuzzy

Re: Solr 5.2.1 deadlock on commit

2015-12-08 Thread Ali Nazemian
I did that already. The situation was worse. The autocommit part makes solr unavailable. On Dec 8, 2015 7:13 PM, "Emir Arnautovic" wrote: > Hi Ali, > Can you try without explicit commits and see if threads will still be > blocked. > > Thanks, > Emir > > On 08.12.2015 16:19, Ali Nazemian wrote: >

solrconfig.xml - configuration scope

2015-12-08 Thread Fitzpatrick, Adrian
Hi, This is probably a very basic question that has been asked many times before - apologies in advance if so! I'm looking to validate whether something I **think** I have observed when using Solr is a known behaviour: >From my read of the docs etc. it was my understanding that solrconfig.xml

Re: capacity of storage a single core

2015-12-08 Thread Upayavira
I understood that on later Solrs, those join issues have been (partially) resolved. So long as your joined-to collection is replicated across every box, you should be good. Upayavira On Tue, Dec 8, 2015, at 04:17 PM, Mugeesh Husain wrote: > Thanks Toke Eskildsen, > > Actually i need to join on

Re: secure solr 5.3.1

2015-12-08 Thread Don Bosco Durai
It was tested and meant to work only in SolrCloud mode. On Tue, Dec 8, 2015 at 9:30 AM -0800, "kostali hassan" wrote: - Kerberos authentication : work in SolrCloud or standalone mode but the documentation is not clear - https://cwiki.apache.org/confluence/display/s

Re: secure solr 5.3.1

2015-12-08 Thread kostali hassan
- Kerberos authentication : work in SolrCloud or standalone mode but the documentation is not clear - https://cwiki.apache.org/confluence/display/solr/Kerberos+Authentication+Plugin?focusedCommentId=61

Re: secure solr 5.3.1

2015-12-08 Thread Don Bosco Durai
Not sure exactly what you mean here. Even if you are running in SolrCloud, you can access it using URL. So there won't be any change on the client side. Bosco On Tue, Dec 8, 2015 at 2:03 AM -0800, "kostali hassan" wrote: if I run solr in SolrCloud mode , my web hosting shoud be C

Re: Issue with Querying Solr

2015-12-08 Thread Don Bosco Durai
You only have 261 documents. That shouldn't be a problem, unless your document size is huge. I feel, the problem still exists somewhere. You have just deferred it... Bosco On Tue, Dec 8, 2015 at 6:48 AM -0800, "Salman Ansari" wrote: Thanks Andrea and Alexandre for your responses.

Re: Highlighting large documents

2015-12-08 Thread Scott Stults
There are two things going on that you should be aware of. The first is, Solr Highlighting is mainly concerned about putting a representative snippet in a results listing. There are a couple of configuration changes you need to do if you want to highlight a whole document, like setting the fragList

Re: Solr 5.2.1 Most solr nodes in a cluster going down at once.

2015-12-08 Thread Erick Erickson
Phillippa: You simply cannot continue adding documents, increasing memory, adding more documents, increasing memory forever, if for no other reason than you'll eventually hit such large GC pauses that your query performance will suffer greatly. I'd _strongly_ advise you to pick a number of docs (l

Re: capacity of storage a single core

2015-12-08 Thread Jack Krupansky
Generally, you will be resource limited (memory, cpu) rather than by some arbitrary numeric limit (like 2 billion.) My personal general recommendation is for a practical limit is 100 million documents on a machine/node. Depending on your data model and actual data that number could be higher or lo

Re: capacity of storage a single core

2015-12-08 Thread Mugeesh Husain
Thanks Toke Eskildsen, Actually i need to join on my core, that why i am going to solrlcoud(join does not support in solrlcoud) Is there any alternate way to doing it ? -- View this message in context: http://lucene.472066.n3.nabble.com/capacity-of-storage-a-single-core-tp4244197p4244248.html

Re: Solr 5.2.1 deadlock on commit

2015-12-08 Thread Emir Arnautovic
Hi Ali, Can you try without explicit commits and see if threads will still be blocked. Thanks, Emir On 08.12.2015 16:19, Ali Nazemian wrote: The indexing load is as follows: - Around 1000 documents every 5 mins. - The indexing speed is slow because of the complicated analyzer which is applied

Re: Solr 5.2.1 deadlock on commit

2015-12-08 Thread Ali Nazemian
The indexing load is as follows: - Around 1000 documents every 5 mins. - The indexing speed is slow because of the complicated analyzer which is applied to each document. It takes around 60 seconds to index 1000 documents with applying this analyzer (It is really slow. However, based on the analyzi

Re: Solr 5.2.1 deadlock on commit

2015-12-08 Thread Ali Nazemian
Dear Emir, Hi, There are some cases that I have soft commit in my application. However, the bulk update part has only hard commit for a bulk of 2500 documents. Here are some information about the whole indexing/updating scenarios: - Indexing part uses soft commit. - In a single update cases soft co

Re: Issue with Querying Solr

2015-12-08 Thread Salman Ansari
Thanks Andrea and Alexandre for your responses. Indeed it was the problem that Solr.NET was returning many rows (as I captured this by fiddler). Currently, my setup has only 500MB of JVM (which I will definitely increase) but at least I found the culprit by reducing the number of rows returned. Re

Re: Issue with Querying Solr

2015-12-08 Thread Alexandre Rafalovitch
Solr by default only returns 10 rows. SolrNet by default returns many rows. I don't know why that would cause OOM, but that's definitely your difference unless you dealt with it: https://github.com/mausch/SolrNet/blob/master/Documentation/Querying.md#pagination Regards, Alex. Newsletter an

Re: Issue with Querying Solr

2015-12-08 Thread Andrea Gazzarini
I would set up logging in the admin console (queries should be logged out by default), and then check the difference between the two invocations. I don't believe the two clients are doing the *same* thing, there should be some difference. Another chance could be that the OOM is just behind the cor

Re: Solr 5.2.1 Most solr nodes in a cluster going down at once.

2015-12-08 Thread Emir Arnautovic
Hi Philippa, It's more likely that this is related to index size/content + queries than to Solr version. Did you experience issues immediately after upgrade? Check slow queries log and see if there are some extremely slow queries. Check cache sizes and calculate how much they take. Increasing

Re: capacity of storage a single core

2015-12-08 Thread Toke Eskildsen
On Tue, 2015-12-08 at 05:18 -0700, Mugeesh Husain wrote: > Capacity regarding 2 simple question: > > 1.) How many document we could store in single core(capacity of core > storage) There is hard limit of 2 billion documents. > 2.) How many core we could create in a single server(single node clus

Re: question: partialResults true with pagination

2015-12-08 Thread Toke Eskildsen
On Tue, 2015-12-08 at 18:42 +0530, Vibhor Goel wrote: > i am using single standalone solr instance. Some of my queries are taking > long time due to large number of result documents. I am using timeout > option and it returns me partial results. Don't request a large number of documents at a time.

question: partialResults true with pagination

2015-12-08 Thread Vibhor Goel
hey, i am using single standalone solr instance. Some of my queries are taking long time due to large number of result documents. I am using timeout option and it returns me partial results. i found out partial results returns a random set of resultant documents, in sorted order. My query is whe

Issue with Querying Solr

2015-12-08 Thread Salman Ansari
Hi, I have created a cluster of Solr and Zookeepers on 3 machines connected together. Currently, I am facing a weird problem. My collection has only 261 documents and when I try to query the documents using the browser such as http:// [ASolrServerInTheCluster]:8983/solr/sabrLocationsStore/select?

Re: Solr Auto-Complete

2015-12-08 Thread Salman Ansari
Thanks Alexandre. I think it is clear. On Sun, Dec 6, 2015 at 5:21 PM, Alexandre Rafalovitch wrote: > For suffix matches, you copy text the field and in the different type add > string reversal for both index and query portions. So you are doing prefix > matching algorithm but on reversed string

Re: Solr 5.2.1 Most solr nodes in a cluster going down at once.

2015-12-08 Thread philippa griggs
Hello Emir, The query load is around 35 requests per min on each shard, we don't document route so we query the entire index. We do have some heavy queries like faceting and its possible that a heavy queries is causing the nodes to go down- we are looking into this. I'm new to solr so this co

capacity of storage a single core

2015-12-08 Thread Mugeesh Husain
Capacity regarding 2 simple question: 1.) How many document we could store in single core(capacity of core storage) 2.) How many core we could create in a single server(single node cluster) Thanks, Mugeesh -- View this message in context: http://lucene.472066.n3.nabble.com/capacity-of-stora

Re: Solr 5.2.1 Most solr nodes in a cluster going down at once.

2015-12-08 Thread Emir Arnautovic
Hi Phillippa, My guess would be that you are running some heavy queries (faceting/deep paging/large pages) or have high query load (can you give bit details about load) or have misconfigured caches. Do you query entire index or you have query routing? You have big machine and might consider r

Re: Use multiple istance simultaneously

2015-12-08 Thread Emir Arnautovic
Can you tolerate having indices in different state or you plan to keep them in sync with controlled commits. DIH-ing content from source when new machine is needed will probably be slow and I am afraid that you will end up simulating master-slave model (copying state from one of healthy nodes

Re: secure solr 5.3.1

2015-12-08 Thread kostali hassan
if I run solr in SolrCloud mode , my web hosting shoud be Cloud web hosting? or dont need a web server having cloud..? 2015-12-08 1:58 GMT+00:00 Don Bosco Durai : > Have you considered running your Solr as SolrCloud with embedded zookeeper? > > If you do, you have multiple options. Basic Auth, Ke

multiword synonym and ManagedSynonymFilterFactory

2015-12-08 Thread Suad Kozlic
Hello, did someone tested this combination ? I am not getting any result. -- Suad Kozlić, mr.el.-dipl.ing.el.

Re: Solr 5.2.1 Most solr nodes in a cluster going down at once.

2015-12-08 Thread philippa griggs
Hello Erick, Thanks for your reply. We have one collection and are writing documents to that collection all the time- it peaks at around 2,500 per minute and dips to 250 per minute, the size of the document varies. On each node we have around 55,000,000 documents with a data size of 43G loc

Re: Solr 5.2.1 deadlock on commit

2015-12-08 Thread Emir Arnautovic
Hi Ali, This thread is blocked because cannot obtain update lock - in this particular case when doing soft commit. I am guessing that there others are blocked for the same reason. Can you tell us bit more about your setup and indexing load and procedure? Do you do explicit commits? Regards, E