Re: Partial sentence match with block join

2015-12-15 Thread Upayavira
Cab you give an example? I cannot understand what you mean from your description below. Thx! On Wed, Dec 16, 2015, at 12:42 AM, Yangrui Guo wrote: > This will be a very common situation. Amazon and Google now display > keywords missing in the document. However it seems that Solr parent-child > s

Re: Solr High Availability

2015-12-15 Thread Jack Krupansky
There is no HA with a single replica for each shard. Replication factor must be at least 2 for HA. -- Jack Krupansky On Wed, Dec 16, 2015 at 12:38 AM, Peter Tan wrote: > Hi Jack, What happens when there is only one replica setup? > > On Tue, Dec 15, 2015 at 9:32 PM, Jack Krupansky > wrote: > >

Re: Highlighting large documents

2015-12-15 Thread Zheng Lin Edwin Yeo
Hi all, Thank you for all the information. I have set the parameter to -1, and the highlighting is working fine now. Regards, Edwin On 14 December 2015 at 18:03, Jens Brandt wrote: > Hi Edwin, > > you are limiting the portion of the document analyzed for highlighting in > your solrconfig.xm

Re: Solr High Availability

2015-12-15 Thread Peter Tan
Hi Jack, What happens when there is only one replica setup? On Tue, Dec 15, 2015 at 9:32 PM, Jack Krupansky wrote: > Solr Cloud provides HA when you configure at least two replicas for each > shard and have at least 3 zookeepers. That's it. No deck or detail document > is needed. > > > > -- Jack

Re: Solr High Availability

2015-12-15 Thread Jack Krupansky
Solr Cloud provides HA when you configure at least two replicas for each shard and have at least 3 zookeepers. That's it. No deck or detail document is needed. -- Jack Krupansky On Tue, Dec 15, 2015 at 9:07 PM, wrote: > Hi Team, > > Can you help me in understanding in achieving the Solr High

Re: Solr 5 upgrade

2015-12-15 Thread bharat jangid
richardg dvdempire.com> writes: > > Ubuntu 14.04.02 > Trying to install solr 5 following this: > https://cwiki.apache.org/confluence/display/solr/Upgrading+a+Solr+4.x+Cl uster+to+Solr+5.0 > > I keep getting "this script requires extracting a war file with either the > jar or unzip utility, pl

Solr High Availability

2015-12-15 Thread k.viswanath.bharathi
Hi Team, Can you help me in understanding in achieving the Solr High Availability . Appreciate you have a detail document or Deck on more details. Thank you Viswanath Bharathi Accenture | Delivery Centres for Technology in India CDC 2, Chennai, India Mobile: +91 9886259010 www.accenture.com

Re: Partial sentence match with block join

2015-12-15 Thread Yangrui Guo
This will be a very common situation. Amazon and Google now display keywords missing in the document. However it seems that Solr parent-child structure requires to use "AND" to confine all terms appear inside a single child document, otherwise it will totally disregard the parent-child structure. I

Re: Partial sentence match with block join

2015-12-15 Thread Jack Krupansky
Set the default operator to OR and optionally set the mm parameter to 2 to require at least two of the query terms to match, and don't quote the terms as a phrase unless you want an exact (optionally sloppy) match. Interesting example since I'll bet there are a lot of us who still think of the com

Re: similarity as a parameter

2015-12-15 Thread Jack Krupansky
There should probably be some doc notes about this stuff, at a minimum alerting the user to the prospect that changing the similarity for a field (or the default for all fields) can require reindexing and when it is likely to require reindexing. The Lucene-level Javadoc should probably say these sa

Re: Solr Basic Configuration - Highlight - Begginer

2015-12-15 Thread Erick Erickson
No, that's not what I meant. The highlight component adds a special section to the return packet that will contain "snippets" of text with highlights. You control how big those snippets are via various parameters in the highlight component and they'll have the tags you specify for highlighting. Yo

RE: similarity as a parameter

2015-12-15 Thread Markus Jelsma
Sweetspot does require reindexing but is that the only one? I have not investigated some exotic implementations, anyone to confirm sweetspot is the only one? In that case you could patch QueryComponent right, instead of having a custom component? M. -Original message- > From:Dmitry

RE: similarity as a parameter

2015-12-15 Thread Chris Hostetter
: Sweetspot does require reindexing but is that the only one? I have not : investigated some exotic implementations, anyone to confirm sweetspot is : the only one? In that case you could patch QueryComponent right, instead : of having a custom component? I'm not sure how where this thread deve

Re: similarity as a parameter

2015-12-15 Thread Dmitry Kan
Hi Hoss, Thanks for sharing the knowledge on dangerous zones, will try to avoid them. #2 is quite probable way of implementing this in my case, as many Query objects are custom (although not all). But #1 is compelling too and sounds like a bit less trouble. On Tue, Dec 15, 2015 at 8:13 PM, Chris

Re: Collection API migrate statement

2015-12-15 Thread Shalin Shekhar Mangar
The migrate is a long running operation. Please use it along with async= parameter so that it can execute in the background. Then you can use the request status API to poll and wait until the operation completes. If there is any error then the same request status API will return the response. See h

Re: similarity as a parameter

2015-12-15 Thread Dmitry Kan
Markus, Jack, I think Ahmet nails it pretty nicely: the similarity functions in question are compatible on the index level. So it is not necessary to create a separate search field. Ahmet, I like your idea. Will take a look, thanks. Rgds, Dmitry On Tue, Dec 15, 2015 at 7:58 PM, Ahmet Arslan wr

Re: similarity as a parameter

2015-12-15 Thread Ahmet Arslan
I wonder what solr-plugin would be best for this functionality. How about a custom search component, in its prepare method? I think we can access (Solr)IndexSearcher inside a SearchComponent. setSimilarity in the process method should work. Ahmet On Tuesday, December 15, 2015 7:43 PM, Ahmet

Re: similarity as a parameter

2015-12-15 Thread Chris Hostetter
: I think this is a legitimate request. Majority of the similarities are : compatible index wise. I think the only exception is sweet spot : similarity. I think you are grossly underestimating the risk of arbitrarily using diff Similarities between index time and query time -- particulaly in h

Re: Solr Basic Configuration - Highlight - Begginer

2015-12-15 Thread Evert R.
Hi Erick, Thank you very much for the reply!! I do get back the full text, autor, and a whole lots of stuff which doesn´t really matter for my project. So, what you are saying is that the solr gets me back the full content and my application will fix the rest? Which means for me that all my book

Re: similarity as a parameter

2015-12-15 Thread Ahmet Arslan
Hi Dmitry, I think this is a legitimate request. Majority of the similarities are compatible index wise. I think the only exception is sweet spot similarity. In Lucene, it can be changed on the fly with a new Searcher. It should be possible to do so in solr. Thanks, Ahmet On Tuesday, Decemb

Re: NRT vs Redis for Dynamic Data in SOLR (like counts, viewcounts, etc) -

2015-12-15 Thread Charlie Hull
On 15/12/2015 14:13, Vikram Parmar wrote: Hi Mikhail, Hi, In case you're interested, several years ago we prototyped a Lucene codec using Redis for just this sort of application: http://www.flax.co.uk/blog/2012/06/22/updating-individual-fields-in-lucene-with-a-redis-backed-codec/ It's a sli

Re: Collection API migrate statement

2015-12-15 Thread Erick Erickson
You might look at colleciton aliasing, this is sometimes used for time-series data (which I'm guessing this is). But I have to ask whether migrating tuf faround like that is really necessary. 2M docs isn't very many, have you stress tested with just indexing them all to a single collection? Is the

Re: Solr Basic Configuration - Highlight - Begginer

2015-12-15 Thread Erick Erickson
How are you trying to display the results? Highlighting is a bit of an odd beast. Assuming it's correctly configured, the response packet will have a separate highlight section, it's the application's responsibility to present that pleasingly. What _do_ you get bak in the response? BTW, the mail

Re: solrcloud used a lot of memory and memory keep increasing during long time run

2015-12-15 Thread Erick Erickson
Rahul's comments were spot on. You can gain more confidence that this is normal if if you try attaching a memory reporting program (jconsole is one) you'll see the memory grow for quite a while, then garbage collection kicks in and you'll see it drop in a sawtooth pattern. Best, Erick On Tue, Dec

Re: solrcloud used a lot of memory and memory keep increasing during long time run

2015-12-15 Thread zhenglingyun
Thank you very much. I will try reduce the heap memory and check if the memory still keep increasing or not. > 在 2015年12月15日,19:37,Rahul Ramesh 写道: > > You should actually decrease solr heap size. Let me explain a bit. > > Solr requires very less heap memory for its operation and more memory f

Re: similarity as a parameter

2015-12-15 Thread Jack Krupansky
You would need to define an alternate field which copied a base field but then had the desired alternate similarity, using SchemaSimilarityFactory. See: https://cwiki.apache.org/confluence/display/solr/Other+Schema+Elements -- Jack Krupansky On Tue, Dec 15, 2015 at 10:02 AM, Dmitry Kan wrote:

RE: similarity as a parameter

2015-12-15 Thread Markus Jelsma
Hello Dmitry - this is currently not possible. Quickest way is to reconfigure and reload the cores. Some similarities also require you to reindex, so it is a bad idea anyway. Markus -Original message- > From:Dmitry Kan > Sent: Tuesday 15th December 2015 16:02 > To: solr-user@lucene.ap

Collection API migrate statement

2015-12-15 Thread philippa griggs
Hello, Solr 5.2.1. I'm using the collection API migrate statement in our test environment with the view to implement a Hot, Cold arrangement- newer documents will be kept on the Hot collection and each night the oldest documents will be migrated into the Cold collection. I've got it all work

[ANNOUNCE] Apache Solr Ref Guide for v5.4

2015-12-15 Thread Cassandra Targett
The Lucene PMC is pleased to announce the release of the Apache Solr Reference Guide for Solr 5.4. This 598 page PDF is the definitive guide for Solr, written and edited by the Solr committer community. You can download it from: https://www.apache.org/dyn/closer.cgi/lucene/solr/ref-guide/ - Cass

similarity as a parameter

2015-12-15 Thread Dmitry Kan
Hi guys, Is there a way to alter the similarity class at runtime, with a parameter? -- Dmitry Kan Luke Toolbox: http://github.com/DmitryKey/luke Blog: http://dmitrykan.blogspot.com Twitter: http://twitter.com/dmitrykan SemanticAnalyzer: www.semanticanalyzer.info

Re: Is DIH going to be removed from Solr future versions?

2015-12-15 Thread Erik Hatcher
With time shaken loose, IMO ideally what we do (under https://issues.apache.org/jira/browse/SOLR-7188 probably) is create an update processor that *forwards* to a _real_ Solr collection update handler, and fire up EmbeddedSolrServer in a client-

RE: Is DIH going to be removed from Solr future versions?

2015-12-15 Thread Davis, Daniel (NIH/NLM) [C]
I am aware of the problems with the implementation of DIH, but is there any problem with the XML driven data import capability? Could it be rewritten (using modern XPath) to run as a part of SolrJ? I've been interested in that, but I just haven't been able to shake loose the time. -Original

Re: NRT vs Redis for Dynamic Data in SOLR (like counts, viewcounts, etc) -

2015-12-15 Thread Vikram Parmar
Hi Mikhail, Thanks for chiming in. Looking forward to your post regarding updatable numeric DocValues. What would be the 2nd most promising approach for now, would you say EFF should be ok to go with? Updating and reloading the EFF external file (containing a millions lines) at very short interv

Re: Security Problems

2015-12-15 Thread Upayavira
I concur - this makes sense. On Tue, Dec 15, 2015, at 01:39 PM, Jan Høydahl wrote: > Yes, that’s why I believe it should be: > 1) if only authentication is enabled, all users must authenticate and all > authenticated users can do anything. > 2) if authz is enabled, then all users must still authen

Re: Security Problems

2015-12-15 Thread Jan Høydahl
Yes, that’s why I believe it should be: 1) if only authentication is enabled, all users must authenticate and all authenticated users can do anything. 2) if authz is enabled, then all users must still authenticate, and can by default do nothing at all, unless assigned proper roles 3) if a user is

Re: NRT vs Redis for Dynamic Data in SOLR (like counts, viewcounts, etc) -

2015-12-15 Thread Mikhail Khludnev
I believe https://issues.apache.org/jira/browse/SOLR-5944 is the most promising approach for such scenarios. Despite it's not delivered in distro. We are going to publish a post about it at blog.griddynamics.com. FWIW, I suppose EFF can be returned in result list. On Fri, Dec 11, 2015 at 1:48 P

Re: solrcloud used a lot of memory and memory keep increasing during long time run

2015-12-15 Thread Rahul Ramesh
You should actually decrease solr heap size. Let me explain a bit. Solr requires very less heap memory for its operation and more memory for storing data in main memory. This is because solr uses mmap for storing the index files. Please check the link http://blog.thetaphi.de/2012/07/use-lucenes-mm

Solr Basic Configuration - Highlight - Begginer

2015-12-15 Thread Evert R.
Hi there! It´s my first installation, not sure if here is the right channel... Here is my steps: 1. Set up a basic install of solr 5.4.0 2. Create a new core through command line (bin/solr create -c test) 3. Post 2 files: 1 .docx and 2 .pdf (bin/post -c test /docs/test/) 4. Query over the bro

solrcloud used a lot of memory and memory keep increasing during long time run

2015-12-15 Thread zhenglingyun
Hi, list I’m new to solr. Recently I encounter a “memory leak” problem with solrcloud. I have two 64GB servers running a solrcloud cluster. In the solrcloud, I have one collection with about 400k docs. The index size of the collection is about 500MB. Memory for solr is 16GB. Following is "ps aux

Re: Issue in Geospatial Search

2015-12-15 Thread Shenbagarajan
Below is the configuration in my managedschema.xml -- View this message in context: http://lucene.472066.n3.nabble.com/Issue-in-Geospatial-Search-tp4245441p4245451.html Sent from the Solr - User mailing list archive at Nabble.com.

Re: Issue in Geospatial Search

2015-12-15 Thread davidphilip cherian
What is the fieldType of the field "latlon" in older schema as well as new schema? Have you confirmed that both are same? On Tue, Dec 15, 2015 at 3:18 PM, Shenbagarajan wrote: > Hello, > > I am trying to implement geo spatial search in solr by referring the below > site. > https://cwiki.apache.

Re: Partial sentence match with block join

2015-12-15 Thread Upayavira
"Apple Computer Company" is a phrase query, meaning all the words must appear, in that order. Adding phrase slop just allows adjustments to the location of those words, not making them optional. If you want them to be optional, then do it as a regular query. If you want a phrase to score better, t

Re: Is DIH going to be removed from Solr future versions?

2015-12-15 Thread Upayavira
I doubt DIH will be "removed". It more likely will be relegated - still there, but emphasised less. Another possibility that has been mooted is to extract it, so that it can run outside of Solr. This strikes me as the best option. Having it run inside Solr strikes me as architecturally wrong, and

Issue in Geospatial Search

2015-12-15 Thread Shenbagarajan
Hello, I am trying to implement geo spatial search in solr by referring the below site. https://cwiki.apache.org/confluence/display/solr/Spatial+Search Everytime i try to execute i am getting the same error as below. "msg":"The field latlon does not support spatial filtering", When i try to run

Re: Memory leak in SolrCloud 4.6

2015-12-15 Thread Emir Arnautovic
Hi Mark, Can you tell us bit more about your index and load. Why do you thing there is a leak? If you give that memory to JVM it will use it and you gave most of it to JVM. Only 4GB is left for OS and disk caches. Since swap is enabled, it might swap some JVM pages. It seems to me like complet