Re: How to stop Solr tokenising search terms with spaces

2014-12-08 Thread Erik Hatcher
What's the parsed query? debug=true On Dec 8, 2014, at 02:50, Dinesh Babu dinesh.b...@pb.com wrote: I just tried your suggestion {!complexphrase}displayName:RVN Viewpoint users Even the above did not work. Am I missing any configuration changes for this parser to work? Regards,

Re: Question on Solr Caching

2014-12-08 Thread Manohar Sripada
Thanks Shawn, Can you please re-direct me to any wiki which describes (in detail) the differences between MMapDirectoryFactory and NRTCachingDirectoryFactory? I found this blog http://blog.thetaphi.de/2012/07/use-lucenes-mmapdirectory-on-64bit.html very helpful which describes about

Re: Anti-Pattern in lucent-join jar?

2014-12-08 Thread Mikhail Khludnev
On Fri, Dec 5, 2014 at 10:44 PM, Darin Amos dari...@gmail.com wrote: public Scorer scorer(){ TermsWithScoreCollector collector = new TermsWithScoreCollector(); JoinQuery.this.s.search(JoinQuery.this.q,

Re: Slow queries

2014-12-08 Thread melb
THnks for the answer A dedicated box will be a great solution but I will wait for that solution, I have restricted sources Is Optimze action can improve performance? Is using default servlet engine Jetty can be harmful for the performance, SHould I use an independant tomcat engine? rgds, --

Clearing SolrCaches

2014-12-08 Thread Manohar Sripada
Hi, Can anyone please let me know on how to clear caches associated with an IndexSearcher explicitly? In my project, I am creating a collection (say collection_1) which holds the data for my organizations dataset. I am using filterCache, queryResultCache and DocumentCache extensively and these

Re: Slow queries

2014-12-08 Thread Siegfried Goeschl
Hi, using Jetty is the recommended approach while using Tomcat is not recommend (unless you are a Tomcat shop). But any discussion comes back to the original question - why is it slow now? Are you I/O-bound, are CPU-bound, how many documents are committed/deleted over the time, do you having

Re: SegmentInfos exposed to /admin/luke

2014-12-08 Thread Dmitry Kan
Hi Alexey, In GUI luke there is an option to Just expunge deleted docs without re-merging. In case you want to give it a try. Dmitry On Wed, Dec 3, 2014 at 1:35 PM, Alexey Kozhemiakin alexey_kozhemia...@epam.com wrote: Dear All, We have a high percentage of deleted docs which do not go

Re: SegmentInfos exposed to /admin/luke

2014-12-08 Thread Mikhail Khludnev
Alexey, I've got that you need to get number of deleted docs in the index http://localhost:8983/solr/admin/mbeans?stats=truecat=CORE here it goes int name=numDocs27/int int name=maxDoc30/int int name=deletedDocs3/int if you need to get detailed segmentation, parse str

Re: CloudSolrServer, concurrency and too many connections

2014-12-08 Thread JoeSmith
We will need to update to 7u52, we are using 7u55. On the client side, this happens with zookeeper 3.4.6 and 4.10.2 solrj. And we will need to update both on the server side. What kind of config/setup information would you need to see if we do still have an issue after these updates? On Mon,

Re: How to stop Solr tokenising search terms with spaces

2014-12-08 Thread Yonik Seeley
On Mon, Dec 8, 2014 at 2:50 AM, Dinesh Babu dinesh.b...@pb.com wrote: I just tried your suggestion {!complexphrase}displayName:RVN Viewpoint users Even the above did not work. Am I missing any configuration changes for this parser to work? What is the fieldType of displayName? The

Re: Question on Solr Caching

2014-12-08 Thread Shawn Heisey
On 12/8/2014 2:42 AM, Manohar Sripada wrote: Can you please re-direct me to any wiki which describes (in detail) the differences between MMapDirectoryFactory and NRTCachingDirectoryFactory? I found this blog http://blog.thetaphi.de/2012/07/use-lucenes-mmapdirectory-on-64bit.html very helpful

Re: Clearing SolrCaches

2014-12-08 Thread Shawn Heisey
On 12/8/2014 3:02 AM, Manohar Sripada wrote: Can anyone please let me know on how to clear caches associated with an IndexSearcher explicitly? In my project, I am creating a collection (say collection_1) which holds the data for my organizations dataset. I am using filterCache,

Re: Anti-Pattern in lucent-join jar?

2014-12-08 Thread Michael Sokolov
I get the impression there was a concern that the caller could hold on to the query generated by JoinUtil for too long - eg across requests in Solr. I'm not sure why the OP thinks that would happen, though. -Mike On 12/08/2014 04:57 AM, Mikhail Khludnev wrote: On Fri, Dec 5, 2014 at 10:44

Length norm not functioning in solr queries.

2014-12-08 Thread S.L
I have two documents doc1 and doc2 and each one of those has a field called phoneName. doc1:phoneName:Details about Apple iPhone 4s - 16GB - White (Verizon) Smartphone Factory Unlocked doc2:phoneName:Apple iPhone 4S 16GB for Net10, No Contract, White Here if I search for

RE: How to stop Solr tokenising search terms with spaces

2014-12-08 Thread Dinesh Babu
Thanks a lot Jack. Will try this Solution. Regards, Dinesh Babu. -Original Message- From: Jack Krupansky [mailto:j...@basetechnology.com] Sent: 07 December 2014 20:38 To: solr-user@lucene.apache.org Subject: Re: How to stop Solr tokenising search terms with spaces Thanks for the

RE: How to stop Solr tokenising search terms with spaces

2014-12-08 Thread Dinesh Babu
Hi Yonik, It is a text field ( all our search fields are of type text ). Very unlucky for me that it is not working. Will try the NGram solution provided by Jack. Regards, Dinesh Babu. -Original Message- From: ysee...@gmail.com [mailto:ysee...@gmail.com] On Behalf Of Yonik Seeley

Re: Anti-Pattern in lucent-join jar?

2014-12-08 Thread Mikhail Khludnev
On Mon, Dec 8, 2014 at 5:38 PM, Michael Sokolov msoko...@safaribooksonline.com wrote: I get the impression there was a concern that the caller could hold on to the query generated by JoinUtil for too long - eg across requests in Solr. Michael, if you still bother, SOLR-6234

Re: How to stop Solr tokenising search terms with spaces

2014-12-08 Thread Ahmet Arslan
Hi, May be you have omitTermFreqAndPositions=true set for your fields? Positions are necessary for phrase queries to work. Ahmet On Monday, December 8, 2014 5:20 PM, Dinesh Babu dinesh.b...@pb.com wrote: Hi Yonik, It is a text field ( all our search fields are of type text ). Very unlucky

Re: Length norm not functioning in solr queries.

2014-12-08 Thread Mikhail Khludnev
It's worth to look into explain to check particular scoring values. But for most suspect is the reducing precision when float norms are stored in byte vals. See javadoc for DefaultSimilarity.encodeNormValue(float) On Mon, Dec 8, 2014 at 5:49 PM, S.L simpleliving...@gmail.com wrote: I have two

Re: Anti-Pattern in lucent-join jar?

2014-12-08 Thread Darin Amos
Hi Mikhail, I was merely posing a thought in an effort to continue to learn and educate myself. Your point about Weight.scorer() being called per segment helps my understanding. I am in the middle of building a POC for a customer of mine that I pointed out in this thread on Dec 5th (shortly

AW: Keeping capitalization in suggestions?

2014-12-08 Thread Clemens Wyss DEV
Allthough making use of AnalyzingInfixSuggester I still getting either or. When lowercase-filter is active I always get suggestions, BUT they are lowercased (i.e. chamäleon). When lowercase-filter is not active I only get suggestions when querying Chamä my solrconfig.xml ... requestHandler

Re: Anti-Pattern in lucent-join jar?

2014-12-08 Thread Michael Sokolov
Right - allowing Solr to manage these queries (SOLR-6234) seems like the way to go ... OP == original poster (I lost track of who started the discussion) -Mike On 12/08/2014 10:19 AM, Mikhail Khludnev wrote: On Mon, Dec 8, 2014 at 5:38 PM, Michael Sokolov msoko...@safaribooksonline.com

Re: [ANN] Heliosearch 0.09 (JSON Request API + Distrib for Facet API)

2014-12-08 Thread Mike Murphy
Very nice, I like the integration of request parameters, parameter substitution, and the handling of multiple JSON snippets. This is starting to feel nicer than elasticsearch even! You just need to round it out a little more now. --Mike On Fri, Dec 5, 2014 at 12:45 PM, Yonik Seeley

RE: How to stop Solr tokenising search terms with spaces

2014-12-08 Thread Dinesh Babu
Hi Erik, 1. With search phrase in quotes {!complexphrase}displayName:RVN Viewpoint* debug: { rawquerystring: {!complexphrase}displayName:\RVN Viewpoint*\, querystring: {!complexphrase}displayName:\RVN Viewpoint*\, parsedquery: ComplexPhraseQuery(\RVN Viewpoint*\),

Re: Preferred Scema/Config for Chinese Language Cores?

2014-12-08 Thread Tom Zimmermann
I tracked down an example from a sample solr config of a CJK setup with bigrams and no CJK tokenizer: fieldType name=text_cjk class=solr.TextField positionIncrementGap=100 analyzer tokenizer class=solr.StandardTokenizerFactory/ !-- normalize width before bigram, as e.g. half-width dakuten

Re: How to stop Solr tokenising search terms with spaces

2014-12-08 Thread Erik Hatcher
debug output tells a lot. Looks like in the last two examples that the second part (Viewpoint*) is NOT parsed with the complex phrase parser - the whitespace thwarts it. I’d recommend doing something like this to test that parser out to avoid the “meta” parsing issue. q={!complexphrase

RE: How to stop Solr tokenising search terms with spaces

2014-12-08 Thread Dinesh Babu
Thanks Erik Regards, Dinesh Babu. -Original Message- From: Erik Hatcher [mailto:erik.hatc...@gmail.com] Sent: 08 December 2014 17:02 To: solr-user@lucene.apache.org Subject: Re: How to stop Solr tokenising search terms with spaces debug output tells a lot. Looks like in the last two

Re: How to stop Solr tokenising search terms with spaces

2014-12-08 Thread Yonik Seeley
On Mon, Dec 8, 2014 at 12:01 PM, Erik Hatcher erik.hatc...@gmail.com wrote: debug output tells a lot. Looks like in the last two examples that the second part (Viewpoint*) is NOT parsed with the complex phrase parser - the whitespace thwarts it. Actually, it looks like it is, but you're not

Re: AW: Keeping capitalization in suggestions?

2014-12-08 Thread Ahmet Arslan
Hi Clemens, There a a number of ways to implement auto complete/suggest. Some of them pull data from indexed terms, therefore they will be lowercased. Some pull data from stored values, therefore capitalisation is preserved. Here are great resources on this topic.

Custom Rollup (Join) Query

2014-12-08 Thread Darin Amos
Hello, I posted this question within another thread and I think it got lost so I wanted to start a new thread about it. I have built a small POC for a customization I am hoping to get some validation on in case what I have built is a really bad implementation. I have been doing a lot of

Re: Tika HTTP 400 Errors with DIH

2014-12-08 Thread Dan Davis
I would say that you could determine a row that gives a bad URL, and then run it in DIH admin interface (or the command-line) with debug enabled The url parameter going into tika should be present in its transformed form before the next entity gets going. This works in a similar scenario for me.

DIH XPathEntityProcessor question

2014-12-08 Thread Dan Davis
When I have a forEach attribute like the following: forEach=/medical-topics/medical-topic/health-topic[@language='English'] And then need to match an attribute of that, is there any alternative to spelling it all out: field column=url

CLUSTERSTATUS timeout

2014-12-08 Thread Hutchins, Jonathan
We are currently running Solr 4.10.0 in production. We have run into an issue where we cannot have our monitoring system hit the CLUSTERSTATUS api command every five minutes (or even as long as every hour) without getting a significant number of timeouts on the command. Does this command

Re: DIH XPathEntityProcessor question

2014-12-08 Thread Dan Davis
In experimentation with a much simpler and smaller XML file, it doesn't look like '//health-topic/@url will not work, nor will '//@url' etc.So far, only spelling it all out will work. With child elements, such as title, an xpath of //title works fine, but it is beginning to same dangerous.

Re: DIH XPathEntityProcessor question

2014-12-08 Thread Alexandre Rafalovitch
I don't believe there are any alternatives. At least I could not get anything but the full path to work. Regards, Alex. Personal: http://www.outerthoughts.com/ and @arafalov Solr resources and newsletter: http://www.solr-start.com/ and @solrstart Solr popularizers community:

Get matched Term in join query

2014-12-08 Thread Peter Sturge
Hi Forum, Is it possible for a Solr query to return the term(s) that matched a particular field/query? For example, let's say there's a field like this: raw=This is a raw text field that happens to contain some text that's also in the action field value... And another field in a different index

Re: DIH XPathEntityProcessor question

2014-12-08 Thread Dan Davis
The problem is that XPathEntityProcessor implements Xpath on its own, and implements a subset of XPath. So, if the input document is small enough, it makes no sense to fight it. One possibility is to apply an XSLT to the file before processing ite This blog post

Re: DIH XPathEntityProcessor question

2014-12-08 Thread Dan Davis
Yes, that worked quite well. I still need the //tagname but that is the only DIH incantation I need. This will substantially accelerate things. On Mon, Dec 8, 2014 at 5:37 PM, Dan Davis d...@danizen.net wrote: The problem is that XPathEntityProcessor implements Xpath on its own, and

Re: unable to build spellcheck in solr

2014-12-08 Thread Min L
Below is the solrconfig.xml. myfieldName is indexed in solr and is searchable. The spellcheck.build=true http://:8080/solr/mycore/suggestkeyword?spellcheck.build=true works in one env but not in another. searchComponent class=solr.SpellCheckComponent name=suggestk lst name=spellchecker

Re: CloudSolrServer, concurrency and too many connections

2014-12-08 Thread JoeSmith
Thanks, Shawn. I updated to 7u72 and was not able to reproduce the problem. That was good. But just to be sure about this, I backed back down to 7u55 and again was not able to reproduce. So at least for now, this has gone away even if the reason is inconclusive. On Mon, Dec 8, 2014 at 7:37

Re: Boosting the score using edismax for a non empty and non indexed field.

2014-12-08 Thread S.L
Anyone ? On Mon, Dec 8, 2014 at 2:45 AM, S.L simpleliving...@gmail.com wrote: Hi All, I have a situation where I need to boost the score of a query if a field (imageURL) in the given document is non empty , I am using edismax so I know that using bq parameter would solve the problem.

Re: Clearing SolrCaches

2014-12-08 Thread Manohar Sripada
How to edit the configuration that is linked to a collection?? I am using SolrCloud and I upload my config to Zookeeper. So, if I modify and upload the config, will that not impact the latest collection as well, if I don't reload the latest collection? Thanks, Manohar On Mon, Dec 8, 2014 at 7:45

SOLR shards stay down forever

2014-12-08 Thread Norgorn
I'm using SOLR 4.10.1 in cloud mode with 3 instances, 5 shards per instance without replication. I restarted one SOLR and now all shards from that instance are down, but there are no errors in logs. All I see is 09.12.2014, 11:13:40WARNUpdateLog Starting log replay

Re: Get matched Term in join query

2014-12-08 Thread Mikhail Khludnev
Hello Peter, Let's limit or just fix the problem definition. I've got that dealing with cross core join id mandatory. Is it right? Then, do you need facets (from all resultset) or just a snippets (just from result page)? 09.12.2014 1:23 пользователь Peter Sturge peter.stu...@gmail.com написал: