sort on facet.index?

2015-04-02 Thread Derek Poh
Is sorting on facet index supported? I would like to sort on the below facet index lst name=P_SupplierRanking int name=014/int int name=18/int int name=212/int int name=3349/int int name=481/int int

Re: Solr -indexing from csv file having 28 cols taking lot of time ..plz help i m new to solr

2015-04-02 Thread avinash09
Alex, finally it worked for me found ctrl A separator ==( separator=%01escape=\) Thanks for your help -- View this message in context: http://lucene.472066.n3.nabble.com/Solr-indexing-from-csv-file-having-28-cols-taking-lot-of-time-plz-help-i-m-new-to-solr-tp4196904p4197143.html Sent from the

Alphanumeric Wild card search

2015-04-02 Thread Palagiri, Jayasankar
Hello Team, Below is my field type fieldType name=text_en_splitting class=solr.TextField positionIncrementGap=100 autoGeneratePhraseQueries=true analyzer type=index tokenizer class=solr.WhitespaceTokenizerFactory/ !-- in this example, we will only use synonyms at query

Re: SolrCloud 5.0 cluster RAM requirements

2015-04-02 Thread Toke Eskildsen
Ryan Steele ryan.ste...@pgi.com wrote: Does a SolrCloud 5.0 cluster need enough RAM across the cluster to load all the collections into RAM at all times? Although Shawn is right about us not being able to answer properly, sometimes we can give qualified suggestions and guesses. At least to the

Question regarding enablePositionIncrements

2015-04-02 Thread Aman Tandon
Hi, I was using the enablePositionIncrements in solr 4.8.1 schema. But when I tries to use it in solr-5.0.0 it is giving error in creating the collection If I am correct it was useful in phrase queries. So is there any particular reasons for not supporting this option in solr 5? If so, then

Re: Restart solr failed after applied the patch in https://issues.apache.org/jira/browse/SOLR-6359

2015-04-02 Thread forest_soup
Thanks Ramkumar! Understood. We will try 100, 10. But with our original steps which we found the exception, can we say that the patch has some issue? 1, put the patch to all 5 running solr servers(tomcat) by replacing the tomcat/webapps/solr/WEB-INF/lib/solr-core-4.7.0.jar with the patched

ShardHandler semantics

2015-04-02 Thread Gregg Donovan
We're starting work on adding backup requests http://static.googleusercontent.com/media/research.google.com/en/us/people/jeff/Berkeley-Latency-Mar2012.pdf to the ShardHandler. Roughly something like: 1. Send requests to 100 shards. 2. Wait for results from 75 to come back. 3. Wait for either a)

Re: newbie questions regarding solr cloud

2015-04-02 Thread Upayavira
A couple of additions: I had a system that indexed log files. I created a new core each day (some 20m log events/day). I created collection aliases called today, week and month that aggregated the relevant collections. That way, accessing the “today” collection would always get you to the right

Re: sort param could not be parsed as a query, and is not a field that exists in the index: geodist()

2015-04-02 Thread Erick Erickson
What comes out int he Solr logs? Nothing's jumping out at me here. What version of Solr are you using? What is your GEOLOCATION field type? Best, Erick On Thu, Apr 2, 2015 at 2:20 PM, Niraj niroj.off...@gmail.com wrote: *Objective: To find out all locations those are present within 1 KM of the

newbie questions regarding solr cloud

2015-04-02 Thread Ben Hsu
Hello I am playing with solr5 right now, to see if its cloud features can replace what we have with solr 3.6, and I have some questions, some newbie, and some not so newbie Background: the documents we are putting in solr have a date field. the majority of our searches are restricted to

Re: Taking Solr 5.0 to Production on Windows

2015-04-02 Thread Upayavira
On Thu, Apr 2, 2015, at 04:23 PM, Shawn Heisey wrote: On 4/2/2015 8:20 AM, Steven White wrote: I'm reading Taking Solr 5.0 to Production https://cwiki.apache.org/confluence/display/solr/Taking+Solr+to+Production but I cannot find anything about Windows, is there some other link I'm

solr query latency spike when replicating index

2015-04-02 Thread wei
I noticed the solr query latency spike on slave node when replicating index from master. Especially when master just finished optimization, the slave node will copy the whole index, and the latency is really bad. Is there some way to fix it? Thanks, Wei

Re: Taking Solr 5.0 to Production on Windows

2015-04-02 Thread Shawn Heisey
On 4/2/2015 2:23 PM, Upayavira wrote: I think the point is more that the majority of developers use a Unix based system, and the majority of testing is done on Unix based systems. Also, there are ways in which the Windows memory model differs from a Unix one, meaning certain memory

Unable to update config file using zkcli or RELOAD

2015-04-02 Thread Shamik Bandopadhyay
Hi, I'm facing a weird issue. I've a solr cloud cluster with 2 shards having a replica each. I started the cluster using -Dbootstrap_confdir=./solr/collection1/conf -Dcollection.configName=myconf. After the cluster is up and running, I added a new request handler (newhandler) and wanted to push

DOcValues

2015-04-02 Thread William Bell
If I set indexed=true and docvalues=true, when I facet=truefacet.field=manu_exact will it use docValues or the Indexed version? Also, does it help with *Too many values for UnInvertedField faceting ?* *Do I need to set facet.method when using docvalues?* field name=manu_exact type=string

Re: SolrCloud 5.0 cluster RAM requirements

2015-04-02 Thread Ryan Steele
Thank you Shawn and Toke for the information and links! No, I was not the one on #solr IRC channel. :/ Here are the details I have right now: I'm building/running the operations side of this new SolrCloud cluster. It will be in Amazon, the initial cluster I'm planning to start with is 5

Problems with solr-cloud 4.8.0 and zookeeper 3.4.6

2015-04-02 Thread Vincenzo D'Amore
Hi, In my development I have 3 servers. Inside every server there are two running instance of zookeeper and solrcloud. zkHos There aren't connections or any other clients running but I have the zookeeper logs flooded by this annoying exceptions coming only from server 1 and 3. All solrcloud and

Facet sorting algorithm for index

2015-04-02 Thread yriveiro
Hi, I have an external application that use the output of a facet to join other dataset using the keys of the facet result. The facet query use index sort but in some point, my application crash because the order of the keys is not correct. If I do an unix sort over the keys of the result with

RE: Alphanumeric Wild card search

2015-04-02 Thread Palagiri, Jayasankar
Hello Team, Below is my field type fieldType name=text_en_splitting class=solr.TextField positionIncrementGap=100 autoGeneratePhraseQueries=true analyzer type=index tokenizer class=solr.WhitespaceTokenizerFactory/ !-- in this example, we will only use synonyms at query

Re: Question regarding enablePositionIncrements

2015-04-02 Thread Jack Krupansky
Position increments were considered problematic, especially for highlighting. Did you get this for the stop filter? There was a Jira for this - check CHANGES.TXT and the Jira for details. For some discussion, see: https://issues.apache.org/jira/browse/SOLR-6468 -- Jack Krupansky On Thu, Apr 2,

Re: Database vs Solr : ID based filtering

2015-04-02 Thread Aman Tandon
Thanks Mikhail for the explanation. With Regards Aman Tandon On Fri, Mar 27, 2015 at 3:40 PM, Mikhail Khludnev mkhlud...@griddynamics.com wrote: for the single where clause RDBMS with index performs comparable same as inverted index. Inverted index wins on multiple 'where' clauses, where it

edismax operators

2015-04-02 Thread Mahmoud Almokadem
Hello, I've a strange behaviour on using edismax with multiwords. When using passing q=+(word1 word2) I got rawquerystring: +(word1 word2), querystring: +(word1 word2), parsedquery: (+(+(DisjunctionMaxQuery((title:word1)) DisjunctionMaxQuery((title:word2)/no_coord, parsedquery_toString:

Re: edismax operators

2015-04-02 Thread Jack Krupansky
The parentheses signal a nested query. Your plus operator applies to the overall nested query - that the nested query must match something. Use the plus operator on each of the discrete terms if each of them is mandatory. The plus and minus operators apply to the overall nested query - they do not

Re: Alphanumeric Wild card search

2015-04-02 Thread Simon Martinelli
Hi, Have a look at the generated terms to see how they look. Simon On Thu, Apr 2, 2015 at 9:43 AM, Palagiri, Jayasankar jayashankar.palag...@honeywell.com wrote: Hello Team, Below is my field type fieldType name=text_en_splitting class=solr.TextField positionIncrementGap=100

Re: Facet sorting algorithm for index

2015-04-02 Thread Yonik Seeley
On Thu, Apr 2, 2015 at 6:36 AM, yriveiro yago.rive...@gmail.com wrote: Hi, I have an external application that use the output of a facet to join other dataset using the keys of the facet result. The facet query use index sort but in some point, my application crash because the order of the

Re: Alphanumeric Wild card search

2015-04-02 Thread Jack Krupansky
This is caused by the word delimiter filter - it breaks multi-part terms (the hyphens trigger it) into multiple terms. Wildcards simply don't work consistently well in such a situation. The basic problem is that the presence of the wildcard causes all but the simplest token filtering stages to be

Re: SolrCloud 5.0 cluster RAM requirements

2015-04-02 Thread Shawn Heisey
On 4/2/2015 4:46 PM, Ryan Steele wrote: Thank you Shawn and Toke for the information and links! No, I was not the one on #solr IRC channel. :/ Here are the details I have right now: I'm building/running the operations side of this new SolrCloud cluster. It will be in Amazon, the initial

Re: SolrCloud 5.0 cluster RAM requirements

2015-04-02 Thread Shawn Heisey
On 4/2/2015 11:18 PM, Shawn Heisey wrote: On 4/2/2015 4:46 PM, Ryan Steele wrote: cluster. It will be in Amazon, the initial cluster I'm planning to start with is 5 r3.xlarge instances each using a general purpose SSD EBS volume for the SolrCloud related data (this will be separate from the

multi core faceting

2015-04-02 Thread Aman Tandon
Hi, I have two cores one contains the data of jeans and other core contains data of shirts available to user. I want to show count of shirts and jeans on my website from one solr request. Is there any functionality available i solr by which I can get the combined facet from both the cores (jeans

Re: multi core faceting

2015-04-02 Thread Shawn Heisey
On 4/2/2015 11:30 PM, Aman Tandon wrote: I have two cores one contains the data of jeans and other core contains data of shirts available to user. I want to show count of shirts and jeans on my website from one solr request. Is there any functionality available i solr by which I can get the

Re: Generating json response in custom requestHandler (xml is working)

2015-04-02 Thread Shalin Shekhar Mangar
The URL you are trying to access is wrong. You are using /solr/etr_base_core/trendswt=json but you should be using /solr/etr_base_core/trends?wt=json On Thu, Apr 2, 2015 at 9:51 AM, Christian Reuschling christian.reuschl...@gmail.com wrote: Hi, I managed it to create a small custom

Re: Facet sorting algorithm for index

2015-04-02 Thread Yonik Seeley
On Thu, Apr 2, 2015 at 9:44 AM, Yago Riveiro yago.rive...@gmail.com wrote: Where can I found the source code used in index sorting? I need to ensure that the external data has the same sorting that the facet result. If you step over the indexed terms of a field you get them in sorted order

Re: sort on facet.index?

2015-04-02 Thread Toke Eskildsen
Ryan Josal rjo...@gmail.com wrote: So maybe you are asking if you can sort by index, but reversed? I don't think this is possible, and it's a good question. It is not currently possible and the JIRA for the issue https://issues.apache.org/jira/browse/SOLR-1672 is 5 years old. On the plus

Re: How to recover a Shard

2015-04-02 Thread Erick Erickson
Matt: This seems dangerous, but you might be able to use the Collections API to 1 DELTEREPLICA an all but one. 2 RELOAD the collection 3 ADDREPLICA back. I don't _like_ this much mind you as when you added the replicas back it'd replicate the index from the leader, but at least you might not

Taking Solr 5.0 to Production on Windows

2015-04-02 Thread Steven White
Hi folks, I'm reading Taking Solr 5.0 to Production https://cwiki.apache.org/confluence/display/solr/Taking+Solr+to+Production but I cannot find anything about Windows, is there some other link I'm missing? This section in the doc is an important part for a successful Solr deployment, but it is

Re: sort on facet.index?

2015-04-02 Thread Ryan Josal
Sorting the result set or the facets? For the facets there is facet.sort=index (lexicographically) and facet.sort=count. So maybe you are asking if you can sort by index, but reversed? I don't think this is possible, and it's a good question. I wanted to chime in on this one because I wanted

Re: Taking Solr 5.0 to Production on Windows

2015-04-02 Thread Shawn Heisey
On 4/2/2015 8:20 AM, Steven White wrote: I'm reading Taking Solr 5.0 to Production https://cwiki.apache.org/confluence/display/solr/Taking+Solr+to+Production but I cannot find anything about Windows, is there some other link I'm missing? This section in the doc is an important part for a

RE: How to recover a Shard

2015-04-02 Thread Matt Kuiper
Thanks Erick! Understand your warning. Next time it occurs, I will plan to give it a try. I am currently in a dev environment, so it is a safe place to experiment. Matt -Original Message- From: Erick Erickson [mailto:erickerick...@gmail.com] Sent: Thursday, April 02, 2015 9:40 AM

Re: edismax operators

2015-04-02 Thread Shawn Heisey
On 4/2/2015 8:35 AM, Mahmoud Almokadem wrote: Thank you Jack for your clarifications. I used regular defType and set q.op=AND so all terms without operators are must. How can I use this with edismax? The edismax parser is capable of much more granularity than simply AND/OR on the default

RE: edismax operators

2015-04-02 Thread Davis, Daniel (NIH/NLM) [C]
Can the mm parameter be set per clause?I guess I've ignored it in the past aside from setting it once to what seemed like a reasonable value. That is probably replicated across every collection, which cannot be ideal for relevance. -Original Message- From: Shawn Heisey

Re: Question regarding enablePositionIncrements

2015-04-02 Thread Jack Krupansky
That's my understanding - but use the Solr Admin UI analysis page to confirm exactly what happens, for both index and query analysis. -- Jack Krupansky On Thu, Apr 2, 2015 at 10:04 AM, Aman Tandon amantandon...@gmail.com wrote: Hi Jack, I read that jira, i understand the concern of heaven.

Re: Question regarding enablePositionIncrements

2015-04-02 Thread Aman Tandon
Hi Jack, I read that jira, i understand the concern of heaven. So does it mean that, no hole will be left when we will use the stop filter? With Regards Aman Tandon On Thu, Apr 2, 2015 at 6:01 PM, Jack Krupansky jack.krupan...@gmail.com wrote: Position increments were considered problematic,

Re: Facet sorting algorithm for index

2015-04-02 Thread Yago Riveiro
The result is a custom responseWriter, I found a bug in my code that append de \ to “. The JSON response shows the data without the \. Where can I found the source code used in index sorting? I need to ensure that the external data has the same sorting that the facet result. — /Yago

Re: edismax operators

2015-04-02 Thread Mahmoud Almokadem
Thank you Jack for your clarifications. I used regular defType and set q.op=AND so all terms without operators are must. How can I use this with edismax? Thanks, Mahmoud On Thu, Apr 2, 2015 at 2:14 PM, Jack Krupansky jack.krupan...@gmail.com wrote: The parentheses signal a nested query. Your

RE: edismax operators

2015-04-02 Thread Davis, Daniel (NIH/NLM) [C]
Thanks Shawn, This is what I thought, but Solr often has features I don't anticipate. -Original Message- From: Shawn Heisey [mailto:apa...@elyograg.org] Sent: Thursday, April 02, 2015 12:54 PM To: solr-user@lucene.apache.org Subject: Re: edismax operators On 4/2/2015 9:59 AM, Davis,

RE: Generating json response in custom requestHandler (xml is working)

2015-04-02 Thread Davis, Daniel (NIH/NLM) [C]
Use XSLT to generate JSON?But you probably actually do want both, and ruby/python, etc. -Original Message- From: Christian Reuschling [mailto:christian.reuschl...@gmail.com] Sent: Thursday, April 02, 2015 12:51 PM To: solr-user@lucene.apache.org Subject: Generating json response in

Re: edismax operators

2015-04-02 Thread Mahmoud Almokadem
Thanks all for you response, But the parsed_query and number of results still when changing MM parameter the following results for mm=100% and mm=0%

Re: sort on facet.index?

2015-04-02 Thread Yonik Seeley
On Thu, Apr 2, 2015 at 10:25 AM, Ryan Josal rjo...@gmail.com wrote: Sorting the result set or the facets? For the facets there is facet.sort=index (lexicographically) and facet.sort=count. So maybe you are asking if you can sort by index, but reversed? I don't think this is possible, and

RE: Generating json response in custom requestHandler (xml is working)

2015-04-02 Thread Davis, Daniel (NIH/NLM) [C]
I mean that you could use XSLTResponseWriter to generate exactly the format you want. However, I anticipate that if you already have a custom response, getting it to automatically generate XML/JSON/Python/Ruby was an expectation, and may be a requirement. Maybe you should look at the code -

Re: edismax operators

2015-04-02 Thread Erick Erickson
The MM parameter is specific to the handler you set up/use, so it's really on a per collection basis. Different collections can specify this however they want. Or I misunderstand what you're asking.. Best, Erick On Thu, Apr 2, 2015 at 8:59 AM, Davis, Daniel (NIH/NLM) [C] daniel.da...@nih.gov

Re: edismax operators

2015-04-02 Thread Shawn Heisey
On 4/2/2015 9:59 AM, Davis, Daniel (NIH/NLM) [C] wrote: Can the mm parameter be set per clause?I guess I've ignored it in the past aside from setting it once to what seemed like a reasonable value. That is probably replicated across every collection, which cannot be ideal for relevance.

Generating json response in custom requestHandler (xml is working)

2015-04-02 Thread Christian Reuschling
Hi, I managed it to create a small custom requestHandler, and filled the response parameter with some static values in the structure I want to have later. I can invoke the requestHander from the browser and get nicely xml with the data and structure I had specified - so far so good. Here is

Re: sort on facet.index?

2015-04-02 Thread Ryan Josal
Awesome, I didn't know this feature was going to add so much power! Looking forward to using it. On Thursday, April 2, 2015, Yonik Seeley ysee...@gmail.com wrote: On Thu, Apr 2, 2015 at 10:25 AM, Ryan Josal rjo...@gmail.com javascript:; wrote: Sorting the result set or the facets? For the

Re: edismax operators

2015-04-02 Thread Jack Krupansky
Personally, I am not convinced how the q.op and mm parameters are really handled within nested queries. There have been bugs in edismax and some oddities for how it does work. I have personally given up on figuring out how the code works. At one stage, back in the days when I did feel that I had a

Re: newbie questions regarding solr cloud

2015-04-02 Thread Erick Erickson
See inline: On Thu, Apr 2, 2015 at 12:36 PM, Ben Hsu ben@criticalmedia.com wrote: Hello I am playing with solr5 right now, to see if its cloud features can replace what we have with solr 3.6, and I have some questions, some newbie, and some not so newbie Background: the documents we

sort param could not be parsed as a query, and is not a field that exists in the index: geodist()

2015-04-02 Thread Niraj
*Objective: To find out all locations those are present within 1 KM of the specified reference point, sorted by the distance from the reference* curl -i --globoff --negotiate -u XXX:XXX -XGET -H Accept: application/json \ -X GET