Re: JSON Facet Syntax Sorting

2016-10-26 Thread Zheng Lin Edwin Yeo
Thanks for the update Yonik. Regards, Edwin On 26 October 2016 at 20:07, Yonik Seeley wrote: > On Wed, Oct 26, 2016 at 3:16 AM, Zheng Lin Edwin Yeo > wrote: > > Hi, > > > > I'm using Solr 6.2.1. > > > > For the JSON Facet Syntax, are we able to sort on

Re: Combine Data from PDF + XML

2016-10-26 Thread Erick Erickson
In that case you'll have to write an indexing client that (probably) uses Tika to parse the PDF file, some kind of XML parser to parse the metadata XML and combine the two into Solr documents that you send to Solr. Here's a skeletal program with some extra stuff in there for database connectivity,

Re: Combine Data from PDF + XML

2016-10-26 Thread tesm...@gmail.com
Hi Erick, Thanks for your reply. Yes, XML files contain metadata about PDF files. I need to search from both XML and PDF files and to show search results from both sources. Regards, On Wed, Oct 26, 2016 at 1:47 AM, Erick Erickson wrote: > First you need to define

Re: Graph Traversal Question

2016-10-26 Thread Grant Ingersoll
On Wed, Oct 26, 2016 at 10:46 AM Joel Bernstein wrote: > Grant, can you describe your use case? Currently we can filter on the > relationship using a filter query. So I was wondering what use case would > involve retrieving the relationship. Are you looking to discover what >

Re: Solr Cloud A/B Deployment Issue

2016-10-26 Thread Pushkar Raste
Nodes will still go into recovery but only for a short duration. On Oct 26, 2016 1:26 PM, "jimtronic" wrote: It appears this has all been resolved by the following ticket: https://issues.apache.org/jira/browse/SOLR-9446 My scenario fails in 6.2.1, but works in 6.3 and

Re: Solr Cloud A/B Deployment Issue

2016-10-26 Thread Pushkar Raste
This is due to leader initiated recovery. When Take a look at https://issues.apache.org/jira/browse/SOLR-9446 On Oct 24, 2016 1:23 PM, "jimtronic" wrote: > We are running into a timing issue when trying to do a scripted deployment > of > our Solr Cloud cluster. > >

Re: Related Search

2016-10-26 Thread Trey Grainger
Yeah, the approach listed by Grant and Markus is a common approach. I've worked on systems that mined query logs like this, and it's a good approach if you have sufficient query logs to pull it off. There are a lot of linguistic nuances you'll encounter along the way, including how you

CodaHale metrics for Solr 6?

2016-10-26 Thread Walter Underwood
Anybody using the CodaHale metrics.jetty9.InstrumentedHandler? It looks a lot like something we built for our own use with Solr 4. http://metrics.dropwizard.io/3.1.0/manual/jetty/

RE: Related Search

2016-10-26 Thread Markus Jelsma
Indeed, we have similar processes running of which one generates a 'related query collection' which just contains a (normalized) query and its related queries. I would not know how this is even possible without continuously processing query and click logs. M. -Original message- >

Re: Solr Cloud A/B Deployment Issue

2016-10-26 Thread jimtronic
It appears this has all been resolved by the following ticket: https://issues.apache.org/jira/browse/SOLR-9446 My scenario fails in 6.2.1, but works in 6.3 and Master where this bug has been fixed. In the meantime, we can use our workaround to issue a simple delete command that deletes a

RE: Query formulation help

2016-10-26 Thread Prasanna S. Dhakephalkar
John, You are right, I am also looking for document fields as variables. That was going to be my next trials. I have been using admin panel for trying out queries. Regards, Prasanna. -Original Message- From: John Bickerstaff [mailto:j...@johnbickerstaff.com] Sent: Wednesday, October

Re: Query formulation help

2016-10-26 Thread John Bickerstaff
For what it's worth- you can do some complex stuff - including using document fields as "variables" -- I did it on an Solr query endpoint (like /search) because I had stuff that was constant for every query. The syntax is challenging, but it can be done. I won't confuse the issue more unless you

Re: Query formulation help

2016-10-26 Thread Tom Evans
On Wed, Oct 26, 2016 at 4:00 PM, Prasanna S. Dhakephalkar wrote: > Hi, > > Thanks for reply, I did > > "q": "cost:[2 TO (2+5000)]" > > Got > > "error": { > "msg": "org.apache.solr.search.SyntaxError: Cannot parse 'cost:[2 to > (2+5000)]': Encountered

Re: Query formulation help

2016-10-26 Thread John Bickerstaff
Ahh - I see what you're after (I think) This page should be helpful for you: https://cwiki.apache.org/confluence/display/solr/Function+Queries again, I'd try using the Admin UI as a test phase to get things right (and see the syntax in the URL that comes back on the response) Open the edismax

Re: Query formulation help

2016-10-26 Thread Shawn Heisey
On 10/26/2016 9:00 AM, Prasanna S. Dhakephalkar wrote: > Hi, Thanks for reply, I did "q": "cost:[2 TO (2+5000)]" Solr doesn't support doing math in that way in a query, except with dates. It's invalid syntax for a range query. Tom's reply was correct, but was phrased in a way that makes

Re: Solr Hit Highlighting

2016-10-26 Thread Bryan Bende
Hello, I think part of the problem is the mis-match between what you are highlighting on and what you are searching on. Your query has no field specified so it must be searching a default field field which looks like it would be _text_ since the copyField was setup to copy everything to that

Re: Query formulation help

2016-10-26 Thread John Bickerstaff
It looks to me as if it's blowing up on syntax. I don't have access to the Admin UI right now, but I would suggest attempting to submit this query via the UI and examining the URL that comes back. That frequently solves my more frustrating syntax problems. I.E. try putting the cost:[...] in

RE: Query formulation help

2016-10-26 Thread Prasanna S. Dhakephalkar
Hi, Thanks for reply, I did "q": "cost:[2 TO (2+5000)]" Got "error": { "msg": "org.apache.solr.search.SyntaxError: Cannot parse 'cost:[2 to (2+5000)]': Encountered \" \"(2+5000) \"\" at line 1, column 18.\nWas expecting one of:\n\"]\" ...\n\"}\" ...\n",

Re: OOM Error

2016-10-26 Thread Susheel Kumar
Hi Toke, I think your guess is right. We have ingestion running in batches. We have 6 shards & 6 replicas on 12 VM's each around 40+ million docs on each shard. Thanks everyone for the suggestions/pointers. Thanks, Susheel On Wed, Oct 26, 2016 at 1:52 AM, Toke Eskildsen

Re: Graph Traversal Question

2016-10-26 Thread Joel Bernstein
Grant, can you describe your use case? Currently we can filter on the relationship using a filter query. So I was wondering what use case would involve retrieving the relationship. Are you looking to discover what relationships are available? One of the assumptions I made was that users would know

Re: Graph Traversal Question

2016-10-26 Thread Grant Ingersoll
The other way to think about is: I want to put labels on the edges. In my case, the label is the relationship, in your case, the label is the rating or author. On Wed, Oct 26, 2016 at 7:26 AM Yonik Seeley wrote: > On Wed, Oct 26, 2016 at 7:13 AM, Grant Ingersoll

Re: JSON Facet Syntax Sorting

2016-10-26 Thread Yonik Seeley
On Wed, Oct 26, 2016 at 3:16 AM, Zheng Lin Edwin Yeo wrote: > Hi, > > I'm using Solr 6.2.1. > > For the JSON Facet Syntax, are we able to sort on multiple values at one go? > > Like for example, if I want to sort by count, follow by the average price. > is this the correct

Re: Graph Traversal Question

2016-10-26 Thread Yonik Seeley
On Wed, Oct 26, 2016 at 7:13 AM, Grant Ingersoll wrote: > On Tue, Oct 25, 2016 at 6:26 PM Yonik Seeley wrote: > > In your example below it would be akin to injecting the rating onto those > responses as well, not just in the 'fq'. Gotcha... Yeah, I

Re: Graph Traversal Question

2016-10-26 Thread Grant Ingersoll
On Tue, Oct 25, 2016 at 6:46 PM Joel Bernstein wrote: > Because the edges are unique on the subject->object there isn't currently a > way to capture the relationship. Aggregations can be rolled up on numeric > fields and as Yonik mentioned you can track the ancestor. > > It

Re: Graph Traversal Question

2016-10-26 Thread Grant Ingersoll
On Tue, Oct 25, 2016 at 6:26 PM Yonik Seeley wrote: > You can get the nodes that to came from by adding trackTraversal=true > Yeah, I've tried that. It's not quite what I want. That just gets me the "subject". What I'm trying to do is more akin to what a triple store does.

Re: Query formulation help

2016-10-26 Thread Tom Evans
On Wed, Oct 26, 2016 at 8:03 AM, Prasanna S. Dhakephalkar wrote: > Hi, > > > > May be very rudimentary question > > > > There is a integer field in a core : "cost" > > Need to build a query that will return documents where 0 < > "cost"-given_number < 500 >

Re: OOM Error

2016-10-26 Thread Tom Evans
On Wed, Oct 26, 2016 at 4:53 AM, Shawn Heisey wrote: > On 10/25/2016 8:03 PM, Susheel Kumar wrote: >> Agree, Pushkar. I had docValues for sorting / faceting fields from >> begining (since I setup Solr 6.0). So good on that side. I am going to >> analyze the queries to find

JSON Facet Syntax Sorting

2016-10-26 Thread Zheng Lin Edwin Yeo
Hi, I'm using Solr 6.2.1. For the JSON Facet Syntax, are we able to sort on multiple values at one go? Like for example, if I want to sort by count, follow by the average price. is this the correct way tot do? json.facet={ categories:{ type : terms, field : cat, sort : {

Query formulation help

2016-10-26 Thread Prasanna S. Dhakephalkar
Hi, May be very rudimentary question There is a integer field in a core : "cost" Need to build a query that will return documents where 0 < "cost"-given_number < 500 How can this be achieved ? Thanks. Prasanna.