Re: indexing error - 6.3.0

2017-01-18 Thread Joe Obernberger
Hi All - I've been trying to debug this, but it keeps occurring. Even if I do 100 at a time, or 50 at a time, eventually I get the below stack trace. I've also adjusted the autoSoftCommit and autoCommit times to a variety of values. It stills fails after a time; typically around 27-50

Sorting order of suggested words

2017-01-18 Thread Keiichi MORITA
When using the suggester, how to be sorted the default order of next suggested words? Example, input 'football' and next words are... football club football tv football manager football channel : In the above, I want to know the sorting order "club, tv, manager, channel". Kind,

Re: retrieve ids of all indexed docs efficiently

2017-01-18 Thread Erick Erickson
Added a tip on the CursorMark CWiki page, thanks for the suggestion! On Wed, Jan 18, 2017 at 5:21 PM, Pushkar Raste wrote: > I think we should add the suggestion about docValues to the cursormark wiki > (documentation), we too ran in the same problem. > > On Jan 18, 2017

Re: Maximum number of characters in highlighing

2017-01-18 Thread Keiichi MORITA
Hi Erick. Thanks a lot. I see the parameter's description, and want to get those high limits. Maybe it depends on target field's contents size and java heap size of my environment. Kind, Keiichi -- View this message in context:

Re: retrieve ids of all indexed docs efficiently

2017-01-18 Thread Pushkar Raste
I think we should add the suggestion about docValues to the cursormark wiki (documentation), we too ran in the same problem. On Jan 18, 2017 5:52 PM, "Erick Erickson" wrote: > Is your ID field docValues? Making it a docValues field should reduce > the amount of JVM heap

Re: Is it possible to rewrite part of the solr response?

2017-01-18 Thread Erick Erickson
The "bailout" bits are just a safety valve. Say the post filter is expensive. Say also that the query is *:*. That would mean that your post-filter would be called for each and every document in the corpus (absent other filters, of course). In order to avoid really _long_ queries in this case I do

Re: retrieve ids of all indexed docs efficiently

2017-01-18 Thread Erick Erickson
Is your ID field docValues? Making it a docValues field should reduce the amount of JVM heap you need. But the export is _much_ preferred, it'll be lots faster as well. Of course to export you need the values you're returning to be docValues... Erick On Wed, Jan 18, 2017 at 1:12 PM, Slomin,

huge amount of overseer queue entries

2017-01-18 Thread Hendrik Haddorp
Hi, I have a 6.2.1 solr cloud setup with 5 nodes containing close to 3000 collections having one shard and three replicas each. It looks like when nodes crash the overseer queue can go wild on grows until ZooKeeper is not working anymore correctly. This looks pretty much like SOLR-5961

Re: retrieve ids of all indexed docs efficiently

2017-01-18 Thread Slomin, David
The export feature sounds promising, although I'll have to talk with our deployment folks here about enabling it. The query I'm issuing is: http://:8983/solr/_shard1_replica1/select?q=*:*=id+asc=1000==id=true=false=json Thanks, Div. On 1/18/17, 3:54 PM, "Jan Høydahl"

Re: Solr Shard Splitting Issue

2017-01-18 Thread Anshum Gupta
Hi Ekta, Rule#1 - You shouldn't forcefully and manually change the state unless you know what you're doing and have performed all the checks. Seems like the child shards were still getting created i.e. copying the entire index from the parent shard when you manually switched. One of the reasons

Solr 6.0.0 Returns Blank Highlights for Certain Queries

2017-01-18 Thread Teague James
Hello everyone! I have a Solr 6.0.0 instance that is storing documents peppered with text like "1a", "2e", "4c", etc. If I search the documents for a word, "ms", "in", "the", etc., I get the correct number of hits and the results are highlighted correctly in the highlighting section. But when I

Re: retrieve ids of all indexed docs efficiently

2017-01-18 Thread Jan Høydahl
Don't know why you have mem problems. Can you paste in examples of full query strings during cursor mark querying? Sounds like you may be using it wrong. Or try exporting https://cwiki.apache.org/confluence/display/solr/Exporting+Result+Sets -- Jan Høydahl > Den 18. jan. 2017 kl. 21.44 skrev

retrieve ids of all indexed docs efficiently

2017-01-18 Thread Slomin, David
Hi -- I'd like to retrieve the ids of all the docs in my Solr 5.3.1 index. In my query, I've set rows=1000, fl=id, and am using the cursorMark mechanism to split the overall traversal into multiple requests. Not because I care about the order, but because the documentation implies that it's

How does using cacheKey and lookup behave?

2017-01-18 Thread Kaushik
I use the cacheKey, cacheLookup, SortedMapBackedCache in the Data Import Handler of Solr 5.x to join two or more entities. Does this give me an equivalent of Sql's inner join? If so, how can I get something similar to left join? Thank you, Kaushik

range facet query with /sql handler

2017-01-18 Thread radha krishnan
HI, can we do a range facet query with /sql handler in solr 6.3 ? something like the below one json.facet: { my_histogram: { type: range, field: timestamp, start: "2017-01-06T20:04:19.884Z", end: "2017-01-06T22:07:37.778Z", gap: "+1MINUTE", mincount: 0 } } q: timestamp:

Re: Is it possible to rewrite part of the solr response?

2017-01-18 Thread Mike Thomsen
I finally got a chance to deep dive into this and have a preliminary working plugin. I'm starting to look at optimization strategies for how to speed processing up and am wondering if you can give me some more information about your "bailout" strategy. Thanks, Mike On Wed, Dec 21, 2016 at 9:08

Re: HELP!!- Error on authentication for mongodb in data-source-config.xml

2017-01-18 Thread Shawn Heisey
On 1/18/2017 10:18 AM, Abhijit Pawar wrote: > One thing that popped in my mind is I saw your code wherein the > password for mysql is not included in quotes or double quotes. > password=REDACTED > Whereas mine were included. > > password="*<>*" > > Do you think that could be a possible issue

Re: HELP!!- Error on authentication for mongodb in data-source-config.xml

2017-01-18 Thread Abhijit Pawar
Hi Shawn, One thing that popped in my mind is I saw your code wherein the password for mysql is not included in quotes or double quotes. password=REDACTED Whereas mine were included. password="*<>*" Do you think that could be a possible issue here? Best Regards,

Re: HELP!!- Error on authentication for mongodb in data-source-config.xml

2017-01-18 Thread Abhijit Pawar
Thanks Shawn. As you rightly mentioned I will check with some who is familiar with JDBC for mongDB. Thank you and take care. Best Regards, Abhijit Pawar Office : +1 (469) 287 2005 x 110 Follow us on: On Wed, Jan 18, 2017 at 9:26

ApacheCon CFP closing soon (11 February)

2017-01-18 Thread Rich Bowen
Hello, fellow Apache enthusiast. Thanks for your participation, and interest in, the projects of the Apache Software Foundation. I wanted to remind you that the Call For Papers (CFP) for ApacheCon North America, and Apache: Big Data North America, closes in less than a month. If you've been

Re: Trouble boosting a field -solved-

2017-01-18 Thread Erick Erickson
bq: Which might be the same as saying nothing matched Right, a score of zero for a doc means it didn't match the query. It can be useful to specify can show the scoring for an arbitrary doc, even one with a zero score. https://cwiki.apache.org/confluence/display/solr/Common+Query+Parameters

Re: Maximum number of characters in highlighing

2017-01-18 Thread Erick Erickson
see: https://cwiki.apache.org/confluence/display/solr/Highlighting Particularly hl.fragsize and hl.maxAnalyzedChars Best, Erick On Wed, Jan 18, 2017 at 4:56 AM, Keiichi MORITA wrote: > Hello, > > When a document contains some highlight targets, what is the maximum number > of

Re: Lucene Merge Thread: skip too large

2017-01-18 Thread Kelly, Frank
Thanks Shawn - super helpful as always. -Frank Frank Kelly Principal Software Engineer HERE 5 Wayside Rd, Burlington, MA 01803, USA 42° 29' 7" N 71° 11' 32" W

Re: HELP!!- Error on authentication for mongodb in data-source-config.xml

2017-01-18 Thread Shawn Heisey
On 1/16/2017 1:04 PM, Abhijit Pawar wrote: > Hello, > > Need your help on one small problem I am facing in SOLR. > > I have added authentication for our mongodb database in data-source-config > file in SOLR. > rating,updatedAt,comparable,hide_price FROM > products':java.lang.RuntimeException: >

Re: Lucene Merge Thread: skip too large

2017-01-18 Thread Shawn Heisey
On 1/18/2017 6:51 AM, Kelly, Frank wrote: > We’re investigating a strange spike in Heap memory usage in our > Production Solr. > Heap is stable for days ~ 1.6GB and then suddenly spikes to 3.9 GB and > we get an OOM. > > Our app server behavior using Solr appears to unchanged (no new schema >

Re: Joining Across Collections

2017-01-18 Thread billnbell
Great question Bill Bell Sent from mobile > On Jan 18, 2017, at 1:32 AM, nabil Kouici wrote: > > Hi All, > I'm using join across collection feature to do an inner join between 2 > collections. It works fine. > Is it possible to use this feature to compare between

Lucene Merge Thread: skip too large

2017-01-18 Thread Kelly, Frank
Hello, We're investigating a strange spike in Heap memory usage in our Production Solr. Heap is stable for days ~ 1.6GB and then suddenly spikes to 3.9 GB and we get an OOM. Our app server behavior using Solr appears to unchanged (no new schema updates, no additional indexing or searching we

Maximum number of characters in highlighing

2017-01-18 Thread Keiichi MORITA
Hello, When a document contains some highlight targets, what is the maximum number of characters that could be highlighted? I want to know that limit of highlighting parameter and which parameter specifies it. Thanks, Keiichi MORITA -- View this message in context:

Re: Trouble boosting a field -solved-

2017-01-18 Thread Tom Chiverton
I 'solved' this by removing some of the 'AND' from my full query. AND should be optional but have no effect if there, right ? But for me it was forcing the score to 0. Which might be the same as saying nothing matched ? Tom On 13/01/17 15:10, Tom Chiverton wrote: I have a few hundred

Joining Across Collections

2017-01-18 Thread nabil Kouici
Hi All, I'm using  join across collection feature to do an inner join between 2 collections. It works fine. Is it possible to use this feature to compare between fields from different collections. For exemple: Collection1 Field1Collection2 Field2 search document from Collection1 where Field1 !=