RE: Search on field returns documents that should not match

2017-09-14 Thread Jagdish Vasani
Hi Chang Wang, Searching is worng here, q=title_lemma: fit you handgun It will search title_lemma: fit or _text:you or _text:handgun (if stop words are not excluded) Here _text is default search field , you might have different default search field. Try search like q=title_lemma:(fit you

Search on field returns documents that should not match

2017-09-14 Thread Chang Wang
Hello All, I am confused by the field search behavior of solr (6.6), and hope someone can help me understand the results. For example, I search "fit you handgun" on the field of "title_lemma". http://localhost:8983/solr/bkb4/select?fq=doc_type:

Re: How to remove control characters in stored value at Solr side

2017-09-14 Thread simon
looks as though the problem is in parsing some malformed XML, based on what I'm seeing: ... Caused by: com.ctc.wstx.exc.WstxUnexpectedCharException: Illegal character ((CTRL-CHAR, code 11)) ... ( char #11 is a vertical tab). This should be fixed outside Solr, but if that is not practical, and

Re: query with @ and *

2017-09-14 Thread Erick Erickson
See: https://lucidworks.com/2011/11/29/whats-with-lowercasing-wildcard-multiterm-queries-in-solr/ It discusses the general problem of particular filters being able to cope with wildcards or not. Generally any filter that could potentially produce more than one output token per input token is

Re: Two separate instances sharing the same zookeeper cluster

2017-09-14 Thread Mike Drob
When you specify the zk string for a solr instance, you typically include a chroot in it. I think the default is /solr, but it doesn't have to be, so you should be able to run with -z zk1:2181/sorl-dev and /solr-prod

Two separate instances sharing the same zookeeper cluster

2017-09-14 Thread James Keeney
I have a staging and a production solr cluster. I'd like to have them use the same zookeeper cluster. It seems like it is possible if I can set a different directory for the second cluster. I've looked through the documentation though and I can't quite figure out where to set that up. As a result

Re: How to remove control characters in stored value at Solr side

2017-09-14 Thread arnoldbronley
Thanks for information. Here is the full stack trace. I thought to handle it from client side but client apps are not under my control and I don't have access to them. org.apache.solr.common.SolrException: Illegal character ((CTRL-CHAR, code 11)) at [row,col {unknown-source}]: [1,413] at

Re: How to remove control characters in stored value at Solr side

2017-09-14 Thread simon
@Arnold: are these non UTF-8 control characters (which is what the Nutch issue was about) or otherwise legal UTF-8 characters which Solr for some reason is choking on ? If you could provide a full stack trace it would be really helpful. On Thu, Sep 14, 2017 at 2:55 PM, Markus Jelsma

Re: 2 Solr Instance with One Data Directory

2017-09-14 Thread Shawn Heisey
On 9/14/2017 10:18 AM, Ravi Kumar Taminidi wrote: > Hi Any one tried, have 2 solr Instance with One Data Directory. > > I get below Error when i try to point the 2nd solr to the first solr > directory. > > Any help ? > > org.apache.solr.common.SolrException:org.apache.solr.common.SolrException:

RE: How to remove control characters in stored value at Solr side

2017-09-14 Thread Markus Jelsma
Hello, You can not do this in Solr, you cannot even send non-character code points in the first place. For Apache Nutch we solved the problem by stripping those non-character code points from Strings before putting them in SolrDocument. Check the ticket, you can easily resuse the strip method.

Re: How to remove control characters in stored value at Solr side

2017-09-14 Thread simon
Sounds as though an update request processor will do that, and also eliminate the need to use the PatternReplaceFilterfactory downstream. Take a look at the documentation in https://lucene.apache.org/solr/guide/6_6/update-request-processors.html. I'm thinking that the RegexReplaceProcessorFactory

Highlighting in subqueries?

2017-09-14 Thread Peter Matthew Eichman
Hello all, Is it possible to highlight the results of subqueries? Thanks, -Peter -- Peter Eichman Senior Software Developer University of Maryland Libraries peich...@umd.edu

How to remove control characters in stored value at Solr side

2017-09-14 Thread Arnold Bronley
I know I can apply PatternReplaceFilterFactory to remove control characters from indexed value. However, is it possible to do similar thing for stored value? Because of some control characters included in indexing request, Solr throws Illegal Character Exception.

2 Solr Instance with One Data Directory

2017-09-14 Thread Ravi Kumar Taminidi
Hi Any one tried, have 2 solr Instance with One Data Directory. I get below Error when i try to point the 2nd solr to the first solr directory. Any help ? org.apache.solr.common.SolrException:org.apache.solr.common.SolrException: Index dir '/engine/data/index/' of core 'engine' is already

Getting exception from /solr/admin/metrics

2017-09-14 Thread Shawn Heisey
I had never seen the /solr/admin/metrics endpoint, so I tried to access it on my dev server.  It threw an exception.  This is the first line: 2017-09-1414:27:11.629ERROR (qtp1394336709-480905)[]o.a.s.h.RequestHandlerBaseorg.apache.lucene.store.AlreadyClosedException:Alreadyclosed The rest of the

Re: query with @ and *

2017-09-14 Thread Susheel Kumar
You may want to use UAX29URLEmailTokenizerFactory tokenizer into your analysis chain. Thanks, Susheel On Thu, Sep 14, 2017 at 8:46 AM, Shawn Heisey wrote: > On 9/14/2017 5:06 AM, Mannott, Birgit wrote: > > I have a problem when searching on email addresses. > > @ seems to

Re: query with @ and *

2017-09-14 Thread Shawn Heisey
On 9/14/2017 5:06 AM, Mannott, Birgit wrote: > I have a problem when searching on email addresses. > @ seems to be handled as a special character but I don't find anything about > it in the documentation. > > This is my test data > t...@one.com > t...@two.com Chances are that have analysis

Re: query with @ and *

2017-09-14 Thread Atita Arora
Hi, Can you give us a little information about the query parser you using in your handler ? Thanks, Ati On Thu, Sep 14, 2017 at 4:36 PM, Mannott, Birgit wrote: > Hi, > > I have a problem when searching on email addresses. > @ seems to be handled as a special character

query with @ and *

2017-09-14 Thread Mannott, Birgit
Hi, I have a problem when searching on email addresses. @ seems to be handled as a special character but I don't find anything about it in the documentation. This is my test data t...@one.com t...@two.com searching for test* results both, ok. searching for t...@one.com results the correct one,

Re: Solr Spatial Index and Data

2017-09-14 Thread Rick Leir
hi Can Ezgi > First of all, i want to use spatial index for my data include polyghons and points. But solr indexed first 18 rows, other rows not indexed. Do all rows have a unique id field? Are there errors in the logfile? cheers -- Rick .

Re: SolrJ Java API examples

2017-09-14 Thread Leonardo Perez Pulido
Hi, This may help: https://github.com/leoperezpulido/lucene-solr/tree/master/solr/solrj/src/test/org/apache/solr/client/solrj Regards. On Thu, Sep 14, 2017 at 4:21 AM, Vishal Srivastava wrote: > Hi, > I'm a beginner at SolrJ , and am currently looking to implement

SolrJ Java API examples

2017-09-14 Thread Vishal Srivastava
Hi, I'm a beginner at SolrJ , and am currently looking to implement and integrate the same at my current organisation using Java . After a lot of research, I failed to find any good material / examples for SolrJ 's Java library that I could use as reference. Please suggest some good material.

Re: Provide suggestion on indexing performance

2017-09-14 Thread Sreenivas.T
I agree with Tom. Doc values and stored fields are present for different reasons. Doc values is another index that gets build for faster sorting/faceting. On Wed, Sep 13, 2017 at 11:30 PM Tom Evans wrote: > On Tue, Sep 12, 2017 at 4:06 AM, Aman Tandon

Solr Spatial Index and Data

2017-09-14 Thread Can Ezgi Aydemir
Hi everyone, First of all, i want to use spatial index for my data include polyghons and points. But solr indexed first 18 rows, other rows not indexed. I need sample datas include polyghons and points. Other problem, i will write spatial query this datas. This spatial query include

Re: Freeze Index

2017-09-14 Thread Toke Eskildsen
On Wed, 2017-09-13 at 11:56 -0700, fabigol wrote: > my problem is that my index freeze several time and i don't know why. > So i lost all the data of my index. > I have 14 million of documents from postgresql database. I have an > only node with 31 GO for my JVM and my server has 64GO. My index