RE: Solr Shingle is not working properly in solr 6.5.0

2017-04-05 Thread Aman Deep Singh
Thanks Steve , Markus. On 06-Apr-2017 3:26 AM, "Markus Jelsma" wrote: Hello Steve - that will do the job. I am sure it will be well documented in the reference docs/cwiki as well, so we all can look this up later. Many thanks, Markus -Original message- >

Re: Streaming expression with JDBC source returns no data for long running queries.

2017-04-05 Thread Joel Bernstein
The JdbcStream is pretty much just a wrapper for the JDBC driver. I don't believe the size of the result set would have much effect on the JdbcStream. Joel Bernstein http://joelsolr.blogspot.com/ On Wed, Apr 5, 2017 at 6:35 PM, Rishabh Patel < rishabh.mahendra.pa...@gmail.com> wrote: > Hello, >

Streaming expression with JDBC source returns no data for long running queries.

2017-04-05 Thread Rishabh Patel
Hello, I have created a Solr collection with 4 shards and 4 replicas. Now in the UI, I am trying to fetch data from Hive JDBC source using streaming expressions. The query executes successfully if I limit it to ~10 rows. With more rows, after about 10 seconds, the waiting spinner is gone and

RE: Solr Shingle is not working properly in solr 6.5.0

2017-04-05 Thread Markus Jelsma
Hello Steve - that will do the job. I am sure it will be well documented in the reference docs/cwiki as well, so we all can look this up later. Many thanks, Markus -Original message- > From:Steve Rowe > Sent: Wednesday 5th April 2017 23:50 > To:

Re: Solr Shingle is not working properly in solr 6.5.0

2017-04-05 Thread Steve Rowe
Hi Markus, Here’s what I included in 6.5.1’s CHANGES.txt (as well as on branch_6x and master, so it’ll be included in future releases’ CHANGES.txt too): - * SOLR-10423: Disable graph query production via schema configuration . This fixes broken queries for ShingleFilter-containing

RE: Solr Shingle is not working properly in solr 6.5.0

2017-04-05 Thread Markus Jelsma
Steve - please include a broad description of this feature in the next CHANGES.txt. I will forget about this thread but need to be reminded of why i could need it :) Thanks, Markus -Original message- > From:Steve Rowe > Sent: Wednesday 5th April 2017 23:26 > To:

Solr Warning

2017-04-05 Thread Webster Homer
I'm working on adding multi-language search to our application. We are multi-language by field, that is all of the languages are in the same collection. We have a lot of scientific data which often will have English terms mixed in with Chinese (or other language). The requirement is that for a

Re: Solr Shingle is not working properly in solr 6.5.0

2017-04-05 Thread Steve Rowe
Aman, In forthcoming Solr 6.5.1, this problem will be addressed by setting a new option named “enableGraphQueries” to “false". Your fieldtype will look like this: - - -- Steve www.lucidworks.com > On Apr 4, 2017, at 5:32 PM, Steve Rowe wrote: >

[ANNOUNCE] Apache Solr Reference Guide for Solr 6.5 released

2017-04-05 Thread Cassandra Targett
The Lucene PMC is pleased to announce that the Solr Reference Guide for Solr 6.5 has been released. This 782-page PDF is the definitive guide to using Apache Solr, the search server built on Apache Lucene. The Guide can be downloaded from:

DisMax search on field only if it exists otherwise fall-back to another

2017-04-05 Thread Georg Sorst
Hi list! The question was already asked by Neil Prosser sometime in 2015 but apparently never got a reply, so here's to better luck this time: At the moment I'm using a DisMax query which looks something like the following (massively cut-down): ?defType=dismax =some query =field_one^0.5

Expressing "not equals" in Block Join Parent Query

2017-04-05 Thread Zheng Lin Edwin Yeo
Hi, Is there any way which we can express not equals in Block Join Parent Query? For example, I want to find accountNo_s that is not equal to 123456 Currently, I am putting it in this way for it to work: ={!parent which="contentType_s:Header"}accountNo_s:* AND !accountNo_s :"123456" It does

Re: Implementing DIH - Using a non-datetime change tracking column to Identify delta

2017-04-05 Thread Shawn Heisey
On 4/4/2017 7:40 AM, subinalex wrote: > Can we use a non-datetime column to identify delta rows in deltaQuery for > DIH configuration. > Like for example in the below deltaQuery , > > deltaQuery="select ID from category where last_modified > '${dih.last_index_time}'" > > the delta rows are

Re: SOLR IndexSearcher Opening

2017-04-05 Thread Erick Erickson
This question is pretty open-ended. Why do you care? What behavior are you trying to understand? This blog has some information, you missed soft commits. https://lucidworks.com/2013/08/23/understanding-transaction-logs-softcommit-and-commit-in-sorlcloud/ Best, Erick On Wed, Apr 5, 2017 at

Re: SQL with join column mismatch

2017-04-05 Thread Jordi Domingo Borràs
Thanks Joel! On Wed, Apr 5, 2017 at 6:30 PM, Joel Bernstein wrote: > Joins are not supported yet. > > Joel Bernstein > http://joelsolr.blogspot.com/ > > On Wed, Apr 5, 2017 at 12:14 PM, Jordi Domingo Borràs < > jordi.domi...@gmail.com> wrote: > > > Hi! > > > > I don't know

SOLR IndexSearcher Opening

2017-04-05 Thread Murari, Ramesh Babu
Hi All, Can you please tell me what all conditions can cause a SOLR Instance to Re-Open a searcher. I know Replication & Hard Commit are obvious answers to this, can you please help me understand what else can trigger reopening a searcher. Thanks in advance. Thanks & Regards,

Re: SQL with join column mismatch

2017-04-05 Thread Joel Bernstein
Joins are not supported yet. Joel Bernstein http://joelsolr.blogspot.com/ On Wed, Apr 5, 2017 at 12:14 PM, Jordi Domingo Borràs < jordi.domi...@gmail.com> wrote: > Hi! > > I don't know if I'm missing something. I did a simple join query asking for > one column in each table. The result contains

SQL with join column mismatch

2017-04-05 Thread Jordi Domingo Borràs
Hi! I don't know if I'm missing something. I did a simple join query asking for one column in each table. The result contains the same column from the right table. (I read about it in the streaming expressions). curl --data-urlencode 'stmt=select t1.type AS type1, t2.type AS type2 from iocs t1

Re: exapanding solr core over an additional ssd

2017-04-05 Thread Mikhail Khludnev
I would suggest an alias pointing to two collections or just shards param, if Christian followed mailing list policy. On Wed, Apr 5, 2017 at 1:49 PM, Rick Leir wrote: > Hi Christian, If you are using an LVM disk system, you can extend it to > use the additional disk. > >

Re: Number of shards - Best practice

2017-04-05 Thread Mikhail Khludnev
FWIW, you can pass ranges of arbitrary number of shards to SPLITSHARD. Thus you can split on any number of shards. On Wed, Apr 5, 2017 at 5:39 PM, Erick Erickson wrote: > You may be confusing the number of shards you configure and how they > expand using the SPLITSHARD

Re: Number of shards - Best practice

2017-04-05 Thread Erick Erickson
You may be confusing the number of shards you configure and how they expand using the SPLITSHARD command. That command creates two shards where there was one before, so in that sense Solr collections can grow by a factor of 2. But that doesn't mean anything about the number of shards you started

Re: distinct records based on a field

2017-04-05 Thread Emir Arnautovic
You cannot use field collapsing on these fields and get correct result. You need to collapse on BC pair. If you introduce field D that is BC you can use something like: q=A:"XYZ"={!collapse field=D}=true=C Collapse query parser will make sure only 1 BC pair is returned and faceting will

Re: Problems creating index for suggestions

2017-04-05 Thread Alexis Aravena Silva
Hi Erick, numDocs and MaxDocs = 8. This is the content of the field _sugerencia_: [cid:e03430ab-ff19-4955-a6da-d50b38e89b3d] I've noticed that the problem is when Solr builds the fuzzySuggester index, in this type of suggestion, the temp file grow greatly and when the process finish it

Re: distinct records based on a field

2017-04-05 Thread Joel Bernstein
In Solr 6 you can do a sql SELECT DISTINCT ... query as well. Joel Bernstein http://joelsolr.blogspot.com/ On Wed, Apr 5, 2017 at 9:11 AM, VJ wrote: > My document schema has fields like: > A,B,C > I am querying for documents with A="XYZ", suppose it returns 5 records > A

Re: Streaming Expression strange behaviour

2017-04-05 Thread Joel Bernstein
Yeah, that's a really common mistake. I think the plan is to make a separate "shuffle" expression that always uses the /export handler. That way people aren't dealing with Solr request handler decisions, they just choose the proper function for the task.Then we can also make the search expression

Re: Streaming Expression strange behaviour

2017-04-05 Thread Jordi Domingo Borràs
Hi there, The problem with my query is that it's using the select endpoint by default (instead of export). The select endpoint returns a limited number of rows so I don't get any results. The fastest option (not for production) is to include the number of desired rows in the search query:

Re: distinct records based on a field

2017-04-05 Thread VJ
My document schema has fields like: A,B,C I am querying for documents with A="XYZ", suppose it returns 5 records AB C XYZFoo cat1 XYZFoo cat2 XYZBar cat1 XYZBar cat1 XYZBar cat2 out

Re: distinct records based on a field

2017-04-05 Thread Emir Arnautovic
Hi VJ, You can use field collapsing feature to do distinct (https://cwiki.apache.org/confluence/display/solr/Result+Grouping) or maybe you can use facet pivoting and pivot on distinct field to get number of doc in each if needed

Re: distinct records based on a field

2017-04-05 Thread Binoy Dalal
Faceting will by default only get the distinct records. On Wed 5 Apr, 2017, 16:38 VJ, wrote: > Hi, > > > Is there any way to pass only distinct records (based on a field) out of a > solr query? > I want to facet the records based on a field but want to restrict the >

Streaming Expression strange behaviour

2017-04-05 Thread Jordi Domingo Borràs
Hi! I'm new to Streaming Expressions (running solr cloud 6.5.0) and I'm facing a weird behaviour that I don't understand. I'm able to make a join and retrieve results: select( innerJoin( select( search(iocs, q=ip:"10.153.9.175", fl="ip,indicator, type", sort="ip asc"), indicator AS

distinct records based on a field

2017-04-05 Thread VJ
Hi, Is there any way to pass only distinct records (based on a field) out of a solr query? I want to facet the records based on a field but want to restrict the results to distinct records before applying the facet. Thanks, VJ

Re: exapanding solr core over an additional ssd

2017-04-05 Thread Rick Leir
Hi Christian, If you are using an LVM disk system, you can extend it to use the additional disk. Otherwise, stop Solr, move the data directory to the larger disk, perhaps use a symbolic link. Or, there is a Solr config value which can be used to identify the data dir. Now restart Solr. Cheers

exapanding solr core over an additional ssd

2017-04-05 Thread FOTACHE CHRISTIAN
Hi everyone,I'm having this problem: I'm running a solr instance on an 120Gb ssd, but the solr core is growing quickly and I badly need extra-space. I have another 240Gb ssd that I can attach to my laptop but I don't know how how to make the solr core expand naturally on the newly attach

Re: Fq and termfrequency are not showing the correct results

2017-04-05 Thread Ayush Gupta
Thanks for the reply. Actaully I've always used termfreq to get word counts for 2- , 3- and 4- word keywords. we use shingles to accomplish this. (ShingleFactory) On Wed, Apr 5, 2017 at 12:17 AM, Erick Erickson wrote: > Functions like termfreq operate on single terms