Re: Solutions for Multi-word Synonyms

2016-06-09 Thread Bernd Fehling
As Doug said, you should really try to build your own solution for Multi-word Synonyms because every need is different and you can customize it for your special use case, like adding a Thesaurus. http://www.ub.uni-bielefeld.de/~befehl/base/solr/InsideBase_eurovocThesaurus.html Regards Bernd Am 0

RE: Returned number of result rows as a function of maxScore or numFound.

2016-06-09 Thread Prasanna Josium
Thanks Erick & Binoy, I will try out the 2 query technique. Guess this will work for numFound related issue. Guess I was not very clear in stating my problem. The problem I'm dealing with is mostly with maxScore. I have collection (~500K docs) where I look for matches to the query. Because of t

Bypassing ExtractingRequestHandler

2016-06-09 Thread Justin Lee
Has anybody had any experience bypassing ExtractingRequestHandler and simply managing Tika manually? I want to make a small modification to Tika to get and save additional data from my PDFs, but I have been procrastinating in no small part due to the unpleasant prospect of setting up a development

Re: Question about multiple fq parameters

2016-06-09 Thread Ahmet Arslan
Hi Mikhail, Can you please explain what this mysterious op parameter is? How is it related to range queries issued on date fields? Thanks, Ahmet On Thursday, June 9, 2016 11:43 AM, Mikhail Khludnev wrote: Shawn, I found "op" at org.apache.solr.schema.DateRangeField.parseSpatialArgs(QParser, S

Re: Scoring changes between 4.10 and 5.5

2016-06-09 Thread Ahmet Arslan
Hi, I wondered the same before and failed to decipher TFIDFSimilarity. Scoring looks like tf*idf*idf to me. I appreciate someone who will shed some light on this. Thanks, Ahmet On Friday, June 10, 2016 12:37 AM, Upayavira wrote: I've just done a very simple, single term query against a 4.10

Re: Solutions for Multi-word Synonyms

2016-06-09 Thread MaryJo Sminkey
Thanks, added my vote (which threw an error but looks like it did get added). MJ On Thu, Jun 9, 2016 at 5:41 PM, Upayavira wrote: > Here's a recently created ticket that covers this issue: > > https://issues.apache.org/jira/browse/SOLR-9185 > > Let's hope we see some traction on it soon, as m

Re: Solutions for Multi-word Synonyms

2016-06-09 Thread Upayavira
Here's a recently created ticket that covers this issue: https://issues.apache.org/jira/browse/SOLR-9185 Let's hope we see some traction on it soon, as many people suffer from this issue. Upayavira On Thu, 9 Jun 2016, at 09:10 PM, MaryJo Sminkey wrote: > On Thu, Jun 9, 2016 at 1:50 PM, Joe Laws

Scoring changes between 4.10 and 5.5

2016-06-09 Thread Upayavira
I've just done a very simple, single term query against a 4.10 system and a 5.5 system, each with much the same data. The score for the 4.10 system was essentially made up of the field weight, which is: score = tf * idf Whereas, in the 5.5 system, there is an additional "query weight", which

suggester stack overflow

2016-06-09 Thread Rick Leir
I know how to debug this, but am hoping someone can give me a tip before I dive in! Solr 6.0.0, I just started the server, hoping to build the suggester. from the log: 3625 INFO (searcherExecutor-7-thread-1-processing-x:blinkmon) [ x:blinkmon] o.a.s.s.s.SolrSuggester SolrSuggester.build(mySugge

Re: Question about multiple fq parameters

2016-06-09 Thread Mikhail Khludnev
Steve, It's hard to debug queries in this way. Try to experiment with debugQuery=true, pulling fq to q, just for explanation, etc. On Thu, Jun 9, 2016 at 5:08 PM, Steven White wrote: > Erick, Mikhail, and Shawn, thank you all for your help. > > > > Just a quick re-cap of what I’m trying to achie

Re: Solutions for Multi-word Synonyms

2016-06-09 Thread MaryJo Sminkey
On Thu, Jun 9, 2016 at 1:50 PM, Joe Lawson < jlaw...@opensourceconnections.com> wrote: > The auth-phrasing-token (APT) filter is a two pronged solution that > requires index and query time processes versus hon-lucene-synonyms (HLS) > which is strictly a query time implementation. The primary take

Re: Nested vs Flattened Indexes

2016-06-09 Thread Rick Leir
Can you use Tika? https://tika.apache.org/0.9/formats.html On Wed, 2016-06-08 at 10:06 -0400, Aniruddh Sharma wrote: > Hi > > I am new to use Solr. > > I am running Solr 4.10.3 on CDH 5.5. > > My use case is , I have real time data ingestion in Hadoop on which I want > to implement search. >

Re: Checking performance of plugins, queryParser, edismax, etc

2016-06-09 Thread Rick Leir
On Wed, 2016-06-08 at 11:56 +0800, Zheng Lin Edwin Yeo wrote: > Hi, > > Would like to find out, is there a way to check the performance of > the queryParser and things like edismax in Solr? > > I have tried on the debug=true, but it only show general information like > the time taken for query,

Re: Sorl 4.3.1 - Does not load the new data using the Java application

2016-06-09 Thread Shawn Heisey
On 6/9/2016 4:13 AM, SRINI SOLR wrote: > *Now the issue is - * > *If I index the new data in Solr - the same data is not getting loaded > through Java application until and un-less if I again load the Core > Container using **embeddedSolrServer.getCoreContainer().load().* This sounds like you are

Re: Solutions for Multi-word Synonyms

2016-06-09 Thread Joe Lawson
> > I'm wondering if anyone has experience using the autophrasing solution on > the Lucidworks blog: > > > https://lucidworks.com/blog/2014/07/12/solution-for-multi-term-synonyms-in-lucenesolr-using-the-auto-phrasing-tokenfilter/ > > The auth-phrasing-token (APT) filter is a two pronged solution th

Re: Solutions for Multi-word Synonyms

2016-06-09 Thread MaryJo Sminkey
On Thu, Jun 9, 2016 at 11:06 AM, Doug Turnbull < dturnb...@opensourceconnections.com> wrote: > Honestly half the time I run into this problem, I end up creating a > QParserPlugin because I need to do something specific. With a QParserPlugin > I can run whatever analysis, slicing and dicing of the

Re: Questions regarding re-index when using Solr as a data source

2016-06-09 Thread Walter Underwood
In the HowToReindex page, under “Using Solr as a Data Store”, it says this: "Don't do this unless you have no other option. Solr is not really designed for this role.” So don’t start by planning to do this. Using a second copy of Solr is still using Solr as a repository. That doesn’t satisfy an

Re: Returned number of result rows as a function of maxScore or numFound.

2016-06-09 Thread Erick Erickson
Why do this at all? I have a hard time understanding what benefit this is to the _user_. And even returning 5% is risky. I mean what happens for a query of *:*? For a corpus of 100M docs that's still 5M documents which is would hurt. Sure, you say, well I'll cap it at XXX docs. The principle stil

RE: Questions regarding re-index when using Solr as a data source

2016-06-09 Thread Hui Liu
Hi Walter, Thank you for the reply, sorry I need to clarify what I mean by 'migrate tables' from Oracle to Solr, we are not literally move existing records from Oracle to Solr, instead, we are building a new application directly feed data into Solr as document and fields, in parallel of another

RE: [E] Re: Question about Data Import Handler

2016-06-09 Thread Jamal, Sarfaraz
I am on SOLR6 =) Thanks, Sas -Original Message- From: Mikhail Khludnev [mailto:mkhlud...@griddynamics.com] Sent: Thursday, June 9, 2016 12:42 PM To: solr-user Subject: [E] Re: Question about Data Import Handler which version of Solr do you run? On Thu, Jun 9, 2016 at 6:23 PM, Jamal,

Re: Question about Data Import Handler

2016-06-09 Thread Mikhail Khludnev
which version of Solr do you run? On Thu, Jun 9, 2016 at 6:23 PM, Jamal, Sarfaraz < sarfaraz.ja...@verizonwireless.com.invalid> wrote: > Hi Guys, > > I have a question about the data import handler and its configuration file > > This is what a part of my data-config looks like: > > >

Re: Questions regarding re-index when using Solr as a data source

2016-06-09 Thread Walter Underwood
First, using Solr as a repository is pretty risky. I would keep the official copy of the data in a database, not in Solr. Second, you can’t “migrate tables” because Solr doesn’t have tables. You need to turn the tables into documents, then index the documents. It can take a lot of joins to flat

Questions regarding re-index when using Solr as a data source

2016-06-09 Thread Hui Liu
Hi, We are porting an application currently hosted in Oracle 11g to Solr Cloud 6.x, i.e we plan to migrate all tables in Oracle as collections in Solr, index them, and build search tools on top of this; the goal is we won't be using Oracle at all after this has been implemented; e

Question about Data Import Handler

2016-06-09 Thread Jamal, Sarfaraz
Hi Guys, I have a question about the data import handler and its configuration file This is what a part of my data-config looks like: ==

Re: Solutions for Multi-word Synonyms

2016-06-09 Thread Doug Turnbull
Mary Jo, Honestly half the time I run into this problem, I end up creating a QParserPlugin because I need to do something specific. With a QParserPlugin I can run whatever analysis, slicing and dicing of the query string to manually construct whatever I need to http://www.supermind.org/blog/1134/

Re: Question about multiple fq parameters

2016-06-09 Thread Steven White
Erick, Mikhail, and Shawn, thank you all for your help. Just a quick re-cap of what I’m trying to achieve: my need is to combine 2 or more “fq” queries to be treated as OR. Erick, Mikhail, I have the syntax you provided but I cannot get them to work properly, in fact I’m seeing odd behavior t

Re: Sorl 4.3.1 - Does not load the new data using the Java application

2016-06-09 Thread Upayavira
Firstly, I'm not sure why you are using embeddedSolrServer. You would be much better off running a standalone Solr server, and connecting to it with a SolrClient, in Java. Then you can do client.commit(); to execute a commit. EmbeddedSolrServer behaves slightly differently from normal Solr, and wi

Re: Question about content indexing with Alfresco

2016-06-09 Thread Rick Leir
Is there some reason you are using version 1.4? In the Solr admin dashboard you can load your core and do queries against it. On June 9, 2016 5:06:33 AM EDT, OTEC Jordi Florit wrote: >Hi, > >I'm using Alfresco 4.2.6 and SOLR 1.4, and I want to verify if my >content is indexing on SOLR or not. I

Per-query boosts in MLT

2016-06-09 Thread Marc Burt
Hi, Is it possible to assign boosts to the MLT similarity fields instead of the defaults set in the config when making a MLT query? I'm currently using a query parser and attempting /select?q={!mlt qf=foo^10,bar^20,upc^50}/id /etc but it's taking the boost to be part of the field name. -- K

Re: Sorl 4.3.1 - Does not load the new data using the Java application

2016-06-09 Thread SRINI SOLR
Hi Upayavira / Team - Can you please explain in-detail - how to do the commit...? if we do the commit - Will the new data will be available to Java Application with-out calling *embeddedSolrServer.* *getCoreContainer().load()*. again. ...? Please help me here ... Thanks in Advance. On Th

Re: Sorl 4.3.1 - Does not load the new data using the Java application

2016-06-09 Thread Upayavira
Are you executing a commit? You must commit before your content becomes visible. Upayavira On Thu, 9 Jun 2016, at 11:13 AM, SRINI SOLR wrote: > Hi Team - > Can you please help me out on the below issue ... > > We are using the Solr 4.3.1 version. > > Integrated Solr 4.3.1 with Java application

Sorl 4.3.1 - Does not load the new data using the Java application

2016-06-09 Thread SRINI SOLR
Hi Team - Can you please help me out on the below issue ... We are using the Solr 4.3.1 version. Integrated Solr 4.3.1 with Java application using EmbeddedSolrServer. Using this EmbeddedSolrServer in java - loading the core container as below ... *embeddedSolrServer.getCoreContainer().load();*

Question about content indexing with Alfresco

2016-06-09 Thread OTEC Jordi Florit
Hi, I'm using Alfresco 4.2.6 and SOLR 1.4, and I want to verify if my content is indexing on SOLR or not. I add alfresco.index.transformContent=false on my solcore.properties, but I want to verify if all is doing correctly. There are some place (on SOLR url https://localhost:8443/solr or someth

SolrInputDocument required id in solr5.4 but the same program run on solr5.0 without any id

2016-06-09 Thread pratika.sarda
Hi, SolrInputDocument requires id in solr5.4 but the same program run on solr5.0 without any id on adding doc. org.apache.solr.client.solrj.impl.HttpSolrClient$RemoteSolrException: Error from server at http://localhost:8983/solr/CampaignCore: [doc=null] missing required field: campaignId at org.a

Re: Question about multiple fq parameters

2016-06-09 Thread Mikhail Khludnev
Shawn, I found "op" at org.apache.solr.schema.DateRangeField.parseSpatialArgs(QParser, String). On Thu, Jun 9, 2016 at 1:46 AM, Shawn Heisey wrote: > On 6/8/2016 2:28 PM, Steven White wrote: > > ?q=*&q.op=OR&fq={!field+f=DateA+op=Intersects}[2020-01-01+TO+2030-01-01] > > Looking at this and chec

Re: Question about CloudSolrServer

2016-06-09 Thread Naveen Pajjuri
Thanks *Shawn.* i was using older version of solrj. upgrading it to newer version worked. Thank you. On Thu, Jun 9, 2016 at 11:41 AM, Shawn Heisey wrote: > On 6/8/2016 11:44 PM, Naveen Pajjuri wrote: > > Trying to migrate from HttpSolrServer to CloudSolrServer. getting the > > following excepti

Solr6 CDCR issue with a 3 cloud design

2016-06-09 Thread dmitry.medvedev
I've set up a 3 cloud CDCR: Source => Target1-Source2 => Target2 CDCR environment, and the replication process works perfectly, but: when I shutdown Target1-Source2 cloud (the mediator, for testing for resilience), index/push some docs to Source1 cloud, get back Target1-Source2 cloud online aft

Re: Solr 6.1.x Release Date ??

2016-06-09 Thread Ramesh Shankar
Hi, I found it working in [subquery] transformer solr-6.1.0-79 nightly builds. Regards Ramesh On Tue, Jun 7, 2016 at 11:08 AM, Ramesh Shankar wrote: > Hi, > > Any idea of Solr 6.1.X Release Date ?? > > I am interested in the [subquery] transformer and like to know the release > date since its

RE: Using Solr to index zip files

2016-06-09 Thread anupama . gangadhar
Hi, The nesting level is fixed. Outerzip has many inner zip files(i.e. 1.zip has many zip files). Currently the outer zip path and inner zip name is stored in a Hive table for reference. I use a Hive query to find the zip for me. I intend to index the outer zip file and store all the inner zips