Re: [MASSMAIL]Re: How to boost query based on result of subquery?

2016-02-20 Thread Jorge Luis Betancourt González
Hi Rajesh, Have you taked a look on Query Re-Ranking? The idea is a little different of what you want but i think it should work, essentially you use your normal search query and then re-rank the top-n documents using a sencod query, this second query could use the position field to influence y

Re: [MASSMAIL] Re: ConcurrentUpdateSolrClient ignoring the collection param in some methods

2015-06-28 Thread Jorge Luis Betancourt González
CloudSolrClient will route all the documents to the correct leader, leading to better performance. That class scales nearly linearly in terms of indexing throughput with the number of shards FWIW, Erick On Sat, Jun 27, 2015 at 2:32 AM, Jorge Luis Betancourt González wrote: > Thanks for

Re: [MASSMAIL] Re: ConcurrentUpdateSolrClient ignoring the collection param in some methods

2015-06-28 Thread Jorge Luis Betancourt González
route all the documents to the correct leader, leading to better performance. That class scales nearly linearly in terms of indexing throughput with the number of shards FWIW, Erick On Sat, Jun 27, 2015 at 2:32 AM, Jorge Luis Betancourt González wrote: > Thanks for the prompt reply Shawn

Re: ConcurrentUpdateSolrClient ignoring the collection param in some methods

2015-06-26 Thread Jorge Luis Betancourt González
some methods On 6/26/2015 2:27 PM, Jorge Luis Betancourt González wrote: > I'm trying to use the ConcurrentUpdateSolrClient class, that has some methods > that accept and aditional parameter to indicate the collection, some of this > methods are add(String collection, SolrInputDoc

ConcurrentUpdateSolrClient ignoring the collection param in some methods

2015-06-26 Thread Jorge Luis Betancourt González
Hi all, I'm trying to use the ConcurrentUpdateSolrClient class, that has some methods that accept and aditional parameter to indicate the collection, some of this methods are add(String collection, SolrInputDocument doc), request(SolrRequest, String collection). With HttpSolrClient this works f

Re: [MASSMAIL]Re: High fieldNorm values causing really odd results

2015-05-18 Thread Jorge Luis Betancourt González
the doc/field level. Is this a desired behaviour? Regards, - Original Message ----- From: "Jorge Luis Betancourt González" To: solr-user@lucene.apache.org Sent: Thursday, May 14, 2015 11:49:18 PM Subject: Re: [MASSMAIL]Re: High fieldNorm values causing really odd results Regarding t

"boost" field in my schema.xml

2015-05-18 Thread Jorge Luis Betancourt González
Does a boost field in Solr has any use on the core calculation? For what I can see in [1] If a boost attribute is used in the doc/field level it my be encoded in the norm field and then used to boost the specific match in the doc/field. But I've a schema.xml with a boost field defined and using

Re: [MASSMAIL]Re: High fieldNorm values causing really odd results

2015-05-14 Thread Jorge Luis Betancourt González
Regarding the experiment, sorry If I explained myself in the wrong way, the indexed document doesn't have 119669 terms have a lot less terms (less than a 1000 terms, I don't have the exact number here now), instead 119669 is the number of distinct terms reported by luke (Top-terms total in the a

Re: [MASSMAIL]Re: High fieldNorm values causing really odd results

2015-05-14 Thread Jorge Luis Betancourt González
Hi Hoss, First of all, thank you for your reply. Sorry for leaving the Solr version out in my previous email, I'm using Solr 4.10.3 running on Centos7, with the following JRE: Oracle Corporation OpenJDK 64-Bit Server VM (1.7.0_75 24.75-b04) This are the relevant portions of my schema.xml

High fieldNorm values causing really odd results

2015-05-14 Thread Jorge Luis Betancourt González
Hi everyone: >From the last couple of week I'm noting some really odd results in my Solr >server, searching for the root cause the one thing I can point out is a very >high value of the fieldNorm parameter in the score calculation, an snippet of >the debug info: { "match":true, "value":4

Re: [MASSMAIL]Total number of Facets

2015-02-09 Thread Jorge Luis Betancourt González
So bottom line you're trying to get the count on distinct values on the loginName field? At least based on your query "*:*", if this is what you're after checkout the Stats component, specially the calcDistinct parameter, although if you expect a really high cardinality in the field this could b

Re: [MASSMAIL]Re: Trending functionality in Solr

2015-02-08 Thread Jorge Luis Betancourt González
For a project I'm working on, what we do is store the user's query in a separated core that we also use to provide an autocomplete query functionality, so far, the frontend app is responsible of sending the query to Solr, meaning: 1. execute the query against our search core and 2. send an updat

Re: [MASSMAIL]Re: "Contextual" sponsored results with Solr

2015-01-28 Thread Jorge Luis Betancourt González
sults. We excluded 'special N' with -id:(1 2 3 ... N) type query. all done on client side. Ahmet On Tuesday, January 27, 2015 8:28 PM, Jorge Luis Betancourt González wrote: Hi all, Recently I got an interesting use case that I'm not sure how to implement, the idea is that t

"Contextual" sponsored results with Solr

2015-01-27 Thread Jorge Luis Betancourt González
Hi all, Recently I got an interesting use case that I'm not sure how to implement, the idea is that the client wants a fixed number of documents, let's call it N, to appear in the top of the results. Let me explain a little we're working with web documents so the idea is too promote the documen

Re: Indexed epoch time in Solr

2015-01-25 Thread Jorge Luis Betancourt González
Perhaps could you use a DocTransformer to convert the unix time field into any representation you want? You'll need to write a custom DocTransformer but this is no complex task. Regards, - Original Message - From: "Ahmed Adel" To: solr-user@lucene.apache.org Sent: Monday, January 26, 2

Re: [MASSMAIL]Weighting of prominent text in HTML

2015-01-25 Thread Jorge Luis Betancourt González
Hi Dan: Agreed, this question is more Nutch related than Solr ;) Nutch doesn't send any data into /update/extract request handler, all the text and metadata extraction happens in Nutch side rather than relying in the ExtractRequestHandler provided by Solr. Underneath Nutch use Tika the same te

Boost documents based on criteria

2015-01-23 Thread Jorge Luis Betancourt González
Hi all, Recently I got an interesting use case that I'm not sure how to implement, the idea is that the client wants a fixed number of documents, let's call it N, to appear in the top of the results. Let me explain a little we're working with web documents so the idea is too promote the documen

Re: Avoiding wildcard queries using edismax query parser

2015-01-23 Thread Jorge Luis Betancourt González
, January 23, 2015 8:26:48 AM Subject: RE: Avoiding wildcard queries using edismax query parser Here's a Jira for this: https://issues.apache.org/jira/browse/SOLR-3031 I've attached a patch there that might be useful for you. -Michael -----Original Message----- From: Jorge Luis Betanco

Re: Avoiding wildcard queries using edismax query parser

2015-01-22 Thread Jorge Luis Betancourt González
uery parser The dismax query parser does not support wildcards. It is designed to be simpler. -- Jack Krupansky On Thu, Jan 22, 2015 at 5:57 PM, Jorge Luis Betancourt González < jlbetanco...@uci.cu> wrote: > I was also suspecting something like that, the odd thing was that the with > the

Re: Avoiding wildcard queries using edismax query parser

2015-01-22 Thread Jorge Luis Betancourt González
; To: "solr-user" Sent: Thursday, January 22, 2015 4:46:08 PM Subject: Re: Avoiding wildcard queries using edismax query parser I suspect the special characters get caught before the analyzer chains. But what about pre-pending a custom search components? Regards, Alex. Sign up

Avoiding wildcard queries using edismax query parser

2015-01-22 Thread Jorge Luis Betancourt González
Hello all, Currently we are using edismax query parser in an internal application, we've detected that some wildcard queries including "*" are causing some performance issues and for this particular case we're not interested in allowing any user to request all the indexed documents. This coul

Re: Distinct Results from Solr Query

2015-01-19 Thread Jorge Luis Betancourt González
I think this sounds like grouping results by field? You should enable groups by adding &group=true&group.field=YOURFIELD to test this feature. For each unique value of the field specified in group.field, Solr returns a docList with the *top scoring document*. In the docList you can see the total

Re: Solr Compile error

2014-11-29 Thread Jorge Luis Betancourt González
This is the full output? try verbose I had an issue with the same library in my case I was downloading from a local nexus mirror but the problem was with a bad checksum, I figured this out with the ant -verbose compile command. Disabling the checksum check for my local nexus and got it working j

Re: How to dynamically create Solr cores with schema

2014-11-08 Thread Jorge Luis Betancourt González
I remember a talk by CareerBuilder whe they wrote an API using the approach explained by Alexandre and they got really good results. - Original Message - From: "Anurag Sharma" To: solr-user@lucene.apache.org Sent: Saturday, November 8, 2014 7:58:48 AM Subject: Re: How to dynamically crea

Question about StandardTokenizer in Solr 4.9

2014-11-02 Thread Jorge Luis Betancourt González
Hi all: >From the description of the StandardTokenizer, it should Recognizes Internet >domain names and email addresses and preserves them as a single token, which >works great, but I've detected that in cases like this: socks25.domain.com it outputs 2 tokens: socks25 | domain.com if the URL d

Re: Design optimal Solr Schema

2014-10-30 Thread Jorge Luis Betancourt González
Are you going to use the values stored on Solr to display the data in HTML? For searching purposes I suggest to delete all the HTML tags, and store the plain text, for this you could use the HTMLStripCharFilterFactory char filter, this will "clean" your content and only pass the actual text whic

Re: Clear Solr Admin Interface Logging page's logs

2014-10-29 Thread Jorge Luis Betancourt González
Although this looks like a nice & simple addition to the web interface. - Original Message - From: "Ramzi Alqrainy" To: solr-user@lucene.apache.org Sent: Wednesday, October 29, 2014 3:18:26 PM Subject: Re: Clear Solr Admin Interface Logging page's logs Yes sure, if you use jetty containe

Re: AW: (auto)suggestions, but ony from a "filtered" set of documents

2014-10-25 Thread Jorge Luis Betancourt González
Maybe you could use copyField to populate one field with the values (fields) that you like to use for suggestions, then you could search only on this field and filter by any other field in your schema, and take advantage on the EdgeNGramFilter placed in the new search field. Regards, - Ori

Re: Analytics component

2014-10-23 Thread Jorge Luis Betancourt González
I believe some of this statistics function that you're trying to use are precent in facets. - Original Message - From: "nabil Kouici" To: solr-user@lucene.apache.org Sent: Thursday, October 23, 2014 5:57:27 AM Subject: Analytics component Hi All, I'm trying to use Solr to do some ana

Re: Solr relevancy tuning

2014-05-04 Thread Jorge Luis Betancourt González
One good thing about kelvin it's more a programmatic task, so you could execute the scripts after a few changes/deployment and get a general idea if the new changes has impacted into the search experience; yeah sure the changing catalog it's still a problem but I kind of like to be able to execu

Re: Solr middle-ware?

2014-01-22 Thread Jorge Luis Betancourt González
I would love to see some proxy-like application implemented in go (partly for my desire of having time to check out go). - Original Message - From: "Shawn Heisey" To: solr-user@lucene.apache.org Sent: Wednesday, January 22, 2014 10:38:34 AM Subject: Re: Solr middle-ware? On 1/22/2014 12

Re: Search Suggestion Filtering

2014-01-16 Thread Jorge Luis Betancourt González
In a custom application we have, we use a separated core (under Solr 3.6.1) to store the queries used by the users and then provide the autocomplete feauture. In our case we need to filter some phrases, that we don't need to be suggested to the users. I build a custom UpdateRequestProcessor to i

Re: Trigger event on change of a field in a document

2014-01-05 Thread Jorge Luis Betancourt González
With custom UpdateRequestProcessor this would be doable, but depending on when this event will be listened, perhaps Otis is right. - Original Message - From: "Utkarsh Sengar" To: solr-user@lucene.apache.org Sent: Friday, December 27, 2013 7:29:40 PM Subject: Re: Trigger event on change o

Prevent indexing of several phrases

2013-12-21 Thread Jorge Luis Betancourt González
Right now we have a custom use case: Basically we are using a separated solr core to store/suggest queries made by our users in our frontend app (writtern y Symfony2+Solarium). So basically each time a user hits our search box the query goes into this particular core. The thing is that there are

Custom update handler with deduplication

2013-12-15 Thread Jorge Luis Betancourt González
Currently I've the following Update Request Processor chain to prevent indexing very similar text items into a core dedicated to store queries that our users put into the web interface of our system. true false signature textsuggest,textng org.apache.solr.upd