Re: Re: solr 4.2.1 index gets slower over time

2014-04-01 Thread Dmitry Kan
Thanks, Markus, that is useful. I'm guessing the higher the weight, the longer the op takes? On Tue, Apr 1, 2014 at 10:39 PM, Markus Jelsma wrote: > You may want to increase reclaimdeletesweight for tieredmergepolicy from 2 > to 3 or 4. By default it may keep too much deleted or updated docs in

The word "no" in a query

2014-04-01 Thread Bob Laferriere
I have built an commerce search engine. I am struggling with the word “no” in queries. We have products that are “No Smoking Sign.” When the query is “Smoking AND Sign” the product is found. If I query as “No AND Sign” I get no results? I do not have no as a stop word. Any ideas why I would get

Re: eDismax parser and the mm parameter

2014-04-01 Thread William Bell
Fuzzy is provided use ~ On Mon, Mar 31, 2014 at 11:04 PM, S.L wrote: > Jack , > > Thanks a lot , I am now using the pf ,pf2 an pf3 and have gotten rid of > the mm parameter from my queries, however for the fuzzy phrase queries , I > am not sure how I would be able to leverage the Complex Query

Re: how do I get search for "fort st john" to match "ft saint john"

2014-04-01 Thread alxsss
It seems to me that, you are missing this line under Alex. -Original Message- From: solr-user To: solr-user Sent: Tue, Apr 1, 2014 5:01 pm Subject: Re: how do I get search for "fort st john" to match "ft saint john" Hi Eric. Sorry, been away. The city_index_synonyms

Re: how do I get search for "fort st john" to match "ft saint john"

2014-04-01 Thread solr-user
Hi Eric. Sorry, been away. The city_index_synonyms.txt file is pretty small as it contains just these two lines: saint,st,ste fort,ft There is nothing at all in the city_query_synonyms.txt file, and it isn't used either. My understanding is that solr would create the appropriate synonym entr

Re: More Robust Search Timeouts (to Kill Zombie Queries)?

2014-04-01 Thread Mikhail Khludnev
Hello Salman, Let's me drop few thoughts on http://mail-archives.apache.org/mod_mbox/lucene-solr-user/200903.mbox/%3c856ac15f0903272054q2dbdbd19kea3c5ba9e105b...@mail.gmail.com%3E There two aspects of this question: 1. dealing with long running processing (thread divergence actions http://docs.ora

Re: tf and very short text fields

2014-04-01 Thread Walter Underwood
Thanks! We'll try that out and report back. I keep forgetting that I want to try BM25, so this is a good excuse. wunder On Apr 1, 2014, at 12:30 PM, Markus Jelsma wrote: > Also, if i remember correctly, k1 set to zero for bm25 automatically omits > norms in the calculation. So thats easy to p

Re: Re: solr 4.2.1 index gets slower over time

2014-04-01 Thread Markus Jelsma
You may want to increase reclaimdeletesweight for tieredmergepolicy from 2 to 3 or 4. By default it may keep too much deleted or updated docs in the index. This can increase index size by 50%!! Dmitry Kan schreef:Elisabeth, Yes, I believe you are right in that the deletes are part of the optim

Re: Re: tf and very short text fields

2014-04-01 Thread Markus Jelsma
Also, if i remember correctly, k1 set to zero for bm25 automatically omits norms in the calculation. So thats easy to play with without reindexing. Markus Jelsma schreef:Yes, override tfidfsimilarity and emit 1f in tf(). You can also use bm25 with k1 set to zero in your schema. Walter Under

Re: Block until replication finishes

2014-04-01 Thread Mikhail Khludnev
On Tue, Apr 1, 2014 at 5:02 PM, Fermin Silva wrote: > Sorry but I have no clue about how to contribute with code. Will check that > but if someone can point me to the right direction it would be nice. > You are welcome http://wiki.apache.org/solr/HowToContribute Btw, cool finding re wait param!

Re: omitNorms and very short text fields

2014-04-01 Thread Markus Jelsma
Yes, that will work. And combined with your other question scores will always be equal even if cinderella or chuck occur more than once in one document. Walter Underwood schreef:Just double-checking my understanding of omitNorms. For very short text fields like personal names or titles, leng

Re: tf and very short text fields

2014-04-01 Thread Markus Jelsma
Yes, override tfidfsimilarity and emit 1f in tf(). You can also use bm25 with k1 set to zero in your schema. Walter Underwood schreef:And here is another peculiarity of short text fields. The movie "New York, New York" should not be twice as relevant for the query "new york". Is there a way

Re: Solr Seach proposal

2014-04-01 Thread ahmed shawki
Hi All,Hi Furkan and Ahmet, Thanks for your reply to my last email about "Solr Search" proposal (sent on last Sunday, 30-Mar-2014). This is just to announce "Solr Search", which is a simple HTML interface for searching documents which are indexed by Apache Solr (TM). Actually, it was developed

tf and very short text fields

2014-04-01 Thread Walter Underwood
And here is another peculiarity of short text fields. The movie "New York, New York" should not be twice as relevant for the query "new york". Is there a way to use a binary term frequency rather than a count? wunder -- Walter Underwood wun...@wunderwood.org

omitNorms and very short text fields

2014-04-01 Thread Walter Underwood
Just double-checking my understanding of omitNorms. For very short text fields like personal names or titles, length normalization can give odd results. For example, we might want these two to score the same for the query "Cinderella". * Cinderella * Cinderella (Diamond Edition) (Blu-ray + DVD

Re: zookeeper reconnect failure

2014-04-01 Thread Jessica Mallet
Filed: https://issues.apache.org/jira/browse/SOLR-5945 On Tue, Apr 1, 2014 at 11:10 AM, Jessica Mallet wrote: > Will do Mark. Thanks! > > > On Sun, Mar 30, 2014 at 1:29 PM, Mark Miller wrote: > >> We don't currently retry, but I don't think it would hurt much if we did >> - at least briefly. >>

Re: zookeeper reconnect failure

2014-04-01 Thread Jessica Mallet
Will do Mark. Thanks! On Sun, Mar 30, 2014 at 1:29 PM, Mark Miller wrote: > We don't currently retry, but I don't think it would hurt much if we did - > at least briefly. > > If you want to file a JIRA issue, that would be the best way to get it in > a future release. > > -- > Mark Miller > abo

Re: How to add a map of key/value pairs into a solr schema?

2014-04-01 Thread Jack Krupansky
Not directly. The various workarounds depend on how you intend to access and query the values. What are your use cases? -- Jack Krupansky -Original Message- From: Silvia Suárez Sent: Tuesday, April 1, 2014 12:29 PM To: solr-user@lucene.apache.org Subject: How to add a map of key/value

How to add a map of key/value pairs into a solr schema?

2014-04-01 Thread Silvia Suárez
Dear all, I'm trying to add a map of key/value pairs into the solr schema, and I just wordering if it is possible. For instance: This is my schema.xml : Is it possible to define a type= map (see the example above in the schema) into the solr xchema?, for example something like this:

Re: Please help: Problems adding a document to the solr collection

2014-04-01 Thread Silvia Suárez
Thanks for your answer Alexandre! S. Silvia Suárez Barón I+D+I 972 989 470 / s...@anpro21.com /

Re: High CPU usage after import

2014-04-01 Thread Jack Krupansky
Some document types can consume significant CPU resources, such as large PDF files. -- Jack Krupansky -Original Message- From: Александр Вандышев Sent: Tuesday, April 1, 2014 9:28 AM To: Solr User Subject: High CPU usage after import I use a update/extract handler for indexing a larg

Re: More Robust Search Timeouts (to Kill Zombie Queries)?

2014-04-01 Thread Luis Lebolo
I got responses, but no easy solution to allow me to directly cancel a request. The responses did point to: - timeAllowed query parameter that returns partial results - https://cwiki.apache.org/confluence/display/solr/Common+Query+Parameters#CommonQueryParameters-ThetimeAllowedParameter

RE: Update single field through SolrJ

2014-04-01 Thread Jean-Sebastien Vachon
Hi, Thanks for pointing me in the proper direction. I managed to change my code to send atomic updates through SolrJ but this morning we experienced something weird. I sent a large batch of updates and deletes through SolrJ and our Cloud quickly became unusable and unresponsive (no leader for a

Re: solr 4.2.1 index gets slower over time

2014-04-01 Thread Dmitry Kan
Elisabeth, Yes, I believe you are right in that the deletes are part of the optimize process. If you delete often, you may consider (if not already) the TieredMergePolicy, which is suited for this scenario. Check out this relevant discussion I had with Lucene committers: https://twitter.com/Dmitry

Re: transaction log size

2014-04-01 Thread Shawn Heisey
On 4/1/2014 1:23 AM, Gurfan wrote: > Hi, > > As Transaction log(Tlog) play important role while restarting the SolrCloud > cluster, we are trying to decrease the size. Many of the posts on net which > we find describing that - > "decreasing the AutoCommit and increasing autoSoftCommit would genera

Re: Block until replication finishes

2014-04-01 Thread Fermin Silva
When trying to add the fix to the trunk version, I found that this was already implemented. There is a parameter '*wait*' that does exactly that. if (solrParams.getBool(WAIT, false)) { puller.join(); } So the only possible way to do this in SOLR 3.x is to create a plugin with a new replic

High CPU usage after import

2014-04-01 Thread Александр Вандышев
I use a update/extract handler for indexing a large number of files. If during indexing a CPU loads was not maximum at the end of import loading decreases. If CPU loading was max then loading remain high. Who can help me?

Re: Block until replication finishes

2014-04-01 Thread Fermin Silva
The ReplicationHandler class is not the most exemplar code to be looking at. I found however the line that could be changed: new Thread() { @Override public void run() { doFetch(paramsCopy); } }.start(); rsp.add(STATUS, OK_STATUS); It should be really simpl

Re: Asp.net MVC 4 and Solr Query Begining

2014-04-01 Thread danielkrudolf
Nazik thanks for the help, is there similar forums, this one seems that is not working, I can't post new subject or question. Thanks for help. -- View this message in context: http://lucene.472066.n3.nabble.com/Asp-net-MVC-4-and-Solr-Query-Begining-tp4128372p4128390.html Sent from the Solr -

sort by an attribute values sequence

2014-04-01 Thread santosh sidnal
Hi All, We have a specific requirement of sorting the products as per a specific attribute value sequence. Any pointer or source of info would help us. Example of the scenario; Let's say for search result i want to sort results based on a attribute producttype. Where producttype has following va

Re: Asp.net MVC 4 and Solr Query Begining

2014-04-01 Thread Nazik
Hi Daniel, I think you should post this to SOLR.NET google group: https://groups.google.com/forum/m/#!forum/solrnet That forum is more appropriate to address this type of problem. @Nazik_Huq On Apr 1, 2014, at 5:30 AM, danielkrudolf wrote: > Hello to all, I am new in Solr, but I see it si

Re: ranking retrieval measure

2014-04-01 Thread Rahul Singh
one of the measurement criteria is DCG. http://en.wikipedia.org/wiki/Discounted_cumulative_gain On Tue, Apr 1, 2014 at 11:44 AM, Floyd Wu wrote: > Usually IR system is measured using Precision & Recall. > But depends on what kind of system you are developing to fit what scenario. > > Take a lo

Asp.net MVC 4 and Solr Query Begining

2014-04-01 Thread danielkrudolf
Hello to all, I am new in Solr, but I see it si very usable. So I whant to build web application with asp.net MVC 4 that shows query from Solr. Ok, let's go so far i have done this: 1) Open new project in Visual Studio 2012 and create new MVC 4 project (Empty solution). 2) In Package Manager Co

Re: solr 4.2.1 index gets slower over time

2014-04-01 Thread elisabeth benoit
Thanks a lot for your answers! Shawn. Our GC configuration has far less parameters defined, so we'll check this out. Dimitry, about the expungeDeletes option, we'll add that in the delete process. But from what I read, this is done in the optimize process (cf. http://lucene.472066.n3.nabble.com/D

transaction log size

2014-04-01 Thread Gurfan
Hi, As Transaction log(Tlog) play important role while restarting the SolrCloud cluster, we are trying to decrease the size. Many of the posts on net which we find describing that - "decreasing the AutoCommit and increasing autoSoftCommit would generate the small size of transaction log". To test

Re: More Robust Search Timeouts (to Kill Zombie Queries)?

2014-04-01 Thread Salman Akram
So you too never got any response... On Mon, Mar 31, 2014 at 6:57 PM, Luis Lebolo wrote: > Hi Salman, > > I was interested in something similar, take a look at the following thread: > > http://mail-archives.apache.org/mod_mbox/lucene-solr-user/201401.mbox/%3CCADSoL-i04aYrsOo2%3DGcaFqsQ3mViF%2Bh

Re: MergingSolrIndexes not supported by SolrCloud?why?

2014-04-01 Thread rulinma
I make it. I make a mistake. -- View this message in context: http://lucene.472066.n3.nabble.com/MergingSolrIndexes-not-supported-by-SolrCloud-why-tp4127111p4128351.html Sent from the Solr - User mailing list archive at Nabble.com.