Re: Indexing Multiple Languages with solr (Arabic & English)

2013-12-02 Thread Alexandre Rafalovitch
Have you tried doing copyField to replicate the content and have one field indexed as English text type and another with the same content as Arabic text type. Then, doing the search against both using edismax or similar. That's one approach to this. Just because it is in one field in the database,

Re: ANNOUNCE: Apache Solr Reference Guide 4.6

2013-12-02 Thread Bernd Fehling
But it still has the error about TrimFilterFactory in it, which I reported a couple of days back. http://www.mail-archive.com/solr-user@lucene.apache.org/msg92064.html So what it needs to correct the Reference Guide is to place a note like under StopFilter somewhere under TrimFilter: "As of Solr

Re: Best approach to multiple languages

2013-12-02 Thread aniljayanti
Hi thanks for you post. I am searching for this type of multiple language indexing and searching in solr. Below is my post in lecene. Can you please help me out of this. http://lucene.472066.n3.nabble.com/Indexing-Multiple-Languages-with-solr-Arabic-amp-English-td4104580.html thanks in advance,

post filtering for boolean filter queries

2013-12-02 Thread Dmitry Kan
Hello! We have been experimenting with post filtering lately. Our setup is a filter having long boolean query; drawing the example from the Dublin's Stump the Chump: fq=UserId:(user1 OR user2 OR...OR user1000) The underlining issue impacting performance is that the combination of user ids in the

Using the flexible query parser in Solr instead of classic

2013-12-02 Thread Karsten R.
Hi folks, last year we built a 3.X Solr-QueryParser based on org.apache.lucene.queryparser.flexible.standard.StandardQueryParser because we had some additions with SpanQueries and PhraseQueries. We think about to adapt this for 4.X At time the SolrQueryParser is based on org.apache.lucene.query

Indexing Multiple Languages with solr (Arabic & English)

2013-12-02 Thread aniljayanti
Hi, I am working on solr for using searching by indexing with "text_general" for "ENGLISH" language. Search is working fine. Now I have a Arabic text, which needs to indexing and searching. Below is my basic config for English.* Same field contains "ENGLISH" and "ARABIC" text in database*. Please

Re: Constantly increasing time of full data import

2013-12-02 Thread Ryan Cutter
Michal, I don't have much experience with DIH so I'll leave that to someone else but I would suggest you profile Solr during imports. That might show you where the bottleneck is. Generally, it's reasonable to think Solr updates will get slower the larger the indexes get and the more load you put

Re: Auto optimized of Solr indexing results

2013-12-02 Thread Bayu Widyasanyata
Thanks Erick for your advance and share. Regards, On Mon, Dec 2, 2013 at 11:06 PM, Erick Erickson wrote: > TieredMergePolicy is the default even though it's > commented out in solrconfig, it's still being used. > So there's nothing to do. > > Given the size of your index, you can actually do >

Re: Function query matching

2013-12-02 Thread Trey Grainger
We're working on the same problem with the combination of the scale(query(...)) combination, so I'd like to share a bit more information that may be useful. *On the scale function:* Even thought the scale query has to calculate the scores for all documents, it is actually doing this work twice for

Re: SolrCloud FunctionQuery inconsistency

2013-12-02 Thread sling
Thank for your reply, Chris. Yes, I am populating "ptime" using a default of "NOW". I only store the id, so I can't get ptime values. But from the perspective of business logic, ptime should not change. Strangely, the sort result is consistent now... :( I should do more test case... -- View t

Re: SolrCloud FunctionQuery inconsistency

2013-12-02 Thread sling
Thanks, Erick I mean the first id of the results is not consistent, and the maxScore is not too. When query, I do index docs at the same time, but they are not revelent to this query. The updated docs can not affect tf cals, and for idf, they should affect for all docs, so the results should co

Re: Error integrating opennlp in solr

2013-12-02 Thread Furkan KAMACI
Especially here: "Also, you may have to add the OpenNLP lib directory to your solr/lib or solr/cores/collection/lib directory. The text types assume that cores/collection/conf/opennlp contains the OpenNLP model files." 3 Aralık 2013 Salı tarihinde Furkan KAMACI adlı kullanıcı şöyle yazdı: > Did y

Re: Error integrating opennlp in solr

2013-12-02 Thread Furkan KAMACI
Did you check here: http://wiki.apache.org/solr/OpenNLP 30 Kasım 2013 Cumartesi tarihinde Arti adlı kullanıcı şöyle yazdı: > > > Hi Team , > > I am getting the stack of errors given below while integrating solr with OpenNLP. Please help. > > > > > > Caused by: org.apache.solr.common.SolrException

Proxy.php tutorials for AJAX Solr

2013-12-02 Thread Reyes, Mark
Are there any good tutorials that touch base on how to integrate the suggested PHP proxy for JavaScript framework AJAX Solr? Here is the proxy, https://gist.github.com/evolvingweb/298580 Also on Stackoverflow, http://stackoverflow.com/questions/20338073/proxy-php-tutorials-for-ajax-solr IMPORT

Re: solr as a service for multiple projects in the same environment

2013-12-02 Thread Ing. Jorge Luis Betancourt Gonzalez
I think that one experience in this area could by provided by Tray Grainger, author of Solr in Action, I believe that some of his work on careerbuilder involve the creation of something (somehow) similar to what you're trying to accomplish. I must say that I'm also interested in this topic, but

Re: SolrCloud FunctionQuery inconsistency

2013-12-02 Thread Chris Hostetter
: However, when sort by "ptime desc", the result is consistent. : The dateDeboost generate the time-weight from ptime, which is multiplied by : the score. As Erick mentioned, you haven't given us enough details to make any educated guesses as to what problem you are seeing. My wild, uneducated,

Re: Function query matching

2013-12-02 Thread Peter Keegan
I'm persuing this possible PostFilter solution, I can see how to collect all the hits and recompute the scores in a PostFilter, after all the hits have been collected (for scaling). Now, I can't see how to get the custom doc/score values back into the main query's HitQueue. Any advice? Thanks, Pet

ANNOUNCE: Apache Solr Reference Guide 4.6

2013-12-02 Thread Chris Hostetter
The Lucene PMC is pleased to announce the release of the Apache Solr Reference Guide for Solr 4.6. This 347 page PDF serves as the definitive users manual for Solr 4.6. The Solr Reference Guide is available for download from the Apache mirror network: https://www.apache.org/dyn/closer.cg

ShardSplit errors..

2013-12-02 Thread Annette Newton
Hi, I have been trying to split a shard with little success. I'm probably missing something obvious but would appreciate a little help. Solr version: 4.6.0 Number of documents in the Shard: 2,933,059 Index size: 6.52 I know I have some setting somewhere that I need to change but I believe I hav

Re: SolrCloud FunctionQuery inconsistency

2013-12-02 Thread Erick Erickson
I'm not quite sure what you're seeing as inconsistent, you didn't say. Is it the maxScore? Did you index any docs in the mean time? Even though both show 121 docs, if you updated some docs it might affect the score because the terms from the old docs still affect tf/idf calcs and thus the boosted s

Re: Auto optimized of Solr indexing results

2013-12-02 Thread Erick Erickson
TieredMergePolicy is the default even though it's commented out in solrconfig, it's still being used. So there's nothing to do. Given the size of your index, you can actually do whatever you please. Optimizing it will shrink its size, but frankly your index is so small I doubt you'll see any noti

Re: luke 4.5.0 released

2013-12-02 Thread Erick Erickson
Excellent! thanks! On Mon, Dec 2, 2013 at 3:27 AM, Dmitry Kan wrote: > Hello! > > I have just released luke 4.5.0 along with the binary. It's version is > reflecting the Lucene's version underneath. > > Feel free to test this and give feedback / submit bug fixes / patches. > > https://github.co

How Whatsapp applies search techniques for conversation?

2013-12-02 Thread Anurag
I was just wondering how Whatsapp uses to implement search in the conversations history. Is it the same thing used for all kinds of android app supporting search on chat/conversations? Has anyone implemented on similar lines? Thanks Kumar Anurag - Kumar Anurag -- View this message in co

Re: Constantly increasing time of full data import

2013-12-02 Thread michallos
Update: I can see that times increases when the search load is higher. During nights and weekends full load times doesn't increase. So it is not caused by the number of documents being loaded (during weekends we have the same number of new documents) but number of queries / minute. Anyone observe

Re: Solr query syntax.

2013-12-02 Thread Jack Krupansky
The edismax (ExtendedDisMax) query parser is the best, overall. There are other specialized query parsers with features that edismax does not have (e.g., surround for span queries, and complex phrase for wildcards in phrases.) -- Jack Krupansky -Original Message- From: elmerfudd Sen

Re: Solr query syntax.

2013-12-02 Thread Ahmet Arslan
Hi, Choice of query parser depends on your needs. I am just surprised that you used prefix notation in your example.  Default query parser syntax for and(blabla , name: "george")  is q=blabla AND name:george Term blabla (which does not consider field) parsed against default search field. Defaul

Re: Solr non-suuported languages

2013-12-02 Thread Ahmet Arslan
Hi Prasi, text_general thats ships with example schema.xml would suit. On Monday, December 2, 2013 12:35 PM, Prasi S wrote: hi , I have a requirement to index and search few languages that are not supported by solr. ( E.g countries like Slovenia, Moldova, Belarus etc.) If i need to do only

Solr non-suuported languages

2013-12-02 Thread Prasi S
hi , I have a requirement to index and search few languages that are not supported by solr. ( E.g countries like Slovenia, Moldova, Belarus etc.) If i need to do only exact match against these langauges, what sort of analyser, tokenizers would suit thanks. Thanks, Prasi

some cores goes down during indexing

2013-12-02 Thread Grzegorz Sobczyk
Hi I have strange situation. During indexing some of cores goes down: ZkController.publish(1017) | publishing core=shops5 state=down ZkController.register(785) | Register replica - core:shops5 address: http://host77:8280/solr collection:shops5 shard:shard1 ZkController.register(810) | We are http:

SolrCloud FunctionQuery inconsistency

2013-12-02 Thread sling
Hi, I have a solrcloud with 4 shards. They are running normally. How is possible that the same function query returns different results? And it happens even in the same shard? However, when sort by "ptime desc", the result is consistent. The dateDeboost generate the time-weight from ptime, which

luke 4.5.0 released

2013-12-02 Thread Dmitry Kan
Hello! I have just released luke 4.5.0 along with the binary. It's version is reflecting the Lucene's version underneath. Feel free to test this and give feedback / submit bug fixes / patches. https://github.com/DmitryKey/luke/releases/tag/4.5.0 Thanks. -- Dmitry Blog: http://dmitrykan.blogsp

Re: Solr query syntax.

2013-12-02 Thread elmerfudd
Im using the default qparser that come with solr 4.4 , Is there anything better? -- View this message in context: http://lucene.472066.n3.nabble.com/Solr-query-syntax-tp4103784p4104344.html Sent from the Solr - User mailing list archive at Nabble.com.