RE: Unable to perform search query after changing uniqueKey

2015-04-01 Thread steve
Gently walking into rough waters here, but if you use any API with GET, you're sending a URI which must be properly encoded. This has nothing to do with with the programming language that generates key and store pairs on the browser or the one(s) used on the server. Lots and lots of good folks

RE: Spark-Solr in python

2015-04-01 Thread Chaushu, Shani
There is a package of python with solr-cloud https://pypi.python.org/pypi/solrcloudpy but I don't know if there is possibility to connect it to spark -Original Message- From: Timothy Potter [mailto:thelabd...@gmail.com] Sent: Tuesday, March 31, 2015 23:15 To:

Solr Cloud Security not working for internal authentication

2015-04-01 Thread Swaraj Kumar
I am trying to use Solr Security on Solr 5.0 Cloud. Following process I have used :- 1. Modifying web.xml :- security-constraintweb-resource-collection web-resource-nameAdminAllowedQueries/web-resource-name url-pattern/admin/*/url-pattern /web-resource-collection auth-constraint

Re: Unable to perform search query after changing uniqueKey

2015-04-01 Thread Zheng Lin Edwin Yeo
Thanks Erick. Yes, it is able to work correct if I do not use spaces for the field names, especially for the uniqueKey. Regards, Edwin On 31 March 2015 at 13:58, Erick Erickson erickerick...@gmail.com wrote: I would never put spaces in my field names! Frankly I have no clue what Solr does

Re: Collapse and Expand behaviour on result with 1 document.

2015-04-01 Thread Derek Poh
Hi Joel Correct me if my understanding is wrong. Using supplier id as the field to collapse on. - If thecollapse group heads inthe main result set has only 1document in each group, the expanded section will be empty since there are no documents to expandfor each collapse group. - To render

solr 4.10.3 and index.xxxxxxxxxxx directory

2015-04-01 Thread Dominique Bejean
Hi, Is it normal with Solr 4.10.3 that the data directory of replicas still contains directories like index.3636365667474747 index.999080980976 and files index.properties replica.properties If yes, why and in which circumstances ? Regards Dominique

Solr -indexing from csv file having 28 cols taking lot of time ..plz help i m new to solr

2015-04-01 Thread avinash09
entity name=test1 processor=LineEntityProcessor dataSource=fds url=test.csv rootEntity=true transformer=RegexTransformer,TemplateTransformer field column=rawLine

Re: Collapse and Expand behaviour on result with 1 document.

2015-04-01 Thread Joel Bernstein
Exactly correct. Joel Bernstein http://joelsolr.blogspot.com/ On Wed, Apr 1, 2015 at 5:44 AM, Derek Poh d...@globalsources.com wrote: Hi Joel Correct me if my understanding is wrong. Using supplier id as the field to collapse on. - If thecollapse group heads inthe main result set has only

Customzing Solr Dedupe

2015-04-01 Thread thakkar.aayush
I'm facing a challenges using de-dupliation of Solr documents. De-duplicate is done using TextProfileSignature with following parameters: str name=fieldsfield1, field2, field3/str str name=quantRate0.5/str str name=minTokenLen3/str Here Field3 is normal text with few lines of data. Field1 and

Re: Solr 3.6, Highlight and multi words?

2015-04-01 Thread Bruno Mannina
Sorry to disturb you with the renew but nobody use or have problem with multi-terms and highlight ? regards, Le 29/03/2015 21:15, Bruno Mannina a écrit : Dear Solr User, I try to work with highlight, it works well but only if I have only one keyword in my query?! If my request is plastic

shard splitting (solr 4.4.0)

2015-04-01 Thread Ashwin Kumar
Hello Solr Community, Greetings ! This is my first post to this group. I am very new to solr, so please do not mind if some of my questions below sound dumb :) Let me explain my present setup: Solr version : Solr_4.4.0 Zookeeper version: zookeeper-3.4.5 -

Re: Customzing Solr Dedupe

2015-04-01 Thread Jack Krupansky
Solr dedupe is based on the concept of a signature - some fields and rules that reduce a document into a discrete signature, and then checking if that signature exists as a document key that can be looked up quickly in the index. That's the conceptual basis. It is not based on any kind of field by

Re: Solr -indexing from csv file having 28 cols taking lot of time ..plz help i m new to solr

2015-04-01 Thread Alexandre Rafalovitch
Solr actually has CSV update handler. You could send file to that directly. Have you tried that? Regards, Alex On 1 Apr 2015 11:56 pm, avinash09 avinash.i...@gmail.com wrote: entity name=test1 processor=LineEntityProcessor dataSource=fds

Suspicious message with attachment

2015-04-01 Thread help
The following message addressed to you was quarantined because it likely contains a virus: Subject: Error while reading index From: Moshe Recanati mos...@kmslh.com However, if you know the sender and are expecting an attachment, please reply to this message, and we will forward the quarantined

RE: Error while reading index

2015-04-01 Thread Moshe Recanati
Hi, I uploaded the log to drive. https://drive.google.com/file/d/0B0GR0M-lL5QHX1B2a2NZZXh3a1E/view?usp=sharing Regards, Moshe Recanati SVP Engineering Office + 972-73-2617564 Mobile + 972-52-6194481 Skype: recanati

Re: Solr -indexing from csv file having 28 cols taking lot of time ..plz help i m new to solr

2015-04-01 Thread avinash09
no could you please share an example -- View this message in context: http://lucene.472066.n3.nabble.com/Solr-indexing-from-csv-file-having-28-cols-taking-lot-of-time-plz-help-i-m-new-to-solr-tp4196904p4196928.html Sent from the Solr - User mailing list archive at Nabble.com.

RE: Solr 3.6, Highlight and multi words?

2015-04-01 Thread Reitzel, Charles
Haven't used Solr 3.x in a long time. But with 4.10.x, I haven't had any trouble with multiple terms. I'd look at a few things. 1. Do you have a typo in your query? Shouldn't it be q=aben:(plastic and bicycle)?

Re: shard splitting (solr 4.4.0)

2015-04-01 Thread Erick Erickson
Ashwin: First, if at all possible I would simply set up my new SolrCloud structure (2 shards, a leader and follower each) and re-index the entire corpus. 24M docs isn't really very many, and you'll have to have this capability sometime since somone, somewhere will want to change the schema in

Re: Solr -indexing from csv file having 28 cols taking lot of time ..plz help i m new to solr

2015-04-01 Thread avinash09
sir , a silly question m confuse here what is difference between data import handler and update csv -- View this message in context: http://lucene.472066.n3.nabble.com/Solr-indexing-from-csv-file-having-28-cols-taking-lot-of-time-plz-help-i-m-new-to-solr-tp4196904p4196940.html Sent from the

Information regarding This conf directory is not valid SolrException.

2015-04-01 Thread Bar Weiner
Hi, I'm working on upgrading a project from solr-4.10.3 to solr-5.0.0. As part of our JUnit tests we have a few tests for deleting/creating collections. Each test createdelete a collection with a different name, but they all share the same config in ZK. When running these tests in Eclipse

Re: Unable to perform search query after changing uniqueKey

2015-04-01 Thread Erick Erickson
Steve: Totally agree. Even if you _do_ correctly escape the URL though, there's no guarantee that Solr will do the right thing with field names with spaces. Plus endless chances for you to get it wrong when constructing the URL Best, Erick On Wed, Apr 1, 2015 at 1:01 AM, steve

Re: Solr -indexing from csv file having 28 cols taking lot of time ..plz help i m new to solr

2015-04-01 Thread Erick Erickson
Data Import Handler is a process in Solr that reaches out, grabs something external and indexes it. Something external can be a database, files on the server etc. Along the way, you can do many transformations of the data. The point is that the source can be anything. The update handler is an

Re: Solr -indexing from csv file having 28 cols taking lot of time ..plz help i m new to solr

2015-04-01 Thread Alexandre Rafalovitch
Well, I believe the tutorial has an example. Always a good thing - going through the tutorial. And the reference guide has the details: https://cwiki.apache.org/confluence/display/solr/Uploading+Data+with+Index+Handlers#UploadingDatawithIndexHandlers-CSVFormattedIndexUpdates . Regards, Alex.

Re: solr 4.10.3 and index.xxxxxxxxxxx directory

2015-04-01 Thread Shawn Heisey
On 4/1/2015 6:35 AM, Dominique Bejean wrote: Is it normal with Solr 4.10.3 that the data directory of replicas still contains directories like index.3636365667474747 index.999080980976 and files index.properties replica.properties If yes, why and in which circumstances ? The

How to recover a Shard

2015-04-01 Thread Matt Kuiper
Hello, I have a SolrCloud (4.10.1) where for one of the shards, both replicas are in a Recovery Failed state per the Solr Admin Cloud page. The logs contains the following type of entries for the two Solr nodes involved, including statements that it will retry. Is there a way to recover from

RE: How to recover a Shard

2015-04-01 Thread Matt Kuiper
Maybe I have been working too many long hours as I missed the obvious solution of bringing down/up one of the Solr nodes backing one of the replicas, and then the same for the second node. This did the trick. Since I brought this topic up, I will narrow the question a bit: Would there be a

Re: Solr went on recovery multiple time.

2015-04-01 Thread William Bell
I would give it 32GB of RAM. And try to use SSD. On Tue, Mar 31, 2015 at 12:50 AM, sthita sthit...@gmail.com wrote: Hi Bill, My index size is around 48GB and contains around 8 million documents. -- View this message in context:

Re: SolrCloud 5.0 cluster RAM requirements

2015-04-01 Thread Shawn Heisey
On 4/1/2015 3:22 PM, Ryan Steele wrote: Does a SolrCloud 5.0 cluster need enough RAM across the cluster to load all the collections into RAM at all times? Need is too strong a word. If you want the best possible performance, then you would have enough RAM across the cluster to cache the

Re: Unable to perform search query after changing uniqueKey

2015-04-01 Thread Zheng Lin Edwin Yeo
Hi Steve, Thanks for the link and the information. Regards, Edwin On 1 April 2015 at 23:17, Erick Erickson erickerick...@gmail.com wrote: Steve: Totally agree. Even if you _do_ correctly escape the URL though, there's no guarantee that Solr will do the right thing with field names with

Re: Solr -indexing from csv file having 28 cols taking lot of time ..plz help i m new to solr

2015-04-01 Thread avinash09
thanks Erick and Alexandre Rafalovitch R one more doubt how to pass ctrl A(^A) seprator while csv upload -- View this message in context: http://lucene.472066.n3.nabble.com/Solr-indexing-from-csv-file-having-28-cols-taking-lot-of-time-plz-help-i-m-new-to-solr-tp4196904p4196998.html Sent

Re: Solr 3.6, Highlight and multi words?

2015-04-01 Thread Bruno Mannina
Dear Charles, Thanks for your answer, please find below my answers. ok it works if I use aben as field in my query as you say in Answer 1. it doesn't work if I use ab may be because ab field is a copyField for abfr, aben, abit, abpt Concerning the 2., yes you have right it's not and but AND

Re: Customzing Solr Dedupe

2015-04-01 Thread Dan Davis
But you can potentially still use Solr dedupe if you do the upfront work (in RDMS or NoSQL pre-index processing) to assign some sort of Group ID. See OCLC's FRBR Work-Set Algorithm, http://www.oclc.org/content/dam/research/activities/frbralgorithm/2009-08.pdf?urlm=161376 , for some details on

Re: solr 4.10.3 and index.xxxxxxxxxxx directory

2015-04-01 Thread Dominique Bejean
Hi Shawn, Thank you for your response. This is a Solrcloud installation on Centos. There are 5 servers with 128 Gb ram each. The collection contains 650 millions of small documents. There are 3 shards with replicationfactor = 2 (so 9 cores). The JVM Xmx parameter was set to 96 Gb. We changed it

Re: solr 4.10.3 and index.xxxxxxxxxxx directory

2015-04-01 Thread Erick Erickson
I _really_ suspect that with the huge JVM heaps you had, you were hitting long GC pauses that exceeded the Zookeeper timeout, causing ZK to believe the node had gone away thus throwing it into recovery mode. You can enable GC logging to see whether you see such long pauses, but with 96G it's

RE: Solr 3.6, Highlight and multi words?

2015-04-01 Thread Reitzel, Charles
If you want to query on the field ab, you'll probably need to add it the qf parameter. To control the highlighting markup, with the standard highlighter, use hl.simple.pre and hl.simple.post. https://cwiki.apache.org/confluence/display/solr/Standard+Highlighter -Original Message-

Re: Solr 3.6, Highlight and multi words?

2015-04-01 Thread Bruno Mannina
ok for qf (i can't test now) but concerning hl.simple.pre hl.simple.post I can define only one color no ? in the sample solrconfig.xml there are several color, !-- multi-colored tag FragmentsBuilder -- fragmentsBuilder name=colored

Re: Solr -indexing from csv file having 28 cols taking lot of time ..plz help i m new to solr

2015-04-01 Thread Alexandre Rafalovitch
That's an interesting question. The reference shows you how to set a separator, but ^A is a special case. You may need to pass it in as a URL escape character or similar. But I would first get a sample working with more conventional separator and then worry about ^A. Just so you are not confusing

Re: Solr 3.6, Highlight and multi words?

2015-04-01 Thread Bruno Mannina
of course no prb charles, you already help me ! Le 01/04/2015 21:54, Reitzel, Charles a écrit : Sorry, I've never tried highlighting in multiple colors... -Original Message- From: Bruno Mannina [mailto:bmann...@free.fr] Sent: Wednesday, April 01, 2015 3:43 PM To:

SolrCloud 5.0 cluster RAM requirements

2015-04-01 Thread Ryan Steele
Does a SolrCloud 5.0 cluster need enough RAM across the cluster to load all the collections into RAM at all times? I'm building a SolrCloud cluster that may have approximately 1 TB of data spread across the collections. Thanks, Ryan

RE: Solr 3.6, Highlight and multi words?

2015-04-01 Thread Reitzel, Charles
Sorry, I've never tried highlighting in multiple colors... -Original Message- From: Bruno Mannina [mailto:bmann...@free.fr] Sent: Wednesday, April 01, 2015 3:43 PM To: solr-user@lucene.apache.org Subject: Re: Solr 3.6, Highlight and multi words? ok for qf (i can't test now) but