Gently walking into rough waters here, but if you use any API with GET, you're
sending a URI which must be properly encoded. This has nothing to do with with
the programming language that generates key and store pairs on the browser or
the one(s) used on the server. Lots and lots of good folks
There is a package of python with solr-cloud
https://pypi.python.org/pypi/solrcloudpy
but I don't know if there is possibility to connect it to spark
-Original Message-
From: Timothy Potter [mailto:thelabd...@gmail.com]
Sent: Tuesday, March 31, 2015 23:15
To:
I am trying to use Solr Security on Solr 5.0 Cloud. Following process I
have used :-
1. Modifying web.xml :-
security-constraintweb-resource-collection
web-resource-nameAdminAllowedQueries/web-resource-name
url-pattern/admin/*/url-pattern
/web-resource-collection
auth-constraint
Thanks Erick.
Yes, it is able to work correct if I do not use spaces for the field names,
especially for the uniqueKey.
Regards,
Edwin
On 31 March 2015 at 13:58, Erick Erickson erickerick...@gmail.com wrote:
I would never put spaces in my field names! Frankly I have no clue
what Solr does
Hi Joel
Correct me if my understanding is wrong.
Using supplier id as the field to collapse on.
- If thecollapse group heads inthe main result set has only 1document in
each group, the expanded section will be empty since there are no
documents to expandfor each collapse group.
- To render
Hi,
Is it normal with Solr 4.10.3 that the data directory of replicas still
contains directories like
index.3636365667474747
index.999080980976
and files
index.properties
replica.properties
If yes, why and in which circumstances ?
Regards
Dominique
entity name=test1
processor=LineEntityProcessor
dataSource=fds
url=test.csv
rootEntity=true
transformer=RegexTransformer,TemplateTransformer
field column=rawLine
Exactly correct.
Joel Bernstein
http://joelsolr.blogspot.com/
On Wed, Apr 1, 2015 at 5:44 AM, Derek Poh d...@globalsources.com wrote:
Hi Joel
Correct me if my understanding is wrong.
Using supplier id as the field to collapse on.
- If thecollapse group heads inthe main result set has only
I'm facing a challenges using de-dupliation of Solr documents.
De-duplicate is done using TextProfileSignature with following parameters:
str name=fieldsfield1, field2, field3/str
str name=quantRate0.5/str
str name=minTokenLen3/str
Here Field3 is normal text with few lines of data.
Field1 and
Sorry to disturb you with the renew but nobody use or have problem with
multi-terms and highlight ?
regards,
Le 29/03/2015 21:15, Bruno Mannina a écrit :
Dear Solr User,
I try to work with highlight, it works well but only if I have only
one keyword in my query?!
If my request is plastic
Hello Solr Community,
Greetings ! This is my first post to this group.
I am very new to solr, so please do not mind if some of my questions below
sound dumb :)
Let me explain my present setup:
Solr version : Solr_4.4.0
Zookeeper version: zookeeper-3.4.5
-
Solr dedupe is based on the concept of a signature - some fields and rules
that reduce a document into a discrete signature, and then checking if that
signature exists as a document key that can be looked up quickly in the
index. That's the conceptual basis. It is not based on any kind of field by
Solr actually has CSV update handler. You could send file to that directly.
Have you tried that?
Regards,
Alex
On 1 Apr 2015 11:56 pm, avinash09 avinash.i...@gmail.com wrote:
entity name=test1
processor=LineEntityProcessor
dataSource=fds
The following message addressed to you was quarantined because it likely
contains a virus:
Subject: Error while reading index
From: Moshe Recanati mos...@kmslh.com
However, if you know the sender and are expecting an attachment, please reply
to this message, and we will forward the quarantined
Hi,
I uploaded the log to drive.
https://drive.google.com/file/d/0B0GR0M-lL5QHX1B2a2NZZXh3a1E/view?usp=sharing
Regards,
Moshe Recanati
SVP Engineering
Office + 972-73-2617564
Mobile + 972-52-6194481
Skype: recanati
no could you please share an example
--
View this message in context:
http://lucene.472066.n3.nabble.com/Solr-indexing-from-csv-file-having-28-cols-taking-lot-of-time-plz-help-i-m-new-to-solr-tp4196904p4196928.html
Sent from the Solr - User mailing list archive at Nabble.com.
Haven't used Solr 3.x in a long time. But with 4.10.x, I haven't had any
trouble with multiple terms. I'd look at a few things.
1. Do you have a typo in your query? Shouldn't it be q=aben:(plastic and
bicycle)?
Ashwin:
First, if at all possible I would simply set up my new SolrCloud
structure (2 shards, a leader and follower each) and re-index the
entire corpus. 24M docs isn't really very many, and you'll have to
have this capability sometime since somone, somewhere will want to
change the schema in
sir , a silly question m confuse here what is difference between data import
handler and update csv
--
View this message in context:
http://lucene.472066.n3.nabble.com/Solr-indexing-from-csv-file-having-28-cols-taking-lot-of-time-plz-help-i-m-new-to-solr-tp4196904p4196940.html
Sent from the
Hi,
I'm working on upgrading a project from solr-4.10.3 to solr-5.0.0.
As part of our JUnit tests we have a few tests for deleting/creating
collections. Each test createdelete a collection with a different name,
but they all share the same config in ZK.
When running these tests in Eclipse
Steve:
Totally agree. Even if you _do_ correctly escape the URL though,
there's no guarantee that Solr will do the right thing with field
names with spaces. Plus endless chances for you to get it wrong when
constructing the URL
Best,
Erick
On Wed, Apr 1, 2015 at 1:01 AM, steve
Data Import Handler is a process in Solr that reaches out, grabs
something external and indexes it. Something external can be a
database, files on the server etc. Along the way, you can do many
transformations of the data. The point is that the source can be
anything.
The update handler is an
Well, I believe the tutorial has an example. Always a good thing -
going through the tutorial.
And the reference guide has the details:
https://cwiki.apache.org/confluence/display/solr/Uploading+Data+with+Index+Handlers#UploadingDatawithIndexHandlers-CSVFormattedIndexUpdates
.
Regards,
Alex.
On 4/1/2015 6:35 AM, Dominique Bejean wrote:
Is it normal with Solr 4.10.3 that the data directory of replicas still
contains directories like
index.3636365667474747
index.999080980976
and files
index.properties
replica.properties
If yes, why and in which circumstances ?
The
Hello,
I have a SolrCloud (4.10.1) where for one of the shards, both replicas are in a
Recovery Failed state per the Solr Admin Cloud page. The logs contains the
following type of entries for the two Solr nodes involved, including statements
that it will retry.
Is there a way to recover from
Maybe I have been working too many long hours as I missed the obvious solution
of bringing down/up one of the Solr nodes backing one of the replicas, and then
the same for the second node. This did the trick.
Since I brought this topic up, I will narrow the question a bit: Would there
be a
I would give it 32GB of RAM. And try to use SSD.
On Tue, Mar 31, 2015 at 12:50 AM, sthita sthit...@gmail.com wrote:
Hi Bill, My index size is around 48GB and contains around 8 million
documents.
--
View this message in context:
On 4/1/2015 3:22 PM, Ryan Steele wrote:
Does a SolrCloud 5.0 cluster need enough RAM across the cluster to
load all the collections into RAM at all times?
Need is too strong a word. If you want the best possible performance,
then you would have enough RAM across the cluster to cache the
Hi Steve,
Thanks for the link and the information.
Regards,
Edwin
On 1 April 2015 at 23:17, Erick Erickson erickerick...@gmail.com wrote:
Steve:
Totally agree. Even if you _do_ correctly escape the URL though,
there's no guarantee that Solr will do the right thing with field
names with
thanks Erick and Alexandre Rafalovitch R
one more doubt how to pass ctrl A(^A) seprator while csv upload
--
View this message in context:
http://lucene.472066.n3.nabble.com/Solr-indexing-from-csv-file-having-28-cols-taking-lot-of-time-plz-help-i-m-new-to-solr-tp4196904p4196998.html
Sent
Dear Charles,
Thanks for your answer, please find below my answers.
ok it works if I use aben as field in my query as you say in Answer 1.
it doesn't work if I use ab may be because ab field is a copyField
for abfr, aben, abit, abpt
Concerning the 2., yes you have right it's not and but AND
But you can potentially still use Solr dedupe if you do the upfront work
(in RDMS or NoSQL pre-index processing) to assign some sort of Group ID.
See OCLC's FRBR Work-Set Algorithm,
http://www.oclc.org/content/dam/research/activities/frbralgorithm/2009-08.pdf?urlm=161376
, for some details on
Hi Shawn,
Thank you for your response.
This is a Solrcloud installation on Centos.
There are 5 servers with 128 Gb ram each.
The collection contains 650 millions of small documents.
There are 3 shards with replicationfactor = 2 (so 9 cores).
The JVM Xmx parameter was set to 96 Gb. We changed it
I _really_ suspect that with the huge JVM heaps you had, you were hitting long
GC pauses that exceeded the Zookeeper timeout, causing ZK to believe the
node had gone away thus throwing it into recovery mode.
You can enable GC logging to see whether you see such long pauses, but with 96G
it's
If you want to query on the field ab, you'll probably need to add it the qf
parameter.
To control the highlighting markup, with the standard highlighter, use
hl.simple.pre and hl.simple.post.
https://cwiki.apache.org/confluence/display/solr/Standard+Highlighter
-Original Message-
ok for qf (i can't test now)
but concerning hl.simple.pre hl.simple.post I can define only one color no ?
in the sample solrconfig.xml there are several color,
!-- multi-colored tag FragmentsBuilder --
fragmentsBuilder name=colored
That's an interesting question. The reference shows you how to set a
separator, but ^A is a special case. You may need to pass it in as a
URL escape character or similar.
But I would first get a sample working with more conventional
separator and then worry about ^A. Just so you are not confusing
of course no prb charles, you already help me !
Le 01/04/2015 21:54, Reitzel, Charles a écrit :
Sorry, I've never tried highlighting in multiple colors...
-Original Message-
From: Bruno Mannina [mailto:bmann...@free.fr]
Sent: Wednesday, April 01, 2015 3:43 PM
To:
Does a SolrCloud 5.0 cluster need enough RAM across the cluster to load
all the collections into RAM at all times?
I'm building a SolrCloud cluster that may have approximately 1 TB of
data spread across the collections.
Thanks,
Ryan
Sorry, I've never tried highlighting in multiple colors...
-Original Message-
From: Bruno Mannina [mailto:bmann...@free.fr]
Sent: Wednesday, April 01, 2015 3:43 PM
To: solr-user@lucene.apache.org
Subject: Re: Solr 3.6, Highlight and multi words?
ok for qf (i can't test now)
but
40 matches
Mail list logo