Re: Can solr index replacement character

2020-12-01 Thread Erick Erickson
Solr handles UTF-8, so it should be able to. The problem you’ll have is getting the UTF-8 characters to get through all the various transport encodings, i.e. if you try to search from a browser, you need to encode it so the browser passes it through. If you search through SolrJ, it needs to be

Can solr index replacement character

2020-11-30 Thread Eran Buchnick
Hi community, During integration tests with new data source I have noticed weird scenario where replacement character can't be searched, though, seems to be stored. I mean, honestly, I don't want that irrelevant data stored in my index but I wondered if solr can index replacement character (U+FFFD

Re: How to forcefully open new searcher, in case when there is no change in Solr index

2020-08-10 Thread Erick Erickson
ote: > > I have a use case where none of the document in my solr index is changing but > I still want to open a new searcher through the curl api. > > On executing the below curl command > curl > "XXX.XX.XX.XXX:9744/solr/mycollection/update?openSearcher=true=true"

How to forcefully open new searcher, in case when there is no change in Solr index

2020-08-10 Thread raj.yadav
I have a use case where none of the document in my solr index is changing but I still want to open a new searcher through the curl api. On executing the below curl command curl "XXX.XX.XX.XXX:9744/solr/mycollection/update?openSearcher=true=true" it doesn't open a new searcher. Below

Re: Solr index size has increased in solr 7.7.2

2020-04-15 Thread David Hastings
m solr 4.6 to solr 7.7.2. > In solr 4.6 the size was 2.5 gb but here in solr 7.7.2 the solr index size > is showing 6.8 gb with the same no of documents. Is it expected behavior or > any suggestions how to optimize the size. >

Solr index size has increased in solr 7.7.2

2020-04-15 Thread Rajdeep Sahoo
Hi all We are migrating from solr 4.6 to solr 7.7.2. In solr 4.6 the size was 2.5 gb but here in solr 7.7.2 the solr index size is showing 6.8 gb with the same no of documents. Is it expected behavior or any suggestions how to optimize the size.

Re: offline Solr index creation

2020-02-13 Thread Erick Erickson
there, > > We are using AWS EMR as our big data processing cluster. We have like 3TB > of text files where each line denotes a json record which I want to be > indexed into Solr. > > I have tried this by batching them and pushing to Solr index using > SolrJClient. But I feel t

offline Solr index creation

2020-02-13 Thread vivek chaurasiya
Hi there, We are using AWS EMR as our big data processing cluster. We have like 3TB of text files where each line denotes a json record which I want to be indexed into Solr. I have tried this by batching them and pushing to Solr index using SolrJClient. But I feel thats really slow. My doubt

Re: solr index data from hdfs with error

2019-12-20 Thread Erick Erickson
Morphlines support was removed from Solr in Solr 6.6, see: https://issues.apache.org/jira/browse/SOLR-9221 So I don’t think anyone here will be very conversant in the details. I vaguely recall that this process added an ID field by default, but it’s been a very long time since I looked. Do

solr index data from hdfs with error

2019-12-20 Thread bennis
Hello I am new in using Solr and I need your help. I have data on HDFS that I need to index with Solr. I) My data looks like that, it is saved on hdfs : ID_METIER_PCS_ESE,CD_PCS_ESE_1,LB_PCS_ESE_1,CD_PCS_ESE_2,LB_PCS_ESE_2,CD_PCS_ESE_3,LB_PCS_ESE_3,DT_DEB,DT_FIN,TS_TEC_INSERT,TS_TEC_UPDATE

Re: About Snapshot API and Backup for Solr Index

2019-11-24 Thread Paras Lehana
Hey Kaya, Are you not able to restore with the same restore backup command? http://localhost:8983/solr/gettingstarted/replication?command=restore=backup_name Replace backup_name with the snapshot name. On Thu, 21 Nov 2019 at 16:23, Kayak28 wrote: > I was not clear in the last email. > I

Re: About Snapshot API and Backup for Solr Index

2019-11-21 Thread Kayak28
I was not clear in the last email. I mean "For me, it is impossible to "backup" or "restore" Solr's index by taking a snapshot." If I make you confuse, I am sorry about that. Sincerely, Kaya Ota 2019年11月21日(木) 19:50 Kayak28 : > Hello, Community Members: > > I am using Solr 7.7.4 > I have a

About Snapshot API and Backup for Solr Index

2019-11-21 Thread Kayak28
Hello, Community Members: I am using Solr 7.7.4 I have a question about a Snapshot API. https://lucene.apache.org/solr/guide/7_4/making-and-restoring-backups.html#create-snapshot-api I have tested basic of snapshot APIs, create snapshot, list snapshot, delete snapshot. As far as I know, when I

Re: Delete documents from the Solr index using SolrJ

2019-11-05 Thread Erick Erickson
nto it as how could I achieve that with my application. Thanks ! > One thing, that I want to know is how to avoid full re-indexing, that is, > what I need is I don’t want that Solr index all the data every time some docs > are added, instead I want it to update it, that is index on

RE: Delete documents from the Solr index using SolrJ

2019-11-05 Thread Khare, Kushal (MIND)
Well, I cannot still completely relate to the solutions by you guys, am looking into it as how could I achieve that with my application. Thanks ! One thing, that I want to know is how to avoid full re-indexing, that is, what I need is I don’t want that Solr index all the data every time some

Re: Delete documents from the Solr index using SolrJ

2019-11-04 Thread Erick Erickson
gt; Sent: 04 November 2019 20:04 >> To: solr-user@lucene.apache.org >> Subject: Re: Delete documents from the Solr index using SolrJ >> >> when you add a new document using the same "id" value as another it just >> over writes it >> >> On Mon

Re: Delete documents from the Solr index using SolrJ

2019-11-04 Thread Walter Underwood
; To: solr-user@lucene.apache.org > Subject: Re: Delete documents from the Solr index using SolrJ > > when you add a new document using the same "id" value as another it just over > writes it > > On Mon, Nov 4, 2019 at 9:30 AM Khare, Kushal (MIND) < > kushal.kh.

RE: Delete documents from the Solr index using SolrJ

2019-11-04 Thread Peter Lancaster
) [mailto:kushal.kh...@mind-infotech.com] Sent: 04 November 2019 15:03 To: solr-user@lucene.apache.org Subject: [EXTERNAL] RE: Delete documents from the Solr index using SolrJ Thanks! Actually am working on a Java web application using SolrJ for Solr search. The users would actually be uploading/editing

RE: Delete documents from the Solr index using SolrJ

2019-11-04 Thread Khare, Kushal (MIND)
to carry on with the solution that you proposed. Please guide ! -Original Message- From: David Hastings [mailto:hastings.recurs...@gmail.com] Sent: 04 November 2019 20:10 To: solr-user@lucene.apache.org Subject: Re: Delete documents from the Solr index using SolrJ delete them by query would

Re: Delete documents from the Solr index using SolrJ

2019-11-04 Thread David Hastings
org > Subject: Re: Delete documents from the Solr index using SolrJ > > when you add a new document using the same "id" value as another it just > over writes it > > On Mon, Nov 4, 2019 at 9:30 AM Khare, Kushal (MIND) < > kushal.kh...@m

RE: Delete documents from the Solr index using SolrJ

2019-11-04 Thread Khare, Kushal (MIND)
: Delete documents from the Solr index using SolrJ when you add a new document using the same "id" value as another it just over writes it On Mon, Nov 4, 2019 at 9:30 AM Khare, Kushal (MIND) < kushal.kh...@mind-infotech.com> wrote: > Could you please let me

RE: Delete documents from the Solr index using SolrJ

2019-11-04 Thread Khare, Kushal (MIND)
more clear now. -Original Message- From: Khare, Kushal (MIND) [mailto:kushal.kh...@mind-infotech.com] Sent: 04 November 2019 20:00 To: solr-user@lucene.apache.org Subject: RE: Delete documents from the Solr index using SolrJ Could you please let me know how to achieve

Re: Delete documents from the Solr index using SolrJ

2019-11-04 Thread David Hastings
age- > From: Jörn Franke [mailto:jornfra...@gmail.com] > Sent: 04 November 2019 19:59 > To: solr-user@lucene.apache.org > Subject: Re: Delete documents from the Solr index using SolrJ > > I don’t understand why it is not possible. > > However why don’t you simply overwrite th

RE: Delete documents from the Solr index using SolrJ

2019-11-04 Thread Khare, Kushal (MIND)
Could you please let me know how to achieve that ? -Original Message- From: Jörn Franke [mailto:jornfra...@gmail.com] Sent: 04 November 2019 19:59 To: solr-user@lucene.apache.org Subject: Re: Delete documents from the Solr index using SolrJ I don’t understand why it is not possible

Re: Delete documents from the Solr index using SolrJ

2019-11-04 Thread Jörn Franke
I don’t understand why it is not possible. However why don’t you simply overwrite the existing document instead of add+delete > Am 04.11.2019 um 15:12 schrieb Khare, Kushal (MIND) > : > > Hello mates! > I want to know how we can delete the documents from the Solr index . Su

Delete documents from the Solr index using SolrJ

2019-11-04 Thread Khare, Kushal (MIND)
Hello mates! I want to know how we can delete the documents from the Solr index . Suppose for my system, I have a document that has been indexed, now its newer version is into use, so I want to use the latest one, for that I want the previous one to be deleted from the index. Kindly help me

Re: Solr index

2019-08-08 Thread Dario Rigolin
Do you know that your solr is open to the internet? It's better to filter the port or at least not put here the full address... Il giorno gio 8 ago 2019 alle ore 15:58 HTMLServices.it < i...@htmlservices.it> ha scritto: > Hi everyone > I installed Solr on a test server (centos 7) to get the

Solr index

2019-08-08 Thread HTMLServices.it
Hi everyone I installed Solr on a test server (centos 7) to get the fastest searches on dovecot, Solr and new for me and I think I didn't understand how it works perfectly. I installed following the official guide on the dovecot wiki: https://wiki2.dovecot.org/Plugins/FTS/Solr but I can't get

Re: Encrypting Solr Index

2019-06-25 Thread Jörn Franke
s FS encryption I didn't find a way to use different >> Username/Password. It by default takes window's username/password to >> encrypt and decrypt. >> >> I tried bitlocker too for creating encrypted virtual directory (Which >> allows me to use different credentials) and to kee

Re: Encrypting Solr Index

2019-06-25 Thread Tim Casey
akes window's username/password to > encrypt and decrypt. > > I tried bitlocker too for creating encrypted virtual directory (Which > allows me to use different credentials) and to keep Solr Index in that but > somehow Solr Admin was unable to access Index from that encrypted >

RE: Encrypting Solr Index

2019-06-25 Thread Ahuja, Sakshi
(Which allows me to use different credentials) and to keep Solr Index in that but somehow Solr Admin was unable to access Index from that encrypted directory. Not sure how that is working. If you have any idea on that- will wok for me. Thanks! -Original Message- From: Jörn Franke

Re: Encrypting Solr Index

2019-06-25 Thread Jörn Franke
hat works but want to know if solr has > some inbuilt feature to encrypt index or any good way to encrypt solr index? > > Thanks, > Sakshi

Re: Encrypting Solr Index

2019-06-25 Thread Erick Erickson
encrypt index for security reasons. I have >> tried Windows FS encryption option that works but want to know if solr has >> some inbuilt feature to encrypt index or any good way to encrypt solr index? >> >> Thanks, >> Sakshi >>

Re: Encrypting Solr Index

2019-06-25 Thread Alexandre Rafalovitch
ypt index for security reasons. I have > tried Windows FS encryption option that works but want to know if solr has > some inbuilt feature to encrypt index or any good way to encrypt solr index? > > Thanks, > Sakshi >

Encrypting Solr Index

2019-06-25 Thread Ahuja, Sakshi
Hi, I am using solr 6.6 and want to encrypt index for security reasons. I have tried Windows FS encryption option that works but want to know if solr has some inbuilt feature to encrypt index or any good way to encrypt solr index? Thanks, Sakshi

Re: Solr index slow response

2019-03-19 Thread Walter Underwood
t; Sent: Tuesday, March 19, 2019 3:29:17 PM > To: solr-user@lucene.apache.org > Subject: Re: Solr index slow response > > Indexing is CPU bound. If you have enough RAM, SSD disks, and enough client > threads, you should be able to drive CPU to over 90%. > > Start with tw

Re: Solr index slow response

2019-03-19 Thread Aaron Yingcai Sun
time. I will try with Solr Could cluster, maybe get better speed there. //Aaron From: Walter Underwood Sent: Tuesday, March 19, 2019 3:29:17 PM To: solr-user@lucene.apache.org Subject: Re: Solr index slow response Indexing is CPU bound. If you have enough RAM

Re: Solr index slow response

2019-03-19 Thread Emir Arnautović
wiki.apache.org > Schema Design Considerations. indexed fields. The number of indexed fields > greatly increases the following: Memory usage during indexing ; Segment merge > time > > > > > > From: Emir Arnautović > Sent: Tuesday

Re: Solr index slow response

2019-03-19 Thread Michael Gibney
I'll second Emir's suggestion to try disabling swap. "I doubt swap would affect it since there is such huge free memory." -- sounds reasonable, but has not been my experience, and the stats you sent indicate that swap is in fact being used. Also, note that in many cases setting vm.swappiness=0 is

Re: Solr index slow response

2019-03-19 Thread Walter Underwood
Indexing is CPU bound. If you have enough RAM, SSD disks, and enough client threads, you should be able to drive CPU to over 90%. Start with two client threads per CPU. That allows one thread to be sending data over the network while another is waiting for Solr to process the batch. A couple

Re: Solr index slow response

2019-03-19 Thread Bernd Fehling
Isn't there somthing about largePageTables which must be enabled in JAVA and also supported by OS for such huge heaps? Just a guess. Am 19.03.19 um 15:01 schrieb Jörn Franke: It could be an issue with jdk 8 that may not be suitable for such large heaps. Have more nodes with smaller heaps (eg

Re: Solr index slow response

2019-03-19 Thread Jörn Franke
It could be an issue with jdk 8 that may not be suitable for such large heaps. Have more nodes with smaller heaps (eg 31 gb) > Am 18.03.2019 um 11:47 schrieb Aaron Yingcai Sun : > > Hello, Solr! > > > We are having some performance issue when try to send documents for solr to > index. The

Re: Solr index slow response

2019-03-19 Thread Chris Ulicny
> From: Emir Arnautović > Sent: Tuesday, March 19, 2019 1:00:19 PM > To: solr-user@lucene.apache.org > Subject: Re: Solr index slow response > > If you start indexing with just a single thread/client, do you still see > slow bulks? > > Emir > -- > Moni

Re: Solr index slow response

2019-03-19 Thread Aaron Yingcai Sun
n Considerations. indexed fields. The number of indexed fields greatly increases the following: Memory usage during indexing ; Segment merge time From: Emir Arnautović Sent: Tuesday, March 19, 2019 1:00:19 PM To: solr-user@lucene.apache.org Subject: Re: Solr ind

Re: Solr index slow response

2019-03-19 Thread Emir Arnautović
m: Emir Arnautović > Sent: Tuesday, March 19, 2019 12:30:33 PM > To: solr-user@lucene.apache.org > Subject: Re: Solr index slow response > > Just to add different perspective here: how do you send documents to Solr? > Are those log lines from your client? Maybe it is not Solr that is

Re: Solr index slow response

2019-03-19 Thread Aaron Yingcai Sun
:33 PM To: solr-user@lucene.apache.org Subject: Re: Solr index slow response Just to add different perspective here: how do you send documents to Solr? Are those log lines from your client? Maybe it is not Solr that is slow. Could it be network or client itself. If you have some dry run on client

Re: Solr index slow response

2019-03-19 Thread Emir Arnautović
her faster way to index such big amount of data? > > > BRs > > //Aaron > > > From: Walter Underwood > Sent: Monday, March 18, 2019 4:59:20 PM > To: solr-user@lucene.apache.org > Subject: Re: Solr index slow response > > Solr is not designed to have cons

Re: Solr index slow response

2019-03-19 Thread Aaron Yingcai Sun
028-189979 DBG1:doc_count: 10 , doc_size: 584 KB, Res code: > 200, QTime: 22800 ms, Request time: 22802 ms. > 190318-162821.056-189948 DBG1:doc_count: 10 , doc_size: 670 KB, Res code: > 200, QTime: 34193 ms, Request time: 34195 ms. > 190318-162821.062-189983 DBG1:doc_count: 10 , doc_s

Re: Solr index slow response

2019-03-18 Thread Walter Underwood
028-189979 DBG1:doc_count: 10 , doc_size: 584 KB, Res code: > 200, QTime: 22800 ms, Request time: 22802 ms. > 190318-162821.056-189948 DBG1:doc_count: 10 , doc_size: 670 KB, Res code: > 200, QTime: 34193 ms, Request time: 34195 ms. > 190318-162821.062-189983 DBG1:doc_count: 10 , doc_size: 67

Re: Solr index slow response

2019-03-18 Thread Aaron Yingcai Sun
Sent: Monday, March 18, 2019 2:54:25 PM To: solr-user@lucene.apache.org Subject: Re: Solr index slow response One other thing to look at besides the heap is your commit settings. We've experienced something similar, and changing commit settings alleviated the issue. Are you opening a search on ever

Re: Solr index slow response

2019-03-18 Thread Chris Ulicny
: Emir Arnautović > Sent: Monday, March 18, 2019 2:19:19 PM > To: solr-user@lucene.apache.org > Subject: Re: Solr index slow response > > Hi Aaron, > Without looking too much into numbers, my bet would be that it is large > heap that is causing issues. I would decrease is

Re: Solr index slow response

2019-03-18 Thread Emir Arnautović
___ > From: Emir Arnautović > Sent: Monday, March 18, 2019 2:19:19 PM > To: solr-user@lucene.apache.org > Subject: Re: Solr index slow response > > Hi Aaron, > Without looking too much into numbers, my bet would be that it is large heap > that is causi

Re: Solr index slow response

2019-03-18 Thread Emir Arnautović
G1:doc_count: 10 , doc_size: 605 KB, Res code: > 200, QTime: 108 ms, Request time: 110 ms. > 190318-142655.304-160208 DBG1:doc_count: 10 , doc_size: 481 KB, Res code: > 200, QTime: 89 ms, Request time: 90 ms. > 190318-142655.410-160208 DBG1:doc_count: 10 , doc_size: 468 KB, Res code: &

Re: Solr index slow response

2019-03-18 Thread Aaron Yingcai Sun
time. BRs //Aaron From: Emir Arnautović Sent: Monday, March 18, 2019 2:19:19 PM To: solr-user@lucene.apache.org Subject: Re: Solr index slow response Hi Aaron, Without looking too much into numbers, my bet would be that it is large heap that is causing issues

Re: Solr index slow response

2019-03-18 Thread Aaron Yingcai Sun
he.org Subject: Re: Solr index slow response On Mon, 2019-03-18 at 10:47 +, Aaron Yingcai Sun wrote: > Solr server is running on a quit powerful server, 32 cpus, 400GB RAM, > while 300 GB is reserved for solr, [...] 300GB for Solr sounds excessive. > Our application send 100 docume

Re: Solr index slow response

2019-03-18 Thread Emir Arnautović
path":"...", >> "commandLineArgs":["-Xms100G", >> "-Xmx300G", >> "-DSTOP.PORT=8079", >> "-DSTOP.KEY=..", >> "-Dsolr.solr.home=..", >> "-Djett

Re: Solr index slow response

2019-03-18 Thread Emir Arnautović
-Djetty.port=8983"], > "startTime":"2019-03-18T09:35:27.892Z", > "upTimeMS":9258422}}, > "system":{ >"name":"Linux", >"arch":"amd64", >"availableProcessors":32, &

Re: Solr index slow response

2019-03-18 Thread Toke Eskildsen
On Mon, 2019-03-18 at 10:47 +, Aaron Yingcai Sun wrote: > Solr server is running on a quit powerful server, 32 cpus, 400GB RAM, > while 300 GB is reserved for solr, [...] 300GB for Solr sounds excessive. > Our application send 100 documents to solr per request, json encoded. > the size is

Re: Solr index slow response

2019-03-18 Thread Aaron Yingcai Sun
quot;:14.72, "version":"3.0.101-311.g08a8a9d-default", "committedVirtualMemorySize":2547960700928, "freePhysicalMemorySize":4530696192, "freeSwapSpaceSize":3486846976, "processCpuLoad":0.3257436126790475, "p

Re: Solr index slow response

2019-03-18 Thread Emir Arnautović
Hi Aaron, Which version of Solr? How did you configure your heap? Is it standalone Solr or SolrCloud? A single server? Do you use some monitoring tool? Do you see some spikes, pauses or CPU usage is constant? Thanks, Emir -- Monitoring - Log Management - Alerting - Anomaly Detection Solr &

Solr index slow response

2019-03-18 Thread Aaron Yingcai Sun
Hello, Solr! We are having some performance issue when try to send documents for solr to index. The repose time is very slow and unpredictable some time. Solr server is running on a quit powerful server, 32 cpus, 400GB RAM, while 300 GB is reserved for solr, while this happening, cpu usage

Re: Solr Index Size after reindex

2019-02-14 Thread David Hastings
t; > I don't see any transaction not completed that normaly means that the > indexation is completed. That's why I don't understand the difference. > > > > Kind Regards > > > > Matthieu > > > > Original Message- > > From: Andrea Gazzarini [mailto

Re: Solr Index Size after reindex

2019-02-14 Thread Erick Erickson
rely completed. > > I don't see any transaction not completed that normaly means that the > indexation is completed. That's why I don't understand the difference. > > Kind Regards > > Matthieu > > Original Message- > From: Andrea Gazzarini [mailto:a

RE: Solr Index Size after reindex

2019-02-13 Thread Mathieu Menard
9 février 2019 16:56 To: solr-user@lucene.apache.org Subject: Re: Solr Index Size after reindex Yes, those numbers are different and that should explain the different size. I think you should be able to find some information in the Alfresco or Solr log. There must be a reason about the missing

Re: Solr Index Size after reindex

2019-02-09 Thread Andrea Gazzarini
8 février 2019 14:54 *To:* solr-user@lucene.apache.org *Subject:* Re: Solr Index Size after reindex Hi Mathieu, what about the docs in the two infrastructures? Do they have the same numbers (numdocs / maxdocs)? Any meaningful message (error or not) in log files? Andrea On 08/02/2019 14:1

RE: Solr Index Size after reindex

2019-02-08 Thread Mathieu Menard
To: solr-user@lucene.apache.org Subject: Re: Solr Index Size after reindex Hi Mathieu, what about the docs in the two infrastructures? Do they have the same numbers (numdocs / maxdocs)? Any meaningful message (error or not) in log files? Andrea On 08/02/2019 14:19, Mathieu Menard wrote: Hello, I

Re: Solr Index Size after reindex

2019-02-08 Thread Andrea Gazzarini
Hi Mathieu, what about the docs in the two infrastructures? Do they have the same numbers (numdocs / maxdocs)? Any meaningful message (error or not) in log files? Andrea On 08/02/2019 14:19, Mathieu Menard wrote: Hello, I would like to have your point of view about an observation we have

Re: Solr index writing to s3

2019-01-17 Thread Mikhail Khludnev
There is some experience on backup to s3 https://issues.apache.org/jira/browse/SOLR-9952 iirc, it lacks performance. Jörn, it's not a point, but literally s3 consistency might be enough, since s3 provides read-after-write for PUT and Lucene index writer is append-only. On Thu, Jan 17, 2019 at

Re: Solr index writing to s3

2019-01-16 Thread Jörn Franke
This is not a requirement. This is a statement to a problem where there could be other solutions. s3 is only eventually consistent and I am not sure Solr works properly in this case. You may also need to check the S3 consistency to be applied. > Am 16.01.2019 um 19:39 schrieb Naveen M : > >

Re: Solr index writing to s3

2019-01-16 Thread Hendrik Haddorp
Theoretically you should be able to use the HDFS backend, which you can configure to use s3. Last time I tried that it did however not work for some reason. Here is an example for that, which also seems to have ultimately failed:

Solr index writing to s3

2019-01-16 Thread Naveen M
hi, My requirement is to write the index data into S3, we have solr installed on aws instances. Please let me know if there is any documentation on how to achieve writing the index data to s3. Thanks

[solr-index-update] solr update Is there a "literal.field_name" feature in json?

2018-12-16 Thread 유정인
Hello, The solr update's csv has a "literal.field_name" function. Does json have a similar function? No function found. Thank you.

Re: Solr Index Data will be delete if state.json did not exists

2018-12-14 Thread Jan Høydahl
I would use the Backup/Restore API https://lucene.apache.org/solr/guide/7_5/making-and-restoring-backups.html Alternatively, you could create collection B, using same configset as A, stop solr, copy the data folder and

Solr Index Data will be delete if state.json did not exists

2018-12-13 Thread Lei Wang
Hi guys, Currently I am running a 2 nodes cloud of Solr 7.5, I already have a collection named A and it worked fine with 20GB index Data, while I want to create a collection named B and want to copy index data from A. So in Solr5.5, I just copy index folder from A and renamed to B. restart solr

Re: [solr-index]Can I do a lot of analysis on one field at the time of indexing?

2018-12-13 Thread Walter Underwood
> > If you can afford the time, can you give us a specific sample of the proposed > method? > > Thank you. > > -Original Message- > From: Walter Underwood > Sent: Friday, December 14, 2018 12:11 PM > To: solr-user@lucene.apache.org > Subject: Re: [so

RE: [solr-index]Can I do a lot of analysis on one field at the time of indexing?

2018-12-13 Thread 유정인
WalterUnderwood, thank you for your reply. If you can afford the time, can you give us a specific sample of the proposed method? Thank you. -Original Message- From: Walter Underwood Sent: Friday, December 14, 2018 12:11 PM To: solr-user@lucene.apache.org Subject: Re: [solr-index]Can

Re: [solr-index]Can I do a lot of analysis on one field at the time of indexing?

2018-12-13 Thread Walter Underwood
Right, no feature that does that for you. You should be able to code that with an update request processor script. You can fetch an analyzer chain, run it, add the results to a field, then do that again. I have one that runs a chain with minhash then saves the hex values of the hashes to a

Re: [solr-index]Can I do a lot of analysis on one field at the time of indexing?

2018-12-13 Thread Erick Erickson
In a word, "no". A field can have exactly one tokenizer, and there are no conditional filters. You can copyField to multiple individual fields and treat each one of those differently, i.e. copy from title to title1, title2 etc. where each one has a different analysis chain. Best, Erick On Thu,

[solr-index]Can I do a lot of analysis on one field at the time of indexing?

2018-12-13 Thread 유정인
Hello I have a question about index schemas. 1) Can I do various analysis on one field? For example, you can analyze the 'title' field with multiple tokenizers, and merge the analysis into a single field. 2) You can collect multiple fields in one field using 'copyField' function. However,

Re: Moving Solr index from Staging to Production

2018-11-28 Thread Toke Eskildsen
Arunan Sugunakumar wrote: > https://lucene.apache.org/solr/guide/6_6/making-and-restoring-backups.html We (also?) prefer to keep our stage/build setup separate from production. Backup + restore works well for us. It is very fast, as it is basically just copying the segment files. - Toke

Re: Moving Solr index from Staging to Production

2018-11-28 Thread David Hastings
you just set up the solr install on the production server as a slave to your current install and hit the replicate button from the admin interface on the production server On Wed, Nov 28, 2018 at 1:34 PM Arunan Sugunakumar wrote: > Hi, > > I have deployed Solr 7.2 in a staging server in

Moving Solr index from Staging to Production

2018-11-28 Thread Arunan Sugunakumar
Hi, I have deployed Solr 7.2 in a staging server in standalone mode. I want to move it to the production server. I would like to know whether I need to run the indexing process again or is there any easier way to move the existing index? I went through this documentation but I couldn't figure

Solr index size affected by duplication

2018-11-18 Thread sagandhi
while searching the nested docs are filtered out for proper result count. This required duplicating the nested doc fields in the parent doc. This duplication of fields has resulted in huge Solr index size and I am planning to get rid of them and use blockjoin for nested doc fields. This has caused

Re: SOLR Index Time Running Optimization

2018-09-26 Thread Walter Underwood
How long does the query take when it is run directly, without Solr? For our DIH queries, Solr was not the slow part. It took 90 minutes directly or with DIH. With our big cluster, I’ve seen indexing rates of one million docs per minute. wunder Walter Underwood wun...@wunderwood.org

Re: SOLR Index Time Running Optimization

2018-09-26 Thread Jan Høydahl
With DIH you are doing indexing single-threaded. You should be able to configure multiple DIH's on the same collection and then partition the data between them, issuing slightly different SQL to each. But I don't exactly know what that would look like. -- Jan Høydahl, search solution architect

Re: SOLR Index Time Running Optimization

2018-09-26 Thread Susheel Kumar
Also are you using Solr data import? That will be much slower compare to if you write our own little indexer which does indexing in batches and with multiple threads. On Wed, Sep 26, 2018 at 8:00 AM Vincenzo D'Amore wrote: > Hi, I know this is the shortest way but, had you tried to add more

Re: SOLR Index Time Running Optimization

2018-09-26 Thread Vincenzo D'Amore
Hi, I know this is the shortest way but, had you tried to add more core or CPU to your solr instances? How big is you collection in terms of GB and number of documents? Ciao, Vincenzo > On 26 Sep 2018, at 08:36, Krizelle Mae Hernandez > wrote: > > Hi. > > Our SOLR currently is running

SOLR Index Time Running Optimization

2018-09-26 Thread Krizelle Mae Hernandez
Hi. Our SOLR currently is running approximately 39hours for Full and Delta Import. I would like to ask for your assistance on how can we shorten the 39hours run time in any possible solution? For SOLR version, we are using solr 5.3.1. Regards, Krizelle Mae M. Hernandez

Re: Solr index clearing

2018-09-25 Thread Jan Høydahl
Hi, Solr does not do anything automatically, so I think this is a question for the Nutch community - http://nutch.apache.org/mailing_lists.html -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com > 24. sep. 2018 kl. 20:06 skrev Bineesh : > > Team, > > We use solr 7.3.1

Solr index clearing

2018-09-25 Thread Bineesh
Team, We use solr 7.3.1 and Nucth 1.15. I created two collections in solr and data successfully indexed from Nutch after crawling. Up on the third collection index in solr, i see that first collecion indexed data automatically clears.Pls suggest -- Sent from:

Re: Solr Index Issues

2018-09-10 Thread Walter Underwood
Every time you see "Expected mime type application/octet-stream but got text/html” from SolrJ, it means that Solr returned an error. Look for an error in the Solr logs at the same time as the SolrJ message. It could be any error, which is why we can’t help more. After you know the Solr error,

Re: Solr Index Issues

2018-09-10 Thread Erick Erickson
It would be best to ask on the Nutch mailing list, this list doesn't have very many people who know _how_ Nutch uses Solr though. Best, Erick On Sun, Sep 9, 2018 at 11:47 PM Bineesh wrote: > > Hi Team, > > We are using Nutch 1.15 and Solr 6.6.3 > > We tried crawling one of the URL and and

Solr Index Issues

2018-09-10 Thread Bineesh
Hi Team, We are using Nutch 1.15 and Solr 6.6.3 We tried crawling one of the URL and and noticed issues while indexing data to solr.Below is the capture from logs Caused by: org.apache.solr.client.solrj.impl.HttpSolrClient$RemoteSolrException: Error from server at

Re: can you migrate solr index files from osx to linux

2018-02-07 Thread Jeff Dyke
I forgot to report back on this. For anyone that runs into it, you need the entire data directory not just the index directory, at least that's what made it work for me. On Thu, Feb 1, 2018 at 9:52 PM, Erick Erickson wrote: > I think SCP will be fine. Shawn's comment

Re: can you migrate solr index files from osx to linux

2018-02-01 Thread Erick Erickson
I think SCP will be fine. Shawn's comment is probably the issue. Best, Erick On Thu, Feb 1, 2018 at 4:34 PM, Shawn Heisey wrote: > On 2/1/2018 4:32 PM, Jeff Dyke wrote: >> I just created a tar file, actually a tar.gz file and scp'd to a server, at >> first i was worried

Re: can you migrate solr index files from osx to linux

2018-02-01 Thread Shawn Heisey
On 2/1/2018 4:32 PM, Jeff Dyke wrote: > I just created a tar file, actually a tar.gz file and scp'd to a server, at > first i was worried that the gzip caused issues, but as i mentioned no > errors on start up, and i thought i would see some. @Erick, how would you > recommend. This is going to

Re: can you migrate solr index files from osx to linux

2018-02-01 Thread Jeff Dyke
I just created a tar file, actually a tar.gz file and scp'd to a server, at first i was worried that the gzip caused issues, but as i mentioned no errors on start up, and i thought i would see some. @Erick, how would you recommend. This is going to be less of an issue b/c i need to build the

Re: can you migrate solr index files from osx to linux

2018-02-01 Thread Erick Erickson
One note, be _very_ sure you copy in binary mode.. On Thu, Feb 1, 2018 at 1:33 PM, Shawn Heisey wrote: > On 2/1/2018 12:56 PM, Jeff Dyke wrote: >> That's exactly what i thought as well. The only difference and i can try >> to downgrade OSX is 7.2, and i grabbed 7.2.1

Re: can you migrate solr index files from osx to linux

2018-02-01 Thread Shawn Heisey
On 2/1/2018 12:56 PM, Jeff Dyke wrote: > That's exactly what i thought as well. The only difference and i can try > to downgrade OSX is 7.2, and i grabbed 7.2.1 for install on Ubuntu. I > didn't think a point minor point release would matter. > > solr@stagingsolr01:~/data/issuers/data$ ls -1 >

Re: can you migrate solr index files from osx to linux

2018-02-01 Thread Jeff Dyke
That's exactly what i thought as well. The only difference and i can try to downgrade OSX is 7.2, and i grabbed 7.2.1 for install on Ubuntu. I didn't think a point minor point release would matter. solr@stagingsolr01:~/data/issuers/data$ ls -1 981552 index _mg8.dii _mg8.dim _mg8.fdt _mg8.fdx

  1   2   3   4   5   6   7   8   9   >