RE: Multi-lingual search

2016-02-08 Thread vidya
Hi Can i implement proximity search if i use >seperate core per language >field per language >multilingual field that supports all languages. And what does proximity search exactly mean? searching for walk word when walking is indexed,should fetch and display the record? It will be included i

Re: Solr 4.10 with Jetty 8.1.10 & Tomcat 7

2016-02-08 Thread Shahzad Masud
Thank you Shawn for your response. I would be running some performance tests lately on this structure (one JVM with multiple cores), and would share feedback on this thread. >There IS a way to specify the solr home for a specific context, but keep >in mind that I definitely DO NOT recommend doing

Leader election issues after upgrade from 4.10.4 to 5.4.1

2016-02-08 Thread Mike Thomsen
We get this error on one of our nodes: Caused by: org.apache.solr.common.SolrException: There is conflicting information about the leader of shard: shard2 our state says: http://server01:8983/solr/collection/ but zookeeper says: http://server02:8983/collection/ Then I noticed this in the log: ]

Re: Request for SOLR-wiki edit permissions

2016-02-08 Thread Jason Gerlowski
Thanks Anshum! On Mon, Feb 8, 2016 at 1:01 PM, Anshum Gupta wrote: > Done. > > On Mon, Feb 8, 2016 at 9:55 AM, Jason Gerlowski > wrote: > >> Hi all, >> >> Can someone please give me edit permissions for the Solr wiki. Is >> there anything I should or need to do to get these permissions? My >>

Re: SolrCloud behavior when a ZooKeeper node goes down

2016-02-08 Thread Erick Erickson
My first guess would be that all of the ZK are configured with each other's addresses available? Or perhaps AWS is messing with your machine addresses On Mon, Feb 8, 2016 at 12:09 PM, Kelly, Frank wrote: > We are running a small SolrCloud instance on AWS > > Solr : Version 5.3.1 > ZooKeep

Re: Solr architecture

2016-02-08 Thread Jack Krupansky
Oops... at 100 qps for a single node you would need 120 nodes to get to 12K qps and 800 nodes to get 80K qps, but that is just an extremely rough ballpark estimate, not some precise and firm number. And that's if all the queries can be evenly distributed throughout the cluster and don't require fan

Solr 5.3 SSL

2016-02-08 Thread Jian Zhang
Hi, Solr guru We have worked well with solr 5.3 non SSL. Now we are working on SSL by following https://cwiki.apache.org/confluence/display/solr/Enabling+SSL Cluster is up and and can be accessible via https://:18983/solr/# When w

SolrCloud behavior when a ZooKeeper node goes down

2016-02-08 Thread Kelly, Frank
We are running a small SolrCloud instance on AWS Solr : Version 5.3.1 ZooKeeper: Version 3.4.6 3 x ZooKeeper nodes (with higher limits and timeouts due to being on AWS) 3 x Solr Nodes (8 GB of memory each - 2 collections with 3 shards for each collection) Let's call the ZooKeeper nodes A, B and

online scoring explanation

2016-02-08 Thread Doug Turnbull
Splainer maybe ;) http://splainer.io Hope it's useful to you. Let us know if you have suggestions/ideas/bugs http://github.com/o19s/splainer On Monday, February 8, 2016, John Blythe > wrote: > hi all, > > last year i had gotten a site recommended to me on this forum. it helped > you break down

Re: online scoring explanation

2016-02-08 Thread Toke Eskildsen
John Blythe wrote: > last year i had gotten a site recommended to me on this forum. it helped > you break down the results/score you were getting from your queries. http://splainer.io/ perhaps? - Toke Eskildsen

online scoring explanation

2016-02-08 Thread John Blythe
hi all, last year i had gotten a site recommended to me on this forum. it helped you break down the results/score you were getting from your queries. it isn't explain.solr.pl, but another one that seemed a bit more robust if my memory serves me correctly. i want to say a member of the thread not o

Re: Request for SOLR-wiki edit permissions

2016-02-08 Thread Anshum Gupta
Done. On Mon, Feb 8, 2016 at 9:55 AM, Jason Gerlowski wrote: > Hi all, > > Can someone please give me edit permissions for the Solr wiki. Is > there anything I should or need to do to get these permissions? My > wiki username is "Jason.Gerlowski", and my wiki email is > "gerlowsk...@gmail.com"

Request for SOLR-wiki edit permissions

2016-02-08 Thread Jason Gerlowski
Hi all, Can someone please give me edit permissions for the Solr wiki. Is there anything I should or need to do to get these permissions? My wiki username is "Jason.Gerlowski", and my wiki email is "gerlowsk...@gmail.com". I spotted a few things that could use some clarification on the HowToCon

Re: Solr architecture

2016-02-08 Thread Jack Krupansky
So is there any aging or TTL (in database terminology) of older docs? And do all of your queries need to query all of the older documents all of the time or is there a clear hierarchy of querying for aged documents, like past 24-hours vs. past week vs. past year vs. older than a year? Sure, you ca

Re: Bulk delete of Solr documents

2016-02-08 Thread Susheel Kumar
Yes, use below url http://localhost:8983/solr//update?stream.body= *:*&commit=true On Mon, Feb 8, 2016 at 11:33 AM, Anil wrote: > Hi , > > Can we delete solr documents from a collection in a bulk ? > > Regards, > Anil >

Re: Bulk delete of Solr documents

2016-02-08 Thread Yago Riveiro
Yes. You can delete using a query http://blog.dileno.com/archive/201106/delete-documents-from-solr-index-by- query/ \-- /Yago Riveiro > On Feb 8 2016, at 4:35 pm, Anil wrote: > > Hi , > > Can we delete solr documents from a collection in a bulk ? > >

Bulk delete of Solr documents

2016-02-08 Thread Anil
Hi , Can we delete solr documents from a collection in a bulk ? Regards, Anil

Re: Tutorial or Code Samples to explain how to Write Solr Plugins

2016-02-08 Thread Binoy Dalal
I've compiled a sample search component example working on solr 5.4.1 The code is ready to run. Find it here: https://github.com/lttazz99/SolrPluginsExamples.git On Mon, Feb 8, 2016 at 1:51 PM Gian Maria Ricci - aka Alkampfer < alkamp...@nablasoft.com> wrote: > Perfect, Thanks again to everyone.

Re: Solr architecture

2016-02-08 Thread Erick Erickson
Short form: You really have to prototype. Here's the long form: https://lucidworks.com/blog/2012/07/23/sizing-hardware-in-the-abstract-why-we-dont-have-a-definitive-answer/ I've seen between 20M and 200M docs fit on a single piece of hardware, so you'll absolutely have to shard. And the other th

Re: Solr architecture

2016-02-08 Thread Emir Arnautovic
Hi Mark, Can you give us bit more details: size of docs, query types, are docs grouped somehow, are they time sensitive, will they update or it is rebuild every time, etc. Thanks, Emir On 08.02.2016 16:56, Mark Robinson wrote: Hi, We have a requirement where we would need to index around 2 B

Re: Solr architecture

2016-02-08 Thread Susheel Kumar
Also if you are expecting indexing of 2 billion docs as NRT or if it will be offline (during off hours etc). For more accurate sizing you may also want to index say 10 million documents which may give you idea how much is your index size and then use that for extrapolation to come up with memory r

Re: Data Import Handler - autoSoftCommit and autoCommit

2016-02-08 Thread Susheel Kumar
You can start with one of the suggestions from this link based on your indexing and query load. https://lucidworks.com/blog/2013/08/23/understanding-transaction-logs-softcommit-and-commit-in-sorlcloud/ Thanks, Susheel On Mon, Feb 8, 2016 at 10:15 AM, Troy Edwards wrote: > We are running the

Solr architecture

2016-02-08 Thread Mark Robinson
Hi, We have a requirement where we would need to index around 2 Billion docs in a day. The queries against this indexed data set can be around 80K queries per second during peak time and during non peak hours around 12K queries per second. Can Solr realize this huge volumes. If so, assuming we ha

Re: Data Import Handler - autoSoftCommit and autoCommit

2016-02-08 Thread Rajesh Hazari
we have this for a collection which updated every 3mins with min of 500 documents and once in a day of 10k documents in start of the day ${solr.autoCommit.maxTime:30} 1 true true ${solr.autoSoftCommit.maxTime:6000} As per solr documentation, If

Re: solr performance issue

2016-02-08 Thread Susheel Kumar
1 million document shouldn't have any issues at all. Something else is wrong with your hw/system configuration. Thanks, Susheel On Mon, Feb 8, 2016 at 6:45 AM, sara hajili wrote: > On Mon, Feb 8, 2016 at 3:04 AM, sara hajili wrote: > > > sorry i made a mistake i have a bout 1000 K doc. > > i

Re: Tesseract command-line OCR engine has stopped working

2016-02-08 Thread Zheng Lin Edwin Yeo
Has anyone experienced this before during indexing of EML files? Regards, Edwin On 5 February 2016 at 17:30, Zheng Lin Edwin Yeo wrote: > Hi, > > I am indexing EML files (emails) into Solr, and some of those emails has > attachment. > > During the indexing, I encountered this "*Tesseract comman

Data Import Handler - autoSoftCommit and autoCommit

2016-02-08 Thread Troy Edwards
We are running the data import handler to retrieve about 10 million records during work hours every day of the week. We are using Clean = true, Commit = true and Optimize = true. The entire process takes about 1 hour. What would be a good setting for autoCommit and autoSoftCommit? Thanks

Re: Solr 4.10 with Jetty 8.1.10 & Tomcat 7

2016-02-08 Thread Shawn Heisey
On 2/8/2016 1:14 AM, Shahzad Masud wrote: > Thank you Shawn for your reply. Here is my structure of cores and shards > > Shard 1 = localhost:8983/solr_2014 [3 Core - Employee, Service Tickets, > Departments] > Shard 2 = localhost:8983/solr_2015 [3 Core - Employee, Service Tickets, > Departments]

Sequential Documents Ids

2016-02-08 Thread Shai Rubin
Hi, Recently I've read Michael McCandless' article (http://blog.mikemccandless.com/2014/05/choosing-fast-unique-identifier-uuid.html) and made some changes to the id assigning mechanism on my project. I'm newbie to Solr / Lucene and I'm trying to figure out how do these changes affect the in

Re: solr performance issue

2016-02-08 Thread sara hajili
On Mon, Feb 8, 2016 at 3:04 AM, sara hajili wrote: > sorry i made a mistake i have a bout 1000 K doc. > i mean about 100 doc. > > On Mon, Feb 8, 2016 at 1:35 AM, Emir Arnautovic < > emir.arnauto...@sematext.com> wrote: > >> Hi Sara, >> Not sure if I am reading this right, but I read it as you

Re: solr performance issue

2016-02-08 Thread Emir Arnautovic
Hi Sara, It is still considered to be small index. Can you give us bit details about your setup? Thanks, Emir On 08.02.2016 12:04, sara hajili wrote: sorry i made a mistake i have a bout 1000 K doc. i mean about 100 doc. On Mon, Feb 8, 2016 at 1:35 AM, Emir Arnautovic < emir.arnauto...@s

Re: solr performance issue

2016-02-08 Thread sara hajili
sorry i made a mistake i have a bout 1000 K doc. i mean about 100 doc. On Mon, Feb 8, 2016 at 1:35 AM, Emir Arnautovic < emir.arnauto...@sematext.com> wrote: > Hi Sara, > Not sure if I am reading this right, but I read it as you have 1000 doc > index and issues? Can you tell us bit more about

RE: Multi-lingual search

2016-02-08 Thread vidya
Hi I need to search on these languages which includes proximity search also. 1.Malay 2.Tamil 3.Bahasa Indonesia 4.Vietnamese 5.Cantonese Will IndicNormalizationFilter work fine or any other filter? Help me if you have already worked on it or have any idea. Thanks in advance -- View this mess

Re: solr performance issue

2016-02-08 Thread Emir Arnautovic
Hi Sara, Not sure if I am reading this right, but I read it as you have 1000 doc index and issues? Can you tell us bit more about your setup: number of servers, hw, index size, number of shards, queries that you run, do you index at the same time... It seems to me that you are running Solr on

solr performance issue

2016-02-08 Thread sara hajili
hi all. i have a problem with my solr performance and usage hardware like a ram,cup... i have a lot of document and so indexed file about 1000 doc in solr that every doc has about 8 field in average. and each field has about 60 char. i set my field as a storedfield = "false" except of 1 field. //

RE: Tutorial or Code Samples to explain how to Write Solr Plugins

2016-02-08 Thread Gian Maria Ricci - aka Alkampfer
Perfect, Thanks again to everyone. -- Gian Maria Ricci Cell: +39 320 0136949 -Original Message- From: Binoy Dalal [mailto:binoydala...@gmail.com] Sent: giovedì 4 febbraio 2016 15:07 To: solr-user@lucene.apache.org Subject: Re: Tutorial or Code Samples to explain how to Write Solr Pl

Re: Solr 4.10 with Jetty 8.1.10 & Tomcat 7

2016-02-08 Thread Shahzad Masud
Thank you Shawn for your reply. Here is my structure of cores and shards Shard 1 = localhost:8983/solr_2014 [3 Core - Employee, Service Tickets, Departments] Shard 2 = localhost:8983/solr_2015 [3 Core - Employee, Service Tickets, Departments] Shard 3 = localhost:8983/solr_2016 [3 Core - Employe