RE: Backup a solr cloud collection - timeout in 180s?

2018-04-10 Thread Petersen, Robert (Contr)
to bewildering results. See: https://issues.apache.org/jira/browse/SOLR-11739. The operations would succeed, but you might not be getting the status of the task you think you are. Best, Erick On Tue, Apr 10, 2018 at 9:25 AM, Petersen, Robert (Contr) <robert.peters...@ftr.com> wrote: >

Re: Backup a solr cloud collection - timeout in 180s?

2018-04-10 Thread Petersen, Robert (Contr)
erty, see: https://lucene.apache.org/solr/guide/6_6/collections-api.html There's also a way to check the status of the backup running in the background. Best, Erick On Mon, Apr 9, 2018 at 11:05 AM, Petersen, Robert (Contr) <robert.peters...@ftr.com> wrote: > Shouldn't this just create the backup fi

Backup a solr cloud collection - timeout in 180s?

2018-04-09 Thread Petersen, Robert (Contr)
Shouldn't this just create the backup file(s) asynchronously? Can the timeout be adjusted? Solr 7.2.1 with five nodes and the addrsearch collection is five shards x five replicas and "numFound":38837970 docs Thx Robi

CDCR - cross data center replication

2018-01-25 Thread Petersen, Robert (Contr)
Hi all, So for an initial CDCR setup documentation says bulk load should be performed first otherwise CDCR won't keep up. By bulk load does that include an ETL process doing rapid atomic updates one doc at a time (with multiple threads) so like 4K docs per minute assuming bandwidth between

Re: solr 5.4.1 leader issue

2018-01-08 Thread Petersen, Robert (Contr)
OK just restarting all the solr nodes did fix it, since they are in production I was hesitant to do that From: Petersen, Robert (Contr) <robert.peters...@ftr.com> Sent: Monday, January 8, 2018 12:34:28 PM To: solr-user@lucene.apache.org Subject: solr 5.4.1

Re: solr 5.4.1 leader issue

2018-01-08 Thread Petersen, Robert (Contr)
the solr instances, the zookeeper instances, both, or is there another better way without restarting everything? Thx Robi From: Petersen, Robert (Contr) <robert.peters...@ftr.com> Sent: Monday, January 8, 2018 12:34:28 PM To: solr-user@lucene.apac

Re: solr 5.4.1 leader issue

2018-01-08 Thread Petersen, Robert (Contr)
I'm on zookeeper 3.4.8 From: Petersen, Robert (Contr) <robert.peters...@ftr.com> Sent: Monday, January 8, 2018 12:34:28 PM To: solr-user@lucene.apache.org Subject: solr 5.4.1 leader issue Hi got two out of my three servers think they are replicas on one

solr 5.4.1 leader issue

2018-01-08 Thread Petersen, Robert (Contr)
Hi got two out of my three servers think they are replicas on one shard getting exceptions wondering what is the easiest way to fix this? Can I just restart zookeeper across the servers? Here are the exceptions: TY Robi ERROR null RecoveryStrategy Error while trying to recover.

Re: Any Insights SOLR Rank tuning tool

2017-12-14 Thread Petersen, Robert (Contr)
I remember when FAST (when it was still FAST) came to our enterprise to pitch their search when we were looking to replace our alta vista search engine with *something* and they demonstrated that relevance tool for business side. While that thing was awesome, I've never seen anything close to

Re: SOLR Rest API for monitoring

2017-12-14 Thread Petersen, Robert (Contr)
you are using cloudera? sounds like a question for them... From: Abhi Basu <9000r...@gmail.com> Sent: Thursday, December 14, 2017 1:27:23 PM To: solr-user@lucene.apache.org Subject: SOLR Rest API for monitoring Hi All: I am using CDH 5.13 with Solr 4.10. Trying

Re: Solr upgrade from 4.x to 7.1

2017-12-14 Thread Petersen, Robert (Contr)
>From what I have read, you can only upgrade to the next major version number >without using a tool to convert the indexes to the newer version. But that is >still perilous due to deprications etc So I think best advice out there is to spin up a new farm on 7.1 (especially from 4.x), make a

Re: Can someone help? Two level nested doc... ChildDocTransformerFactory sytax...

2017-11-07 Thread Petersen, Robert (Contr)
From: Petersen, Robert (Contr) <robert.peters...@ftr.com> Sent: Monday, November 6, 2017 5:05:31 PM To: solr-user@lucene.apache.org Subject: Can someone help? Two level nested doc... ChildDocTransformerFactory sytax... OK no faceting, no filtering, I just want the hie

Can someone help? Two level nested doc... ChildDocTransformerFactory sytax...

2017-11-06 Thread Petersen, Robert (Contr)
OK no faceting, no filtering, I just want the hierarchy to come backin the results. Can't quite get it... googled all over the place too. Doc: { id : asdf, type_s:customer, firstName_s:Manny, lastName_s:Acevedo, address_s:"123 Fourth Street", city_s:Gotham, tn_s:1234561234,

Re: Java 9

2017-11-06 Thread Petersen, Robert (Contr)
Actually I can't believe they're depricating UseConcMarkSweepGC , That was the one that finally made solr 'sing' with no OOMs! I guess they must have found something better, have to look into that... Robi From: Chris Hostetter Sent:

Java 9

2017-11-06 Thread Petersen, Robert (Contr)
Hi Guys, Anyone else been noticing this this msg when starting up solr with java 9? (This is just an FYI and not a real question) Java HotSpot(TM) 64-Bit Server VM warning: Option UseConcMarkSweepGC was deprecated in version 9.0 and will likely be removed in a future release. Java

Re: Anyone have any comments on current solr monitoring favorites?

2017-11-06 Thread Petersen, Robert (Contr)
first service that is trying InfluxDB. wunder Walter Underwood wun...@wunderwood.org http://observer.wunderwood.org/ (my blog) > On Nov 6, 2017, at 1:31 PM, Petersen, Robert (Contr) > <robert.peters...@ftr.com> wrote: > > Hi Walter, > > > Yes, now I see it. I'm wond

Re: Anyone have any comments on current solr monitoring favorites?

2017-11-06 Thread Petersen, Robert (Contr)
ve any comments on current solr monitoring favorites? Look back down the string to my post. We use Grafana. wunder Walter Underwood wun...@wunderwood.org http://observer.wunderwood.org/ (my blog) > On Nov 6, 2017, at 11:23 AM, Petersen, Robert (Contr) > <robert.peters...@ftr.com> wrot

Re: Anyone have any comments on current solr monitoring favorites?

2017-11-06 Thread Petersen, Robert (Contr)
2017-11-06 20:13 GMT+01:00 Petersen, Robert (Contr) < robert.peters...@ftr.com>: > PS I knew sematext would be required to chime in here!  > > > Is there a non-expiring dev version I could experiment with? I think I did > sign up for a trial years ago from a different company...

Re: Anyone have any comments on current solr monitoring favorites?

2017-11-06 Thread Petersen, Robert (Contr)
boards that mix Cloudwatch (for > the load balancer) and InfluxDB. > > I’m still working out the kinks in some of the more complicated queries, but > the data is all there. I also want to expand the servlet filter to report > HTTP response codes. > > wunder > Walter Underwood

String payloads...

2017-11-06 Thread Petersen, Robert (Contr)
Hi Guys, I was playing with payloads example as I had a possible use case of alternate product titles for a product. https://lucidworks.com/2017/09/14/solr-payloads/ bin/solr start bin/solr create -c payloads bin/post -c payloads -type text/csv -out yes -d $'id,vals_dpf\n1,one|1.0 two|2.0

Anyone have any comments on current solr monitoring favorites?

2017-11-02 Thread Petersen, Robert (Contr)
OK I'm probably going to open a can of worms here... lol In the old old days I used PSI probe to monitor solr running on tomcat which worked ok on a machine by machine basis. Later I had a grafana dashboard on top of graphite monitoring which was really nice looking but kind of complicated

Re: Upgrade path from 5.4.1

2017-11-02 Thread Petersen, Robert (Contr)
Thanks guys! I kind of suspected this would be the best route and I'll move forward with a fresh start on 7.x as soon as I can get ops to give me the needed machines!  Best Robi From: Erick Erickson Sent: Thursday, November 2, 2017

Upgrade path from 5.4.1

2017-11-01 Thread Petersen, Robert (Contr)
Hi Guys, I just took over the care and feeding of three poor neglected solr 5.4.1 cloud clusters at my new position. While spinning up new collections and supporting other business initiatives I am pushing management to give me the green light on migrating to a newer version of solr. The last

RE: Do I really need copyField when my app can do the copy?

2015-07-08 Thread Petersen, Robert
Perhaps some people like maybe those using DIH to feed their index might not have that luxury and copyfield is the better way for them. If you have an application you can do it either way. I have done both ways in different situations. Robi -Original Message- From: Steven White

RE: Best practice to support multi-tenant with Solr

2014-03-15 Thread Petersen, Robert
Hi Overall I think you are mixing up your terminology. What used to be called a 'core' is now called a 'collection' in solr cloud. In the old master slave setup, you made separate cores and replicated them to all slaves. Now they want you to think of them as collections and let the cloud

RE: network slows when solr is running - help

2014-03-04 Thread Petersen, Robert
autoCommit maxDocs25/maxDocs maxTime90/maxTime /autoCommit -Original Message- From: Lan [mailto:dung@gmail.com] Sent: Monday, March 03, 2014 1:24 PM To: solr-user@lucene.apache.org Subject: Re: network slows when solr is running - help How frequently

network slows when solr is running - help

2014-02-28 Thread Petersen, Robert
Hi guys, Got an odd thing going on right now. Indexing into my master server (solr 3.6.1) has slowed and it is because when solr runs ping shows latency. When I stop solr though, ping returns to normal. This has been happening occasionally, rebooting didn't help. This is the first time I

RE: network slows when solr is running - help

2014-02-28 Thread Petersen, Robert
machines causing slow downs) Another possible option is the network card is offloading processing onto the CPU which is introducing latency when the CPU is under load. On Fri, Feb 28, 2014 at 4:11 PM, Petersen, Robert robert.peter...@mail.rakuten.com wrote: Hi guys, Got an odd thing going

RE: Searching with special chars

2014-02-27 Thread Petersen, Robert
I agree with Erick, but if you want the special characters to count in searches, you might consider not just stripping them out but replacing them with textual placeholders (which would also have to be done at indexing time). For instance, I replace C# with csharp and C++ with cplusplus during

RE: expungeDeletes vs optimize

2014-02-05 Thread Petersen, Robert
Hi Bryan, From what I've seen it will only get rid of the deletes in the segments that the commit merged and there will be some residual deleted docs still in the index. It doesn't do the full rewrite. Even if you play with merge factors etc, you'll still have lint. In your situation I'd

RE: Interesting search question! How to match documents based on the least number of fields that match all query terms?

2014-01-22 Thread Petersen, Robert
Hi Daniel, How about trying something like this (you'll have to play with the boosts to tune this), search all the fields with all the terms using edismax and use the minimum should match parameter, but require all terms to match in the allMetadata field.

solr as nosql - pulling all docs vs deep paging limitations

2013-12-17 Thread Petersen, Robert
Hi solr users, We have a new use case where need to make a pile of data available as XML to a client and I was thinking we could easily put all this data into a solr collection and the client could just do a star search and page through all the results to obtain the data we need to give them.

RE: solr as nosql - pulling all docs vs deep paging limitations

2013-12-17 Thread Petersen, Robert
My use case is basically to do a dump of all contents of the index with no ordering needed. It's actually to be a product data export for third parties. Unique key is product sku. I could take the min sku and range query up to the max sku but the skus are not contiguous because some get

RE: Best implementation for multi-price store?

2013-11-21 Thread Petersen, Robert
Hi, I'd go with (2) also but using dynamic fields so you don't have to define all the storeX_price fields in your schema but rather just one *_price field. Then when you filter on store:store1 you'd know to sort with store1_price and so forth for units. That should be pretty straightforward.

RE: Sorting memory-efficiently by any numeric field (dates too?)

2013-11-12 Thread Petersen, Robert
Hi Erick, I like your idea, FWIW please also leave room for boost by function query which takes many numeric fields as input but results in a single value. I don't know if this counts as a really clever function but here's one that I currently use: {!boost

RE: removing duplicates

2013-08-21 Thread Petersen, Robert
Hi Perhaps you could query for all documents asking for the id field to be returned and then facet on the field you say you can key off of for duplicates. Set the facet mincount to 2, then you would have to filter on each facet value and page through all doc IDs (except skip the first

RE: removing duplicates

2013-08-21 Thread Petersen, Robert
This would describe the facet parameters we're talking about: http://wiki.apache.org/solr/SimpleFacetParameters Query something like this: http://localhost:8983/solr/select?q=*:*fl=idrows=0facet=truefacet.limit=-1facet.field=your field namefacet.mincount=2 Then filter on each facet returned

RE: uniqueKey: string vs. long integer

2013-08-01 Thread Petersen, Robert
Hi guys, We have used an integer as our unique key since solr 1.3 with no problems at all. We never thought of using anything else because our solr unique key is based upon our product sku data base field which is defined as an integer also. We're on solr 3.6.1 currently. Thanks Robi

RE: replication getting stuck on a file

2013-08-01 Thread Petersen, Robert
I have seen this happen before in our 3.6.1 deployment. It seemed related to high JVM memory consumption on the server when our index got too big (ie we were close to getting OOMs). That is probably why restarting solr sort of fixes it, assuming the file it is stuck on is the final file and

RE: Alternative searches

2013-07-31 Thread Petersen, Robert
Hi Mark Yes, it is something we implemented also. We just try various subsets of the search terms when there are zero results. To increase performance for all these searches we return only the first three results and no facets so we can simply display the result counts for the various

RE: expunging deletes

2013-07-12 Thread Petersen, Robert
, 2013 5:34 PM To: solr-user@lucene.apache.org Subject: Re: expunging deletes On 7/10/2013 5:58 PM, Petersen, Robert wrote: Using solr 3.6.1 and the following settings, I am trying to run without optimizes. I used to optimize nightly, but sometimes the optimize took a very long time to complete

expunging deletes

2013-07-10 Thread Petersen, Robert
Hi guys, Using solr 3.6.1 and the following settings, I am trying to run without optimizes. I used to optimize nightly, but sometimes the optimize took a very long time to complete and slowed down our indexing. We are continuously indexing our new or changed data all day and night. After a

replication getting stuck on a file

2013-07-09 Thread Petersen, Robert
Hi My solr 3.6.1 slave farm is suddenly getting stuck during replication. It seems to stop on a random file on various slaves (not all) and not continue. I've tried stoping and restarting tomcat etc but some slaves just can't get the index pulled down. Note there is plenty of space on the

RE: replication getting stuck on a file

2013-07-09 Thread Petersen, Robert
Time Remaining: 88091277s, Speed: 281 bytes/s -Original Message- From: Petersen, Robert [mailto:robert.peter...@mail.rakuten.com] Sent: Tuesday, July 09, 2013 1:22 PM To: solr-user@lucene.apache.org Subject: replication getting stuck on a file Hi My solr 3.6.1 slave farm is suddenly

RE: Informal poll on running Solr 4 on Java 7 with G1GC

2013-06-20 Thread Petersen, Robert
I've been trying it out on solr 3.6.1 with a 32GB heap and G1GC seems to be more prone to OOMEs than CMS. I have been running it on one slave box in our farm and the rest of the slaves are still on CMS and three times now it has gone OOM on me whereas the rest of our slaves kept chugging along

RE: yet another optimize question

2013-06-19 Thread Petersen, Robert
document cache, too. I usually see about 0.75 or better on that. wunder On Jun 18, 2013, at 10:22 AM, Petersen, Robert wrote: Hi Otis, Yes the query results cache is just about worthless. I guess we have too diverse of a set of user queries. The business unit has decided to let bots

RE: TieredMergePolicy reclaimDeletesWeight

2013-06-19 Thread Petersen, Robert
Subject: Re: TieredMergePolicy reclaimDeletesWeight The default is 2.0, and higher values will more strongly favor merging segments with deletes. I think 20.0 is likely way too high ... maybe try 3-5? Mike McCandless http://blog.mikemccandless.com On Tue, Jun 18, 2013 at 6:46 PM, Petersen, Robert

RE: TieredMergePolicy reclaimDeletesWeight

2013-06-19 Thread Petersen, Robert
, Petersen, Robert robert.peter...@mail.rakuten.com wrote: OK thanks, will do. Just out of curiosity, what would having that set way too high do? Would the index become fragmented or what? -Original Message- From: Michael McCandless [mailto:luc...@mikemccandless.com] Sent: Wednesday, June

RE: yet another optimize question

2013-06-19 Thread Petersen, Robert
a lot. That is OK, because users are getting faster responses than they would from Solr. A 5% hit rate may be OK since you have that front end HTTP cache. The Netflix index was updated daily. wunder On Jun 19, 2013, at 10:36 AM, Petersen, Robert wrote: Hi Walter, I used to have larger

RE: yet another optimize question

2013-06-18 Thread Petersen, Robert
ElasticSearch Support -- http://sematext.com/ On Mon, Jun 17, 2013 at 2:21 PM, Petersen, Robert robert.peter...@mail.rakuten.com wrote: Hi Otis, Right I didn't restart the JVMs except on the one slave where I was experimenting with using G1GC on the 1.7.0_21 JRE. Also some time ago I made all

RE: yet another optimize question

2013-06-18 Thread Petersen, Robert
. Upayavira On Mon, Jun 17, 2013, at 07:21 PM, Petersen, Robert wrote: Hi Otis, Right I didn't restart the JVMs except on the one slave where I was experimenting with using G1GC on the 1.7.0_21 JRE. Also some time ago I made all our caches small enough to keep us from getting OOMs while still having

RE: yet another optimize question

2013-06-18 Thread Petersen, Robert
to facet.method=fc, however the JVM heap usage went down from about 20GB to 4GB. André On 06/17/2013 08:21 PM, Petersen, Robert wrote: Also some time ago I made all our caches small enough to keep us from getting OOMs while still having a good hit rate.Our index has about 50 fields which

TieredMergePolicy reclaimDeletesWeight

2013-06-18 Thread Petersen, Robert
Hi In continuing a previous conversation, I am attempting to not have to do optimizes on our continuously updated index in solr3.6.1 and I came across the mention of the reclaimDeletesWeight setting in this blog: http://blog.mikemccandless.com/2011/02/visualizing-lucenes-segment-merges.html

RE: yet another optimize question

2013-06-17 Thread Petersen, Robert
on? Otis -- Performance Monitoring - http://sematext.com/spm/index.html Solr ElasticSearch Support -- http://sematext.com/ On Fri, Jun 14, 2013 at 8:04 PM, Petersen, Robert robert.peter...@mail.rakuten.com wrote: Hi guys, We're on solr 3.6.1 and I've read the discussions about whether

RE: yet another optimize question

2013-06-17 Thread Petersen, Robert
over time, keeping the old config. The consequence of this is that they don't get the right config for the TieredMergePolicy, and therefore don't get to use it, seeing the old behaviour which does require periodic optimise. Upayavira On Mon, Jun 17, 2013, at 07:21 PM, Petersen, Robert wrote: Hi

yet another optimize question

2013-06-14 Thread Petersen, Robert
Hi guys, We're on solr 3.6.1 and I've read the discussions about whether to optimize or not to optimize. I decided to try not optimizing our index as was recommended. We have a little over 15 million docs in our biggest index and a 32gb heap for our jvm. So without the optimizes the index

RE: Is payload the right solution for my problem?

2013-05-17 Thread Petersen, Robert
Hi It will not be double the disk space at all. You will not need to store the field you search, only the field being returned needs to be stored. Furthermore if you are not searching the XML field you will not need to index that field, only store it. Hope that helps, Robi -Original

RE: Solr 3.6.1: changing a field from stored to not stored

2013-04-23 Thread Petersen, Robert
Good info, Thanks Hoss! I was going to add a more specific fl= parameter to my queries at the same time. Currently I am doing fl=*,score so that will have to be changed. -Original Message- From: Chris Hostetter [mailto:hossman_luc...@fucit.org] Sent: Tuesday, April 23, 2013 4:18 PM

RE: Solr 3.6.1: changing a field from stored to not stored

2013-04-23 Thread Petersen, Robert
Hey I just want to verify one thing before I start doing this: function queries only require fields to be indexed but don't require them to be stored right? -Original Message- From: Petersen, Robert [mailto:robert.peter...@mail.rakuten.com] Sent: Tuesday, April 23, 2013 4:39 PM

RE: Really bad query performance for date range queries

2013-02-05 Thread Petersen, Robert
Hi Shawn, I've looked at the xing JVM before but don't use it. jHiccup looks like a really useful tool. Can you tell us how you are starting it up? Do you start it wrapping the app container (ie tomcat / jetty)? Thanks Robi -Original Message- From: Shawn Heisey

RE: Really bad query performance for date range queries

2013-02-05 Thread Petersen, Robert
: Shawn Heisey [mailto:s...@elyograg.org] Sent: Tuesday, February 05, 2013 2:53 PM To: solr-user@lucene.apache.org Subject: Re: Really bad query performance for date range queries On 2/5/2013 3:19 PM, Petersen, Robert wrote: Hi Shawn, I've looked at the xing JVM before but don't use it. jHiccup

RE: field space consumption - stored vs not stored

2013-01-31 Thread Petersen, Robert
Thanks Shawn. Actually now that I think about it, Yonik also mentioned something about lucene number representation once in reply to one of my questions. Here it is: Could you also tell me what these `#8;#0;#0;#0;#1; strings represent in the debug output? That's internally how a number is

RE: Can I start solr with replication activated but disabled between master and slave

2013-01-30 Thread Petersen, Robert
Hi Jamel, You can start solr slaves with them pointed at a master and then turn off replication in the admin replication page. Hope that helps, -Robi Robert (Robi) Petersen Senior Software Engineer Search Department -Original Message- From: Jamel ESSOUSSI

RE: Solr Faceting with Name Values

2013-01-29 Thread Petersen, Robert
there be a problem putting these in a dynamic field name? 3. Is it possible to query for the possible list of dynamic fieldnames? I might need this when creating a list of attributes. Thanks again Robi. O. O. -- Petersen, Robert wrote Hi O.O

queryResultCache *very* low hit ratio

2013-01-29 Thread Petersen, Robert
Hi solr users, My queryResultCache hitratio has been trending down lately and is now at 0.01%, and also it's warmup time was almost a minute. I have lowered the autowarm count dramatically since there are no hits anyway. I also wanted to lower my autowarm counts across the board because I am

RE: queryResultCache *very* low hit ratio

2013-01-29 Thread Petersen, Robert
don't experience increased latency. If you remove all auto-warming of the query result cache, you may want to add static warming entries for these fields. -Yonik http://lucidworks.com On Tue, Jan 29, 2013 at 3:36 PM, Petersen, Robert rober...@buy.com wrote: Hi solr users, My queryResultCache

RE: queryResultCache *very* low hit ratio

2013-01-29 Thread Petersen, Robert
as we can. Thanks! Robi -Original Message- From: Shawn Heisey [mailto:s...@elyograg.org] Sent: Tuesday, January 29, 2013 2:24 PM To: solr-user@lucene.apache.org Subject: Re: queryResultCache *very* low hit ratio On 1/29/2013 1:36 PM, Petersen, Robert wrote: My queryResultCache hitratio has

RE: Solr Faceting with Name Values

2013-01-28 Thread Petersen, Robert
Hi O.O., You don't need to add them all into the schema. You can use the wildcard fields like dynamicField name=*_s type=string indexed=true stored=true / to hold them. You can then have the attribute name be the part of the wildcard and the attribute value be the field contents. So you

RE: firstSearcher and NewSearcher parameters

2013-01-23 Thread Petersen, Robert
ElasticSearch Support http://sematext.com/ On Jan 22, 2013 8:08 PM, Petersen, Robert rober...@buy.com wrote: Hi guys, I was wondering if there was a way to pass commonly used boost values in with commonly used filter queries in these solrConfig event handler sections. Could I just append

RE: firstSearcher and NewSearcher parameters

2013-01-23 Thread Petersen, Robert
Thanks Hoss, Good to know! I have that exact situation: a complex function based on multiple field values that I always run for particular types of searches including global star searches to aid in sorting the results appropriately. Robi -Original Message- From: Chris Hostetter

firstSearcher and NewSearcher parameters

2013-01-22 Thread Petersen, Robert
Hi guys, I was wondering if there was a way to pass commonly used boost values in with commonly used filter queries in these solrConfig event handler sections. Could I just append the ^1.5 at the end of the fq value? IE can I do this: str

parsing debug output for readability

2013-01-10 Thread Petersen, Robert
Hi Solr Users, Can someone give me some good parsing rules of thumb to make the debug explain output human readable? I found this cool site for visualizing the output but our queries are too complex and break their parser: http://explain.solr.pl I tried adding new lines plus indenting after

RE: parsing debug output for readability

2013-01-10 Thread Petersen, Robert
it should come out with whitespace and newlines in the actual XML source (browsers render it ugly though) Erik On Jan 10, 2013, at 15:35 , Petersen, Robert wrote: Hi Solr Users, Can someone give me some good parsing rules of thumb to make the debug explain output human readable? I

RE: parsing debug output for readability

2013-01-10 Thread Petersen, Robert
PS the wt=ruby param is even better! Great tips. -Original Message- From: Petersen, Robert [mailto:rober...@buy.com] Sent: Thursday, January 10, 2013 3:17 PM To: solr-user@lucene.apache.org Subject: RE: parsing debug output for readability Hi Erik, Thanks, debug.explain.structured

RE: theory of sets

2013-01-07 Thread Petersen, Robert
Hi Uwe, We have hundreds of dynamic fields but since most of our docs only use some of them it doesn't seem to be a performance drag. They can be viewed as a sparse matrix of fields in your indexed docs. Then if you make the sortinfo_for_groupx an int then that could be used in a function

RE: occasional GC crashes

2012-12-20 Thread Petersen, Robert
the latest Java 7 or if you have to remain on 6 then use the latest 6. Otis -- SOLR Performance Monitoring - http://sematext.com/spm On Dec 18, 2012 7:54 PM, Petersen, Robert rober...@buy.com wrote: Hi solr user group, ** ** Sorry if this isn't directly a Solr question. Seems like once

occasional GC crashes

2012-12-18 Thread Petersen, Robert
Hi solr user group, Sorry if this isn't directly a Solr question. Seems like once in a blue moon the GC crashes on a server in our Solr 3.6.1 slave farm. This seems to only happen on a couple of the twelve slaves we have deployed and only very rarely on those. It seems like this doesn't

Re: star searches with high page number requests taking long times

2012-12-08 Thread Petersen, Robert
-- SOLR Performance Monitoring - http://sematext.com/spm On Dec 7, 2012 8:10 PM, Petersen, Robert rober...@buy.com wrote: Hi guys, Sometimes we get a bot crawling our search function on our retail web site. The ebay crawler loves to do this (Request.UserAgent: Terapeakbot). They just

star searches with high page number requests taking long times

2012-12-07 Thread Petersen, Robert
Hi guys, Sometimes we get a bot crawling our search function on our retail web site. The ebay crawler loves to do this (Request.UserAgent: Terapeakbot). They just do a star search and then iterate through page after page. I've noticed that when they get to higher page numbers like page

RE: anyone have any clues about this exception

2012-10-12 Thread Petersen, Robert
, Petersen, Robert rober...@buy.com wrote: You could be right. Going back in the logs, I noticed it used to happen less frequently and always towards the end of an optimize operation. It is probably my indexer timing out waiting for updates to occur during optimizes. The errors grew

RE: anyone have any clues about this exception

2012-10-10 Thread Petersen, Robert
events from happening all at once. Lately, it doesn't seem to be working. (Anonymous - via GTD book) On Wed, Oct 10, 2012 at 11:31 PM, Petersen, Robert rober...@buy.com wrote: Tomcat localhost log (not the catalina log) for my solr 3.6.1 (master) instance contains lots of these exceptions

RE: Faceted search question (Tokenizing)

2012-10-10 Thread Petersen, Robert
What do you want the results to be, persons? And the facets should be interests or subinterests? Why are there two layers of interests anyway? Can there my many subinterests under one interest? Is one of those two a name of the interest which would look nice as a facet? Anyway, have you

some general solr 4.0 questions

2012-09-20 Thread Petersen, Robert
Hello solr user group, I am evaluating the new Solr 4.0 beta with an eye to how to fit it into our current solr setup. Our current setup is running on solr 3.6.1 and uses 12 slaves behind a load balancer and a master which we index into, and they all have three cores (now referred to as

RE: some general solr 4.0 questions

2012-09-20 Thread Petersen, Robert
That is a great idea to run the updates thru the LB also! I like it! Thanks for the replies guys -Original Message- From: jimtronic [mailto:jimtro...@gmail.com] Sent: Thursday, September 20, 2012 1:46 PM To: solr-user@lucene.apache.org Subject: Re: some general solr 4.0 questions

RE: broken links in solr wiki

2012-09-18 Thread Petersen, Robert
. Regarding URLs http://svn.apache.org/repos/asf/lucene/dev/trunk/solr/core/src/test-files/solr/collection1/conf/stemdict.txt http://svn.apache.org/repos/asf/lucene/dev/trunk/solr/example/solr/collection1/conf/protwords.txt --- On Tue, 9/18/12, Petersen, Robert rober...@buy.com wrote: From

broken links in solr wiki

2012-09-17 Thread Petersen, Robert
Hi group, On this wiki page these two links below are broken as they are also on lucidworks' version, can someone point me at the correct locations please? I googled around and came up with possible good links. Thanks Robi http://wiki.apache.org/solr/LanguageAnalysis#Other_Tips

RE: Solr grouping / facet query

2012-07-20 Thread Petersen, Robert
Why not just index one title per document, each having author and specialty fields included? Then you could search titles with a user query and also filter/facet on the author and specialties at the same time. The author bio and other data could be looked up on the fly from a DB if you

RE: how do I search the archives for solr-user

2012-07-03 Thread Petersen, Robert
This site is pretty cool also, just filter on solr-user like this: http://markmail.org/search/?q=list%3Aorg.apache.lucene.solr-user -Original Message- From: Chris Hostetter [mailto:hossman_luc...@fucit.org] Sent: Monday, July 02, 2012 5:34 PM To: solr-user@lucene.apache.org Subject: Re:

RE: Broken pipe error

2012-07-03 Thread Petersen, Robert
I also had this problem on solr/tomcat and finally saw the errors were coming from my application side disconnecting from solr after a timeout. This was happening when solr was busy doing an optimize and thus not responding quickly enough. Initially when I saw this in the logs, I was quite