Re: collectionDistribution vs SolrReplication

2009-01-04 Thread Noble Paul നോബിള്‍ नोब्ळ्
* SolrReplication does not create snapshots . So you have less cleanup to do. The script based replication results is more disk space consumption (especially if you do frequent commits) * Performance is roughly same unless you are replicating across different LAN where SolrReplication can zip and t

Re: cannot allocate memory for snapshooter

2009-01-04 Thread Mark Miller
Hey Brian, I didn't catch what OS you are using on EC2 by the way. I thought most UNIX OS's were using memory overcommit - A quick search brings up Linux, AIX, and HP-UX, and maybe even OSX? What are you running over there? EC2, so Linux I assume? Yonik: I take it, now that Linux uses copy on

Re: cannot allocate memory for snapshooter

2009-01-04 Thread Mark Miller
Yonik Seeley wrote: On Sun, Jan 4, 2009 at 8:07 PM, Mark Miller wrote: Forking for a small script on something that can have such a large memory footprint is just a huge waste of resources. Ideally you might have a tiny program running, listening on a socket or something, and it can be alert

Re: cannot allocate memory for snapshooter

2009-01-04 Thread Yonik Seeley
On Sun, Jan 4, 2009 at 8:07 PM, Mark Miller wrote: > Forking for a small script on something that can have such a large memory > footprint is just a huge waste of resources. Ideally you might have a tiny > program running, listening on a socket or something, and it can be alerted > and do the actu

Re: cannot allocate memory for snapshooter

2009-01-04 Thread Mark Miller
Your right, its nasty, but its how Fork works. I would say its something that should be fixed, its so nasty, but with the new all Java replication, its probably a moot point. Forking for a small script on something that can have such a large memory footprint is just a huge waste of resources.

Re: Pgination in Solr

2009-01-04 Thread Chris Hostetter
: solr supports params start and rows : append &start=X&rows=Y to the url (assuming you are using standard : request handler) : : where X = page number : and Y = results per page. not quite ... "start" is the result number you wnat to start at (zero indexed), so if you want 10 results per page y

Re: How can i omit the illegal characters,when indexing the docs?

2009-01-04 Thread Peter Wolanin
For documents we are indexing via the PHP client, we are currently using the following regex to strip control characters from each field that might contain them: function apachesolr_strip_ctl_chars($text) { // See: http://w3.org/International/questions/qa-forms-utf-8.html // Printable utf-8 d