Re: Reduce is Really Slow!

Damien Katz Tue, 19 Aug 2008 16:21:38 -0700

I think the problem with your reduce is that it looks like its notactually reducing to a single value, but instead using reduce forgrouping data. That will cause severe performance problems.

For reduce to work properly, you should end up with a fixed size datastructure regardless of the number of values being reduced (notstricty true, but that's the general rule).


-Damien

On Aug 19, 2008, at 6:55 PM, Nicholas Retallack wrote:

Okay, I got it built on gentoo instead, but I'm still havingperformance
issues with reduce.
Erlang (BEAM) emulator version 5.6.3 [source] [64-bit] [async-threads:0]
couchdb - Apache CouchDB 0.8.1-incubating

Here's a query I tried to do:
I freshly imported about 191MB of data in 155399 documents. 29090are notdiscarded by map. Map produces one row with 5 fields for each ofthesedocuments. After grouping, each group should have four rows.Reduce is a
simple function(keys,values){return values}.

Here's the query call:
time curl -X GET '
http://localhost:5984/clickfund/_view/offers/index?count=1&group=true&group_level=1
'

This is running on a 512MB slicehost account.  http://www.slicehost.com/
I'd love to give you this command's execution time, since I ran itlastnight before I went to bed, but it must have taken over an hourbecause my
laptop went to sleep and severed the connection.  Trying it again.
Considering it's blazing fast without the reduce function, I canonly assumewhat's taking all this time is overhead setting up and tearing downthe
simple function(keys,values){return values}.
I can give you guys the python source to set up this database so youcan try
it yourself if you like.

Re: Reduce is Really Slow!

Reply via email to