On Thu, Jun 07, 2007 at 05:19:26PM +0000, Daniel Einspanjer wrote:
> I am looking at trying to use ferret/aaf to supplement my querying against a
> medium and large table with lots of columns.  Some facts first:
> 
> Ferret 0.11.4
> AAF 0.4.0
> Ruby 1.8.6
> Rails 1.2.3
> 
> Medium table:
> 105,464 rows
> 168 columns (mostly varchar(20))
> 11 actual columns indexed in aaf plus
> 40 virtual columns indexed in aaf (virtual is concat of two physical columns.
> e.g. cast_first_name_1 + cast_last_name_1 through cast_first_name_20 +
> cast_last_name_20)
> 
> Large table:
> 1,244,716 rows
> same column/index structure
> 
> These tables are not updated via Ruby, only read.  I am trying to use
> rebuild_index to bootstrap the medium sized table and it is taking a very long
> time (running for about 4 hours, indicates 50% complete with 4 hours 
> remaining)
> and creating a massive number of files in the index directory (currently about
> 65k, was 90k earlier)

strange. Ferret is faster than that - I have a test script that builds
an index of 100000 documents with 50 fields each containing a single random 
word in under 10 Minutes here on standard hardware.

Maybe the problem is something else? For starters, change line 220
of local_index.rb from
index << rec.to_doc if rec.ferret_enabled?(true)
to
doc = rec.to_doc if rec.ferret_enabled?(true)

so nothing is added to the index. How long does that take? 

Jens

-- 
Jens Krämer
webit! Gesellschaft für neue Medien mbH
Schnorrstraße 76 | 01069 Dresden
Telefon +49 351 46766-0 | Telefax +49 351 46766-66
[EMAIL PROTECTED] | www.webit.de
 
Amtsgericht Dresden | HRB 15422
GF Sven Haubold, Hagen Malessa
_______________________________________________
Ferret-talk mailing list
[email protected]
http://rubyforge.org/mailman/listinfo/ferret-talk

Reply via email to