Re: [Ferret-talk] Road map of ferret

Erik Hatcher Thu, 28 Aug 2008 08:21:59 -0700


On Aug 28, 2008, at 9:52 AM, Jens Kraemer wrote:

So here's my very own biased opinion just to complete the picture :)


Hey, software should be opinionated!   That's totally fair :)

(shameless plug: selfhtml.org search will be powered by Stellr[1] ;-).


Stellr - great name.  Interesting... that's pretty sweet.

Solr, while being an interesting project without doubt, won't everreach the level of Rails integration that's possible withacts_as_ferret, simply because it's server doesn't run in thecontext of the rails app with model classes and all that stuff.

What advantage does Ferret have in terms of ActiveRecord integrationthat Solr wouldn't have?

If you're talking about custom analyzers being in Ruby, more on thatbelow.

It's an independent server indexing whatever you throw over thefence via http+xml.

Solr can index CSV as well now a relational database directly (withthe new DataImportHandler).

It also responds with Ruby hash structure (just add &wt=ruby to theURLs, or use solr-ruby which does that automatically and hides allserver communication from you anyway).

How to use a custom analyzer with solr? You have to code it in Java(or you do your analysis before feeding the data into java land,which I wouldn't consider good app design).

Most users would not need to write a custom analyzer. Many of thebuilt-in ones are quite configurable. Yes, Solr does require schemaconfiguration via an XML file, but there have been acts_as_solrvariants (good and bad thing about this git craze) that generate thatfor you automatically from an AR model.

But even if you do that then you have
a) half a java project (I don't want that)

That's totally fair, and really the primary compelling reason for aFerret over Solr for pure Ruby/Rails projects. I dig that.


But isn't Ferret is like 60k lines of C code too?!

and b) no way to use your existing rails classes in that customanalyzer (I *have* analyzers using rails models to retrieve synonymsand narrower terms for thesaurus based query expansion)

You could leverage client-side query expansion with Solr... just takethe users query, massage it, and send whatever query you like toSolr. Solr also has synonym and stop word capability too.

However, there is also no reason (and I have this on my copious-free-time-TOOD-list) that JRuby couldn't be used behind the scenes of aSolr analyzer/tokenizer/filter or even request handler... and do allthe cool Ruby stuff you like right there. Heck, you could even sendthe Ruby code over to Solr to execute there if you like ;)

Here's what I would do *if* I experienced severe problems withFerret in any of my projects:
Take aaf, replace Ferret with Lucene or even make it modular todecide at run time which one to use, run the DRb server (or thewhole app, that depends) under JRuby and call it acts_as_lucene :-)Et voila - great Rails integration plus Lucene's maturity. But aslong as Ferret's working fine for me that's really unlikely tohappen... Unless somebody wants to sponsor that project, of course ;)

Just using Solr and fixing up acts_as_solr to meet your needs (if itdoesn't) would be even easier than all that :) Solr really is abetter starting point than Lucene directly, for caching, scalability,replication, faceting, etc.

I'd be curious to see scalability comparisons between Ferret and Solr- or perhaps more properly between Stellr and Solr - as it boils downto number of documents, queries per second, and faceting andhighlighting speed. I'm betting on Solr myself (by being so into itand basing my professional life on it).


        Erik

_______________________________________________
Ferret-talk mailing list
[email protected]
http://rubyforge.org/mailman/listinfo/ferret-talk

Re: [Ferret-talk] Road map of ferret

Reply via email to