The documents we want to index come in many formats; e.g., HTML, PDF, RTF, Word, Excel, etc., etc., etc. I've been searching to find parsers that will translate each of these formats to indexable text, but have had little success. Any help will be appreciated.
-- Posted via http://www.ruby-forum.com/. _______________________________________________ Ferret-talk mailing list [email protected] http://rubyforge.org/mailman/listinfo/ferret-talk

