Re: Reduction of open files

2008-10-16 Thread Paul deGrandis
My biggest concern is why do the remaining files stay open even if my mergeFactor is 2. I would expect to see one or two segment files and one or two sets of accompanying file (.nrm, .frq, etc), based on the documentation. Paul On Thu, Oct 16, 2008 at 4:23 PM, Paul deGrandis <[EMAIL PROTEC

Re: Reduction of open files

2008-10-16 Thread Paul deGrandis
, Oct 16, 2008 at 4:16 PM, Grant Ingersoll <[EMAIL PROTECTED]> wrote: > Are you using the compound file format? > > -Grant > > On Oct 16, 2008, at 3:28 PM, Paul deGrandis wrote: > >> I have been working with SOLR for a few months now. According to some >> docu

Reduction of open files

2008-10-16 Thread Paul deGrandis
I have been working with SOLR for a few months now. According to some documentation I read, segment files only have one set of all the other lingustic module type of stuff (normalization, frequency), is there a way to remove/reduce the files not associated with a segment besides optimizing the ind

Re: Indexing content, storing html

2008-02-22 Thread Paul deGrandis
t; http://wiki.apache.org/solr/AnalyzersTokenizersTokenFilters > > -Reece > > > > On Fri, Feb 22, 2008 at 2:10 PM, Paul deGrandis > > <[EMAIL PROTECTED]> wrote: > > Thanks! > > > > Does Solr include an HTMLTokenFilterFactory? > > > > Paul > > &

Re: Indexing content, storing html

2008-02-22 Thread Paul deGrandis
> > > The CDATA thing will basically say anything between it's tag's will be > rendered as the field value. It only breaks if your html string has a > "]]>" in it to end the data tag. > > > -Reece > > > > > On F

Indexing content, storing html

2008-02-22 Thread Paul deGrandis
Hi all, I'm working on a solr app that pulls HTML from an embedded JavaScript WYSIWYG editor, and I need to index on the content, but store and reproduce the HTML. The problem I have is when I try to add and commit, the HTML gets interpreted as XML. Is the way to do this properly to create an HT