Re: Tokenize Sentence and Set Attribute

2013-05-08 Thread Edward Garrett
i find UpdateRequestProcessors ( http://wiki.apache.org/solr/UpdateRequestProcessor) a handy way to add and remove NLP-related fields to a document as it is processed by Solr. this is also how UIMA integrates with Solr (http://wiki.apache.org/solr/SolrUIMA). you might want to take a look at UIMA

Re: indexing Text file in solr

2013-01-29 Thread Edward Garrett
i don't have experience with this but it looks like you could use, from DIH: http://wiki.apache.org/solr/DataImportHandler#LineEntityProcessor On Sun, Jan 27, 2013 at 10:23 AM, hadyelsahar hadyelsa...@gmail.com wrote: i have a large Arabic Text File that contains Tweets each line contains one

Re: Calculate a sum.

2013-01-14 Thread Edward Garrett
i've had perfectly fine performance with StatsComponent, but have only tested with 50,000 documents. for example i have field syllables and numeric field syllables_count. then i sum the syllable count for any search query. how many documents are you working with? On Mon, Jan 14, 2013 at 10:54 AM,

get a list of terms sorted by total term frequency

2012-11-07 Thread Edward Garrett
hi, is there a simple way to get a list of all terms that occur in a field sorted by their total term frequency within that field? TermsComponent (http://wiki.apache.org/solr/TermsComponent) provides fast field faceting over the whole index, but as counts it gives the number of documents that

Re: get a list of terms sorted by total term frequency

2012-11-07 Thread Edward Garrett
module has HighFreqTerms tool. Mike McCandless http://blog.mikemccandless.com On Wed, Nov 7, 2012 at 1:15 PM, Edward Garrett heacu.mcint...@gmail.com wrote: hi, is there a simple way to get a list of all terms that occur in a field sorted by their total term frequency within that field

Re: How to tell the highlighter not to escape?

2007-01-04 Thread Edward Garrett
just to add a note on this, the whole idea of inserting pseudo-markup into XML text elements seems to be pretty much in disrepute, and certainly caused many complaints about RSS 1.0, see e.g. http://www.biglist.com/lists/xsl-list/archives/200505/msg00316.html in xsl, you **can** use

Re: How to tell the highlighter not to escape?

2007-01-03 Thread Edward Garrett
/xsl:when xsl:otherwise xsl:value-of select=$val/ /xsl:otherwise /xsl:choose /xsl:template On 1/3/07, Thorsten Scherler [EMAIL PROTECTED] wrote: On Wed, 2007-01-03 at 02:16 +, Edward Garrett wrote: thorsten, see the following for discussion. your case is indeed

Re: How to tell the highlighter not to escape?

2007-01-02 Thread Edward Garrett
with cdata sections in xsl. I had a look in the lucene highlighter and it seem that it does not escape the tags. Can somebody point me to code which is responsible for escaping and maybe give me a tip how I can patch to make it configurable. TIA salu2 -- Edward Garrett Visiting