i find UpdateRequestProcessors (
http://wiki.apache.org/solr/UpdateRequestProcessor) a handy way to add and
remove NLP-related fields to a document as it is processed by Solr. this is
also how UIMA integrates with Solr (http://wiki.apache.org/solr/SolrUIMA).
you might want to take a look at UIMA
i don't have experience with this but it looks like you could use, from DIH:
http://wiki.apache.org/solr/DataImportHandler#LineEntityProcessor
On Sun, Jan 27, 2013 at 10:23 AM, hadyelsahar hadyelsa...@gmail.com wrote:
i have a large Arabic Text File that contains Tweets each line contains one
i've had perfectly fine performance with StatsComponent, but have only
tested with 50,000 documents. for example i have field syllables and
numeric field syllables_count. then i sum the syllable count for any
search query. how many documents are you working with?
On Mon, Jan 14, 2013 at 10:54 AM,
hi,
is there a simple way to get a list of all terms that occur in a field
sorted by their total term frequency within that field?
TermsComponent (http://wiki.apache.org/solr/TermsComponent) provides
fast field faceting over the whole index, but as counts it gives the
number of documents that
module has HighFreqTerms tool.
Mike McCandless
http://blog.mikemccandless.com
On Wed, Nov 7, 2012 at 1:15 PM, Edward Garrett heacu.mcint...@gmail.com
wrote:
hi,
is there a simple way to get a list of all terms that occur in a field
sorted by their total term frequency within that field
just to add a note on this, the whole idea of inserting pseudo-markup into
XML text elements seems to be pretty much in disrepute, and certainly caused
many complaints about RSS 1.0, see e.g.
http://www.biglist.com/lists/xsl-list/archives/200505/msg00316.html
in xsl, you **can** use
/xsl:when
xsl:otherwise
xsl:value-of select=$val/
/xsl:otherwise
/xsl:choose
/xsl:template
On 1/3/07, Thorsten Scherler [EMAIL PROTECTED] wrote:
On Wed, 2007-01-03 at 02:16 +, Edward Garrett wrote:
thorsten,
see the following for discussion. your case is indeed
with cdata sections in xsl.
I had a look in the lucene highlighter and it seem that it does not
escape the tags.
Can somebody point me to code which is responsible for escaping and
maybe give me a tip how I can patch to make it configurable.
TIA
salu2
--
Edward Garrett
Visiting