Text Summarizer
Hi! I'm looking for a good way to get a good text summarizer for my personal search engine based Solr. Actually, I'm using ots (Open Text Summurizer) but the result is far from perfection. Here's an example of usage: $ elinks http://lucene.apache.org/solr/; -force-html -no-numbering \ -no-references 2/dev/null | ots -r 40 | less -S The result is OK for this site, but I would like to obtain something similar to google text snippet (a real excerpt). Advices are welcome? N.B: all the HTML pages I'm indexing are converted to text with elinks (the text browser) like in the previous example. Thanks in adavance. cheers Younès
Re: Text Summarizer
Hi Otis, Don't know really what's the name for that. cheers Y. Otis Gospodnetic a écrit : Sounds like you are looking for a highlighter/KWIC, not a summarizer? Otis -- Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch - Original Message From: Ycrux [EMAIL PROTECTED] To: solr-user@lucene.apache.org Sent: Monday, January 14, 2008 2:45:09 PM Subject: Text Summarizer Hi! I'm looking for a good way to get a good text summarizer for my personal search engine based Solr. Actually, I'm using ots (Open Text Summurizer) but the result is far from perfection. Here's an example of usage: $ elinks http://lucene.apache.org/solr/; -force-html -no-numbering \ -no-references 2/dev/null | ots -r 40 | less -S The result is OK for this site, but I would like to obtain something similar to google text snippet (a real excerpt). Advices are welcome? N.B: all the HTML pages I'm indexing are converted to text with elinks (the text browser) like in the previous example. Thanks in adavance. cheers Younès
Re: Text Summarizer
Sounds like you are looking for a highlighter/KWIC, not a summarizer? Otis -- Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch - Original Message From: Ycrux [EMAIL PROTECTED] To: solr-user@lucene.apache.org Sent: Monday, January 14, 2008 2:45:09 PM Subject: Text Summarizer Hi! I'm looking for a good way to get a good text summarizer for my personal search engine based Solr. Actually, I'm using ots (Open Text Summurizer) but the result is far from perfection. Here's an example of usage: $ elinks http://lucene.apache.org/solr/; -force-html -no-numbering \ -no-references 2/dev/null | ots -r 40 | less -S The result is OK for this site, but I would like to obtain something similar to google text snippet (a real excerpt). Advices are welcome? N.B: all the HTML pages I'm indexing are converted to text with elinks (the text browser) like in the previous example. Thanks in adavance. cheers Younès
Re: Text Summarizer
See http://wiki.apache.org/solr/HighlightingParameters . The default behaviour will provide snippets like google does. Note that you need to store the text of fields you want to highlight for this to work. cheers, -Mike On 14-Jan-08, at 2:17 PM, Ycrux wrote: Maybe the right name is Snippet. Like Google snippets. cheers Y. Otis Gospodnetic a écrit : Sounds like you are looking for a highlighter/KWIC, not a summarizer? Otis -- Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch - Original Message From: Ycrux [EMAIL PROTECTED] To: solr-user@lucene.apache.org Sent: Monday, January 14, 2008 2:45:09 PM Subject: Text Summarizer Hi! I'm looking for a good way to get a good text summarizer for my personal search engine based Solr. Actually, I'm using ots (Open Text Summurizer) but the result is far from perfection. Here's an example of usage: $ elinks http://lucene.apache.org/solr/; -force-html -no-numbering \ -no-references 2/dev/null | ots -r 40 | less -S The result is OK for this site, but I would like to obtain something similar to google text snippet (a real excerpt). Advices are welcome? N.B: all the HTML pages I'm indexing are converted to text with elinks (the text browser) like in the previous example. Thanks in adavance. cheers Younès