Text Summarizer

2008-01-14 Thread Ycrux

Hi!

I'm looking for a good way to get a good text summarizer
for my personal search engine based Solr.

Actually, I'm using ots (Open Text Summurizer) but the result
is far from perfection.

Here's an example of usage:
$ elinks http://lucene.apache.org/solr/; -force-html -no-numbering \
-no-references  2/dev/null | ots -r 40 | less -S

The result is OK for this site, but I would like to obtain something 
similar

to google text snippet (a real excerpt).

Advices are welcome?

N.B: all the HTML pages I'm indexing are converted to text with elinks 
(the text browser)

like in the previous example.

Thanks in adavance.

cheers
Younès


Re: Text Summarizer

2008-01-14 Thread Ycrux

Hi Otis,

Don't know really what's the name for that.

cheers
Y.

Otis Gospodnetic a écrit :

Sounds like you are looking for a highlighter/KWIC, not a summarizer?

Otis 


--
Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch

- Original Message 
From: Ycrux [EMAIL PROTECTED]
To: solr-user@lucene.apache.org
Sent: Monday, January 14, 2008 2:45:09 PM
Subject: Text Summarizer 


Hi!

I'm looking for a good way to get a good text summarizer
for my personal search engine based Solr.

Actually, I'm using ots (Open Text Summurizer) but the result
is far from perfection.

Here's an example of usage:
$ elinks http://lucene.apache.org/solr/; -force-html -no-numbering \
-no-references  2/dev/null | ots -r 40 | less -S

The result is OK for this site, but I would like to obtain something 
similar

to google text snippet (a real excerpt).

Advices are welcome?

N.B: all the HTML pages I'm indexing are converted to text with
 elinks 
(the text browser)

like in the previous example.

Thanks in adavance.

cheers
Younès



 

  




Re: Text Summarizer

2008-01-14 Thread Otis Gospodnetic
Sounds like you are looking for a highlighter/KWIC, not a summarizer?

Otis 

--
Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch

- Original Message 
From: Ycrux [EMAIL PROTECTED]
To: solr-user@lucene.apache.org
Sent: Monday, January 14, 2008 2:45:09 PM
Subject: Text Summarizer 

Hi!

I'm looking for a good way to get a good text summarizer
for my personal search engine based Solr.

Actually, I'm using ots (Open Text Summurizer) but the result
is far from perfection.

Here's an example of usage:
$ elinks http://lucene.apache.org/solr/; -force-html -no-numbering \
-no-references  2/dev/null | ots -r 40 | less -S

The result is OK for this site, but I would like to obtain something 
similar
to google text snippet (a real excerpt).

Advices are welcome?

N.B: all the HTML pages I'm indexing are converted to text with
 elinks 
(the text browser)
like in the previous example.

Thanks in adavance.

cheers
Younès





Re: Text Summarizer

2008-01-14 Thread Mike Klaas
See http://wiki.apache.org/solr/HighlightingParameters .  The default  
behaviour will provide snippets like google does.


Note that you need to store the text of fields you want to  
highlight for this to work.


cheers,
-Mike

On 14-Jan-08, at 2:17 PM, Ycrux wrote:


Maybe the right name is Snippet. Like Google snippets.

cheers
Y.

Otis Gospodnetic a écrit :

Sounds like you are looking for a highlighter/KWIC, not a summarizer?

Otis
--
Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch

- Original Message 
From: Ycrux [EMAIL PROTECTED]
To: solr-user@lucene.apache.org
Sent: Monday, January 14, 2008 2:45:09 PM
Subject: Text Summarizer
Hi!

I'm looking for a good way to get a good text summarizer
for my personal search engine based Solr.

Actually, I'm using ots (Open Text Summurizer) but the result
is far from perfection.

Here's an example of usage:
$ elinks http://lucene.apache.org/solr/; -force-html -no-numbering \
-no-references  2/dev/null | ots -r 40 | less -S

The result is OK for this site, but I would like to obtain  
something similar

to google text snippet (a real excerpt).

Advices are welcome?

N.B: all the HTML pages I'm indexing are converted to text with
 elinks (the text browser)
like in the previous example.

Thanks in adavance.

cheers
Younès