Re: Keyphrase Extraction (via Lingo)

2007-05-09 Thread Bill Janssen
Dawid Weiss wrote: You could also try splitting the document into paragraphs and use Carrot2's Lingo algorithm (www.carrot2.org) on a paragraph-level to extract clusters. Labelling routine in Lingo should extract 'key' phrases; this analysis is heavily frequency-based, but... you know,

Re: Keyphrase Extraction

2007-05-08 Thread Bill Janssen
Dawid Weiss wrote: You could also try splitting the document into paragraphs and use Carrot2's Lingo algorithm (www.carrot2.org) on a paragraph-level to extract clusters. Labelling routine in Lingo should extract 'key' phrases; this analysis is heavily frequency-based, but... you know, you

RE: Keyphrase Extraction

2007-05-08 Thread Vishal Shah
Hi Arsen, I've seen another commercial one from a company called Connexor (www.connexor.com) . It has a decent part-of-speech tagger that could be used in keyphrase extraction with some heuristics on top of it. -vishal. -Original Message- From: Mark Miller [mailto:[EMAIL PROTECTED

Re: Keyphrase Extraction

2007-05-08 Thread José Ramón Pérez Agüera
here you have a very good tool for Keyphrase Extraction. It is GNU and easy to integrate in Lucene. http://www.paynter.info/academia/Kea.php best jose On 5/8/07, Bill Janssen [EMAIL PROTECTED] wrote: Dawid Weiss wrote: You could also try splitting the document into paragraphs and use

Re: Keyphrase Extraction

2007-05-08 Thread Bob Carpenter
Mark Miller wrote: The only commercial options that I have seen do not have a web presence (that I know of or can find) and I don't recall the company names (only peripherally involved). Are we talking about Yahoo's buzz index and Amazon's SIPs or CAPs? I actually think the most interesting

Re: Keyphrase Extraction

2007-05-07 Thread Mark Miller
- Original Message From: Otis Gospodnetic [EMAIL PROTECTED] To: java-user@lucene.apache.org Sent: Monday, 30 April, 2007 4:11:36 AM Subject: Re: Keyphrase Extraction Av, look at Lucene's JIRA and search for Mark Harwood. I believe he once contributed something that does this in JIRA

Re: Keyphrase Extraction

2007-05-06 Thread Dawid Weiss
significance. Cheers Mark - Original Message From: Otis Gospodnetic [EMAIL PROTECTED] To: java-user@lucene.apache.org Sent: Monday, 30 April, 2007 4:11:36 AM Subject: Re: Keyphrase Extraction Av, look at Lucene's JIRA and search for Mark Harwood. I believe he once contributed something that does

Re: Keyphrase Extraction

2007-05-06 Thread [EMAIL PROTECTED]
Hi Mark, Do you know of a good paid product that does this? Thanks, Arsen - Original Message From: Mark Miller [EMAIL PROTECTED] To: java-user@lucene.apache.org Sent: Wednesday, May 2, 2007 7:52:36 AM Subject: Re: Keyphrase Extraction From what I know you generally have to pay

Re: Keyphrase Extraction

2007-05-06 Thread Otis Gospodnetic
@lucene.apache.org Sent: Sunday, May 6, 2007 10:57:42 PM Subject: Re: Keyphrase Extraction Hi Mark, Do you know of a good paid product that does this? Thanks, Arsen - Original Message From: Mark Miller [EMAIL PROTECTED] To: java-user@lucene.apache.org Sent: Wednesday, May 2, 2007 7:52:36 AM

Re: Keyphrase Extraction

2007-05-02 Thread Mark Miller
- Original Message From: Otis Gospodnetic [EMAIL PROTECTED] To: java-user@lucene.apache.org Sent: Monday, 30 April, 2007 4:11:36 AM Subject: Re: Keyphrase Extraction Av, look at Lucene's JIRA and search for Mark Harwood. I believe he once contributed something that does this in JIRA. If you

Re: Keyphrase Extraction

2007-04-30 Thread mark harwood
. Cheers Mark - Original Message From: Otis Gospodnetic [EMAIL PROTECTED] To: java-user@lucene.apache.org Sent: Monday, 30 April, 2007 4:11:36 AM Subject: Re: Keyphrase Extraction Av, look at Lucene's JIRA and search for Mark Harwood. I believe he once contributed something that does

Keyphrase Extraction

2007-04-29 Thread [EMAIL PROTECTED]
Hi, I tried using MoreLikeThis contrib feature to extract interesting terms from a document. This works very well - but only for SINGLE words. I am looking for a way to extra keyPHRASES from a document. Is there an easy way to achieve this using Lucene index? Thanks in advance! Av

Re: Keyphrase Extraction

2007-04-29 Thread Otis Gospodnetic
- http://lucene-consulting.com/ - Original Message From: [EMAIL PROTECTED] [EMAIL PROTECTED] To: java-user@lucene.apache.org Sent: Sunday, April 29, 2007 5:24:17 PM Subject: Keyphrase Extraction Hi, I tried using MoreLikeThis contrib feature to extract interesting terms from a document