Use of Lucene to store data from RSS feeds

2010-10-14 Thread appy74
Hello I would like to store data retrieved hourly from RSS feeds in a database or in Lucene so that the text can be easily indexed for word frequencies. I need to get the text from the title and description elements of RSS items. Ideally, for each hourly retrieval from a given feed, I would add

Re: Use of Lucene to store data from RSS feeds

2010-10-14 Thread Grant Ingersoll
On Oct 14, 2010, at 10:17 AM, app...@dsl.pipex.com wrote: > Hello > > I would like to store data retrieved hourly from RSS feeds in a database or > in Lucene so that the text can be easily > indexed for word frequencies. > > I need to get the text from the title and description elements of RSS

Re: Use of Lucene to store data from RSS feeds

2010-10-15 Thread Pulkit Singhal
When you ask: a) will each feed would form a Lucene document, or b) will each database row would form a lucene document I'm inclined to say that really depends on what type of aggregation tool or logic you are using. I don't know if "Tika" does it but if there is a tool out there that can be point

RE: Use of Lucene to store data from RSS feeds

2010-10-15 Thread Martin O'Shea
Pulkit Singhal [mailto:pulkitsing...@gmail.com] Sent: 15 Oct 2010 13 36 To: java-user@lucene.apache.org Subject: Re: Use of Lucene to store data from RSS feeds When you ask: a) will each feed would form a Lucene document, or b) will each database row would form a lucene document I'm inclined to say th

Re: Use of Lucene to store data from RSS feeds

2010-10-15 Thread Erick Erickson
tion. > > -Original Message- > From: Pulkit Singhal [mailto:pulkitsing...@gmail.com] > Sent: 15 Oct 2010 13 36 > To: java-user@lucene.apache.org > Subject: Re: Use of Lucene to store data from RSS feeds > > When you ask: > a) will each feed would form a Lucene docum