Hi,
I have a research project where I want to implement LSI technique. The scenario
is something as follows.
Search
the news sites for the locally event based news. Cluster the similar
news items together. For example hurricane in New York city.
We want to apply basic LSI as follows
-Key word extraction
-Filter using stop list
-Stemming
-Option: Synonym detection
- Frequency Matrix
- SVD Decomposition
-Cluster related News items
The input data will be from the web based news sites such as Yahoo, google etc.
How can we use Lucene to achieve this. Please provide me the steps.
Thanks.
Faikeeyes
____________________________________________________________________________________
Now that's room service! Choose from over 150,000 hotels
in 45,000 destinations on Yahoo! Travel to find your fit.
http://farechase.yahoo.com/promo-generic-14795097