Re: Java web page key word search protocal

2009-08-01 Thread Bryan O'Neal
Thought of that, the overhead is worse then scraping, parsing, and searching. On Fri, Jul 31, 2009 at 7:51 AM, Lisa Kachold lisakach...@obnosis.comwrote: Try using google? On 7/31/09, Bryan O'Neal bon...@cornerstonehome.com wrote: Ok, so I want to, with utmost efficacy, go through a web

Re: Java web page key word search protocal

2009-08-01 Thread Lisa Kachold
Why java? Why not a simple javascript search script? http://stackoverflow.com/questions/141280/whats-the-best-way-to-count-keywords-in-javascript On 8/1/09, Bryan O'Neal bon...@cornerstonehome.com wrote: Thought of that, the overhead is worse then scraping, parsing, and searching. On Fri,

Re: Java web page key word search protocal

2009-08-01 Thread Stephen P Rufle
Would either of these fit the bill. I saw your comment about lucene, but I think Solr can be used as a tool to do what you want. http://lucene.apache.org/solr/ Bryan O'Neal wrote: Ok, so I want to, with utmost efficacy, go through a web pages and ask how many of a set of key words is in that

Re: Java web page key word search protocal

2009-08-01 Thread Bryan O'Neal
Solr looks interesting thank you. Why Java 1) I know Java the best, and I can make something work very very easy, I just want something I can run continuesly with low foot print on a comodity VM. 2) The rest of the backend is in Java and I like to have a single development languae for any given

Java web page key word search protocal

2009-07-31 Thread Bryan O'Neal
Ok, so I want to, with utmost efficacy, go through a web pages and ask how many of a set of key words is in that web page. Does any one know of a good open source tool for this? I have hundreds of web pages and a near equal number of key word sets so scraping each page, parsing to create a vector

Re: Java web page key word search protocal

2009-07-31 Thread Lisa Kachold
Try using google? On 7/31/09, Bryan O'Neal bon...@cornerstonehome.com wrote: Ok, so I want to, with utmost efficacy, go through a web pages and ask how many of a set of key words is in that web page. Does any one know of a good open source tool for this? I have hundreds of web pages and a