Knowledge about contents of a page

2010-01-28 Thread ram_sj

Hi,

My question is about crawling, I know this is not relevant here, but I asked
nutch people, didn't get any response, I just thought of posing here,

I'm trying to crawl reviews for business, 

a. is there any way to tell the content in a web pages are reviews or not?
Is it possible to do it in automated fashion?

b. How could be map a block of text to a particular business ? ex: like
google reviews

   

Thanks
Ram
-- 
View this message in context: 
http://old.nabble.com/Knowledge-about-contents-of-a-page-tp27358779p27358779.html
Sent from the Solr - User mailing list archive at Nabble.com.



Re: Knowledge about contents of a page

2010-01-28 Thread Avlesh Singh
Classification? - http://en.wikipedia.org/wiki/Document_classification

Cheers
Avlesh

On Fri, Jan 29, 2010 at 1:18 AM, ram_sj rpachaiyap...@gmail.com wrote:


 Hi,

 My question is about crawling, I know this is not relevant here, but I
 asked
 nutch people, didn't get any response, I just thought of posing here,

 I'm trying to crawl reviews for business,

 a. is there any way to tell the content in a web pages are reviews or not?
 Is it possible to do it in automated fashion?

 b. How could be map a block of text to a particular business ? ex: like
 google reviews



 Thanks
 Ram
 --
 View this message in context:
 http://old.nabble.com/Knowledge-about-contents-of-a-page-tp27358779p27358779.html
 Sent from the Solr - User mailing list archive at Nabble.com.