Re: Crawl News Web

Sjaiful Bahri Wed, 28 Jan 2009 21:22:09 -0800

Hello Wildan,

This is the process to crawl news site:


1. Deep First search to identify news site 
2. Crawl process using regular expression 
3. Save result contents into database 
4. Users ready to find News through database

--- W <[email protected]> wrote:

> Can you share the architecture do you use ? are you
> using nutch also
> for the backend ?
> 
> 
> Regards,
> Wildan
> 
> On Tue, Jan 27, 2009 at 4:53 PM, Sjaiful Bahri
> <[email protected]> wrote:
> > FYI,
> > Zipclue is designed to crawl news information on
> the
> > web effectively and efficiently.
> >
> > http://zipclue.com
> >
> >
> >
> > Cheers
> > iful at http://zipclue.com
> >
> >
> >
> >
> 
> 
> 
> -- 
> ---
> tobeThink!
> www.tobethink.com
> 
> Aligning IT and Education
> 
> >> 021-99325243
> Y! : hawking_123
> Linkedln : http://www.linkedin.com/in/wildanmaulana
> 


Cheers 
iful at http://zipclue.com

Re: Crawl News Web

Reply via email to