Hello Wildan, This is the process to crawl news site:
1. Deep First search to identify news site 2. Crawl process using regular expression 3. Save result contents into database 4. Users ready to find News through database --- W <[email protected]> wrote: > Can you share the architecture do you use ? are you > using nutch also > for the backend ? > > > Regards, > Wildan > > On Tue, Jan 27, 2009 at 4:53 PM, Sjaiful Bahri > <[email protected]> wrote: > > FYI, > > Zipclue is designed to crawl news information on > the > > web effectively and efficiently. > > > > http://zipclue.com > > > > > > > > Cheers > > iful at http://zipclue.com > > > > > > > > > > > > -- > --- > tobeThink! > www.tobethink.com > > Aligning IT and Education > > >> 021-99325243 > Y! : hawking_123 > Linkedln : http://www.linkedin.com/in/wildanmaulana > Cheers iful at http://zipclue.com
