Re: [Apertium-stuff] monolingual corpus crawler

2012-07-17 Thread Miquel EsplĂ 
Hello Hieu, have you tried httrack? is the one used by Bitextor. It is very good, with a lot of options which allows you to filter the kind of files to be downloaded. Also, the documentation is quite good: http://www.httrack.com/ Cheers! Miquel. 2012/7/17 Hieu Hoang > hello, > > me again. Can

[Apertium-stuff] monolingual corpus crawler

2012-07-17 Thread Hieu Hoang
hello, me again. Can any of you guys recommend a monolingual web crawling toolkit. Bitextor sounds like overkill for this particular job thanks hieu -- Live Security Virtual Conference Exclusive live event will cover al