Luca, I would like to know: how much language, your system could identify? In my view, this difficult part in your system is: how to collect so many languages/character in the world for *one person*...
Regards, Mead On Sun, Oct 23, 2011 at 1:27 AM, Petite Abeille <[email protected]>wrote: > > On Oct 22, 2011, at 2:49 AM, Luca Rondanini wrote: > > > I usually use Nutch for this but, just for fun, I tried to create a > language > > identifier based on Lucene only. > > Talking of which: > > Google's Compact Language Detector > > http://blog.mikemccandless.com/2011/10/language-detection-with-googles-compact.html > > > --------------------------------------------------------------------- > To unsubscribe, e-mail: [email protected] > For additional commands, e-mail: [email protected] > >
