Luca, I would like to know: how much language, your system could identify? In my view, this difficult part in your system is: how to collect so many languages/character in the world for *one person*...
Regards, Mead On Sun, Oct 23, 2011 at 1:27 AM, Petite Abeille <petite_abei...@me.com>wrote: > > On Oct 22, 2011, at 2:49 AM, Luca Rondanini wrote: > > > I usually use Nutch for this but, just for fun, I tried to create a > language > > identifier based on Lucene only. > > Talking of which: > > Google's Compact Language Detector > > http://blog.mikemccandless.com/2011/10/language-detection-with-googles-compact.html > > > --------------------------------------------------------------------- > To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org > For additional commands, e-mail: java-user-h...@lucene.apache.org > >