Re: NASA's OCO-2 mission instrument processing system: OODT and Tika inside!

2014-07-03 Thread Jérôme Charron
Amazing ! Tika processing data from space ! The next step, Tika in space ! ;) Jérôme Envoyé de mon iPhone > Le 3 juil. 2014 à 17:25, "Mattmann, Chris A (3980)" > a écrit : > > Hey Guys, > > Just as an FYI: the NASA OCO-2 mission successfully launched yesterday > July 2, 2014, > and is now in

Google's Compact Language Detector

2011-10-24 Thread Jérôme Charron
Hi, I just find this blog post from Mike McCandless about Google's Compact Language Detection code used in Chrome : http://blog.mikemccandless.com/2011/10/language-detection-with-googles-compact.html There's probably some interesting things to explore in the Google Code in order to improve Tika's

Re: Google's Compact Language Detector

2011-10-25 Thread Jérôme Charron
Thanks Mike for sharing these tests. There is clearly a performance issue regarding Tika run time. As you noticed it, it will be interesting to see if the accuracy can be increased by mixing the languages profiles of many libraries. But not sure if the accuracy is depending only from the languages

Re: Multilingual Tika

2011-11-05 Thread Jérôme Charron
> > I totally am. I've got some PHP skillz and Python skillz > that I would be willing to throw into the mix here. > Yes, I have some basic skillz on Python, and some advanced skillz on PHP, so I can help you! > One other thing along these lines I've had in mind for a while: > how cool would it b