Apologies for hijacking this thread, but this is probably the most
relevant place to point you to a simple web-frontend to cld I cooked
up in a couple of hours:
http://detector-de-idioma.herokuapp.com/index.html
Additionally, you can call it as a service like
$ url 'http://detector-de-idioma.he
> I'm more curious about why the output isn't even deterministic. The
> same input string produced three different results.
You can see in the FAQ
(http://code.google.com/p/language-detection/wiki/FrequentlyAskedQuestion)
that:
"Langdetect uses random sampling for avoiding local noises(person
nam
I'm more curious about why the output isn't even deterministic. The
same input string produced three different results.
--
You received this message because you are subscribed to the Google
Groups "Clojure" group.
To post to this group, send email to clojure@googlegroups.com
Note that posts from
On 29 February 2012 17:48, Lee Hinman wrote:
> On Tuesday, February 28, 2012 6:03:26 PM UTC-7, Robin Kraft wrote:
>>
>> Awesome! I'm seeing some inconsistency though. Does anyone know why a
>> Bayesian classifier would produce such different results? Could it be
>> because of the short input text?
On Tuesday, February 28, 2012 6:03:26 PM UTC-7, Robin Kraft wrote:
>
> Awesome! I'm seeing some inconsistency though. Does anyone know why a
> Bayesian classifier would produce such different results? Could it be
> because of the short input text?
>
> (lang/detect "My name is joe")
> ["af" {"af
Awesome! I'm seeing some inconsistency though. Does anyone know why a
Bayesian classifier would produce such different results? Could it be
because of the short input text?
(lang/detect "My name is joe")
["af" {"af" "0.8571390166207665", "lt" "0.14285675907555712"}]
(lang/detect "My name is joe")
Tested with Korean and German, and works great!
user> (cld.core/detect "한국 음식중에 김치가 제일 맛있어요.")
["ko" {"ko" "0.9998"}]
cld.core=> (cld.core/detect "In München steht ein Hofbräuhaus.")
["de" {"de" "0.972552285171"}]
--
You received this message because you are subscribed to the Go
similar functionality is also available in clj-tika
(https://github.com/alexott/clj-tika, and clojars) - you can detect
language, mime-type of data & extract text
On Tue, Feb 28, 2012 at 3:24 AM, Lee Hinman wrote:
> Hi all,
> I'm pleased to announce the initial 0.1.0 release of cld (Clojure
> Lan
Cool. Time to get my cores to work extracting from pastebins. :)
'(Devin Walters)
On Feb 27, 2012, at 8:24 PM, Lee Hinman wrote:
> Hi all,
> I'm pleased to announce the initial 0.1.0 release of cld (Clojure
> Language Detection). CLD a tiny library wrapping language-detect[1]
> that can be used
Hi all,
I'm pleased to announce the initial 0.1.0 release of cld (Clojure
Language Detection). CLD a tiny library wrapping language-detect[1]
that can be used to determine the language of a particular piece of
text very quickly. You should be able to use it from Clojars[2] with
the following:
[cld
10 matches
Mail list logo