On Sun, Jul 10, 2011 at 12:59 PM, ivan vadovic wrote:
> Hi,
>
> Also a library for string normalization in the sense of stripping diacritical
> marks would be handy too. Does anything in this respect exist that would be
> usable from haskell?
The closest thing I know of is this:
http://hackage.ha
Hi,
Also a library for string normalization in the sense of stripping diacritical
marks would be handy too. Does anything in this respect exist that would be
usable from haskell?
Thanks
On Fri, Jul 01, 2011 at 02:31:34PM +0400, Dmitri O.Kondratiev wrote:
> Hi,
> Please advise on NLP libraries si
Perhaps this is interesting? On the relationship between exploratory
(a.k.a. sloppy or theoretical) and rigorous math.
http://arxiv.org/pdf/math/9307227v1
-k
--
If I haven't seen further, it is by standing in the footprints of giants
___
Haskell-Caf
Heh, I just hit Reply All and I guess the address came in wrong. Ah, well.
I strongly agree with you on the state of linguistics, et al. Having done
little bits of work in a few of those fields (or at least work _with_ people in
them), your comments are spot on. Though perhaps I wouldn't say tha
(Psst, the nlp list is :)
On 7/9/11 3:10 AM, Jack Henahan wrote:
> On Jul 7, 2011, at 10:53 PM, wren ng thornton wrote:
>> I can't help but be a (meta)theorist. But then, I'm of the firm opinion
>> that theory must be grounded in actual practice, else it belongs more to
>> the realm of theology t
Oof, you're liable to wound my (pure) mathematician's pride with remarks like
that, wren. :P
Now go intone the Litany of Categories as penance. :D I'll start you off… Set,
Rel, Top, Ring, Grp, Cat, Hask…
On Jul 7, 2011, at 10:53 PM, wren ng thornton wrote:
> I can't help but be a (meta)theori
On 7/7/11 3:50 AM, Aleksandar Dimitrov wrote:
> It's actually a shame we're discussing this on -cafe and not on -nlp. Then
> again, maybe it's going to prompt somebody to join -nlp, and I'm gonna
CC it
> there, because some folks over there might be interested, but not read
-cafe.
Quite :)
> When
On 7/7/11 3:38 AM, Aleksandar Dimitrov wrote:
> On Wed, Jul 06, 2011 at 07:27:10PM -0700, wren ng thornton wrote:
>> I definitely agree with the iteratees comment, but I'm curious about the
>> leaks you mention. I haven't run into leakiness issues (that I'm aware of)
>> in my use of ByteStrings for
It's actually a shame we're discussing this on -cafe and not on -nlp. Then
again, maybe it's going to prompt somebody to join -nlp, and I'm gonna CC it
there, because some folks over there might be interested, but not read -cafe.
On Wed, Jul 06, 2011 at 07:22:41PM -0700, wren ng thornton wrote:
>
On 7/6/11 8:46 PM, Richard O'Keefe wrote:
>> I've been working over the last year+ on an optimized HMM-based POS
>> tagger/supertagger with online tagging and anytime n-best tagging. I'm
>> planning to release it this summer (i.e., by the end of August), though
>> there are a few things I'd like to
On 7/6/11 6:45 PM, Aleksandar Dimitrov wrote:
> One hint, if you ever find yourself reading in quantitative linguistic
data with
> Haskell: forget lazy IO. Forget strict IO, except your documents aren't
ever
> bigger than a few hundred megs. In case you're not keeping the whole
document in
> memory
On 7/6/11 5:58 PM, Aleksandar Dimitrov wrote:
> On Wed, Jul 06, 2011 at 09:32:27AM -0700, wren ng thornton wrote:
>> On 7/6/11 9:27 AM, Dmitri O.Kondratiev wrote:
>>> Hi,
>>> Continuing my search of Haskell NLP tools and libs, I wonder if the
>>> following Haskell libraries exist (googling them doe
On 7/07/2011, at 7:04 AM, Dmitri O.Kondratiev wrote:
> I am looking for Haskell implementation of sentence tokenizer such as
> described by Tibor Kiss and Jan Strunk’s in “Unsupervised Multilingual
> Sentence Boundary Detection”, which is implemented in NLTK:
That method is multilingual but re
On Wed, Jul 06, 2011 at 03:14:07PM -0700, Rogan Creswick wrote:
> Have you used that particular combination yet? I'd like to know the
> details of how you hooked everything together if that's something you
> can share. (We're working on a similar Frankenstein at the moment.)
These Frankensteins,
On Wed, Jul 6, 2011 at 3:03 PM, Aleksandar Dimitrov
wrote:
>
> So you'd use, say, UIMA+OpenNLP to do sentence boundaries, tokens, tags,
> named-entities whatnot, then spit out some annotated format, read it in with
> Haskell, and do the logic/magic there.
Have you used that particular combination
On Wed, Jul 06, 2011 at 11:04:30PM +0400, Dmitri O.Kondratiev wrote:
> On Wed, Jul 6, 2011 at 8:32 PM, wren ng thornton wrote:
>
> > On 7/6/11 9:27 AM, Dmitri O.Kondratiev wrote:
> > > Hi,
> > > Continuing my search of Haskell NLP tools and libs, I wonder if the
> > > following Haskell libraries
On Wed, Jul 06, 2011 at 09:32:27AM -0700, wren ng thornton wrote:
> On 7/6/11 9:27 AM, Dmitri O.Kondratiev wrote:
> > Hi,
> > Continuing my search of Haskell NLP tools and libs, I wonder if the
> > following Haskell libraries exist (googling them does not help):
> > 1) End of Sentence (EOS) Detecti
On Wed, Jul 6, 2011 at 8:32 PM, wren ng thornton wrote:
> On 7/6/11 9:27 AM, Dmitri O.Kondratiev wrote:
> > Hi,
> > Continuing my search of Haskell NLP tools and libs, I wonder if the
> > following Haskell libraries exist (googling them does not help):
> > 1) End of Sentence (EOS) Detection. Brea
On 7/6/11 9:27 AM, Dmitri O.Kondratiev wrote:
> Hi,
> Continuing my search of Haskell NLP tools and libs, I wonder if the
> following Haskell libraries exist (googling them does not help):
> 1) End of Sentence (EOS) Detection. Break text into a collection of
> meaningful sentences.
Depending on ho
Hi,
Continuing my search of Haskell NLP tools and libs, I wonder if the
following Haskell libraries exist (googling them does not help):
1) End of Sentence (EOS) Detection. Break text into a collection of
meaningful sentences.
2) Part-of-Speech (POS) Tagging. Assign part-of-speech information to ea
On Fri, Jul 1, 2011 at 2:52 PM, Dmitri O.Kondratiev wrote:
> Any other then 'toktok' Haskell word tokenizer that compiles and works?
> I need something like:
> http://nltk.googlecode.com/svn/trunk/doc/api/nltk.tokenize.regexp.WordPunctTokenizer-class.html
>
I don't think this exists out of the bo
On Fri, Jul 1, 2011 at 11:58 PM, Rogan Creswick wrote:
> On Fri, Jul 1, 2011 at 12:38 PM, Dmitri O.Kondratiev
> wrote:
> > On Fri, Jul 1, 2011 at 9:34 PM, Rogan Creswick
> wrote:
> >>
> >> On Fri, Jul 1, 2011 at 3:31 AM, Dmitri O.Kondratiev
> >> wrote:> First of all I need:
> >
> > Unfortunate
On Fri, Jul 1, 2011 at 12:38 PM, Dmitri O.Kondratiev wrote:
> On Fri, Jul 1, 2011 at 9:34 PM, Rogan Creswick wrote:
>>
>> On Fri, Jul 1, 2011 at 3:31 AM, Dmitri O.Kondratiev
>> wrote:> First of all I need:
>
> Unfortunately 'cabal install' fails with toktok:
>
> tools/ExtractLexicon.hs:5:35:
>
On Fri, Jul 1, 2011 at 9:34 PM, Rogan Creswick wrote:
> On Fri, Jul 1, 2011 at 3:31 AM, Dmitri O.Kondratiev
> wrote:> First of all I need:
>
...
> > - tools to construct 'bag of words'
> > (http://en.wikipedia.org/wiki/Bag_of_words_model), which is a list of
> words
> > in the
> > article.
>
>
On Fri, Jul 1, 2011 at 3:31 AM, Dmitri O.Kondratiev wrote:
> Hi,
> Please advise on NLP libraries similar to Natural Language Toolkit
There is a (slowly?) growing NLP community for haskell over at:
http://projects.haskell.org/nlp/
The nlp mailing list may be a better place to ask for details.
Hi,
Please advise on NLP libraries similar to Natural Language Toolkit (
www.nltk.org)
First of all I need:
- tools to construct 'bag of words' (
http://en.wikipedia.org/wiki/Bag_of_words_model), which is a list of words
in the
article.
- tools to prune common words, such as prepositions and conjun
26 matches
Mail list logo