Re: [Haskell-cafe] NLP libraries and tools?

2011-07-10 Thread Jason Dagit
On Sun, Jul 10, 2011 at 12:59 PM, ivan vadovic wrote: > Hi, > > Also a library for string normalization in the sense of stripping diacritical > marks would be handy too. Does anything in this respect exist that would be > usable from haskell? The closest thing I know of is this: http://hackage.ha

Re: [Haskell-cafe] NLP libraries and tools?

2011-07-10 Thread ivan vadovic
Hi, Also a library for string normalization in the sense of stripping diacritical marks would be handy too. Does anything in this respect exist that would be usable from haskell? Thanks On Fri, Jul 01, 2011 at 02:31:34PM +0400, Dmitri O.Kondratiev wrote: > Hi, > Please advise on NLP libraries si

Re: [Haskell-cafe] NLP libraries and tools?

2011-07-10 Thread Ketil Malde
Perhaps this is interesting? On the relationship between exploratory (a.k.a. sloppy or theoretical) and rigorous math. http://arxiv.org/pdf/math/9307227v1 -k -- If I haven't seen further, it is by standing in the footprints of giants ___ Haskell-Caf

Re: [Haskell-cafe] NLP libraries and tools?

2011-07-09 Thread Jack Henahan
Heh, I just hit Reply All and I guess the address came in wrong. Ah, well. I strongly agree with you on the state of linguistics, et al. Having done little bits of work in a few of those fields (or at least work _with_ people in them), your comments are spot on. Though perhaps I wouldn't say tha

Re: [Haskell-cafe] NLP libraries and tools?

2011-07-09 Thread wren ng thornton
(Psst, the nlp list is :) On 7/9/11 3:10 AM, Jack Henahan wrote: > On Jul 7, 2011, at 10:53 PM, wren ng thornton wrote: >> I can't help but be a (meta)theorist. But then, I'm of the firm opinion >> that theory must be grounded in actual practice, else it belongs more to >> the realm of theology t

Re: [Haskell-cafe] NLP libraries and tools?

2011-07-09 Thread Jack Henahan
Oof, you're liable to wound my (pure) mathematician's pride with remarks like that, wren. :P Now go intone the Litany of Categories as penance. :D I'll start you off… Set, Rel, Top, Ring, Grp, Cat, Hask… On Jul 7, 2011, at 10:53 PM, wren ng thornton wrote: > I can't help but be a (meta)theori

Re: [Haskell-cafe] NLP libraries and tools?

2011-07-07 Thread wren ng thornton
On 7/7/11 3:50 AM, Aleksandar Dimitrov wrote: > It's actually a shame we're discussing this on -cafe and not on -nlp. Then > again, maybe it's going to prompt somebody to join -nlp, and I'm gonna CC it > there, because some folks over there might be interested, but not read -cafe. Quite :) > When

Re: [Haskell-cafe] NLP libraries and tools?

2011-07-07 Thread wren ng thornton
On 7/7/11 3:38 AM, Aleksandar Dimitrov wrote: > On Wed, Jul 06, 2011 at 07:27:10PM -0700, wren ng thornton wrote: >> I definitely agree with the iteratees comment, but I'm curious about the >> leaks you mention. I haven't run into leakiness issues (that I'm aware of) >> in my use of ByteStrings for

Re: [Haskell-cafe] NLP libraries and tools?

2011-07-07 Thread Aleksandar Dimitrov
It's actually a shame we're discussing this on -cafe and not on -nlp. Then again, maybe it's going to prompt somebody to join -nlp, and I'm gonna CC it there, because some folks over there might be interested, but not read -cafe. On Wed, Jul 06, 2011 at 07:22:41PM -0700, wren ng thornton wrote: >

Re: [Haskell-cafe] NLP libraries and tools?

2011-07-06 Thread wren ng thornton
On 7/6/11 8:46 PM, Richard O'Keefe wrote: >> I've been working over the last year+ on an optimized HMM-based POS >> tagger/supertagger with online tagging and anytime n-best tagging. I'm >> planning to release it this summer (i.e., by the end of August), though >> there are a few things I'd like to

Re: [Haskell-cafe] NLP libraries and tools?

2011-07-06 Thread wren ng thornton
On 7/6/11 6:45 PM, Aleksandar Dimitrov wrote: > One hint, if you ever find yourself reading in quantitative linguistic data with > Haskell: forget lazy IO. Forget strict IO, except your documents aren't ever > bigger than a few hundred megs. In case you're not keeping the whole document in > memory

Re: [Haskell-cafe] NLP libraries and tools?

2011-07-06 Thread wren ng thornton
On 7/6/11 5:58 PM, Aleksandar Dimitrov wrote: > On Wed, Jul 06, 2011 at 09:32:27AM -0700, wren ng thornton wrote: >> On 7/6/11 9:27 AM, Dmitri O.Kondratiev wrote: >>> Hi, >>> Continuing my search of Haskell NLP tools and libs, I wonder if the >>> following Haskell libraries exist (googling them doe

Re: [Haskell-cafe] NLP libraries and tools?

2011-07-06 Thread Richard O'Keefe
On 7/07/2011, at 7:04 AM, Dmitri O.Kondratiev wrote: > I am looking for Haskell implementation of sentence tokenizer such as > described by Tibor Kiss and Jan Strunk’s in “Unsupervised Multilingual > Sentence Boundary Detection”, which is implemented in NLTK: That method is multilingual but re

Re: [Haskell-cafe] NLP libraries and tools?

2011-07-06 Thread Aleksandar Dimitrov
On Wed, Jul 06, 2011 at 03:14:07PM -0700, Rogan Creswick wrote: > Have you used that particular combination yet? I'd like to know the > details of how you hooked everything together if that's something you > can share. (We're working on a similar Frankenstein at the moment.) These Frankensteins,

Re: [Haskell-cafe] NLP libraries and tools?

2011-07-06 Thread Rogan Creswick
On Wed, Jul 6, 2011 at 3:03 PM, Aleksandar Dimitrov wrote: > > So you'd use, say, UIMA+OpenNLP to do sentence boundaries, tokens, tags, > named-entities whatnot, then spit out some annotated format, read it in with > Haskell, and do the logic/magic there. Have you used that particular combination

Re: [Haskell-cafe] NLP libraries and tools?

2011-07-06 Thread Aleksandar Dimitrov
On Wed, Jul 06, 2011 at 11:04:30PM +0400, Dmitri O.Kondratiev wrote: > On Wed, Jul 6, 2011 at 8:32 PM, wren ng thornton wrote: > > > On 7/6/11 9:27 AM, Dmitri O.Kondratiev wrote: > > > Hi, > > > Continuing my search of Haskell NLP tools and libs, I wonder if the > > > following Haskell libraries

Re: [Haskell-cafe] NLP libraries and tools?

2011-07-06 Thread Aleksandar Dimitrov
On Wed, Jul 06, 2011 at 09:32:27AM -0700, wren ng thornton wrote: > On 7/6/11 9:27 AM, Dmitri O.Kondratiev wrote: > > Hi, > > Continuing my search of Haskell NLP tools and libs, I wonder if the > > following Haskell libraries exist (googling them does not help): > > 1) End of Sentence (EOS) Detecti

Re: [Haskell-cafe] NLP libraries and tools?

2011-07-06 Thread Dmitri O.Kondratiev
On Wed, Jul 6, 2011 at 8:32 PM, wren ng thornton wrote: > On 7/6/11 9:27 AM, Dmitri O.Kondratiev wrote: > > Hi, > > Continuing my search of Haskell NLP tools and libs, I wonder if the > > following Haskell libraries exist (googling them does not help): > > 1) End of Sentence (EOS) Detection. Brea

Re: [Haskell-cafe] NLP libraries and tools?

2011-07-06 Thread wren ng thornton
On 7/6/11 9:27 AM, Dmitri O.Kondratiev wrote: > Hi, > Continuing my search of Haskell NLP tools and libs, I wonder if the > following Haskell libraries exist (googling them does not help): > 1) End of Sentence (EOS) Detection. Break text into a collection of > meaningful sentences. Depending on ho

Re: [Haskell-cafe] NLP libraries and tools?

2011-07-06 Thread Dmitri O.Kondratiev
Hi, Continuing my search of Haskell NLP tools and libs, I wonder if the following Haskell libraries exist (googling them does not help): 1) End of Sentence (EOS) Detection. Break text into a collection of meaningful sentences. 2) Part-of-Speech (POS) Tagging. Assign part-of-speech information to ea

Re: [Haskell-cafe] NLP libraries and tools?

2011-07-01 Thread Rogan Creswick
On Fri, Jul 1, 2011 at 2:52 PM, Dmitri O.Kondratiev wrote: > Any other then 'toktok' Haskell word tokenizer that compiles and works? > I need something like: > http://nltk.googlecode.com/svn/trunk/doc/api/nltk.tokenize.regexp.WordPunctTokenizer-class.html > I don't think this exists out of the bo

Re: [Haskell-cafe] NLP libraries and tools?

2011-07-01 Thread Dmitri O.Kondratiev
On Fri, Jul 1, 2011 at 11:58 PM, Rogan Creswick wrote: > On Fri, Jul 1, 2011 at 12:38 PM, Dmitri O.Kondratiev > wrote: > > On Fri, Jul 1, 2011 at 9:34 PM, Rogan Creswick > wrote: > >> > >> On Fri, Jul 1, 2011 at 3:31 AM, Dmitri O.Kondratiev > >> wrote:> First of all I need: > > > > Unfortunate

Re: [Haskell-cafe] NLP libraries and tools?

2011-07-01 Thread Rogan Creswick
On Fri, Jul 1, 2011 at 12:38 PM, Dmitri O.Kondratiev wrote: > On Fri, Jul 1, 2011 at 9:34 PM, Rogan Creswick wrote: >> >> On Fri, Jul 1, 2011 at 3:31 AM, Dmitri O.Kondratiev >> wrote:> First of all I need: > > Unfortunately 'cabal install' fails with toktok: > > tools/ExtractLexicon.hs:5:35: >  

Re: [Haskell-cafe] NLP libraries and tools?

2011-07-01 Thread Dmitri O.Kondratiev
On Fri, Jul 1, 2011 at 9:34 PM, Rogan Creswick wrote: > On Fri, Jul 1, 2011 at 3:31 AM, Dmitri O.Kondratiev > wrote:> First of all I need: > ... > > - tools to construct 'bag of words' > > (http://en.wikipedia.org/wiki/Bag_of_words_model), which is a list of > words > > in the > > article. > >

Re: [Haskell-cafe] NLP libraries and tools?

2011-07-01 Thread Rogan Creswick
On Fri, Jul 1, 2011 at 3:31 AM, Dmitri O.Kondratiev wrote: > Hi, > Please advise on NLP libraries similar to Natural Language Toolkit There is a (slowly?) growing NLP community for haskell over at: http://projects.haskell.org/nlp/ The nlp mailing list may be a better place to ask for details.

[Haskell-cafe] NLP libraries and tools?

2011-07-01 Thread Dmitri O.Kondratiev
Hi, Please advise on NLP libraries similar to Natural Language Toolkit ( www.nltk.org) First of all I need: - tools to construct 'bag of words' ( http://en.wikipedia.org/wiki/Bag_of_words_model), which is a list of words in the article. - tools to prune common words, such as prepositions and conjun