Re: [Haskell-cafe] A cabal odyssey

2010-08-17 Thread Andrew Coppin
Ben Millwood wrote: It sounds to me like your life would be a lot easier if you knew about cabal-install's root-cmd configuration parameter! Open your .cabal/config file and uncomment and set: root-cmd: sudo Now cabal-install will take up root permissions when and only when necessary :) I only

[Haskell-cafe] Re: Handling absent maintainers

2010-08-17 Thread Ben Millwood
2010/8/17 Geoffrey Mainland : > On 08/17/2010 12:28, Ben Millwood wrote: >> 2010/8/17 Jonas Almström Duregård : >>> Hi, >>> >>> Has there been any progress with this package? Like you I have also >>> tried to contact Matt and like you I have ended up making my own >>> version of src-meta :). When s

Re: [Haskell-cafe] Re: String vs ByteString

2010-08-17 Thread Jinjing Wang
> John Millikin wrote: >> >> The reason many Japanese and Chinese users reject UTF-8 isn't due to >> space constraints (UTF-8 and UTF-16 are roughly equal), it's because >> they reject Unicode itself. > > +1. > > This is the thing Unicode advocates don't want to admit. Until Unicode has > code poin

[Haskell-cafe] Re: regex-compat on latest Haskell Platform

2010-08-17 Thread Kevin Jardine
Mikhail and Don, All I needed to do was to type: cabal install regex-posix and everything now works as expected. As Mikhail also mentioned this problem is already in the Haskell Platform bug tracker. Kevin On Aug 18, 5:17 am, Don Stewart wrote: > kevinjardine: > > > I'm running Haskel Platfo

Re: [Haskell-cafe] Re: String vs ByteString

2010-08-17 Thread Michael Snoyman
Well, I'm not certain if it counts as a typical Chinese website, but here are the stats; UTF8: 64,198 UTF16: 113,160 And just for fun, after gziping: UTF8: 17,708 UTF16: 19,367 On Wed, Aug 18, 2010 at 2:59 AM, anderson leo wrote: > Hi michael, here is a web site http://zh.wikipedia.org/zh-cn/

Re: [Haskell-cafe] Re: String vs ByteString

2010-08-17 Thread wren ng thornton
Ivan Lazar Miljenovic wrote: On 18 August 2010 12:12, wren ng thornton wrote: Johan Tibell wrote: To my knowledge the data we have about prevalence of encoding on the web is accurate. We crawl all pages we can get our hands on, by starting at some set of seeds and then following all the links.

Re: [Haskell-cafe] Re: String vs ByteString

2010-08-17 Thread Richard O'Keefe
On Aug 17, 2010, at 11:51 PM, Ketil Malde wrote: > Yitzchak Gale writes: > >> I don't think the genome is typical text. > > I think the typical *large* collection of text is text-encoded data, and > not, for lack of a better word, literature. Genomics data is just an > example. I have a coll

Re: [Haskell-cafe] regex-compat on latest Haskell Platform

2010-08-17 Thread Don Stewart
kevinjardine: > I'm running Haskel Platform 2010.2.0.0 and attempting to compile with > regex-compat under Windows XP. > > The link stage fails with a number of lines of the form: > > D:\Program Files\Haskell Platform\2010.2.0.0\lib\extralibs\regex- > posix-0.94.2\ghc-6.12.3/libHSregex-posix-0.94

Re: [Haskell-cafe] Embedded scripting Language for haskell app

2010-08-17 Thread Stephen Sinclair
On Tue, Aug 17, 2010 at 6:05 AM, Hemanth Kapila wrote: > Hi, > Can some one please give me a suggestion on the best choice for an embedded > scripting Language for a haskell application? > I mean, something like guile/lua for c/c++ and groovy/jruby for java. > For quite some time, I've been using

Re: [Haskell-cafe] Re: String vs ByteString

2010-08-17 Thread Ivan Lazar Miljenovic
On 18 August 2010 12:12, wren ng thornton wrote: > Johan Tibell wrote: >> >> To my knowledge the data we have about prevalence of encoding on the web >> is >> accurate. We crawl all pages we can get our hands on, by starting at some >> set of seeds and then following all the links. You cannot be s

Re: [Haskell-cafe] Re: String vs ByteString

2010-08-17 Thread wren ng thornton
Johan Tibell wrote: To my knowledge the data we have about prevalence of encoding on the web is accurate. We crawl all pages we can get our hands on, by starting at some set of seeds and then following all the links. You cannot be sure that you've reached all web sites as there might be cliques i

Re: [Haskell-cafe] Re: String vs ByteString

2010-08-17 Thread wren ng thornton
John Millikin wrote: The reason many Japanese and Chinese users reject UTF-8 isn't due to space constraints (UTF-8 and UTF-16 are roughly equal), it's because they reject Unicode itself. +1. This is the thing Unicode advocates don't want to admit. Until Unicode has code points for _all_ Chine

Re: [Haskell-cafe] Re: Sifflet dependencies

2010-08-17 Thread Ivan Lazar Miljenovic
On 18 August 2010 05:04, Anthony Cowley wrote: > On Tue, Aug 17, 2010 at 2:54 PM, Andrew Coppin > wrote: >> While we're on the subject... Suppose I have a package, which I know works >> with foo-8.7.2. What should the Cabal dependents field say? We have a choice >> of >> >>  foo == 8.7.2 >>  foo

Re: [Haskell-cafe] Which Mail editor or mail chain do you use ?

2010-08-17 Thread Ivan Lazar Miljenovic
On 17 August 2010 23:17, Luc TAESCH wrote: > Is this some kind of idiom that developped over time and habit ( of people > reading papers) , particularly educated, and in need for others recognition, > or born out of using an (arche)typical environment/ toolchain Markdown - > latex like ? The actu

Re: [Haskell-cafe] Re: String vs ByteString

2010-08-17 Thread wren ng thornton
Michael Snoyman wrote: On Tue, Aug 17, 2010 at 2:20 PM, Ivan Lazar Miljenovic < ivan.miljeno...@gmail.com> wrote: Michael Snoyman writes: I don't think *anyone* is asserting that UTF-16 is a common encoding for files anywhere, *ahem* http://en.wikipedia.org/wiki/UTF-16/UCS-2#Use_in_major_op

Re: [Haskell-cafe] Re: String vs ByteString

2010-08-17 Thread John Millikin
On Tue, Aug 17, 2010 at 12:30, Donn Cave wrote: > If Haskell had the development resources to make something like this > work, would it actually take the form of a Haskell-level type like > that - data Text = (Encoding, ByteString)?  I mean, I know that's > just a very clear and convenient way to

Re: [Haskell-cafe] Re: String vs ByteString

2010-08-17 Thread wren ng thornton
Bulat Ziganshin wrote: Johan wrote: So it's not clear to me that using UTF-16 makes the program noticeably slower or use more memory on a real program. it's clear misunderstanding. of course, not every program holds much text data in memory. but some does, and here you will double memory usage

[Haskell-cafe] Re: Slightly humorous: Headhunters toolbox (example for Germany)

2010-08-17 Thread Ertugrul Soeylemez
Lutz Donnerhacke wrote: > * Ertugrul Soeylemez wrote: > >> Good to know that Saxony-Anhalt is the state in Germany with > >> leading interest in Haskell. :-) I would like to know, whether this > >> is due to Magdeburg or Halle. > > > > Saxony-Anhalt is the state in Germany with leading number of

Re: [Haskell-cafe] A cabal odyssey

2010-08-17 Thread Ben Millwood
On Mon, Aug 16, 2010 at 8:27 PM, Andrew Coppin wrote: > The amusing part is, if you "sudo cabal > install" so it has permission to put the installed files into place, it then > uses root's configuration file instead. *sigh* Well anyway, I managed to > work around that. But... Cabal *still* fails t

Re: [Haskell-cafe] Embedded scripting Language for haskell app

2010-08-17 Thread Felipe Lessa
On Tue, Aug 17, 2010 at 9:12 PM, Hemanth Kapila wrote: >   I was about to toss a coin to decide which one to pickup. Perhaps I should > worry about the size. You should think about what kind of code you want to support in your scripts. I mean, if you start binding every Haskell library into Lua,

Re: [Haskell-cafe] Embedded scripting Language for haskell app

2010-08-17 Thread Hemanth Kapila
thanks to all for the responses. I tried hint and hslua and haskell and both are very nice. My sincere thanks and kudos to the developers and maintainers of the packages (and to Bulat for the tutorial on hslua). FWIW, a couple of observations: 1. haskell indeed makes a great scripting langua

Re: [Haskell-cafe] Re: String vs ByteString

2010-08-17 Thread John Meacham
On Tue, Aug 17, 2010 at 03:21:32PM +0200, Daniel Peebles wrote: > Sounds to me like we need a lazy Data.Text variation that allows UTF-8 and > UTF-16 "segments" in it list of strict text elements :) Then big chunks of > western text will be encoded efficiently, and same with CJK! Not sure what > to

Re: [Haskell-cafe] Re: String vs ByteString

2010-08-17 Thread anderson leo
Hi michael, here is a web site http://zh.wikipedia.org/zh-cn/. It is the wikipedia for Chinese. -Andrew On Tue, Aug 17, 2010 at 7:00 PM, Michael Snoyman wrote: > > > On Tue, Aug 17, 2010 at 1:50 PM, Yitzchak Gale wrote: > >> Ketil Malde wrote: >> > I haven't benchmarked it, but I'm fairly sure

Re: [Haskell-cafe] Embedded scripting Language for haskell app

2010-08-17 Thread Felipe Lessa
On Tue, Aug 17, 2010 at 7:07 PM, Christopher Done wrote: > Sadly this is true. I went ahead and tested this to confirm; compiled > mueval (which uses hint), copied the executable to a virtual machine > and it required the GHC package repo among other GHC-related > libraries. > > The size is indeed

Re: [Haskell-cafe] Maintainer wanted for pappy

2010-08-17 Thread John Meacham
On Tue, Aug 17, 2010 at 10:38:53PM +0200, Christopher Done wrote: > 2. Not really interested in maintaining, but in a good state and > probably worth maintaining: > pappy (Bryan Ford gave me permission to upload and for someone to maintain > this) I actually have been actively developing and main

Re: [Haskell-cafe] Question about memory usage

2010-08-17 Thread Richard O'Keefe
On Aug 17, 2010, at 6:39 PM, Roel van Dijk wrote: >> Using x**n = exp(n*log(x)) and floating point arithmetic, >> the whole thing can be done in O(1) time, and the price of >> some inaccuracy. > It will calculate a subset of the Fibonacci numbers in O(1) time. > Thinking about it I think you can e

[Haskell-cafe] Re: Handling absent maintainers

2010-08-17 Thread Geoffrey Mainland
On 08/17/2010 12:28, Ben Millwood wrote: > 2010/8/17 Jonas Almström Duregård : >> Hi, >> >> Has there been any progress with this package? Like you I have also >> tried to contact Matt and like you I have ended up making my own >> version of src-meta :). When someone assumes maintainership of this

Re: [Haskell-cafe] Maintainer wanted for pappy

2010-08-17 Thread Christopher Done
I've never used the cairo library as it was part of gtk2hs and thus not on Hackage. Looks like it's on Hackage now as of last may. I'll take a look next time I want to do graphics. I imagine it's faster. On 17 August 2010 23:48, Felipe Lessa wrote: > On Tue, Aug 17, 2010 at 5:38 PM, Christopher D

Re: [Haskell-cafe] Maintainer wanted for pappy

2010-08-17 Thread Felipe Lessa
On Tue, Aug 17, 2010 at 5:38 PM, Christopher Done wrote: > 1. Interested in and will continue maintaining: > gd, higherorder, cgi-utils, fastcgi, ircbouncer Just out of curiosity, why do you use gd instead of cairo? Cheers! =) -- Felipe. ___ Haskell-

Re: [Haskell-cafe] Embedded scripting Language for haskell app

2010-08-17 Thread Felipe Lessa
On Tue, Aug 17, 2010 at 8:50 AM, Wouter Swierstra wrote: >>> Can some one please give me a suggestion on the best choice for an >>> embedded >>> scripting Language for a haskell application? > > Why not use Haskell itself? I agree that C and Java aren't perhaps the > best choice for application sc

[Haskell-cafe] Maintainer wanted for pappy

2010-08-17 Thread Christopher Done
Hi, I thought I'd go through my uploaded Hackage packages and decide which ones I am going to maintain, which are worth others maintaining, and which are probably not worth maintaining (spoiler, most aren't). 1. Interested in and will continue maintaining: gd, higherorder, cgi-utils, fastcgi, irc

[Haskell-cafe] Re: regex-compat on latest Haskell Platform

2010-08-17 Thread Mikhail Glushenkov
Hi Kevin, Kevin Jardine gmail.com> writes: > > I'm running Haskel Platform 2010.2.0.0 and attempting to compile with > regex-compat under Windows XP. > [...] Apparently, you've been bitten by a bug in the installer: http://trac.haskell.org/haskell-platform/ticket/137 I haven't updated the in

Re: [Haskell-cafe] Re: String vs ByteString

2010-08-17 Thread Johan Tibell
On Tue, Aug 17, 2010 at 9:30 PM, Donn Cave wrote: > Quoth John Millikin , > > > Ruby, which has an enormous Japanese userbase, solved the problem by > > essentially defining Text = (Encoding, ByteString), and then > > re-implementing text logic for each encoding. This allows very > > efficient op

Re: [Haskell-cafe] Re: String vs ByteString

2010-08-17 Thread Donn Cave
Quoth John Millikin , > Ruby, which has an enormous Japanese userbase, solved the problem by > essentially defining Text = (Encoding, ByteString), and then > re-implementing text logic for each encoding. This allows very > efficient operation with every possible encoding, at the cost of > increase

Re: [Haskell-cafe] Re: Sifflet dependencies

2010-08-17 Thread Anthony Cowley
On Tue, Aug 17, 2010 at 2:54 PM, Andrew Coppin wrote: > While we're on the subject... Suppose I have a package, which I know works > with foo-8.7.2. What should the Cabal dependents field say? We have a choice > of > >  foo == 8.7.2 >  foo >= 8.7.2 >  foo >= 8.7 >  foo >= 8.7 && < 8.8 >  foo == 8.

[Haskell-cafe] Re: Sifflet dependencies

2010-08-17 Thread Andrew Coppin
Don Stewart wrote: Those specific versions of packages are overly constrained. They should follow the PVP. While we're on the subject... Suppose I have a package, which I know works with foo-8.7.2. What should the Cabal dependents field say? We have a choice of foo == 8.7.2 foo >= 8.7.2

Re: [Haskell-cafe] Re: Slightly humorous: Headhunters toolbox (example for Germany)

2010-08-17 Thread Lutz Donnerhacke
* Ertugrul Soeylemez wrote: >> Good to know that Saxony-Anhalt is the state in Germany with leading >> interest in Haskell. :-) I would like to know, whether this is due to >> Magdeburg or Halle. > > Saxony-Anhalt is the state in Germany with leading number of Google > searches regarding Haskell.

Re: [Haskell-cafe] Which Mail editor or mail chain do you use ?

2010-08-17 Thread Dominique Devriese
2010/8/17 Luc TAESCH : > May I ask you how you redact your answers and which toolchain you are using? You can use footnote-mode [1] for easily producing the footnotes/references if you're an emacs user. Dominique Footnotes: [1] http://www.emacswiki.org/emacs/FootnoteMode ___

Re: [Haskell-cafe] Re: Slightly humorous: Headhunters toolbox (example for Germany)

2010-08-17 Thread Henning Thielemann
On Tue, 17 Aug 2010, Ertugrul Soeylemez wrote: Saxony-Anhalt is the state in Germany with leading number of Google searches regarding Haskell. This tells nothing about interest. It could just as well mean that people in other states understand Haskell better or go to the proper website (Hacka

[Haskell-cafe] Re: Slightly humorous: Headhunters toolbox (example for Germany)

2010-08-17 Thread Ertugrul Soeylemez
Henning Thielemann wrote: > Daniel Kahlenberg schrieb: > > Hi list, > > > > stumbled across that: > > http://www.google.com/insights/search/?hl=de#q=haskell&geo=DE&cmpt=q > > Good to know that Saxony-Anhalt is the state in Germany with leading > interest in Haskell. :-) I would like to know, whet

Re: [Haskell-cafe] Re: String vs ByteString

2010-08-17 Thread Johan Tibell
On Tue, Aug 17, 2010 at 6:19 PM, John Millikin wrote: > Ruby, which has an enormous Japanese userbase, solved the problem by > essentially defining Text = (Encoding, ByteString), and then > re-implementing text logic for each encoding. This allows very > efficient operation with every possible en

[Haskell-cafe] Re: Embedded scripting Language for haskell app

2010-08-17 Thread Gracjan Polak
Bulat Ziganshin gmail.com> writes: > > Hello Hemanth, > > Tuesday, August 17, 2010, 2:05:44 PM, you wrote: > > btw, i've written unfinished hslua tutorial: > http://haskell.org/haskellwiki/HsLua > And in related news embedded Lua interpreter recently got upgraded to version 5.1.4. -- Gracj

Re: [Haskell-cafe] Re: Handling absent maintainers

2010-08-17 Thread Ben Millwood
2010/8/17 Jonas Almström Duregård : > Hi, > > Has there been any progress with this package? Like you I have also > tried to contact Matt and like you I have ended up making my own > version of src-meta :). When someone assumes maintainership of this > package I would like to discuss integrating so

Re: [Haskell-cafe] Re: String vs ByteString

2010-08-17 Thread John Millikin
On Tue, Aug 17, 2010 at 06:12, Michael Snoyman wrote: > I'm not talking about API changes here; the topic at hand is the internal > representation of the stream of characters used by the text package. That is > currently UTF-16; I would argue switching to UTF8. The Data.Text.Foreign module is par

Re: [Haskell-cafe] Heavy lift-ing

2010-08-17 Thread Tilo Wiklund
On 24/07/2010, aditya siram wrote: > Perhaps I'm being unclear again. All I was trying to say was that: > liftM2 (-) [0,1] [2,3] /= liftM2 (-) [2,3] [0,1] > > -deech I'm sorry if I'm bumping an old thread, but why should "liftM2 f" be commutative when "f" isn't? (I hope I'm not responding incorr

Re: [Haskell-cafe] Question about memory usage

2010-08-17 Thread Jacques Carette
Daniel Fischer wrote: On Aug 16, 2010, at 6:03 PM, Jacques Carette wrote: Any sequence of numbers given by a linear recurrence equation with constant coefficients can be computed quickly using asymptotically efficient matrix operations. In fact, the code to do this can be derived automatic

Re: [Haskell-cafe] Question about memory usage

2010-08-17 Thread Sebastian Fischer
BTW, what sort of memory usage are we talking about here? I was referring to the memory usage of this program import System.Environment import Data.Numbers.Fibonacci main :: IO () main = do n <- (read . head) `fmap` getArgs (fib n :: Integer) `seq` return () comp

Re: [Haskell-cafe] Re: String vs ByteString

2010-08-17 Thread Gábor Lehel
(Actually, this seems more like a job for a type class.) 2010/8/17 Gábor Lehel : > Someone mentioned earlier that IHHO all of this messing around with > encodings and conversions should be handled transparently, and I guess > you could do something like have the internal representation be along >

Re: [Haskell-cafe] Re: regex-compat on latest Haskell Platform

2010-08-17 Thread Stephen Tetley
On 17 August 2010 14:15, Kevin Jardine wrote: [snip] > So sorry for the waste of bandwidth. > No - its good to have a solution in public. The issue tracker seemed a bit inconclusive, but anyone reading an archive of the mailing list will now be able to pick up what to do.

[Haskell-cafe] Re: Support for lock-free/wait-free programming?

2010-08-17 Thread Gregory Collins
Simon Marlow writes: > On 17/08/2010 06:09, Gregory Collins wrote: > > We could provide a compare-and-swap on IORefs, indeed I've been > planning to try that so we could move the implementation of > atomicModifyIORef into user-space, so to speak. For hash tables it would be nice to have a CAS in

Re: [Haskell-cafe] Question about memory usage

2010-08-17 Thread Daniel Fischer
On Tuesday 17 August 2010 08:59:29, Sebastian Fischer wrote: > > I wonder whether the numbers in a single step of the computation > occupy all the memory or whether old numbers are retained although > they shouldn't be. BTW, what sort of memory usage are we talking about here? I have now tried you

Re: [Haskell-cafe] Re: String vs ByteString

2010-08-17 Thread Gábor Lehel
Someone mentioned earlier that IHHO all of this messing around with encodings and conversions should be handled transparently, and I guess you could do something like have the internal representation be along the lines of Either UTF8 UTF16 (or perhaps even more encodings), and then implement every

Re: [Haskell-cafe] Which Mail editor or mail chain do you use ?

2010-08-17 Thread Magnus Therning
On Tue, Aug 17, 2010 at 14:17, Luc TAESCH wrote: > Gentlemen, Slightly off topic, but when reading this list , I was ( happily > ) surprised when reading this list, about the quality of typing, including > redaction,paragraph, greeks and footnote ref like , in just a (pure) text > list > ... > bas

Re: [Haskell-cafe] Re: String vs ByteString

2010-08-17 Thread Ketil Malde
Felipe Lessa writes: [-snip- I've already spent too much time on the other stuff :-] > And what do you think about creating a real SeqData data type > with two bases per byte? In terms of processing speed I guess > there will be a small penalty, but if you need to have large > quantities of bas

Re: [Haskell-cafe] Re: String vs ByteString

2010-08-17 Thread Daniel Peebles
Sounds to me like we need a lazy Data.Text variation that allows UTF-8 and UTF-16 "segments" in it list of strict text elements :) Then big chunks of western text will be encoded efficiently, and same with CJK! Not sure what to do about strict Data.Text though :) On Tue, Aug 17, 2010 at 1:40 PM, K

[Haskell-cafe] Which Mail editor or mail chain do you use ?

2010-08-17 Thread Luc TAESCH
Gentlemen, Slightly off topic, but when reading this list , I was ( happily ) surprised when reading this list, about the quality of typing, including redaction,paragraph, greeks and footnote ref like , in just a (pure) text list ... baskell[1] seems interesting. And there's hslua[2]. Can one use h

[Haskell-cafe] Re: regex-compat on latest Haskell Platform

2010-08-17 Thread Kevin Jardine
The bug tracker recommended just installing from Hackage so with some trepidation I typed: cabal install regex-posix and everything now works as expected! So sorry for the waste of bandwidth. At least this thread is now in Google. Kevin On Aug 17, 3:06 pm, Kevin Jardine wrote: > yep > > now

Re: [Haskell-cafe] Re: String vs ByteString

2010-08-17 Thread Michael Snoyman
On Tue, Aug 17, 2010 at 3:23 PM, Yitzchak Gale wrote: > Michael Snoyman wrote: > > Regarding the data: you haven't actually quoted any > > statistics about the prevalence of CJK data > > True, I haven't seen any - except for Google, which > I don't believe is accurate. I would like to see some >

[Haskell-cafe] Re: regex-compat on latest Haskell Platform

2010-08-17 Thread Kevin Jardine
yep now I see it: http://trac.haskell.org/haskell-platform/ticket/137 Next time I'll check the bug tracker as well as Google before posting here! Thanks. Kevin On Aug 17, 2:37 pm, Stephen Tetley wrote: > Hi Kevin > > I've just installed Platform-2010.2.0.0 and I'm getting the symbol > error

Re: [Haskell-cafe] Re: String vs ByteString

2010-08-17 Thread Johan Tibell
On Tue, Aug 17, 2010 at 2:23 PM, Yitzchak Gale wrote: > Michael Snoyman wrote: > > Regarding the data: you haven't actually quoted any > > statistics about the prevalence of CJK data > > True, I haven't seen any - except for Google, which > I don't believe is accurate. I would like to see some >

Re: [Haskell-cafe] Re: String vs ByteString

2010-08-17 Thread Colin Paul Adams
> "Johan" == Johan Tibell writes: Johan> On Tue, Aug 17, 2010 at 1:36 PM, Tako Schotanus wrote: Johan> Yeah, I tried looking it up but I could find the Johan> technical definition for Char, but in the end I found that Johan> "maxBound" was "0x10" making it basically

Re: [Haskell-cafe] Re: String vs ByteString

2010-08-17 Thread Felipe Lessa
Hello, Ketil Malde! On Tue, Aug 17, 2010 at 8:02 AM, Ketil Malde wrote: > Ivan Lazar Miljenovic writes: > >> Seeing as how the genome just uses 4 base "letters", > > Yes, the bulk of the data is not really "text" at all, but each sequence > (it's fragmented due to the molecular division into chr

Re: [Haskell-cafe] Re: String vs ByteString

2010-08-17 Thread Johan Tibell
On Tue, Aug 17, 2010 at 1:36 PM, Tako Schotanus wrote: > Yeah, I tried looking it up but I could find the technical definition for > Char, but in the end I found that "maxBound" was "0x10" making it > basically 24 bits :) > I think that's enough to represent all the assigned Unicode code poi

Re: [Haskell-cafe] Re: regex-compat on latest Haskell Platform

2010-08-17 Thread Stephen Tetley
Hi Kevin I've just installed Platform-2010.2.0.0 and I'm getting the symbol error on the file that I could compile with Platform2009.2.0.2. So I'd say it is a bug with the Platform, also it looks like the error was acknowledged on the Platform mailing list last week. $ /c/ghc/HP/2010.2.0.0/bin/

Re: [Haskell-cafe] Slightly humorous: Headhunters toolbox (example for Germany)

2010-08-17 Thread Henning Thielemann
Daniel Kahlenberg schrieb: > Hi list, > > stumbled across that: > http://www.google.com/insights/search/?hl=de#q=haskell&geo=DE&cmpt=q Good to know that Saxony-Anhalt is the state in Germany with leading interest in Haskell. :-) I would like to know, whether this is due to Magdeburg or Halle. __

Re: [Haskell-cafe] Re: String vs ByteString

2010-08-17 Thread Yitzchak Gale
Ketil Malde wrote: > I'd point out that it seems at least as unfair to optimize for CJK at > the cost of Western languages. Quite true. > [...speculative calculation from which we conclude that] > a given document translated > between Chinese and English should occupy roughly the same space in >

Re: [Haskell-cafe] Re: String vs ByteString

2010-08-17 Thread Ketil Malde
Colin Paul Adams writes: >> Char is not an encoding, right? > > Ivan> No, but in GHC at least it corresponds to a Unicode codepoint. > > I don't think this is right, or shouldn't be right, anyway.. Surely it > stands for a character. Unicode codepoints include non-characters such > as the sur

Re: [Haskell-cafe] Re: String vs ByteString

2010-08-17 Thread Yitzchak Gale
Michael Snoyman wrote: > Regarding the data: you haven't actually quoted any > statistics about the prevalence of CJK data True, I haven't seen any - except for Google, which I don't believe is accurate. I would like to see some good unbiased data. Right now we just have our intuitions based on a

[Haskell-cafe] Re: regex-compat on latest Haskell Platform

2010-08-17 Thread Kevin Jardine
No, it's my own code. But in ghci even the simplest =~ fails on my machine. Here's a complete transcript: >ghci GHCi, version 6.12.3: http://www.haskell.org/ghc/ :? for help Loading package ghc-prim ... linking ... done. Loading package integer-gmp ... linking ... done. Loading package base ...

[Haskell-cafe] Re: String vs ByteString

2010-08-17 Thread Kevin Jardine
On Aug 17, 1:55 pm, Tako Schotanus wrote: > I'll repeat here that in my opinion a Text package should be good at > handling text, human text, from whatever country. If I need to handle large > streams of ASCII I'll use something else. I would mostly agree. However, a key use case for Text in

Re: [Haskell-cafe] Re: regex-compat on latest Haskell Platform

2010-08-17 Thread Stephen Tetley
Hi Kevin Is it a library on Hackage? I'll take a look if the dependencies aren't too deep... ___ Haskell-Cafe mailing list Haskell-Cafe@haskell.org http://www.haskell.org/mailman/listinfo/haskell-cafe

[Haskell-cafe] Re: regex-compat on latest Haskell Platform

2010-08-17 Thread Kevin Jardine
I'm running cabal install for the program that uses regex-compat. runhaskell doesn't work because compiling the code requires library information in the cabal file. I've also tried to run something similar using ghci on the same machine and it also fails with a similar error. Kevin On Aug 17,

Re: [Haskell-cafe] Re: String vs ByteString

2010-08-17 Thread Tako Schotanus
On Tue, Aug 17, 2010 at 13:40, Ketil Malde wrote: > Michael Snoyman writes: > > > As far as space usage, you are correct that CJK data will take up more > > memory in UTF-8 than UTF-16. > > With the danger of sounding ... alphabetist? as well as belaboring a > point I agree is irrelevant (the st

Re: [Haskell-cafe] Re: regex-compat on latest Haskell Platform

2010-08-17 Thread Stephen Tetley
On 17 August 2010 12:45, Kevin Jardine wrote: > I was assuming that regex would work properly with the latest Haskell > Platform so I haven't attempted to re-install it. Yes - its wise not to upgrade platform components, especially on Windows. > However as I mentioned above, attempting to compil

Re: [Haskell-cafe] Re: String vs ByteString

2010-08-17 Thread Ketil Malde
Yitzchak Gale writes: > I don't think the genome is typical text. I think the typical *large* collection of text is text-encoded data, and not, for lack of a better word, literature. Genomics data is just an example. -k -- If I haven't seen further, it is by standing in the footprints of gian

Re: [Haskell-cafe] Re: String vs ByteString

2010-08-17 Thread Colin Paul Adams
> "Ivan" == Ivan Lazar Miljenovic writes: > Char is not an encoding, right? Ivan> No, but in GHC at least it corresponds to a Unicode codepoint. I don't think this is right, or shouldn't be right, anyway.. Surely it stands for a character. Unicode codepoints include non-characters such

Re: [Haskell-cafe] Embedded scripting Language for haskell app

2010-08-17 Thread Wouter Swierstra
>> Can some one please give me a suggestion on the best choice for an >> embedded >> scripting Language for a haskell application? Why not use Haskell itself? I agree that C and Java aren't perhaps the best choice for application scripting – but both Xmonad and Yi have had quite some success using

[Haskell-cafe] Re: regex-compat on latest Haskell Platform

2010-08-17 Thread Kevin Jardine
I was assuming that regex would work properly with the latest Haskell Platform so I haven't attempted to re-install it. However as I mentioned above, attempting to compile a program that uses regex fails with linker errors on my Windows XP machine. Any ideas on how I can fix this problem? Kevin

Re: [Haskell-cafe] Re: String vs ByteString

2010-08-17 Thread Ketil Malde
Michael Snoyman writes: > As far as space usage, you are correct that CJK data will take up more > memory in UTF-8 than UTF-16. With the danger of sounding ... alphabetist? as well as belaboring a point I agree is irrelevant (the storage format): I'd point out that it seems at least as unfair

Re: [Haskell-cafe] Embedded scripting Language for haskell app

2010-08-17 Thread John Lato
> > From: Hemanth Kapila > Subject: [Haskell-cafe] Embedded scripting Language for haskell app > > Hi, > > Can some one please give me a suggestion on the best choice for an embedded > scripting Language for a haskell application? > I mean, something like guile/lua for c/c++ and groovy/jruby for j

Re: [Haskell-cafe] Re: String vs ByteString

2010-08-17 Thread Tako Schotanus
On Tue, Aug 17, 2010 at 13:29, Ketil Malde wrote: > Tako Schotanus writes: > > >> Just like Char is capable of encoding any valid Unicode codepoint. > > > Unless a Char in Haskell is 32 bits (or at least more than 16 bits) it > con > > NOT encode all Unicode points. > > And since it can encode (

Re: [Haskell-cafe] regex-compat on latest Haskell Platform

2010-08-17 Thread Stephen Tetley
Hi Kevin Are you trying to compile a program using regex-posix or compiling and installing an update of regex-posix (which is already in the Platform)? If you are compiling a program that uses regex-posix, you shouldn't need to install other DLLs as the relevant libraries (.a and .o files) are di

Re: [Haskell-cafe] Re: String vs ByteString

2010-08-17 Thread Michael Snoyman
On Tue, Aug 17, 2010 at 2:20 PM, Ivan Lazar Miljenovic < ivan.miljeno...@gmail.com> wrote: > Michael Snoyman writes: > > > I don't think *anyone* is asserting that UTF-16 is a common encoding > > for files anywhere, > > *ahem* > http://en.wikipedia.org/wiki/UTF-16/UCS-2#Use_in_major_operating_sys

Re: [Haskell-cafe] Re: String vs ByteString

2010-08-17 Thread Tako Schotanus
On Tue, Aug 17, 2010 at 13:00, Michael Snoyman wrote: > > > On Tue, Aug 17, 2010 at 1:50 PM, Yitzchak Gale wrote: > >> Ketil Malde wrote: >> > I haven't benchmarked it, but I'm fairly sure that, if you try to fit a >> > 3Gbyte file (the Human genome, say¹), into a computer with 4Gbytes of >> > R

Re: [Haskell-cafe] Re: String vs ByteString

2010-08-17 Thread Ivan Lazar Miljenovic
Michael Snoyman writes: > I don't think *anyone* is asserting that UTF-16 is a common encoding > for files anywhere, *ahem* http://en.wikipedia.org/wiki/UTF-16/UCS-2#Use_in_major_operating_systems_and_environments -- Ivan Lazar Miljenovic ivan.miljeno...@gmail.com IvanMiljenovic.wordpress.com

Re: [Haskell-cafe] Re: String vs ByteString

2010-08-17 Thread Ivan Lazar Miljenovic
Tako Schotanus writes: > On Tue, Aug 17, 2010 at 12:54, Ivan Lazar Miljenovic < > ivan.miljeno...@gmail.com> wrote: > >> Tom Harper writes: >> >> > 2010/8/17 Bulat Ziganshin : >> >> Hello Tom, >> > >> > >> > >> >> i don't understand what you mean. are you support all 2^20 codepoints >> >> in Da

[Haskell-cafe] Re: gtk2hs with old gtk2

2010-08-17 Thread Johannes Waldmann
Brandon S Allbery KF8NH ece.cmu.edu> writes: > I ran up against this here; I can't mandate that our older SuSE systems be > upgraded to a newer distribution (our faculty don't like change, in > general), so if 2.18 is really needed then it's pretty much the end of the > line for gtk2hs as far as

Re: Re[2]: [Haskell-cafe] Re: String vs ByteString

2010-08-17 Thread Johan Tibell
On Tue, Aug 17, 2010 at 1:05 PM, Bulat Ziganshin wrote: > Hello Tako, > > Tuesday, August 17, 2010, 3:03:20 PM, you wrote: > > > Unless a Char in Haskell is 32 bits (or at least more than 16 bits) > > it con NOT encode all Unicode points. > > it's 32 bit > Like Bulat said it's 32 bit. It's *defin

Re: Re[2]: [Haskell-cafe] Re: String vs ByteString

2010-08-17 Thread Johan Tibell
On Tue, Aug 17, 2010 at 12:39 PM, Bulat Ziganshin wrote: > Hello Tom, > > Tuesday, August 17, 2010, 2:09:09 PM, you wrote: > > > In the first iteration of the Text package, UTF-16 was chosen because > > it had a nice balance of arithmetic overhead and space. The > > arithmetic for UTF-8 started

Re: [Haskell-cafe] Re: String vs ByteString

2010-08-17 Thread Johan Tibell
Hi Ketil, On Tue, Aug 17, 2010 at 12:09 PM, Ketil Malde wrote: > Johan Tibell writes: > > > It's not clear to me that using UTF-16 internally does make Data.Text > > noticeably slower. > > I haven't benchmarked it, but I'm fairly sure that, if you try to fit a > 3Gbyte file (the Human genome, s

Re[2]: [Haskell-cafe] Re: String vs ByteString

2010-08-17 Thread Bulat Ziganshin
Hello Tako, Tuesday, August 17, 2010, 3:03:20 PM, you wrote: > Unless a Char in Haskell is 32 bits (or at least more than 16 bits) > it con NOT encode all Unicode points. it's 32 bit -- Best regards, Bulatmailto:bulat.zigans...@gmail.com _

Re: [Haskell-cafe] Re: String vs ByteString

2010-08-17 Thread Ketil Malde
Ivan Lazar Miljenovic writes: > Seeing as how the genome just uses 4 base "letters", Yes, the bulk of the data is not really "text" at all, but each sequence (it's fragmented due to the molecular division into chromosomes, and due to incompleteness) also has a textual header. Generally, the

Re: [Haskell-cafe] Re: String vs ByteString

2010-08-17 Thread Tako Schotanus
On Tue, Aug 17, 2010 at 12:54, Ivan Lazar Miljenovic < ivan.miljeno...@gmail.com> wrote: > Tom Harper writes: > > > 2010/8/17 Bulat Ziganshin : > >> Hello Tom, > > > > > > > >> i don't understand what you mean. are you support all 2^20 codepoints > >> in Data.Text package? > > > > Bulat, > > > >

Re: [Haskell-cafe] Re: String vs ByteString

2010-08-17 Thread Michael Snoyman
On Tue, Aug 17, 2010 at 1:50 PM, Yitzchak Gale wrote: > Ketil Malde wrote: > > I haven't benchmarked it, but I'm fairly sure that, if you try to fit a > > 3Gbyte file (the Human genome, say¹), into a computer with 4Gbytes of > > RAM, UTF-16 will be slower than UTF-8... > > I don't think the genom

Re: [Haskell-cafe] Re: String vs ByteString

2010-08-17 Thread Ivan Lazar Miljenovic
Miguel Mitrofanov writes: > Ivan Lazar Miljenovic wrote: >> Tom Harper writes: >> >>> 2010/8/17 Bulat Ziganshin : Hello Tom, >>> >>> i don't understand what you mean. are you support all 2^20 codepoints in Data.Text package? >>> Bulat, >>> >>> Yes, its internal representation is

Re: [Haskell-cafe] Re: String vs ByteString

2010-08-17 Thread Miguel Mitrofanov
Ivan Lazar Miljenovic wrote: Tom Harper writes: 2010/8/17 Bulat Ziganshin : Hello Tom, i don't understand what you mean. are you support all 2^20 codepoints in Data.Text package? Bulat, Yes, its internal representation is UTF-16, which is capable of encoding *any* valid Unicode codepo

Re: [Haskell-cafe] Re: String vs ByteString

2010-08-17 Thread Ivan Lazar Miljenovic
Tom Harper writes: > 2010/8/17 Bulat Ziganshin : >> Hello Tom, > > > >> i don't understand what you mean. are you support all 2^20 codepoints >> in Data.Text package? > > Bulat, > > Yes, its internal representation is UTF-16, which is capable of > encoding *any* valid Unicode codepoint. Just li

Re: [Haskell-cafe] Re: String vs ByteString

2010-08-17 Thread Yitzchak Gale
Ketil Malde wrote: > I haven't benchmarked it, but I'm fairly sure that, if you try to fit a > 3Gbyte file (the Human genome, say¹), into a computer with 4Gbytes of > RAM, UTF-16 will be slower than UTF-8... I don't think the genome is typical text. And I doubt that is true if that text is in a CJ

Re: Re[2]: [Haskell-cafe] Re: String vs ByteString

2010-08-17 Thread Tom Harper
2010/8/17 Bulat Ziganshin : > Hello Tom, > i don't understand what you mean. are you support all 2^20 codepoints > in Data.Text package? Bulat, Yes, its internal representation is UTF-16, which is capable of encoding *any* valid Unicode codepoint. -- Tom __

Re[2]: [Haskell-cafe] Re: String vs ByteString

2010-08-17 Thread Bulat Ziganshin
Hello Tom, Tuesday, August 17, 2010, 2:09:09 PM, you wrote: > In the first iteration of the Text package, UTF-16 was chosen because > it had a nice balance of arithmetic overhead and space. The > arithmetic for UTF-8 started to have serious performance impacts in > situations where the entire do

  1   2   >