Re: Unicode normalization (was Re: The 1.0 Thread)

2009-06-22 Thread Paul Davis
On Mon, Jun 22, 2009 at 9:52 PM, Noah Slater wrote: > On Mon, Jun 22, 2009 at 04:22:10PM -0400, Paul Davis wrote: >> On Mon, Jun 22, 2009 at 3:32 PM, Noah Slater wrote: >> > >> >  * calculate the document hash from a canonical binary serialisation >> > >> >  * calculate the document hash from the b

Re: Unicode normalization (was Re: The 1.0 Thread)

2009-06-22 Thread Noah Slater
On Mon, Jun 22, 2009 at 04:22:10PM -0400, Paul Davis wrote: > On Mon, Jun 22, 2009 at 3:32 PM, Noah Slater wrote: > > > >  * calculate the document hash from a canonical binary serialisation > > > >  * calculate the document hash from the binary serialisation [...] > > * calculate the document has

Re: View Filtering (was Re: The 1.0 Thread)

2009-06-22 Thread Barry Wark
On Fri, Jun 19, 2009 at 1:53 PM, Chris Anderson wrote: > On Fri, Jun 19, 2009 at 1:42 PM, Barry Wark wrote: >> >> If I may, I would like to put a +1 on ability to intersect (or union) >> multiple view results. This is the feature that's preventing >> whole-hearted adoption of CouchDB for several ap

[jira] Commented: (COUCHDB-257) HTTP caching headers don't provide expected behaviour

2009-06-22 Thread Mark Hammond (JIRA)
[ https://issues.apache.org/jira/browse/COUCHDB-257?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12722880#action_12722880 ] Mark Hammond commented on COUCHDB-257: -- My 2c - IE is wrong in not interpreting a lac

Re: Unicode normalization (was Re: The 1.0 Thread)

2009-06-22 Thread Chris Anderson
On Mon, Jun 22, 2009 at 1:22 PM, Paul Davis wrote: > On Mon, Jun 22, 2009 at 3:32 PM, Noah Slater wrote: >> On Mon, Jun 22, 2009 at 03:15:24PM -0400, Paul Davis wrote: > > Exactly, though I would add a third choice that is > >  * calculate the document hash from the deterministic binary serializat

Re: Unicode normalization (was Re: The 1.0 Thread)

2009-06-22 Thread Paul Davis
On Mon, Jun 22, 2009 at 3:32 PM, Noah Slater wrote: > On Mon, Jun 22, 2009 at 03:15:24PM -0400, Paul Davis wrote: >> I think he means optimization in so much as that the deterministic >> revision algorithm still works regardless. > > For what definition of "works" though? > > Should two versions of

Re: Unicode normalization (was Re: The 1.0 Thread)

2009-06-22 Thread Noah Slater
On Mon, Jun 22, 2009 at 03:15:24PM -0400, Paul Davis wrote: > I think he means optimization in so much as that the deterministic > revision algorithm still works regardless. For what definition of "works" though? Should two versions of the same document, one using combining code points and the ot

Re: Unicode normalization (was Re: The 1.0 Thread)

2009-06-22 Thread Paul Davis
On Mon, Jun 22, 2009 at 2:46 PM, Noah Slater wrote: > On Mon, Jun 22, 2009 at 09:21:44AM -0700, Chris Anderson wrote: >> My larger point is that normalization is basically an optimization. > > Optimisation of what? Unicode normalisation should be considered absolutely > critical to any canonical fo

London CouchDB meetup Thursday

2009-06-22 Thread Chris Anderson
Couchers, Jan and I are in London for the Erlang Factory conference, along with a lot of other longtime (and new) Erlangers. We'll be having a Erlounge this Thursday evening at St. James Crypt at Clerkenwell Green. You are invited for drinks and conversation. The event starts at 18:00 (((look at m

Re: Unicode normalization (was Re: The 1.0 Thread)

2009-06-22 Thread Noah Slater
On Mon, Jun 22, 2009 at 07:46:40PM +0100, Noah Slater wrote: > Consider the following byte sequences: > > U+006B U+014D U+0061 U+006E > > U+006B U+006F U+0304 U+0061 U+006E > > Both of these look like "kōan" yet the byte sequence depends on my input > method. Just to be pedantic, these are co

Re: Unicode normalization (was Re: The 1.0 Thread)

2009-06-22 Thread Noah Slater
On Mon, Jun 22, 2009 at 09:21:44AM -0700, Chris Anderson wrote: > My larger point is that normalization is basically an optimization. Optimisation of what? Unicode normalisation should be considered absolutely critical to any canonical form. If we want to use some proprietary algorithm for determi

[jira] Commented: (COUCHDB-377) allow native view servers

2009-06-22 Thread Damien Katz (JIRA)
[ https://issues.apache.org/jira/browse/COUCHDB-377?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12722757#action_12722757 ] Damien Katz commented on COUCHDB-377: - I think we should not try to avoid the JSON ter

Re: The 1.0 Thread

2009-06-22 Thread Noah Slater
On Mon, Jun 22, 2009 at 10:51:27AM -0700, Jim McCoy wrote: > p.s. I am not necessarily suggesting that this task be the focus of > anyone on the dev team, but it would be nice to know that such an > effort would be acceptable to the powers that be before someone spends > time working on it... You

Re: The 1.0 Thread

2009-06-22 Thread Paul Davis
On Mon, Jun 22, 2009 at 1:51 PM, Jim McCoy wrote: > Something that would be nice to have for 1.0 or at least somewhere on > the roadmap is full support for Erlang in views and in other bits of > the codebase that accept functions as input.  I know that Javascript > is quite popular with most of the

Re: The 1.0 Thread

2009-06-22 Thread Jim McCoy
Something that would be nice to have for 1.0 or at least somewhere on the roadmap is full support for Erlang in views and in other bits of the codebase that accept functions as input. I know that Javascript is quite popular with most of the Couch team, but it would be nice to know that Erlang coul

Re: Unicode normalization (was Re: The 1.0 Thread)

2009-06-22 Thread Chris Anderson
On Mon, Jun 22, 2009 at 7:36 AM, Noah Slater wrote: > On Sun, Jun 21, 2009 at 11:21:00PM -0700, Chris Anderson wrote: >> My gut reaction is that normalizing strings using NFC [1] is not appropriate >> for a database. Here's why we should treat strings as binary and not worry >> about unicode normal

Re: Unicode normalization (was Re: The 1.0 Thread)

2009-06-22 Thread Paul Davis
Couple things over thinking this over for a night. Firstly, I was about to write an email almost exactly like Chris' last night, but during the time of drafting it I started looking into unicode normalization and what effects it might have. As it turns out, we're already messing around with string

Re: Unicode normalization (was Re: The 1.0 Thread)

2009-06-22 Thread Antony Blakey
On 23/06/2009, at 12:06 AM, Noah Slater wrote: On Sun, Jun 21, 2009 at 11:21:00PM -0700, Chris Anderson wrote: My gut reaction is that normalizing strings using NFC [1] is not appropriate for a database. Here's why we should treat strings as binary and not worry about unicode normalization

Re: Unicode normalization (was Re: The 1.0 Thread)

2009-06-22 Thread Noah Slater
On Sun, Jun 21, 2009 at 11:21:00PM -0700, Chris Anderson wrote: > My gut reaction is that normalizing strings using NFC [1] is not appropriate > for a database. Here's why we should treat strings as binary and not worry > about unicode normalization at all: [...] > First of all, I'm certain we can'

[jira] Commented: (COUCHDB-383) HTTP get parameter strictness/looseness

2009-06-22 Thread Curt Arnold (JIRA)
[ https://issues.apache.org/jira/browse/COUCHDB-383?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12722582#action_12722582 ] Curt Arnold commented on COUCHDB-383: - The typical work-around used with other service

[jira] Commented: (COUCHDB-257) HTTP caching headers don't provide expected behaviour

2009-06-22 Thread Vinay Sajip (JIRA)
[ https://issues.apache.org/jira/browse/COUCHDB-257?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12722506#action_12722506 ] Vinay Sajip commented on COUCHDB-257: - I don't see that this is particularly related t

[jira] Commented: (COUCHDB-383) HTTP get parameter strictness/looseness

2009-06-22 Thread Vinay Sajip (JIRA)
[ https://issues.apache.org/jira/browse/COUCHDB-383?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12722505#action_12722505 ] Vinay Sajip commented on COUCHDB-383: - I don't see that this is particularly related t

Re: patches that need reviewing

2009-06-22 Thread Dirkjan Ochtman
On Sun, Jun 21, 2009 at 23:53, Damien Katz wrote: > I haven't been keeping very good track lately of patches. Sorry if you have > a patch we haven't reviewed yet. > > If their are any patches waiting review, respond here and I'll try to get to > it soon or reassign to someone else to review. I'd l