Re: [regex] case-splitting strings in unicode

2005-10-09 Thread Martin v. Löwis
John Perks and Sarah Mount wrote: > I have to split some identifiers that are casedLikeThis into their > component words. In this instance I can safely use [A-Z] to represent > uppercase, but what pattern should I use if I wanted it to work more > generally? I can envisage walking the string testin

Re: [regex] case-splitting strings in unicode

2005-10-08 Thread Micah Elliott
On Oct 09, John Perks and Sarah Mount wrote: > I have to split some identifiers that are casedLikeThis into their > component words. In this instance I can safely use [A-Z] to represent > uppercase, but what pattern should I use if I wanted it to work more > generally? I can envisage walking the st

[regex] case-splitting strings in unicode

2005-10-08 Thread John Perks and Sarah Mount
I have to split some identifiers that are casedLikeThis into their component words. In this instance I can safely use [A-Z] to represent uppercase, but what pattern should I use if I wanted it to work more generally? I can envisage walking the string testing the unicodedata.category of each char, b