Re: grapheme cluster library

2017-10-23 Thread Thomas Jollans
On 23/10/17 16:25, Rustom Mody wrote: > On Monday, October 23, 2017 at 1:15:35 PM UTC+5:30, Steve D'Aprano wrote: >> >> and more. Many linguists also include digraphs (pairs of letters) like the >> English "th", "sh", "qu", or "gh" as graphemes. >> >> >>

Re: grapheme cluster library

2017-10-23 Thread Rustom Mody
On Monday, October 23, 2017 at 1:15:35 PM UTC+5:30, Steve D'Aprano wrote: > On Mon, 23 Oct 2017 05:47 pm, Rustom Mody wrote: > > > On Monday, October 23, 2017 at 8:06:03 AM UTC+5:30, Lawrence D’Oliveiro > > wrote: > [...] > >> Bear in mind that the logical representation of the text is as code

Re: grapheme cluster library

2017-10-23 Thread Steve D'Aprano
On Mon, 23 Oct 2017 05:47 pm, Rustom Mody wrote: > On Monday, October 23, 2017 at 8:06:03 AM UTC+5:30, Lawrence D’Oliveiro > wrote: [...] >> Bear in mind that the logical representation of the text is as code points, >> graphemes would have more to do with rendering. > > Heh! Speak of

Re: grapheme cluster library (Posting On Python-List Prohibited)

2017-10-23 Thread Rustom Mody
On Monday, October 23, 2017 at 8:06:03 AM UTC+5:30, Lawrence D’Oliveiro wrote: > On Saturday, October 21, 2017 at 5:11:13 PM UTC+13, Rustom Mody wrote: > > Is there a recommended library for manipulating grapheme clusters? > > Is this any

Re: grapheme cluster library

2017-10-21 Thread Steven D'Aprano
On Fri, 20 Oct 2017 21:11:02 -0700, Rustom Mody wrote: > Is there a recommended library for manipulating grapheme clusters? Back in July, I asked for anyone interested in grapheme clusters to consider checking out this issue on the bug tracker: http://bugs.python.org/issue30717 My post

Re: grapheme cluster library

2017-10-21 Thread Rustom Mody
On Saturday, October 21, 2017 at 9:22:24 PM UTC+5:30, MRAB wrote: > On 2017-10-21 05:11, Rustom Mody wrote: > > Is there a recommended library for manipulating grapheme clusters? > > > > In particular, in devanagari > > क् + ि = कि > > in (pseudo)unicode names > > KA-letter + I-sign =

Re: grapheme cluster library

2017-10-21 Thread MRAB
On 2017-10-21 05:11, Rustom Mody wrote: Is there a recommended library for manipulating grapheme clusters? In particular, in devanagari क् + ि = कि in (pseudo)unicode names KA-letter + I-sign = KI-composite-letter I would like to be able to handle KI as a letter rather than two code-points.

Re: grapheme cluster library

2017-10-21 Thread Rustom Mody
On Saturday, October 21, 2017 at 11:51:57 AM UTC+5:30, Chris Angelico wrote: > On Sat, Oct 21, 2017 at 3:25 PM, Stefan Ram wrote: > > Rustom Mody writes: > >>Is there a recommended library for manipulating grapheme clusters? > > > > The Python Library has a module "unicodedata", with functions

Re: grapheme cluster library

2017-10-21 Thread Chris Angelico
On Sat, Oct 21, 2017 at 3:25 PM, Stefan Ram wrote: > Rustom Mody writes: >>Is there a recommended library for manipulating grapheme clusters? > > The Python Library has a module "unicodedata", with functions like: > > |unicodedata.normalize(

grapheme cluster library

2017-10-20 Thread Rustom Mody
Is there a recommended library for manipulating grapheme clusters? In particular, in devanagari क् + ि = कि in (pseudo)unicode names KA-letter + I-sign = KI-composite-letter I would like to be able to handle KI as a letter rather than two code-points. Can of course write an automaton to group