Feature Requests item #1503411, was opened at 2006-06-09 12:47 Message generated for change (Tracker Item Submitted) made by Item Submitter You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=352439&aid=1503411&group_id=2439
Please note that this message will contain a full copy of the comment thread, including the initial issue submission, for this request, not just the latest update. Category: None Group: None Status: Open Priority: 5 Submitted By: DV__ (dv__) Assigned to: Nobody/Anonymous (nobody) Summary: UTF-8 case-insensitive search for non-latin Initial Comment: Please help me to realize the subj. Problem (inside Scintilla): Because UTF-8 non-latin characters consist of one to three bytes, non-latin patched functions MakeUpperCase(char ch) and MakeLowerCase(char ch) do not work with UTF-8 buffer, because these functions work with one-byte characters. Possible solution: MakeUpperCase and MakeLowerCase must be different for a regular (non-Unicode) text and UTF-8 text. But I don't know how to implement it in Scintilla :( Another problem (outside Scintilla's sources): I want to read current character and its previous and next characters in UTF-8 non-latin text. I can calculate a number of bytes for the current character, and that's why I will know where is the next character's first byte. But how can I find out something about the previous character? I don't know a number of its bytes, so how can I read it? Thank you in advance. ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=352439&aid=1503411&group_id=2439 _______________________________________________ Scintilla-interest mailing list [email protected] http://mailman.lyra.org/mailman/listinfo/scintilla-interest
