In perl.git, the branch smoke-me/khw-locale has been created <https://perl5.git.perl.org/perl.git/commitdiff/a2041504c9d11f77b1ed843d1487dee399daa056?hp=0000000000000000000000000000000000000000>
at a2041504c9d11f77b1ed843d1487dee399daa056 (commit) - Log ----------------------------------------------------------------- commit a2041504c9d11f77b1ed843d1487dee399daa056 Author: Karl Williamson <k...@cpan.org> Date: Sat Nov 18 17:34:25 2017 -0700 XXX unfinished langinfo commit 81b942c4cbcf8e86b839671949174751a37a11dd Author: Karl Williamson <k...@cpan.org> Date: Sat Nov 18 16:12:18 2017 -0700 XXX msg: Use is_utf8_invariant_string() more Now that this function was changed to do word-at-a time searching in commit XXX, we can more quickly find the first variant byte in a string, if any. Given that a lot of usage of Perl is on ASCII data, it makes sense to try this first before any byte-at-a-time processing. Since Perl can be used on things that are mostly non-ASCII, we give up at the first such one, and process the rest of the string byte-by-byte. Otherwise we could have a pipeline of finding the next variant quickly, but this would only be faster if variants were rare, which I don't feel we can be confident about, after finding at least one. commit 669e5422e4ef36dec21ee02348839de39695aa2a Author: Karl Williamson <k...@cpan.org> Date: Sat Nov 18 14:21:49 2017 -0700 POSIX.xs: Convert to use is_utf8_non_invariant_string() In localeconv() commit 40ca74cfafbeeba20d4afc8f08bc20e1b7385e04 Author: Karl Williamson <k...@cpan.org> Date: Sat Nov 18 14:17:30 2017 -0700 toke.c: Convert to use is_utf8_non_invariant_string commit d1307be672ad1e0855fbb00acbad4759cfee014c Author: Karl Williamson <k...@cpan.org> Date: Sat Nov 18 14:16:26 2017 -0700 mg.c: Convert to use is_utf8_non_invariant_string() commit 02a7e4d7737631fed2f7cbf5c50d50bd83aab47a Author: Karl Williamson <k...@cpan.org> Date: Sat Nov 18 14:05:07 2017 -0700 Add is_utf8_non_invariant_string() This function tells whether or not its argument is legal UTF-8 that isn't invariant under UTF-8 or not. This paradigm is used in several places in the perl core to decide whether to turn on an SV's utf8 flag. None of those places realized that there was a simple way to avoid rescanning the string (though perhaps an C optimizer would). This commit uses that method; the next commits will convert to use this function. commit d1a04727f87bf77b9094ffdbf082480d5f901f5a Author: Karl Williamson <k...@cpan.org> Date: Sat Nov 18 13:52:44 2017 -0700 Change 3 functions to be #defines of others I made these separate functions because I thought would make faster code, but I realized that modern compilers should be able to optimize the more general functions into the same code as the ones removed, given that the parameters are known to be 0 at compile time and can easily determined to always be false. It's easier to maintain one version of a function than two, so this commit favors that. commit 631f0c783e32aac30ee2f07a5aa3fea8e486034f Author: Karl Williamson <k...@cpan.org> Date: Fri Nov 17 21:59:59 2017 -0700 inline.h: Avoid some extra strlen() The API of these functions says that if the length is 0, strlen() is called to compute it. In several cases, control is handed off to a function using 0, throwing away the already-computed length. Change to use the computed length when calling the functions, avoiding the issue. commit 44b6de549b591291ec993bd0d24d4d99107568ab Author: Karl Williamson <k...@cpan.org> Date: Tue Nov 14 22:27:06 2017 -0700 mg.c: Fix typo in #if This typo only affected platforms without the locale category LC_MESSAGES. Principally this is Windows. commit 09af73c9294581bbe1a5a2acd2ea27159ea3e4a0 Author: Karl Williamson <k...@cpan.org> Date: Mon Aug 28 18:01:43 2017 -0600 XXX may include other things after final edits: ExtUtils::ParseXS/lib/perlxs.pod: Nits This removes extra blanks following colons that don't mean the normal thing for colons that traditionally have two spaces after them, and capitalizes Perl. commit 632bd619d918ff824322c55ff749ff6e22176872 Author: Karl Williamson <k...@cpan.org> Date: Wed Jul 26 08:59:33 2017 -0600 Teach perl about more locale categories glibc has various other categories than the ones perl handles, for example LC_PAPER. This commit adds knowledge of these to perl, so that one can set them, interrogate them, and have libraries work on them, even though perl itself does not. This is in preparation for future commits, where it becomes more important than currently for perl to know about all the locale categories on the system. I looked through various other systems to try to find other categories, but did not see any. If a system does have such a category, it is pretty easy to tell perl about it, and recompile. Use the changes in this commit as a template, and send an email to perl...@perl.org, so that the next Perl release will have it. ----------------------------------------------------------------------- -- Perl5 Master Repository