In perl.git, the branch smoke-me/khw-5.21 has been created <http://perl5.git.perl.org/perl.git/commitdiff/9907a7508ed6300fbefbe4f0b86713821725dcc1?hp=0000000000000000000000000000000000000000>
at 9907a7508ed6300fbefbe4f0b86713821725dcc1 (commit) - Log ----------------------------------------------------------------- commit 9907a7508ed6300fbefbe4f0b86713821725dcc1 Author: Karl Williamson <k...@cpan.org> Date: Mon May 19 16:36:33 2014 -0600 perlre: Clarify /x eol can't be escaped M pod/perlre.pod commit 208a479babe720baf4616a0311aa39d48aed01e2 Author: Karl Williamson <k...@cpan.org> Date: Sat May 17 19:37:06 2014 -0600 XXX Don't push. attempt to tell tries everything at compile time But, it appears this is thrown away, have to consult with Yves to see if is worth pursuing M embed.fnc M embed.h M perl.h M proto.h M regcomp.c M regcomp.h M regcomp.sym commit 13c31b5ff0bf63d2140e0e709aad94fa558da97a Author: Karl Williamson <k...@cpan.org> Date: Sat May 17 19:18:25 2014 -0600 XXX isSOME_FOLD_A M handy.h commit 1ebe7aaef7ec71d61b445ec900d0617e91fc3f7d Author: Karl Williamson <k...@cpan.org> Date: Tue May 13 17:31:39 2014 -0600 t/run/locale.t: Skip some tests for some shells Some sh programs don't accept changes of environment variables like LC_ALL to invalid values. For these the tests aren't testing what they think they are, so change them to just return the expected value, so they will pass. This effectively skips the tests when run on these shells. See http://nntp.perl.org/group/perl.perl5.porters/214503 M t/run/locale.t commit aad760d7f0776fcdea1545c121e326386d8df73f Author: Karl Williamson <k...@cpan.org> Date: Tue May 13 17:08:24 2014 -0600 t/run/locale.t: Rmv obsolete complicated idiom See http://nntp.perl.org/group/perl.perl5.porters/213369 M t/run/locale.t commit f175961ff68f1769efd95d5869ac739dafc1ee15 Author: Karl Williamson <k...@cpan.org> Date: Mon May 12 14:49:48 2014 -0600 XXX See current commit msg Reverts xyz and abc to deprecate unescaped { Needs more pod places, delta. Fails because of confict with DaveM code in ??. I think the latter also would fail with madskills true. One possibility is to only do the keep delim (or whatever it is) within the ?? portion of the string. M dquote_static.c M embed.fnc M embed.h M handy.h M l1_char_class_tab.h M lib/B/Deparse-core.t M pod/perldiag.pod M pod/perlre.pod M proto.h M regcomp.c M regen/mk_PL_charclass.pl M t/lib/warnings/toke M t/op/kvhslice.t M t/re/pat_advanced.t M t/re/reg_mesg.t M toke.c commit 0fbf4e7604c85bd33343991763dfcb2b76c02407 Author: Karl Williamson <k...@cpan.org> Date: Sun May 11 18:20:54 2014 -0600 regcomp.c: Update comment to new reality M regcomp.c commit 183596d017ec0413a9e32c538176dbfc80d88b34 Author: Karl Williamson <k...@cpan.org> Date: Sun May 11 18:19:40 2014 -0600 regcomp.c: Add assertion, remove case: statements This routine should never be called unless the new assertion is true, which removes the need for the removed cases in the switch. M regcomp.c commit ef7a7b6868458a42d66ba2f8cdec484c2bb47200 Author: Karl Williamson <k...@cpan.org> Date: Sun May 11 17:41:48 2014 -0600 regcomp.c: Skip work that is a no-op There are a few characters in the Latin1 range that can be folded to by above-Latin1 characters. Some of these are folded to as part of a single character fold, like KELVIN SIGN folds to 'k'. More are folded to as part of a multi-character fold. Until this commit, there wasn't a quick way to distinguish between the two classes. A couple of places only want the single-character ones. It is more efficient to look for just those than to include the multi-char ones which end up not doing anything. This uses a bit in l1_char_class_tab.h to indicate those characters that are in the desired class. M handy.h M l1_char_class_tab.h M regcomp.c M regen/mk_PL_charclass.pl commit fa01072f478b0949f125f801f15fe0ce5e28f9f2 Author: Karl Williamson <pub...@khwilliamson.com> Date: Sat Mar 16 22:48:22 2013 -0600 regen/mk_PL_charclass.pl: Rmv hard-coded char names Since this program was written, the abbreviated names of the control characters have become available from charnames::viacode(). We change to use these instead of hard-coding them in. At the same time, this shortens the names for some of the other characters in cases where it is easy to read the short ones. It also changes to use mnemonics instead of hard-coded ordinals, like using ASCII instead of x < 128. This allows it to be run on an EBCDIC platform. M l1_char_class_tab.h M regen/mk_PL_charclass.pl commit 1ac371e9ae8e321e8b0a56faa40c0d270352270c Author: Karl Williamson <k...@cpan.org> Date: Wed May 7 17:36:40 2014 -0600 XXX resolve this ticket when push PATCH: [perl #121777] User-defined prop and packages The solution to this is to add the package name to the property name for run-time lookup. M regcomp.c M t/re/pat_advanced.t commit 39296c9a791aaa40dc30db581089e239c8de184b Author: Karl Williamson <k...@cpan.org> Date: Wed May 7 16:41:51 2014 -0600 regcomp.c: Rmv temporary variable This variable is unused after its first use, so can just be elminated. M regcomp.c commit 1dc10df0dde72171cb31264e6761b9803263f208 Author: Karl Williamson <k...@cpan.org> Date: Wed May 7 16:40:48 2014 -0600 lib/utf8_heavy.pl: Stack '?' and ':' ternaries This is to make the code easier to read. M lib/utf8_heavy.pl commit cb298cb3a8e5ec99ff25788265f761c69b2e950c Author: Karl Williamson <k...@cpan.org> Date: Tue May 6 16:26:05 2014 -0600 t/test.pl: Use existing variable instead of recalculating If the current platform is ascii is available already in a variable. Use that instead of inventing sa different paradigm. M t/test.pl commit 9bc14f765e1a8da36e4687d93eed9dfb64e3bd1e Author: Karl Williamson <k...@cpan.org> Date: Sun Apr 20 20:06:04 2014 -0600 regexec.c: Eliminate a malloc/free This uses an C automatic variable instead of a malloc and free. M regexec.c commit 167cf19ff6037a06057b054865d9c6a790df67b5 Author: Karl Williamson <pub...@khwilliamson.com> Date: Fri Mar 15 11:50:35 2013 -0600 XS-APItest/t/handy.t: Change output message On EBCDIC platforms, the output is not in terms of \N{U+}; change text to \x{ } M ext/XS-APItest/t/handy.t commit 612543bb15dbc0495067d6b0909472a72bd8c808 Author: Karl Williamson <k...@cpan.org> Date: Mon May 5 09:56:43 2014 -0600 utf8.c: Move documentation next to its function Somehow this pod stuff was orphaned from the function it describes. M utf8.c commit 4ef1e198ef5dce772cb637de35a7ce0afff45425 Author: Karl Williamson <k...@cpan.org> Date: Sun May 4 19:00:40 2014 -0600 sv.c: Clarify comment I find reordering the text makes it easier to understand M sv.c commit c8393ad3c879b4dcfd30cf81193df6f4716a4213 Author: Karl Williamson <k...@cpan.org> Date: Sun May 4 18:58:51 2014 -0600 perlapi: Clarify some instances where NUL is or isn't permitted Some functions that take a string/length pair can have embedded NULs and don't have to be NUL terminated; others are the opposite. This adds text to clarify the issue. M handy.h M sv.c M sv.h M util.c commit 1347bad150bad8ec2d1039fa366f84fdea0c7da8 Author: Karl Williamson <k...@cpan.org> Date: Sun May 4 17:09:22 2014 -0600 Wrap various pod uses of NUL with C<> This makes the uses of this consistent in our pods. Also changed one use of the word 'buffer' into 'string', the latter being more appropriate. M inline.h M pod/perlguts.pod M pod/perlvms.pod M sv.c M sv.h M toke.c M util.c commit 62aa7bf989bfa1fb50813f73431bad6f54e46bc1 Author: Karl Williamson <k...@cpan.org> Date: Sat May 3 15:38:27 2014 -0600 regen/regcharclass.pl: Improve the generated code This is a small improvement when a consecutive group of U8 code points begins at 0 or ends at 255. These end points are physically impossible of being exceeded, so there is no need to test for that end of the range. In several places this causes a mask operation to not be generated. M regcharclass.h M regen/regcharclass.pl commit e0852f3512f003385dc2044cd72801d19a5d09a1 Author: Karl Williamson <k...@cpan.org> Date: Tue Apr 29 20:18:42 2014 -0600 Revert "mktables: Remove obsolete sort constraint" This reverts commit cf13ddc5efb78ce6d588441978947303e5baf1d8, and adds a comment as to why. M lib/unicore/mktables commit 47d55402463ad8a415e6df4ac876ebbd3d1dcf7d Author: Karl Williamson <pub...@khwilliamson.com> Date: Sat Feb 22 16:36:28 2014 -0700 mktables: Add comment M lib/unicore/mktables commit 4f5ae060a1721838673a84f426434a5f8723c95f Author: Karl Williamson <k...@cpan.org> Date: Mon Apr 28 21:50:52 2014 -0600 regcomp.c: Fix comment M regcomp.c commit 7804cc706d851e27349e6a1d786842edad090645 Author: Karl Williamson <k...@cpan.org> Date: Mon Apr 28 21:43:39 2014 -0600 regcomp.c: Tighten up synthetic start class for Unicode characters When creating a synthetic start class (SSC) for EXACTFish nodes, in many instances prior to this patch we punted and assumed any code point could be matched. This patch actually goes to the trouble of figuring out what Unicode code points could possibly match, and uses those. The only time it punts now is when the node starts with a string of multiple characters that together could be the fold of a single one. It's too complicated getting all the possibilities for these, as there could be overlapping sequences that work. M regcomp.c commit 149b961019cb5c6576d0e762812e11b1fe85f3fb Author: Karl Williamson <k...@cpan.org> Date: Mon Apr 28 21:40:47 2014 -0600 regcomp.c: Fix typo in comment M regcomp.c commit f5c4d962cdbee7767dcc0fbbcc9169b1d25665a2 Author: Karl Williamson <k...@cpan.org> Date: Mon Apr 28 21:10:48 2014 -0600 regcomp.c: Add assertion This is because coverity thinks this could happen, and jhi and khw don't M regcomp.c commit 93af0eb983ee7bf00e61e820ca681ca1959f46b8 Author: Karl Williamson <k...@cpan.org> Date: Mon Apr 28 21:08:19 2014 -0600 regcomp.c: Change a panic into an assert This really should never happen, so might as well be an assert. M regcomp.c commit aba356c7fceb3499b7d3d69c9787ec6cbc4547bd Author: Karl Williamson <k...@cpan.org> Date: Mon Apr 28 20:55:50 2014 -0600 regcomp.c: Move code into a function This is in preparation for it to be called from another place M embed.fnc M embed.h M lib/diagnostics.t M pod/perldiag.pod M proto.h M regcomp.c commit 9dc9d2924c4bae6b648946e643132cb28e98d495 Author: Karl Williamson <k...@cpan.org> Date: Mon Apr 28 19:53:39 2014 -0600 XXX need commit # Remove obsolete code Commit XXX changed things so that this code (as its comments suggest) is now obsolete. M embed.h M proto.h M regcomp.c commit 416927700cd9ef91d61a458919bef1446331d295 Author: Karl Williamson <k...@cpan.org> Date: Mon Apr 28 19:48:40 2014 -0600 regcomp.c, regexec.c: Move common code to a function There are other cases where this functionality will be needed as well. M embed.fnc M regcomp.c M regexec.c commit 03905cb4482f82f881ff67e97fd6af01aaf94d93 Author: Karl Williamson <k...@cpan.org> Date: Mon Apr 28 19:44:28 2014 -0600 regen/regcharclass_multi_char_folds.pl: Add some comments M regen/regcharclass_multi_char_folds.pl commit 03316451d50ea41d791ca96949609384743645b3 Author: Karl Williamson <k...@cpan.org> Date: Mon Apr 28 19:42:32 2014 -0600 regen/regcharclass.pl: Don't generate macro twice Until this patch, this could happen if both 'safe' and 'fast' are specified with a cp macro. M regen/regcharclass.pl commit 85ee1aebaef32d62c4d0ed00da3d920a27b89b53 Author: Karl Williamson <k...@cpan.org> Date: Mon Apr 28 14:10:37 2014 -0600 PATCH: [perl #121630]: perlpod clarification M pod/perlpod.pod commit b9e41ba075599baca9359d41a5ca22ad9e120e74 Author: Karl Williamson <k...@cpan.org> Date: Sun Apr 27 10:53:00 2014 -0600 Optimize /[a-z]/ and /[A-Z]/ These bracketed character classes can easily be optimized into smaller probably faster regops that match identically. M regcomp.c commit 082e91e4476b9861362f296bb114f85087e42f31 Author: Karl Williamson <k...@cpan.org> Date: Sun Apr 27 10:26:58 2014 -0600 /x in patterns now includes all \p{PatWS} This brings Perl regular expressions more into conformance with Unicode. /x now accepts 5 additional characters as white space. Use of these characters as literals under /x has been deprecated since 5.18, so now we are free to change what they mean. This commit eliminates the static function that processes the old whitespace definition (and a generated macro that was used only for this), using the already existing one for the new definition. It refactors slightly the static function that skips comments to mesh better with the needs of its callers, and calls it in one place where before the code was essentially duplicated. p5p discussion starting in http://nntp.perl.org/group/perl.perl5.porters/214726 convinced me that the (?[ ]) comments should be terminated the same way as regular /x comments, and this was also done in this commit. No prior notice is necessary as this is an experimental feature. M embed.fnc M embed.h M pod/perldelta.pod M pod/perldiag.pod M pod/perlre.pod M proto.h M regcharclass.h M regcomp.c M regen/regcharclass.pl M t/re/pat.t M t/re/reg_mesg.t commit 5e345404d3e93fae332270c917c757dd511ec5f0 Author: Karl Williamson <k...@cpan.org> Date: Sat Apr 26 11:27:52 2014 -0600 Deprecate NBSP in \N{...} names This is currently allowed, but is non-graphic, and is indistinguishable from a regular space. I was the one who initially allowed it, and did so out of ignorance of the negative consequences of doing so. There is no other precedent for including it. M lib/_charnames.pm M lib/charnames.pm M pod/perldelta.pod M pod/perldiag.pod M regen/unicode_constants.pl M t/lib/charnames/alias M t/re/pat_advanced.t M toke.c M unicode_constants.h commit fb4265afda221d8229835330db225ade439aeddf Author: Karl Williamson <k...@cpan.org> Date: Sat Apr 26 11:07:11 2014 -0600 Fix bug where charnames xlator doesn't return utf8 A user-supplied charnames translator for \N{} processing doesn't have to return in UTF-8. Prior to this patch, the code in toke.c assumed it did. M t/re/pat_advanced.t M toke.c commit ff007f7ad3f6c4d2f61077514e53f4e8246aa275 Author: Karl Williamson <k...@cpan.org> Date: Thu Apr 24 09:30:07 2014 -0600 t/lib/charnames/alias: Add comment M t/lib/charnames/alias commit c138dedf09fb4a646bdb2841524be5b04d1caf77 Author: Karl Williamson <k...@cpan.org> Date: Thu Apr 24 08:35:08 2014 -0600 Fatalize deprecated \N{} definitions Having a sequence of multiple spaces in a charnames alias name definition or having trailing spaces in it have been deprecated since 5.18, and it is now time to make them fatal. M lib/_charnames.pm M pod/perldiag.pod M t/lib/charnames/alias M t/re/pat_advanced.t M toke.c commit 285f87355b31cad20658c6f0940b2c033c9c1de0 Author: Karl Williamson <k...@cpan.org> Date: Wed Apr 23 20:33:12 2014 -0600 charnames: Eliminate need to sync code in two places This refactors the code so that it doesn't have to be kept in sync with other code. M lib/_charnames.pm M lib/charnames.pm M toke.c commit 3e7ab5a3b8bbcd9e0ad08446ebef0c6741f2b738 Author: Karl Williamson <k...@cpan.org> Date: Wed Apr 23 12:40:30 2014 -0600 Fatalize splitting '(?' and '(*' in regexes Inserting spaces or comments between the '(' and '?' or '*' has been deprecated since v5.18.0. M pod/perldelta.pod M pod/perldiag.pod M regcomp.c M t/re/pat.t M t/re/reg_mesg.t commit e4e7dea0f238b7f0a2350cdb28e8f2a77f934b6c Author: Karl Williamson <k...@cpan.org> Date: Mon Apr 21 21:02:44 2014 -0600 utfebcdic.h: Comment changes only Clarifications and typo fix. M utfebcdic.h commit 82cc0786f55e6789defcff9ce82eed68be3aaee2 Author: Karl Williamson <k...@cpan.org> Date: Mon Apr 21 21:27:37 2014 -0600 t/re/charset.t: Fix comment, test names Some of the test names have hard-coded values that are wrong. This uses a calculated value. M t/re/charset.t commit 73186f6edea11572f229fb20e6577d722d7c92d0 Author: Karl Williamson <k...@cpan.org> Date: Mon Apr 21 20:05:57 2014 -0600 handy.h: Comments, white-space only M handy.h commit 439941618c4538e83296436c70743650cc4adaaf Author: Karl Williamson <k...@cpan.org> Date: Mon Apr 21 20:00:16 2014 -0600 handy.h: Use some common macros for ASCII/EBCDIC It turns out that the EBCDIC definitions can be made the same as the ASCII ones, so this moves the ASCII definitions to the spot where other ones common to the 2 platforms reside, and removes the EBCDIC ones. In other words it combines separate definitions into common ones. M handy.h commit f4a4655ec554798425ad07fb14fcdeb814b7b5ff Author: Karl Williamson <pub...@khwilliamson.com> Date: Mon Jun 24 21:14:37 2013 -0600 regexec.c: Fix some EBCDIC problems We were testing for UTF-8 invariant, when we should have been testing for ASCII. This is a problem only on EBCDIC platforms, where they mean two different sets of code points. M regexec.c M t/re/re_tests commit c73ec41c7e8a5e42724933c69b6dc41cec000827 Author: Karl Williamson <pub...@khwilliamson.com> Date: Sat Mar 30 21:13:38 2013 -0600 Unicode::UCD.pm: Pod clarifications and nits M lib/Unicode/UCD.pm commit 507412be075dee8c932136ba122d09f5507731e9 Author: Karl Williamson <k...@cpan.org> Date: Mon Apr 21 12:11:40 2014 -0600 dquote_static.c: Clarify code We output the message when there is a printable ASCII character, so use that as the test instead of complementing the test for a control. This makes the code somewhat clearer. M dquote_static.c commit 67d2b994585d46a9a65ef27a3c19e90072743298 Author: Karl Williamson <pub...@khwilliamson.com> Date: Tue Apr 8 19:27:59 2014 -0600 "\cX" where X is non-printable is now fatal It had been deprecated, scheduled to be removed in 5.22. M dquote_static.c M pod/perldelta.pod M pod/perldiag.pod commit d9b5342522e7af53dc4d3b80a47045934fb53ce7 Author: Karl Williamson <pub...@khwilliamson.com> Date: Tue Apr 8 18:46:25 2014 -0600 Remove deprecated 'PL_sv_objcount' This was scheduled to be removed in 5.20, but was forgotten. M embedvar.h M intrpvar.h M pod/perldelta.pod commit a3e3c0b0bff9929ba8f879eee5578638b394e310 Author: ni...@thykier.net <ni...@thykier.net> Date: Sat Mar 22 08:51:26 2014 -0700 PATCH: [perl #121489] Optimization in pp_fttext Optimize out a call to PerlIO_get_cnt in pp_fttext M pp_sys.c ----------------------------------------------------------------------- -- Perl5 Master Repository