In perl.git, the branch khw/ebcdic has been created

<http://perl5.git.perl.org/perl.git/commitdiff/ea5293216892b52c7c96aa4fd99a06d256e794a9?hp=0000000000000000000000000000000000000000>

        at  ea5293216892b52c7c96aa4fd99a06d256e794a9 (commit)

- Log -----------------------------------------------------------------
commit ea5293216892b52c7c96aa4fd99a06d256e794a9
Author: Karl Williamson <[email protected]>
Date:   Thu Aug 13 20:15:53 2015 -0600

    Revert "XXX Run Unicode's official normalization tests"

M       MANIFEST
M       Makefile.SH
M       charclass_invlists.h
D       lib/Unicode/testnorm.t
D       lib/unicore/NormTest.txt
M       regcharclass.h

commit f8160831380b6b9e164fe8d7f14c7040863ae131
Author: Karl Williamson <[email protected]>
Date:   Mon Aug 3 22:00:53 2015 -0600

    XXX experimental: op/tr.t

M       t/op/tr.t

commit b20a1b6b29ce53e5c5b2f062032fe173143a1799
Author: Karl Williamson <[email protected]>
Date:   Mon Aug 3 10:35:26 2015 -0600

    XXX temporary

M       cpan/Encode/Encode.xs

commit 4ed49db456730b392b0f4773b736073e58529055
Author: Karl Williamson <[email protected]>
Date:   Mon Aug 3 10:17:08 2015 -0600

    XXX op/sort.t: Add more tests
    
    probably comment.  These should fail currently on ebcdic

M       t/op/sort.t

commit 98496d36fd496af1fee1c3426ec91bdcb69dee5a
Author: Karl Williamson <[email protected]>
Date:   Sun Aug 2 22:18:10 2015 -0600

    XXX Test Unicode::Collate

M       t/TEST

commit d12b331835940e9a10b500177799ca44bea408bc
Author: Karl Williamson <[email protected]>
Date:   Sun Aug 2 21:20:44 2015 -0600

    offuni

M       toke.c
M       utf8.c
M       utf8.h
M       utfebcdic.h

commit 1607e6685bd8fc234d19aa38f1a8e4ec15c44dad
Author: Karl Williamson <[email protected]>
Date:   Sun Aug 2 21:21:25 2015 -0600

    invariant

M       utf8.h

commit e0aaeaa9ac31005d2bce9dd0ba9a6b2109b01161
Author: Karl Williamson <[email protected]>
Date:   Sat Aug 1 22:15:18 2015 -0600

    Change EBCDIC macro definition
    
    This changes the definition of isUTF8_POSSIBLY_PROBLEMATIC() on EBCDIC
    platforms to use PL_charclass[] instead of PL_e2a[].  The new array is
    more likely to be in the memory cache.

M       handy.h
M       l1_char_class_tab.h
M       regen/mk_PL_charclass.pl
M       utf8.h
M       utfebcdic.h

commit de51dd695c4ed379dd1214ef8bc044c31a9fd433
Author: Karl Williamson <[email protected]>
Date:   Sun Aug 2 09:02:51 2015 -0600

    Change EBCDIC macro definition
    
    Prior to this commit UVCHR_SKIP() was defined the same in both ASCII and
    EBCDIC, but they expanded to different things.  Now, they are defined
    separately  -- to what they expand to, and the EBCDIC version is changed
    when all expanded out to use PL_charclass[] instead of PL_e2a[].  The
    new array is more likely to be in the memory cache.

M       utf8.h
M       utfebcdic.h

commit dd6a4d0f2c2ab65956abfa82a0034ad215ac1e33
Author: Karl Williamson <[email protected]>
Date:   Sat May 16 10:43:40 2015 -0600

    Change EBCDIC macro definition
    
    Prior to this commit UVCHR_IS_INVARIANT() was defined the same in both
    ASCII and EBCDIC, but they expanded to different things.  Now, they are
    defined separately to what they expand to, and the EBCDIC version is
    changed when all expanded out to use PL_charclass[] instead of PL_e2a[].
    The new array is more likely to be in the memory cache.

M       utf8.h
M       utfebcdic.h

commit fda82a60edf5c4540b9176d12a3c25271978effd
Author: Karl Williamson <[email protected]>
Date:   Sat May 16 10:31:19 2015 -0600

    utf8.h: Change defn of UNI_IS_INVARIANT
    
    This changes it to be isASCII(), instead of repeating the "special"
    number 0x80.

M       utf8.h

commit db90d0568eae7b735ab18a5267d20a4ce751c54f
Author: Karl Williamson <[email protected]>
Date:   Fri May 15 14:49:21 2015 -0600

    Remove no longer used #define
    
    The previous commit removed all uses of this non-public #define.

M       regen/unicode_constants.pl
M       unicode_constants.h

commit ef7b8a7b925179e2c8e6a6ba525e8740bca4e04b
Author: Karl Williamson <[email protected]>
Date:   Fri May 15 14:48:23 2015 -0600

    Change filter of problematic code points for EBCDIC
    
    There are three classes of problematic Unicode code points that may
    require special handling.  Which code points are problematic is fairly
    complicated, requiring lots of branches.  However, the smallest of them
    is 0xD800, which means that most code points in modern use are below
    them all, and a single test can be used to exclude just about everything
    likely to be encountered.  The problem was that the way this test was
    done on EBCDIC caused way too many things to pass and have to be checked
    with the more complicated branches.  The digits 0-9 and some capital
    letters were not filtered out.  This commit changes the EBCDIC test to
    transform into I8 (an array lookup), and this fixes it to exclude things
    that shouldn't have passed before.

M       utf8.c
M       utf8.h

commit b9fb28c3cd6da4af660316a12a2672e4f882a569
Author: Karl Williamson <[email protected]>
Date:   Fri May 15 14:35:45 2015 -0600

    Change some UTF-EBCDIC macro handling defns
    
    This commit changes the definitions of some macros for UTF-8 handling on
    EBCDIC platforms.  The previous definitions transformed the bytes into
    I8 and did tests on the transformed values.  The change is to use
    previously unused bits in l1_char_class_tab.h so the transform isn't
    needed, and generally only one branch is.  These macros are called from
    the inner loops of, for example, regex backtracking.

M       l1_char_class_tab.h
M       regen/mk_PL_charclass.pl
M       utfebcdic.h

commit 99258827e8d07cab58870e33bef71793154b816c
Author: Karl Williamson <[email protected]>
Date:   Fri May 15 14:23:12 2015 -0600

    l1_char_class_tab.h: Add bits for UTF-EBCDIC
    
    This is for the next commit.

M       handy.h
M       l1_char_class_tab.h
M       regen/mk_PL_charclass.pl

commit 29f7798250bd8a08c2797050c0c844fbd6bc906e
Author: Karl Williamson <[email protected]>
Date:   Fri May 15 14:21:25 2015 -0600

    regen/mk_PL_charclass.pl: Refactor a print
    
    This is in preparation for the next commits.

M       regen/mk_PL_charclass.pl

commit 01dc574d5b52dcea4dec02a6e54fe0dc87dab19d
Author: Karl Williamson <[email protected]>
Date:   Fri May 15 10:59:54 2015 -0600

    Add macro for converting Latin1 to UTF-8, and use it
    
    This adds a macro that converts a code point in the ASCII 128-255 range
    to UTF-8, and changes existing code to use it when the range is known to
    be restricted to this one, rather than the previous macro which accepted
    a wider range (any code point representable by 2 bytes), but had an
    extra test on EBCDIC platforms, hence was larger than necessary and
    slightly slower.

M       handy.h
M       hv.c
M       pp.c
M       regcomp.c
M       regexec.c
M       toke.c
M       utf8.c
M       utf8.h

commit d7b0d7350194383f808b7a7c75796a752f5df6a6
Author: Karl Williamson <[email protected]>
Date:   Fri May 15 10:55:30 2015 -0600

    utf8.h: Add assertions to macro

M       utf8.h

commit ef4f9e3cbe8080cc01806f9ff386cd6c5053b552
Author: Karl Williamson <[email protected]>
Date:   Wed May 13 17:38:08 2015 -0600

    Change to use UVCHR_SKIP over UNI_SKIP
    
    UNI_SKIP is somewhat ambiguous.  Perl has long used 'uvchr' as part of a
    name to mean the unsigned values using the native character set plus
    Unicode values for those above 255.
    
    This also changes two calls (one in dquote_static.c and one in
    dquote_inline.h) to use UVCHR_SKIP; they should not have been OFFUNI, as
    they are dealing with native values.

M       dquote.c
M       dquote_inline.h
M       op.c
M       perl.c
M       pp.c
M       regcomp.c
M       regexec.c
M       toke.c
M       utf8.c

commit c6aafac7b4ae267752bd58c7f644dc93c2de16c7
Author: Karl Williamson <[email protected]>
Date:   Sat Aug 1 08:52:52 2015 -0600

    XXX Run Unicode's official normalization tests

M       MANIFEST
M       Makefile.SH
M       charclass_invlists.h
A       lib/Unicode/testnorm.t
A       lib/unicore/NormTest.txt
M       regcharclass.h

commit e42af1e40810f65c887fe21faa1c3fe9c6c81721
Author: Karl Williamson <[email protected]>
Date:   Mon May 18 11:16:18 2015 -0600

    lib/utf8.t: EBCDIC fixes
    
    Some of the test chose code points that did not match its assumptions as
    to their classifications.
    
    And some of the tests were extended to work on 1047 EBCDIC

M       lib/utf8.t

commit f8f7c410fe54a5f8cd174049b5ced16591b1dc70
Author: Karl Williamson <[email protected]>
Date:   Mon May 18 10:47:50 2015 -0600

    lib/utf8.t: Add some tests to stress EBCDIC
    
    The tests for the Latin1 \xFF aren't a fair test of UTF-8 on EBCDIC
    platforms, because it is generally a UTF-8 invariant character, so is
    the same regardless of being in UTF-8 or not.  This adds some tests
    where the UTF-EBCDIC version is 2 bytes (as well as the UTF-8 version).

M       lib/utf8.t

commit e0088aa4f4106894d3f3ab5a00d58026785455d5
Author: Karl Williamson <[email protected]>
Date:   Mon May 18 10:45:10 2015 -0600

    XXX t/uni/lex_utf8.t: Do some of the tests on EBCDIC
    
    XXX prob. the \xA2 and \377 will fail

M       t/uni/lex_utf8.t

commit 27ea727d9acc78d65fd1df321abee02a01689035
Author: Karl Williamson <[email protected]>
Date:   Mon May 18 10:30:57 2015 -0600

    t/op/utf8decode.t: Better SKIP message

M       t/op/utf8decode.t

commit 42155b8c05fc6b8b9fce1fa7aef6fe1b9eed0a7a
Author: Karl Williamson <[email protected]>
Date:   Mon May 18 10:24:11 2015 -0600

    XXX experimental t/op/tr.t

M       t/op/tr.t

commit 9830a50bc57dae127c9d3d5aa16d4c348e0f9338
Author: Karl Williamson <[email protected]>
Date:   Mon May 18 10:18:51 2015 -0600

    XXX experimental t/op/split.t

M       t/op/split.t

commit bf0c8131384d48b164aade2d26922cc83a0a13dd
Author: Karl Williamson <[email protected]>
Date:   Mon May 18 10:12:14 2015 -0600

    t/op/chr.t: Better SKIP message

M       t/op/chr.t

commit cabd4e0c1e7131f314785eb39b0f1d3fef685708
Author: Karl Williamson <[email protected]>
Date:   Mon May 18 10:02:08 2015 -0600

    t/base/lex.t: Use more standard test for EBCDIC
    
    This makes it easier to grep for these things.  The typical test is for
    the ord("A"), not some other character.  Since this is in t/base, it
    doesn't use helper scripts.

M       t/base/lex.t

commit ea6ba319e0338fdb5d03f38faaad6bc9460b4709
Author: Karl Williamson <[email protected]>
Date:   Mon May 18 09:52:59 2015 -0600

    XXX t/io/utf8.t: Experimental

M       t/io/utf8.t

commit 1cbfa5744ce6d8304aeac502a679dd1a3824e216
Author: Karl Williamson <[email protected]>
Date:   Mon May 18 08:49:37 2015 -0600

    XXX japh/abigail.t
    
    Experiment with running on EBCDIC, and using test.pl's skip()

M       t/japh/abigail.t

commit ae4a07d83866e333014dfd694f25d5484ac83f78
Author: Karl Williamson <[email protected]>
Date:   Fri May 15 13:34:07 2015 -0600

    t/op/tr.t: Clarify skip tests message

M       t/op/tr.t

commit a98e59b11be9d40e564fd3db47ff52dbd35de6d8
Author: Karl Williamson <[email protected]>
Date:   Fri May 8 21:35:12 2015 -0600

    perlapi: Nits

M       sv.c
M       util.c

commit 9b7c129fde2e354f5092745a2ff30ac0e0b5b456
Author: Karl Williamson <[email protected]>
Date:   Fri May 8 21:25:33 2015 -0600

    XXX look for more has X bit set

M       pad.c
M       sv.c

commit 609d4d061deb8b7dcd8a6ea927cbdbd503b9c74f
Author: Karl Williamson <[email protected]>
Date:   Fri May 8 21:22:32 2015 -0600

    XXX look for more perlapi: Add L<>

M       op.h

commit 0fad8596fa4896ae85c4b6e2aa1bdbc86ac97e1a
Author: Karl Williamson <[email protected]>
Date:   Fri May 8 21:21:51 2015 -0600

    perlapi: Add link

M       hv.c

commit e07d30e8e78cfa64a58dfa63d394b091cf10510d
Author: Karl Williamson <[email protected]>
Date:   Fri May 8 21:21:17 2015 -0600

    XXX look for more perlapi UTF-8

M       gv.c
M       hv.c
M       sv.h

commit d4469457c31cec7e497041cdbfe55c6b8921348a
Author: Karl Williamson <[email protected]>
Date:   Fri May 8 21:19:13 2015 -0600

    XXX look for more eg to e.g.

M       cv.h
M       mg.c

commit 8bedd4ab210b951d159adb976a89ff7dd38a44ef
Author: Karl Williamson <[email protected]>
Date:   Fri May 8 21:17:48 2015 -0600

    XXX check and look more to come Add S<>

M       av.c
M       mg.c
M       op.c
M       pad.c
M       sv.c
M       utf8.h
M       util.h

commit 21c9744e052e05fdf83474da62b5bcaec72d20dc
Author: Karl Williamson <[email protected]>
Date:   Fri May 8 21:12:55 2015 -0600

    vutil.c Nits, C<> L<>, XXX cpan upstream

M       vutil.c

commit 614e111097fe807923e70af492620fb9b66545b8
Author: Karl Williamson <[email protected]>
Date:   Fri May 8 21:10:16 2015 -0600

    XXX C<> for mro.xs

M       ext/mro/mro.xs

commit 6135e3641842d845ace174db5f01b5e3daf0f8ea
Author: Karl Williamson <[email protected]>
Date:   Thu May 7 10:58:54 2015 -0600

    XXX perlapi: Add C<> around
    
    Look through the code again, like for NUL(L)?
    
    Removes 'the' in front of parameter in some instances.

M       XSUB.h
M       av.c
M       dump.c
M       gv.c
M       handy.h
M       hv.c
M       hv.h
M       inline.h
M       intrpvar.h
M       mathoms.c
M       mg.c
M       mro_core.c
M       numeric.c
M       op.c
M       op.h
M       pad.c
M       pad.h
M       perl.c
M       pp_ctl.c
M       pp_pack.c
M       pp_sort.c
M       pp_sys.c
M       regexp.h
M       sv.c
M       sv.h
M       utf8.c
M       util.c

commit 8141c89fa3917dbbd16dbe0c29edad0f6554a68a
Author: Karl Williamson <[email protected]>
Date:   Fri Apr 3 12:06:39 2015 -0600

    Various .t files: Use globals to see if on EBCDIC
    
    These globals are already available; by using them instead of rolling
    our own, it makes it easer to grep for these kinds of instances.

M       lib/bytes.t
M       lib/utf8.t
M       t/op/chr.t
M       t/op/lex.t
M       t/op/oct.t
M       t/op/split.t
M       t/op/sub_lval.t
M       t/op/tr.t
M       t/op/vec.t
M       t/op/ver.t

commit 16773f95a9690f22abd20c28bc6f65abd0d5db27
Author: Karl Williamson <[email protected]>
Date:   Sat Aug 1 10:02:19 2015 -0600

    Revert "XXX Temp Configure to point to bash"
    
    This reverts commit

M       Configure

commit 6177ad5d0db94f0dd41087e8d1370cdef3a477a2
Author: Karl Williamson <[email protected]>
Date:   Wed Dec 17 13:28:56 2014 -0700

    XXX Temp Configure to point to bash

M       Configure
-----------------------------------------------------------------------

--
Perl5 Master Repository

Reply via email to