In perl.git, the branch khw/ebcdic has been created

<http://perl5.git.perl.org/perl.git/commitdiff/09c649c845044175cb0100c80b58b26213a56abd?hp=0000000000000000000000000000000000000000>

        at  09c649c845044175cb0100c80b58b26213a56abd (commit)

- Log -----------------------------------------------------------------
commit 09c649c845044175cb0100c80b58b26213a56abd
Author: Karl Williamson <[email protected]>
Date:   Sat Aug 15 12:53:57 2015 -0600

    XXX Make EBCDIC cmp work when both operands are UTF-8
    
    XXX fix indent

M       sv.c

commit 088bad0b1b9a1373721fcc65e2edeebb21ac532c
Author: Karl Williamson <[email protected]>
Date:   Sat Aug 15 12:53:17 2015 -0600

    utf8.h: Add comment; white space changes

M       utf8.h

commit 5baf44b3f6628132152e3c2cff07630ad530ad93
Author: Karl Williamson <[email protected]>
Date:   Thu Aug 13 20:15:53 2015 -0600

    Revert "XXX Run Unicode's official normalization tests"

M       MANIFEST
M       Makefile.SH
M       charclass_invlists.h
D       lib/Unicode/testnorm.t
D       lib/unicore/NormTest.txt
M       regcharclass.h

commit c1e2d13f27e0e772e3a647bc2cebb6b18ccf0c2c
Author: Karl Williamson <[email protected]>
Date:   Mon Aug 3 22:00:53 2015 -0600

    XXX experimental: op/tr.t

M       t/op/tr.t

commit c6adb917d8716b59cfedca4860d4bd849f6f039d
Author: Karl Williamson <[email protected]>
Date:   Mon Aug 3 10:35:26 2015 -0600

    XXX temporary

M       cpan/Encode/Encode.xs

commit 2c3e3e72aa3fbdc5c29cca193a099bb8f19274d3
Author: Karl Williamson <[email protected]>
Date:   Mon Aug 3 10:17:08 2015 -0600

    XXX op/sort.t: Add more tests
    
    probably comment.  These should fail currently on ebcdic

M       t/op/sort.t

commit a611796b006b5debf3c638fe71699e53ff8db600
Author: Karl Williamson <[email protected]>
Date:   Sun Aug 2 22:18:10 2015 -0600

    XXX Test Unicode::Collate and Unicode::Normalize

M       t/TEST

commit b59481fe8fe1b666630e4ccda5a1b0b45bae2238
Author: Karl Williamson <[email protected]>
Date:   Sun Aug 2 21:20:44 2015 -0600

    offuni

M       toke.c
M       utf8.c
M       utf8.h
M       utfebcdic.h

commit 07da4f57f849e3ffddb12d766fc97cf77ff62405
Author: Karl Williamson <[email protected]>
Date:   Sun Aug 2 21:21:25 2015 -0600

    invariant

M       utf8.h

commit 4f8ed8ae5fc9711eb17ce185fd07c111347a347d
Author: Karl Williamson <[email protected]>
Date:   Sat Aug 1 22:15:18 2015 -0600

    Change EBCDIC macro definition
    
    This changes the definition of isUTF8_POSSIBLY_PROBLEMATIC() on EBCDIC
    platforms to use PL_charclass[] instead of PL_e2a[].  The new array is
    more likely to be in the memory cache.

M       handy.h
M       l1_char_class_tab.h
M       regen/mk_PL_charclass.pl
M       utf8.h
M       utfebcdic.h

commit dfcd70157c07b34a701774143a9302eddc021fe3
Author: Karl Williamson <[email protected]>
Date:   Sun Aug 2 09:02:51 2015 -0600

    Change EBCDIC macro definition
    
    Prior to this commit UVCHR_SKIP() was defined the same in both ASCII and
    EBCDIC, but they expanded to different things.  Now, they are defined
    separately  -- to what they expand to, and the EBCDIC version is changed
    when all expanded out to use PL_charclass[] instead of PL_e2a[].  The
    new array is more likely to be in the memory cache.

M       utf8.h
M       utfebcdic.h

commit dcf8bf74b12943cc4f6423562505a87b08e164ec
Author: Karl Williamson <[email protected]>
Date:   Sat May 16 10:43:40 2015 -0600

    Change EBCDIC macro definition
    
    Prior to this commit UVCHR_IS_INVARIANT() was defined the same in both
    ASCII and EBCDIC, but they expanded to different things.  Now, they are
    defined separately to what they expand to, and the EBCDIC version is
    changed when all expanded out to use PL_charclass[] instead of PL_e2a[].
    The new array is more likely to be in the memory cache.

M       utf8.h
M       utfebcdic.h

commit 8506807c4c766fcc2eb9d8ada991a09b61fb79d0
Author: Karl Williamson <[email protected]>
Date:   Sat May 16 10:31:19 2015 -0600

    utf8.h: Change defn of UNI_IS_INVARIANT
    
    This changes it to be isASCII(), instead of repeating the "special"
    number 0x80.

M       utf8.h

commit b8153934111bc16488214e9414fb14da25fc4164
Author: Karl Williamson <[email protected]>
Date:   Fri May 15 14:49:21 2015 -0600

    Remove no longer used #define
    
    The previous commit removed all uses of this non-public #define.

M       regen/unicode_constants.pl
M       unicode_constants.h

commit 7fb502c30ef3cd57513cdd207a237d59e1b823ab
Author: Karl Williamson <[email protected]>
Date:   Fri May 15 14:48:23 2015 -0600

    Change filter of problematic code points for EBCDIC
    
    There are three classes of problematic Unicode code points that may
    require special handling.  Which code points are problematic is fairly
    complicated, requiring lots of branches.  However, the smallest of them
    is 0xD800, which means that most code points in modern use are below
    them all, and a single test can be used to exclude just about everything
    likely to be encountered.  The problem was that the way this test was
    done on EBCDIC caused way too many things to pass and have to be checked
    with the more complicated branches.  The digits 0-9 and some capital
    letters were not filtered out.  This commit changes the EBCDIC test to
    transform into I8 (an array lookup), and this fixes it to exclude things
    that shouldn't have passed before.

M       utf8.c
M       utf8.h

commit 427f148f0dd00fbe14d22fcb4afbeaa61fdfecf1
Author: Karl Williamson <[email protected]>
Date:   Fri May 15 14:35:45 2015 -0600

    Change some UTF-EBCDIC macro handling defns
    
    This commit changes the definitions of some macros for UTF-8 handling on
    EBCDIC platforms.  The previous definitions transformed the bytes into
    I8 and did tests on the transformed values.  The change is to use
    previously unused bits in l1_char_class_tab.h so the transform isn't
    needed, and generally only one branch is.  These macros are called from
    the inner loops of, for example, regex backtracking.

M       l1_char_class_tab.h
M       regen/mk_PL_charclass.pl
M       utfebcdic.h

commit 9bfcb1d3fb7d8c42a81344b84ba384bc93304390
Author: Karl Williamson <[email protected]>
Date:   Fri May 15 14:23:12 2015 -0600

    l1_char_class_tab.h: Add bits for UTF-EBCDIC
    
    This is for the next commit.

M       handy.h
M       l1_char_class_tab.h
M       regen/mk_PL_charclass.pl

commit 14d14d9c55956808c1ebe51c3f17f72740a4f223
Author: Karl Williamson <[email protected]>
Date:   Fri May 15 14:21:25 2015 -0600

    regen/mk_PL_charclass.pl: Refactor a print
    
    This is in preparation for the next commits.

M       regen/mk_PL_charclass.pl

commit a4bb21e8456bceaee0923299d83320e47f5fb9c9
Author: Karl Williamson <[email protected]>
Date:   Fri May 15 10:59:54 2015 -0600

    Add macro for converting Latin1 to UTF-8, and use it
    
    This adds a macro that converts a code point in the ASCII 128-255 range
    to UTF-8, and changes existing code to use it when the range is known to
    be restricted to this one, rather than the previous macro which accepted
    a wider range (any code point representable by 2 bytes), but had an
    extra test on EBCDIC platforms, hence was larger than necessary and
    slightly slower.

M       handy.h
M       hv.c
M       pp.c
M       regcomp.c
M       regexec.c
M       toke.c
M       utf8.c
M       utf8.h

commit d54bb224c7f6d07df39980bec20554bb3915aebd
Author: Karl Williamson <[email protected]>
Date:   Fri May 15 10:55:30 2015 -0600

    utf8.h: Add assertions to macro

M       utf8.h

commit f60196f8ad22d3da6eb7dc12e1bc69826446a18c
Author: Karl Williamson <[email protected]>
Date:   Wed May 13 17:38:08 2015 -0600

    Change to use UVCHR_SKIP over UNI_SKIP
    
    UNI_SKIP is somewhat ambiguous.  Perl has long used 'uvchr' as part of a
    name to mean the unsigned values using the native character set plus
    Unicode values for those above 255.
    
    This also changes two calls (one in dquote_static.c and one in
    dquote_inline.h) to use UVCHR_SKIP; they should not have been OFFUNI, as
    they are dealing with native values.

M       dquote.c
M       dquote_inline.h
M       op.c
M       perl.c
M       pp.c
M       regcomp.c
M       regexec.c
M       toke.c
M       utf8.c

commit f7d0575ba50f51279a022feea53344b3966892f2
Author: Karl Williamson <[email protected]>
Date:   Sat Aug 1 08:52:52 2015 -0600

    XXX Run Unicode's official normalization tests

M       MANIFEST
M       Makefile.SH
M       charclass_invlists.h
A       lib/Unicode/testnorm.t
A       lib/unicore/NormTest.txt
M       regcharclass.h

commit fbac77a6091f4778dda112e6468bbeee51b8f368
Author: Karl Williamson <[email protected]>
Date:   Mon May 18 11:16:18 2015 -0600

    lib/utf8.t: EBCDIC fixes
    
    Some of the test chose code points that did not match its assumptions as
    to their classifications.
    
    And some of the tests were extended to work on 1047 EBCDIC

M       lib/utf8.t

commit 884c333a041ce21f20396045b8edf437a98e8bd8
Author: Karl Williamson <[email protected]>
Date:   Mon May 18 10:47:50 2015 -0600

    lib/utf8.t: Add some tests to stress EBCDIC
    
    The tests for the Latin1 \xFF aren't a fair test of UTF-8 on EBCDIC
    platforms, because it is generally a UTF-8 invariant character, so is
    the same regardless of being in UTF-8 or not.  This adds some tests
    where the UTF-EBCDIC version is 2 bytes (as well as the UTF-8 version).

M       lib/utf8.t

commit df67424686b4b0a6ff51f2aaefc644a69de4a43f
Author: Karl Williamson <[email protected]>
Date:   Mon May 18 10:45:10 2015 -0600

    XXX t/uni/lex_utf8.t: Do some of the tests on EBCDIC
    
    XXX prob. the \xA2 and \377 will fail

M       t/uni/lex_utf8.t

commit bba4d8de19190925e48b6ef79dd0f115eb8d17dc
Author: Karl Williamson <[email protected]>
Date:   Mon May 18 10:30:57 2015 -0600

    t/op/utf8decode.t: Better SKIP message

M       t/op/utf8decode.t

commit 681d3eeabf9302480f192c0aa4fc7d40f790059a
Author: Karl Williamson <[email protected]>
Date:   Mon May 18 10:24:11 2015 -0600

    XXX experimental t/op/tr.t

M       t/op/tr.t

commit b7ad5869e894944178edf6b9a1c9cc5903fd8e22
Author: Karl Williamson <[email protected]>
Date:   Mon May 18 10:18:51 2015 -0600

    XXX experimental t/op/split.t

M       t/op/split.t

commit e378da3a1a25304b959a2577996b343ac1509b38
Author: Karl Williamson <[email protected]>
Date:   Mon May 18 10:12:14 2015 -0600

    t/op/chr.t: Better SKIP message

M       t/op/chr.t

commit 90a777e767cd3f6dfa83b0eca988f626b6d2bec4
Author: Karl Williamson <[email protected]>
Date:   Mon May 18 10:02:08 2015 -0600

    t/base/lex.t: Use more standard test for EBCDIC
    
    This makes it easier to grep for these things.  The typical test is for
    the ord("A"), not some other character.  Since this is in t/base, it
    doesn't use helper scripts.

M       t/base/lex.t

commit ec4695d0c160a680f823aea6c9d714b1bf7e68d3
Author: Karl Williamson <[email protected]>
Date:   Mon May 18 09:52:59 2015 -0600

    XXX t/io/utf8.t: Experimental

M       t/io/utf8.t

commit 8f6232ea3bac01595fc35ae1dee7f21c9ded1c22
Author: Karl Williamson <[email protected]>
Date:   Mon May 18 08:49:37 2015 -0600

    XXX japh/abigail.t
    
    Experiment with running on EBCDIC, and using test.pl's skip()

M       t/japh/abigail.t

commit 350f9134c6753ceb6bf92190d128b5ce17bb8431
Author: Karl Williamson <[email protected]>
Date:   Fri May 15 13:34:07 2015 -0600

    t/op/tr.t: Clarify skip tests message

M       t/op/tr.t

commit 0024e616c7c4e022668df61117c1292f1410897c
Author: Karl Williamson <[email protected]>
Date:   Fri May 8 21:35:12 2015 -0600

    perlapi: Nits

M       sv.c
M       util.c

commit 78d96f29e3d741928d591be4559ee08bb22b50ac
Author: Karl Williamson <[email protected]>
Date:   Fri May 8 21:25:33 2015 -0600

    XXX look for more has X bit set

M       pad.c
M       sv.c

commit 7b3836d7250fd314b48ffe02f18e1b0b6dc41cfe
Author: Karl Williamson <[email protected]>
Date:   Fri May 8 21:22:32 2015 -0600

    XXX look for more perlapi: Add L<>

M       op.h

commit 4433b4a3111920db26f9d21748893093f3219be8
Author: Karl Williamson <[email protected]>
Date:   Fri May 8 21:21:51 2015 -0600

    perlapi: Add link

M       hv.c

commit bff7a460553a192756c770c4db4641a96154f501
Author: Karl Williamson <[email protected]>
Date:   Fri May 8 21:21:17 2015 -0600

    XXX look for more perlapi UTF-8

M       gv.c
M       hv.c
M       sv.h

commit 24cd42bb4277fe280a75f5b3465cc1df1dfe6123
Author: Karl Williamson <[email protected]>
Date:   Fri May 8 21:19:13 2015 -0600

    XXX look for more eg to e.g.

M       cv.h
M       mg.c

commit 2d0924b375d17ee718c4d3f3f227ef2c1efe9c9b
Author: Karl Williamson <[email protected]>
Date:   Fri May 8 21:17:48 2015 -0600

    XXX check and look more to come Add S<>

M       av.c
M       mg.c
M       op.c
M       pad.c
M       sv.c
M       utf8.h
M       util.h

commit 2c1037cded427ba336db2592c02f8a3cc1c2a7ad
Author: Karl Williamson <[email protected]>
Date:   Fri May 8 21:12:55 2015 -0600

    vutil.c Nits, C<> L<>, XXX cpan upstream

M       vutil.c

commit 141478f098ad9f15865f58c5a74d49384be271bf
Author: Karl Williamson <[email protected]>
Date:   Fri May 8 21:10:16 2015 -0600

    XXX C<> for mro.xs

M       ext/mro/mro.xs

commit 17e613b539a805b8644fef339a0a13a3d95d8db0
Author: Karl Williamson <[email protected]>
Date:   Thu May 7 10:58:54 2015 -0600

    XXX perlapi: Add C<> around
    
    Look through the code again, like for NUL(L)?
    
    Removes 'the' in front of parameter in some instances.

M       XSUB.h
M       av.c
M       dump.c
M       gv.c
M       handy.h
M       hv.c
M       hv.h
M       inline.h
M       intrpvar.h
M       mathoms.c
M       mg.c
M       mro_core.c
M       numeric.c
M       op.c
M       op.h
M       pad.c
M       pad.h
M       perl.c
M       pp_ctl.c
M       pp_pack.c
M       pp_sort.c
M       pp_sys.c
M       regexp.h
M       sv.c
M       sv.h
M       utf8.c
M       util.c

commit 2f63e9297cfd14a34d11e0e529eee54f6ab3fd6c
Author: Karl Williamson <[email protected]>
Date:   Fri Apr 3 12:06:39 2015 -0600

    Various .t files: Use globals to see if on EBCDIC
    
    These globals are already available; by using them instead of rolling
    our own, it makes it easer to grep for these kinds of instances.

M       lib/bytes.t
M       lib/utf8.t
M       t/op/chr.t
M       t/op/lex.t
M       t/op/oct.t
M       t/op/split.t
M       t/op/sub_lval.t
M       t/op/tr.t
M       t/op/vec.t
M       t/op/ver.t

commit 62c32849a20408a6c6d3b4e9cd3bf093bc6121cb
Author: Karl Williamson <[email protected]>
Date:   Sat Aug 1 10:02:19 2015 -0600

    Revert "XXX Temp Configure to point to bash"
    
    This reverts commit

M       Configure

commit 1d5f3a2b220c194501b7f79e49246ae9b44fda32
Author: Karl Williamson <[email protected]>
Date:   Wed Dec 17 13:28:56 2014 -0700

    XXX Temp Configure to point to bash

M       Configure
-----------------------------------------------------------------------

--
Perl5 Master Repository

Reply via email to