Re: Lost information converting decoded value to in32_t

Riccardo Mottola Mon, 25 May 2020 14:56:48 -0700

Hi,

after hacking an evening with this with Fred, I have an update.


I hope Richard, Wolfgang, David... can chime in.

I opened a PR with a proposed solution which "works"

On 2020-05-25 07:50:26 +0000 Fred Kiefer <fredkie...@gmx.de> wrote:

Looks like your archive has a value that gets encodes as an NSIntegerandthis is 32 bit on your current machine but the original encoded valuewasactually 64 bit and uses more than the 32 bit you are able to read.The bestthing to do is to find out which value it is that causes thebehaviour andencode (and decode) it as a long.
And to answer your question, the message itself is surely not a bug,it justshows how clever base tries to deal with different encoded values.The actualencoding may be a bug. There it would really help to know where it iscomingfrom.


The issue is in NSUnarchiver itself, around line 1401.
The unarchived NSInteger coming from the gorm file is just "2":

However the tests fail, even with simulations:
(gdb) p big
$1 = 2
(gdb) p big > 2147483647
$2 = 0
(gdb) p big < -2147483648
$3 = 1

What is interesting, if gdb follows the same literal rules andpromotions of GCC:


(gdb) p  (int64_t)-2147483648
$8 = 2147483648
(gdb) p  (int64_t)-2147483648l
$9 = 2147483648
(gdb) p  (int64_t)-2147483648L
$10 = 2147483648

(gdb) p  (int64_t)-2147483647
$15 = -2147483647

Whatever I do as a literal suffix, it comes out positive. One valueless and it works.

Here I am at loss with types and constants.

Looking up on the internet, I found the explanation is that theliteral is only the number part without signed, regarless if it is asigned or unsigned decimal. Then the - operator is performed. For thisreason it is usually written as (-2147483647 -1) in the limits headerfiles.With that explanation, it is promoted to the "unsinged" type and thenthe "-" operation fails and underflows.

For that reason I propose to change the lower bounds check to anequivalent easy to read

            if (big > 2147483647 || big + 2147483648 < 0

instead of the "minus 1" trick.

I wanted to open a PR on an "fix_32bit_decode" branch, butapparently... everything got pushed to master without the branch.Sorry.

So please if you disagree, revert and fix with the style you prefer.Also, perhaps, technically, we should apply the same style for int8-tand int16_t although we perhaps will never encounter that becauseminimum literal type is "int".


Riccardo

Re: Lost information converting decoded value to in32_t

Reply via email to