Re: Integer conversions too pedantic in 64-bit

Steven Schveighoffer Tue, 15 Feb 2011 13:12:10 -0800

On Tue, 15 Feb 2011 14:15:06 -0500, Rainer Schuetze <r.sagita...@gmx.de>wrote:

I think David has raised a good point here that seems to have been lostin the discussion about naming.
Please note that the C name of the machine word integer was usuallycalled "int". The C standard only specifies a minimum bit-size for thedifferent types (see for examplehttp://www.ericgiguere.com/articles/ansi-c-summary.html). Most ofcurrent C++ implementations have identical "int" sizes, but now "long"is different. This approach has failed and has caused many headacheswhen porting software from one platform to another. D has recognizedthis and has explicitely defined the bit-size of the various integertypes. That's good!
Now, with size_t the distinction between platforms creeps back into thelanguage. It is everywhere across phobos, be it as length of ranges orsize of containers. This can get viral, as everything that gets in touchwith these values might have to stick to size_t. Is this really desired?

Do you really want portable code? The thing is, size_t is specificallydefined to be *the word size* whereas C defines int as a fuzzy size"should be at least 16 bits, and recommended to be equivalent to thenatural size of the machine". size_t is *guaranteed* to be the same sizeon the same platform, even among different compilers.

In addition size_t isn't actually defined by the compiler. So the librarycontrols the size of size_t, not the compiler. This should make itextremely portable.

Consider saving an array to disk, trying to read it on another platform.How many bits should be written for the size of that array?

It depends on the protocol or file format definition. It should beirrelevant what platform/architecture you are on. Any format or protocolworth its salt will define what size integers you should store.

Then you need a protocol implementation that converts between the nativesize and the stored size.

This is just like network endianness vs. host endianness. You always usehtonl and ntohl even if your platform has the same endianness as thenetwork, because you want your code to be portable. Not using them is ano-no even if it works fine on your big-endian system.

I don't have a perfect solution, but maybe builtin arrays could belimited to 2^^32-1 elements (or maybe 2^^31-1 to get rid of endlesssigned/unsigned conversions), so the normal type to be used is still"int". Ranges should adopt the type sizes of the underlying objects.

No, this is too limiting. If I have 64GB of memory (not out of thequestion), and I want to have a 5GB array, I think I should be allowedto. This is one of the main reasons to go to 64-bit in the first place.


-Steve

Re: Integer conversions too pedantic in 64-bit

Reply via email to