Back to General and Gameplay Programming

Casting between different sized integer references

General and Gameplay Programming Programming

Started by Servant of the Lord July 16, 2013 08:58 PM

10 comments, last by EarthBanana 10 years, 9 months ago

Servant of the Lord

33,739

Author

July 16, 2013 08:58 PM

If you cast references between two integers of different sizes, say using reinterpret_cast, you start sinking into dangerous undefined behavior, right?


uint16_t originalInt = 5000;

uint8_t &smallerThanOriginalRef = reinterpret_cast<uint8_t&>(originalInt);
uint32_t &largerThanOriginalRef = reinterpret_cast<uint32_t&>(originalInt);

Would assigning to 'smallerThanOriginalRef' only set the first byte of 'originalInt'?

So this:


smallerThanOriginalRef = 123;

Wouldn't first clear the higher bytes of 'originalInt', and so 'originalInt' wouldn't be guaranteed to be 123, right?

And assigning to 'largerThanOriginalRef' would accidentally write on bytes outside of 'originalInt', which might write on memory that is used by other variables?

(in practice, those ints might be internally represented as a 32 bit or 64 bit integer anyway... but that's not guaranteed by the standard)

SiCrane

11,840

July 16, 2013 09:05 PM

Yup, undefined behavior territory. In practice what would happen depends on what endianness the platform is. Assigning to smallerThanOriginalRef might end up clearing either the higher or lower byte of originalInt.

LancerSolurus

630

July 16, 2013 09:34 PM

If you are sizing it up use the & to clean out the upper bits. Also you don't have to use reinterpret_cast, takes longer to type out than simply using v=(uint32_t)variable, which will do the exact same thing.

uint32_t largerThanOriginalRef = ((uint32_t)originalInt)&0xFFFF;

******************************************************************************************
Youtube Channel

ApochPiQ

23,138

July 16, 2013 09:43 PM

Also you don't have to use reinterpret_cast, takes longer to type out than simply using v=(uint32_t)variable, which will do the exact same thing.

And also has the wonderful benefit of making your evil cast impossible to find in a simple text search!

We should always favor saving a couple of keystrokes to make our lives more miserable down the road.

Wielder of the Sacred Wands
[Work - ArenaNet] [Epoch Language] [Scribblings]

LancerSolurus

630

July 16, 2013 09:47 PM

How so? I would simply search for (uint32_t) since I always put it in braces.

******************************************************************************************
Youtube Channel

Servant of the Lord

33,739

Author

July 16, 2013 10:13 PM

Also you don't have to use reinterpret_cast, takes longer to type out than simply using v=(uint32_t)variable, which will do the exact same thing.

My original use had to do with passing a uint32_t to a function wanting a uint8_t&, but then I realized the all problems involved with that.

LancerSolurus

630

July 17, 2013 09:01 AM

You would lose a huge amount of precision converting it to an eight bit value. If you really needed it 8 bit then extract it out in the routine itself, c++ normally passes everything as 32 bits anyways. Anything larger (such as strings) I always pass as a pointer reference using &... The only case I avoid doing this is for logging where I might send in a full string such as Log("Feature xxxx failed to work"); otherwise it is always a variable.

******************************************************************************************
Youtube Channel

samoth

9,833

July 17, 2013 11:06 AM

Interestingly, GCC 4.8 does not even warn about that code of yours although it's arguably in violation of the standard which says "A reference shall be initialized to refer to a valid object or function" (8.3.2) with "valid" being the important bit.

Since originalInt is not of a type that the new reference type can accomodate, it isn't a valid object (well, originalInt itself is a valid object, but result of the cast which the reference is initialized with isn't). You would think that this is obvious to the compiler, too. But maybe it's because of the cast operation. Probably the compiler assumes "programmer said cast, so he knows what he's doing".

King Mir

2,506

July 17, 2013 10:30 PM

It would be the same as if pointers were used.

uint16_t originalInt = 5000;
uint8_t &smallerThanOriginalRef = reinterpret_cast<uint8_t&>(originalInt);
uint32_t &largerThanOriginalRef = reinterpret_cast<uint32_t&>(originalInt);
 
smallerThanOriginalRef = 123;
largerThanOriginalRef = 1000000;

Should behave the same as:

uint16_t originalInt = 5000;
uint8_t *smallerThanOriginalPtr = reinterpret_cast<uint8_t*>(&originalInt);
uint32_t *largerThanOriginalPtr = reinterpret_cast<uint32_t*>(&originalInt);
 
*smallerThanOriginalPtr = 123;
*largerThanOriginalPtr = 1000000;

The smaller write would right to the beginning bits of the original int, and whether that's the low or high bits depends on endienness. The larger would write beyond the memory of the original int, possibly clobbering another stack variable.

Shannon Barber

1,684

July 19, 2013 05:53 AM

Unspecified behavior not undefined behavior.

Undefined behavior is always wrong code.

Unspecified behavior means it is defined by the implementation.

- The trade-off between price and quality does not exist in Japan. Rather, the idea that high quality brings on cost reduction is widely accepted.-- Tajima & Matsubara

Casting between different sized integer references

This topic is closed to new replies.

Popular Topics

Recommended Tutorials

Casting between different sized integer references

This topic is closed to new replies.

Popular Topics

Recommended Tutorials

Reticulating splines