Back to General and Gameplay Programming

Best way to check endianness at run-time?

Tim Aagaard · 2013-03-11T19:56:08

Okay first off I found this: bool endianness() { int i = 1; char *ptr; ptr = (char*) &i; return (*ptr); } I like the way this is done, it's clever, quick and gets the job done. Except it only really works if the size of an int is at-least twice the size of a char. As I understand it the data types don't have a set size and that each 'larger' data type only has to be greater than or equal to the 'next smallest' data type. Considering endianness is architecture specific, and the size of data types would differ on different architectures, this seems to be an unsafe assumption. There's a lot of garbage information out there on endianness but from what I've gathered (and please correct me if I'm wrong): char isn't guranteed to be one byte, it seems it's usually the size of whatever the processor processes things in, which is sometimes 16-bit (2 byte) increments integer can be 1 byte on some architectures, which would break this code I also have one question, is endianness byte ordering always based on 8-bit bytes or would it be ordered in sections of 16-bits on architectures with 16-bit chars? I'm trying to accomplish something like this (pseudo-code): enum ENDIANNESS {UNKNOWN, LITTLE_ENDIAN, BIG_ENDIAN}; ENDIANNESS getEndian() { ENDIANESS e = UNKNOWN; get a 2 byte numeric data type; set bytes value equal to 1; get first byte if ( first byte == 0) { e = BIG_ENDIAN } else if ( first byte == 1) { e = LITTLE_ENDIAN } return e; } Just not sure the best way to handle this, especially since I'm unclear on whether the byte order is based on 8-bits always (seems unlikely) or whatever the size of a char is on the system executing the code (seems more likely). Any help is greatly appreciated :)

General and Gameplay Programming Programming

Started by TimA March 07, 2013 01:42 AM

17 comments, last by wintertime 11 years, 1 month ago

Khatharr

8,814

March 07, 2013 06:31 AM

The same code can generally compile and run without problems on systems with diverse endianness (provided it's compiled for the appropriate targets). It's usually only something that you need to worry about when dealing with data between machines, and in that case you're better off using a compile-time macro that ensures that data is in the correct format or else is converted.

In order to encounter endianness problems in the code itself you'd have to get up to the same flavor of hackery that's going on in your test function there, which probably isn't good, and at least should be a rare occurrence.

Long story short, if it's not causing errors then don't mess with it.

void hurrrrrrrr() {__asm sub [ebp+4],5;}

There are ten kinds of people in this world: those who understand binary and those who don't.

larspensjo

1,561

March 07, 2013 08:58 AM

As also have been mentioned above, there is a risk that you are falling into a common trap here. Data (or protocols) can be little endian or big endian, but you should never worry about what the current machine/system has.

Please read The byte order fallacy, which I think summarizes it all perfectly!

[size=2]Current project: Ephenation.
[size=2]Sharing OpenGL experiences: http://ephenationopengl.blogspot.com/

Khatharr

8,814

March 07, 2013 03:10 PM

Nice link. That sums it up quite definitively.

void hurrrrrrrr() {__asm sub [ebp+4],5;}

There are ten kinds of people in this world: those who understand binary and those who don't.

Kylotan

10,510

March 07, 2013 03:20 PM

Or just use htonl/ntohl etc. to write the values in a standardised order. That code should work on all machines.

rip-off

11,000

March 07, 2013 09:37 PM

I would be wary of pretending to support platforms unless you build and test on them. I don't mean to say that you should write unportable code, but don't think that patchwork of untested preprocessor branches is writing portable code.

blog | twitter

TimA

139

Author

March 10, 2013 06:49 PM

Thanks for all the info, you've all gave me different aspects to consider when I decide to switch my map format over to binary. Think I'm gonna save that task for after mid-terms though.

osmanb

2,082

March 10, 2013 07:56 PM

As also have been mentioned above, there is a risk that you are falling into a common trap here. Data (or protocols) can be little endian or big endian, but you should never worry about what the current machine/system has.

Please read The byte order fallacy, which I think summarizes it all perfectly!

I was also going to link to that article, but you beat me to it. For the use-case being described here, i think it's pretty accurate. However, as much respect as I have for all the Bell Labs folks, there is more to the story. In the ideal world where computers are infinitely fast, reading and writing every multi-byte value one byte at a time may be great. For those of us making games that need to load 100s of MBs with minimal delay -- not so much. I'd love to ignore the endianness of each platform we support, and serialize all data in one consistent format. But then we'd be iterating over all of that data at load time, instead of loading up a block of memory and doing pointer fixup.

All of that said, our solution is exactly as frob has indicated: We just #define it at compile time. We need to declare lots of things about each platform we support, we just add endianness to the list. It's really not hard. The #if/#elif chain for each of those decisions always looks like:


#if PLATFORM_WIN64
#define LITTLE_ENDIAN
#elif PLATFORM_PS3
#define BIG_ENDIAN
#else
#error Need to specify endianness for new platform!
#endif

Cornstalks

7,033

March 10, 2013 08:44 PM

I have some issues with that "The byte order fallacy" post, and since I can't post on his blog, I'm going to leave my comments here:

1. He claims his code "computes a 32-bit integer value regardless of the local size of integers" which is not quite correct. If an int is not 32-bits (particularly, if it's less than 25 bits), his code is invoking undefined behavior (shifting an N-bit number by N or more bits is undefined behavior).

2. He only talks about the simplest case: reading/writing ints. Other data types, like float or double, can't be magically written or read like he shows.

I can't think of other nitpicks. In general, I think he makes a good point, but I think he's being overly critical of the endian-specific method while not being critical enough of his own version. If he's going to be critical, he should be critical to both.

[size=2][ I was ninja'd 71 times before I stopped counting a long time ago ] [ f.k.a. MikeTacular ] [ My Blog ] [ SWFer: Gaplessly looped MP3s in your Flash games ]

wintertime

4,154

March 11, 2013 07:56 PM

Both of your problems is not only a problem with the cleaner always shifting code but also with the #ifdef BIGENDIAN twiddlings and there its maybe not even tested.

1. That can easily be fixed in both with explicit casts to int32_t before shifting.

2. With that you would probably just do a pointer cast. Floats and doubles are ultimately just some data bytes you read in too.

Best way to check endianness at run-time?

This topic is closed to new replies.

Popular Topics

Recommended Tutorials

Best way to check endianness at run-time?

This topic is closed to new replies.

Popular Topics

Recommended Tutorials

Reticulating splines