Back to General and Gameplay Programming

Why is math transformation taxing to most CPUs?

Nick · 2014-12-09T16:30:41

I'm reading the 3-D Graphics section of the "Video" Chapter in my I.T book and it states: 1) The computer must track all of the vertices of all of the objects in the 3-D world, including the ones you cannot currently see. Question 1: When the book says the computer, do they really mean the program itself or the CPU that does the processing of the "addresses" or the RAM where the addresses are stored 2) This calculation process is called transformation is extremely taxing to most CPUs. Question 2: Is it because the CPU cannot process the address of the data quickly per frame? Does it lead to a slow frame-rate in the game? Anybody who knows a lot about computer architecture and 3D game programming share your experiences with me. =] I only done 2D game programming and I only worked with X, Y coordinates and have never explored the X, Y and Z coordinates.

General and Gameplay Programming Programming

Started by Nicholas Kong December 06, 2014 05:26 AM

12 comments, last by tanzanite7 9 years, 4 months ago

Jason Z

6,437

December 08, 2014 12:59 AM

Done poorly a game can still overload the CPU with badly-written math operations.

badly-written math operations as in "poorly optimized math code"? May you write an example showcasing a badly-written math operations and a goodly-written math operations? Definitely would want to learn more.

I think what he is getting at is that if you don't take advantage of parallel operations, and if you take the naïve approach (i.e. math ops without any refactoring), then you get terrible performance.

However, I would like to take a different stance than the others on this topic. I think the terms provided by the author are probably appropriate - taxing to a CPU can mean a lot of different things, and just like all things in computer graphics, it depends on the scene you are processing. If you have a simple scene, a CPU rasterizer can easily keep a high frame rate on modern CPUs. If you want to push the limits of current technology, then CPUs are not the choice processor graphics - you would obviously go for GPUs.

So the author is incorrect because of his blanket statement (that the CPU is always heavily taxed by transformations), because it depends on the scene being rendered and the ops being executed. If you take a look at the latest WARP devices in D3D11, you can find some really screaming software based rasterizers that work just fine for many situations.

Jason Zink :: DirectX MVP

Direct3D 11 engine on CodePlex: Hieroglyph 3

Direct3D Books: Practical Rendering and Computation with Direct3D 11, Programming Vertex, Geometry, and Pixel Shaders
Articles: Dual-Paraboloid Mapping Article :: Parallax Occlusion Mapping Article (original):: Fast Silhouettes Article

Games: Lunar Rift

Wyrframe

2,489

December 08, 2014 09:46 PM

[...] and threads (2-8 as many operations per clock, if being unrealistically ideal) [...]

I nearly blew a fuse reading that. Please tell me that's a typo, and not how you think threading improves performance.

RIP GameDev.net: launched 2 unusably-broken forum engines in as many years, and now has ceased operating as a forum at all, happy to remain naught but an advertising platform with an attached social media presense, headed by a staff who by their own admission have no idea what their userbase wants or expects.Here's to the good times; shame they exist in the past.

Ravyne

14,306

December 09, 2014 02:55 AM

[...] and threads (2-8 as many operations per clock, if being unrealistically ideal) [...]

I nearly blew a fuse reading that. Please tell me that's a typo, and not how you think threading improves performance.

In general no, but if all we're talking about is vertex transforms or another "embarrassingly parallel" problem, then yes. You could very well write a software T&L engine and simply replicate it across any and all cores not already consumed with other duties and achieve essentially linear speedup on vertex transformations, limited only by available memory bandwidth. The same properties that make this problem suitable for the massive parallelism of GPUs make this equally possible on CPUs. This is more or less what GPUs do, except they're massively scaled up (and of course they have other optimizations appropriate for their problem domain.

throw table_exception("(? ???)? ? ???");

tanzanite7

1,409

December 09, 2014 04:30 PM

[...] and threads (2-8 as many operations per clock, if being unrealistically ideal) [...]

I nearly blew a fuse reading that. Please tell me that's a typo, and not how you think threading improves performance.

"if being unrealistically ideal".

His example was detailing the higher/upper bound and is correct as such. Reality of it is irrelevant in that context.

edit: or did you get the impression he is not talking about hardware threads (cpu cores and HT if available ... typically 2-8)?

Why is math transformation taxing to most CPUs?

This topic is closed to new replies.

Popular Topics

Recommended Tutorials

Why is math transformation taxing to most CPUs?

This topic is closed to new replies.

Popular Topics

Recommended Tutorials

Reticulating splines