Math Performance

posted in Excursions into the Unknown

Published August 06, 2009

Ever since things settled down in SlimDX, we've been turning our attention outward towards new projects that we've been naming by humorously tacking on the "Slim" moniker. Josh has been working on both an SVN client and a home-made CLR, which right now is targeting the PSP. Promit has his SlimTune profiler project that you no doubt have seen, and I have my mini SlimLine line counter project.

Washu and I originally started a separate SlimMath project back in March, when the new XNAMath library was release that made heavy use of SIMD for blazing fast performance. We wanted to bring these performance gains to SlimDX, but it was obvious that the normal method of wrapping wouldn't suffice for these small and quick bits of code; the managed/native barrier was too great to cross. I came up with the idea to batch math calls and send them all across the marshaling barrier at once, therefore only paying the cost once in each direction. We started work on that, but even with this case, the costs paid were so large that you had to do a monstrous amount of work for it to be worthwhile.

We abandoned this idea, and SlimMath was left to die. Just a few weeks ago though, Washu came up with the idea of modifying the .NET assembly after it had been compiled with NGEN and injecting our own implementations of certain functions that used hand-written ASM. The idea immediately appealed to me, and we set off on such a project, dubbing it SlimGen. While Washu is no doubt preparing a series of blog posts on the subject (he's quite excited about this, something that should amaze and scare you at the same time), I just wanted to talk a little bit about how this could be used.

The way we see it, our SlimGen client program runs on the end user's computer, at install time. After the .NET target assembly (for example, SlimDX.dll) has been NGEN-ed and installed into the GAC, we run our program that takes compiled object files and inserts the raw code into the native assembly image. This has a few major benefits for us. First, you can avoid all interop costs by injecting the code directly. Second, you can make use of any instructions you wish in the raw code, which means that math methods can now take advantage of SIMD extensions. The third benefit, which is mostly derived from running the program at install time, is that it can customize the assembly based upon the extensions supported by the target processor. This means that if a chip is detected as having SSE4 support, we can inject our SSE4 version of a function directly, without any runtime checks.

This scales well too, since for functions where we don't care to provide a particular SSE4 version, we can simply downgrade to the next available version, or if we don't provide any optimized ASM for a method, simple go on using the one provided in the assembly. This new tool, once finished and properly tested, should allow .NET library and application developers to squeeze every possible performance benefit out of using native code, while still retaining many of the niceties and usability benefits of managed development.

Previous Entry SlimLine

Next Entry GDNet+

0 likes 2 comments

Comments

Prinz Eugn

No SlimMMORPGMaker '09?
I'm not going to lie, you're Journal goes over my head to the point where by back kind of hurts now. I had only the vaguest notion GDNet moderators worked on anything until I read this, I was under the impression that they were more like indentured servants dutifully reading every post and formulating appropriate sarcastic responses.

August 10, 2009 01:47 PM

Mike.Popoloski

Making sarcastic comments is a requirement for becoming a moderator, but every once in a while you can poke one into doing something useful for you.

Also, I see you were true to your word [grin]

August 10, 2009 04:46 PM

You must log in to join the conversation.

Don't have a GameDev.net account? Sign up!

Mike.Popoloski

Author

Math Performance

Comments

Mike.Popoloski

Latest Entries

New Blog

Exodus of the Faithful

Demystifying SSE Move Instructions

Progress Update

Start of Project

New Job

Language Builder IDE

The Downward Spiral

Job Interviews

HLSL Language Service

Math Performance

Comments

Mike.Popoloski

Latest Entries

New Blog

Exodus of the Faithful

Demystifying SSE Move Instructions

Progress Update

Start of Project

New Job

Language Builder IDE

The Downward Spiral

Job Interviews

HLSL Language Service

Reticulating splines