Jump to content
  • Advertisement
Sign in to follow this  
  • entries
  • comments
  • views

More on SSE

Sign in to follow this  


Well, even though I said I was not going to convert my CVector3 class to hold a float[3] array instead of separate x,y,z variables, I did. It wasnt so much of an issue in the end, but unlike what I though, you DO need to use float[4] even if you're not using the whole 4 variables, plus, you should use movups instead of movaps when moving data from memory to a MMX register, yes, even if you declared the data as ALIGN16.

For some reason using movaps randomly threw segfault exceptions on me, even though the adress%16=0;

You learn something new every day [smile]

also, although it is valid to use the memory as one of the parameters for the parallel instructions, you're better off movups'ing the values to a MMX register instead of using the memory location directly because of the same reason to favor movups over movaps.

Sign in to follow this  


Recommended Comments

There are no comments to display.

Create an account or sign in to comment

You need to be a member in order to leave a comment

Create an account

Sign up for a new account in our community. It's easy!

Register a new account

Sign in

Already have an account? Sign in here.

Sign In Now
  • Advertisement

Important Information

By using GameDev.net, you agree to our community Guidelines, Terms of Use, and Privacy Policy.

We are the game development community.

Whether you are an indie, hobbyist, AAA developer, or just trying to learn, GameDev.net is the place for you to learn, share, and connect with the games industry. Learn more About Us or sign up!

Sign me up!