Sign in to follow this  
HaywireGuy

Unity PUSH ECX faster? Or MOV Count, ECX?

Recommended Posts

Unless you're programming a 486 or earlier, it won't make a difference.
I'd probably use the stack directly with push since its more 'assembly-like' imo. Using variables is for sissies =-P

Share this post


Link to post
Share on other sites
Given that (non-register passing) function calls have their parameters passed with PUSHes for every compiler that I've ever seen, rather than a SUB and a bunch of MOVs, I'd say jsut go with PUSH.

Share this post


Link to post
Share on other sites
Thanks guys, that's helpful. I'm goin' with BUSH, I mean, PUSH.

Anyway my friend told me that PUSH is built in, so it only
occupies a single code byte (0x51), whereas "MOV Count, ECX"
occupies three code bytes and it's slower. Yeah and I saw that
in my dissasembly too :)

Share this post


Link to post
Share on other sites
Not necessarily. It may be that he could go forth and do a bunch of benchmarks, and infer, from the data gained, which approach is faster. But instead of doing all that stuffing around, he pops the question on GameDev.Net, on the assumption that with so many asm programmers around, someone's probably already done the groundwork and he'd be wasting his time to repeat it.

Linkay. Which may or may not be useful.

Share this post


Link to post
Share on other sites
Quote:
Original post by HaywireGuy
Thanks guys, that's helpful. I'm goin' with BUSH, I mean, PUSH.

Anyway my friend told me that PUSH is built in, so it only
occupies a single code byte (0x51), whereas "MOV Count, ECX"
occupies three code bytes and it's slower. Yeah and I saw that
in my dissasembly too :)


These days pretty much all of the ALU operations (including MOV and PUSH/POP) are hardwired into the chip already. Yes the alternate encoding for PUSH register operand onto the stack is only one byte, and I think a MOV mem/reg operation is two bytes followed by up to a four byte memory value.

The Intel Optimization manual gives PUSH with latency of 1.5 and throughput of 1, while MOV has a latency of 0.5 and throughput of 0.5. Remember that these are just estimates, and what usually matters more on modern processors is memory and cache performance. Either way I doubt you'll get much speed difference between the two.

Share this post


Link to post
Share on other sites
Quote:
Original post by Zahlman
If you have to ask (i.e. can't figure out how to test it for yourself), then you are pretty much automatically not qualified to make use of the answer.
Ain't that the absolute truth!

I was going to somehow say the same thing, but you couldn't have said it better.

Share this post


Link to post
Share on other sites
Zahlman, I was waitin' for a reply like that given by
DukeAtreides076 and the rest, which my test application won't
tell me. So, yah, I'm here.

Cheers fractoid, I'm sure that's gonna be useful :)

Duke, I'm sure that's true, maybe I'm just being a lit'
paranoid when I have these MOVs in a very tight loop that
executes millions of times. But anyway, thanks!

Share this post


Link to post
Share on other sites
FYI: The point I think we were both making is that if it was in an innermost loop then timing it would easily tell you which is faster. If it isn't in an innermoost loop then it doesn't really matter as it isn't being executed very much and will barely make any difference. So there is no need to ask the question either way, at least not without explaining why you needed to ask the question, such as 'I can't get my profiling working'.
On the other hand if you were just too lazy to profile then...

Excuse my beverity btw.

Share this post


Link to post
Share on other sites
Quote:
Original post by iMalc
FYI: The point I think we were both making is that if it was in an innermost loop then timing it would easily tell you which is faster. If it isn't in an innermoost loop then it doesn't really matter as it isn't being executed very much and will barely make any difference. So there is no need to ask the question either way, at least not without explaining why you needed to ask the question, such as 'I can't get my profiling working'.
On the other hand if you were just too lazy to profile then...

Excuse my beverity btw.
Even if it was in an inner loop, it'd have to be executing (MANY * MANY * MANY * MANY)999999 times to be noticable. If the times given by DukeAtreides076 are accurate, it might take ~3 cycles. On a 500MHZ machine, that means it would take 6*10-9 seconds, or 6 nanoseconds. On a modern machine (3.5GHz say) it would take a single nanosecond. You could do that operation a US Billion times per second. Unless you're writing for a REALLY slow machine, the difference is practically 0 and the time you wasted on the 'optimization' could be better spent optimizing the algorithms or implementations of something else..

Share this post


Link to post
Share on other sites

Create an account or sign in to comment

You need to be a member in order to leave a comment

Create an account

Sign up for a new account in our community. It's easy!

Register a new account

Sign in

Already have an account? Sign in here.

Sign In Now

Sign in to follow this  

  • Announcements

  • Forum Statistics

    • Total Topics
      628333
    • Total Posts
      2982127
  • Similar Content

    • By BAG Labs
      Mobile SoS

      Platform: Android
      Genre: Board
      Link: Google Play
       
      This games sharpen memory and test your strategies to place S-O-S pattern within time limit and serve 3 difficulties as Easy, Normal, and Hard.

      Goals of the game is to put S-O-S words in patterns (Horizontal, Vertical, and Diagonal) alternately with enemy.
       
      Features:
      Single Player Multiplayer Achievements Leaderboards  
       
      Screenshot:

       

       

       
      Link: Google Play
       
      Please help us improve this game with review
    • By abarnes
      Hello!
      I am a game development student in my second year of a three year program and I would like to start building my portfolio. I was thinking of creating some games to show what I can do to potential employers since I wont have any work related experience when I graduate. But as I'm sure you all know there are tons of ways to approach developing/designing a game and I'm curious if anyone had any insight as to any "standards" that come with this? Is it okay to use game engines like Unity, Unreal, Game Maker etc? Or would it be better to make a game from scratch to better show case your skills? Any and all advice will be greatly appreciated!
    • By Hilster
      Hello 2D Artists,
      I've started making a 2D Puzzle Adventure game for mobile and I'm looking for someone who would want in on creating assets for the game. The core of the programming is pretty much complete, you can walk within the grid laid out and push boxes, when there is an object on top of a pressure pad it will activate the linked objects or if there is one object with multiple linked pressure pads it requires you to activate all points for the object to become active. 

      The level iteration for the game is quick and simple, a Photoshop file that is made of individual pixels that represents objects is put into the game and it creates the level out of those pixels with the assigned objects.
      The objects that need sprites created so far is the character, box, pressure pad, door, trap door, the walls, the stairs and the tiled background.
      I intend to add more objects so the amount I'd like to add will be extended.
      My motivations for posting here is to have something that looks nice to be able to display on my portfolio, so if you're looking for a working game that you can place your art into and improve the look of your portfolio then we're in business.
      Please reply with a few past examples of your art below and I'll be in touch!
    • By thefollower
      Hi
      I have set up my TcpClient to connect to my server and that works fine. But i am a bit confused how i read messages from the network stream with it continuously running via async, without ever closing the connection ?
      My TcpClient Manager class has:
       
      public async Task<bool> Connect(string address, int port) { try { await _tcpClient.ConnectAsync(address, port); IsConnected = true; return true; } catch(Exception e) { Debug.Log(e); return false; } } public async Task<int> Read(byte[] readBuffer) { if (!IsConnected) return -1; using (var networkStream = _tcpClient.GetStream()) { try { var bytesRead = await networkStream.ReadAsync(readBuffer, 0, readBuffer.Length); return bytesRead; } catch (Exception e) { Debug.Log(e); IsConnected = false; return -1; } } }  
      So i thought to just run a co-routine and call Read constantly to get the most recent message, but that doesn't make much sense to me since a co-routine would be blocked with the await. How is this actually done? The MS Docs don't have very good Async examples with the TcpClient class so i don't know fully get how to keep calling Read correctly.
  • Popular Now