Custom pseudo-instancing faster than real instancing on Nvidia cards?

Started by
10 comments, last by theagentd 10 years, 10 months ago

So... How do I report this so Nvidia actually listens?

I'm not quite sure, but making an account at developer.nvidia.com would probably be a good start, and then either on their forums, or via the "contact" form.
The small, self-contained reproduction test is great for them to be able to see exactly what you're on about.

I haven't looked at your code, but simply adding an index buffer shouldn't double your CPU-side frame-times I would think...
I ran your test on my PC (Q6600 CPU, GTX460, driver 9.18.13.2018 12/05/2013) and got (from left to right buttons): 77 43 31 83 42
Advertisement

Thank you very much! That means that the performance quirks exist (at least) on the following cards:

GTX 295

GTX 460m (laptop)

GTX 460

GT 630

GTX 680

The only difference between the indexed renderer and the array renderer is that one uses 6 indices to form 2 GL_TRIANGLES and the other effectively does the same thing internally using GL_QUADS but without an index buffer. This isn't a problem when just rendering quads, but everything I'm rendering in my engine uses indexed triangles.

EDIT: I've posted this on the Nvidia developer forums: https://devtalk.nvidia.com/default/topic/548150/opengl/performance-bug-in-gldrawelementsinstanced/

This topic is closed to new replies.

Advertisement