Creating new triangles using GL_TRIANGLE_STRIPS

Started by
7 comments, last by L. Spiro 11 years, 2 months ago

Hello all,


Trying to render a model using GL_TRIANGLE_STRIPS and its working fine, however, at certain arbitrary points I want to break off the the triangle strips and essentially start a new set of triangle strips (that are completely separate from the original strips). How should I go about doing this? Multiple VBOs? I currently have only one VBO that holds all my vertices.

(OpenGL ES2.0 in particular but surely applies to OpenGL as a whole)


Thanks!

Advertisement

You can do it with multiple draw calls just starting each at a different offset into your index buffer, no need to use multiple VBOs at all, but it's usually much more efficient to use degenerate triangles to 'stitch' strips together. For illustrative purposes (your strips are probably longer) imagine two quads and the following triangle strip indices to render them: 0, 1, 2, 3, 3, 4, 4, 5, 6, 7.

That's 10 indices, so 8 triangles. But triangle 2, 3, 3, triangle 3, 3, 4, triangle 3, 4, 4 and triangle 4, 4, 5 are degenerate, that is they have zero area, so they do not contribute to the scene at all. So only 4 triangles get rendered, just the two quads we wanted.

You may want to also test if triangle strips are really best for you in the first place.

In my experience, starting with triangle lists and then adding triangle stripping later the performance actually decreased. At my office we got the same results (our triangle-strippers are not related and use different algorithms).

Neither of us did a thorough investigation into why our triangle strips were slower than triangle lists, but the candidates are obvious: poor caching and extra triangles.

Degenerate triangles get culled early, but not early enough. You take a hit for them.

And while there is less bandwidth due to smaller index buffers, it is not enough to make up for the better cache performance offered by triangle lists.

As a general rule of thumb, you should only use triangle strips when the index buffer’s size is below 60% of its triangle-list form. When the index buffer is able to decrease by this much it means fewer degenerate triangles were generated and bandwidth is so significantly reduced that it can make up for the poorer cache.

Also, your mobile device uses a unified memory model, which means bandwidth is irrelevant, which means that by using triangle strips you only increase the triangle count and decrease the cache performance. The only way in which triangle strips help you is in decreasing bandwidth, but on mobile devices there is no bandwidth issue in the first place, which means triangle strips are really just a way of shooting yourself in the foot. You get all of the bad and none of the good.

Never blindly go with what is rumored to be the better way. Always do your own testing. You will likely find that you will have better performance with triangle lists than with triangle strips.

L. Spiro

I restore Nintendo 64 video-game OST’s into HD! https://www.youtube.com/channel/UCCtX_wedtZ5BoyQBXEhnVZw/playlists?view=1&sort=lad&flow=grid

Depending on the hardware being targetted, strips may yet be the best choice; mobile hardware (this is GL ES after all) is particularly known for flying in the face of what works better elsewhere. If you do want to retain strip order, then using indexing will let you do that and will nicely cover the case where you need to join two strips; this will be cheaper than adding extra verts to make degenerates as indices are smaller than vertices. Indexing can also cover cases where you need to add free triangles into your mesh, or even add in some fans, and all without any messing and with just one draw call per mesh.

Primitive restart can also do that, but since you're on ES2 you don't have primitive restart available, so do at least try a benchmark with just indexed triangles as in general terms (and as L Spiro says) they are the general-case fastest path nowadays; if you run into performance problems with those (or if you are on a class of mobile hardware where you know for absolute certain that strips are preferred) then is the time to start considering strips, not before.

Essential reading here: http://hacksoflife.blogspot.ie/2010/01/to-strip-or-not-to-strip.html

Direct3D has need of instancing, but we do not. We have plenty of glVertexAttrib calls.

You may want to also test if triangle strips are really best for you in the first place.

Also, your mobile device uses a unified memory model, which means bandwidth is irrelevant, which means that by using triangle strips you only increase the triangle count and decrease the cache performance. The only way in which triangle strips help you is in decreasing bandwidth, but on mobile devices there is no bandwidth issue in the first place, which means triangle strips are really just a way of shooting yourself in the foot. You get all of the bad and none of the good.

Whoa that is incredible. Thank you so much. I will first construct it using triangles. I was just getting such terrible performance using strips and this could be way. I will reconstruct my data using just standard GL_TRIANGLES.

Do you have any more information about this bandwidth issue?

Thank you!

-jmfurlott

Depending on the hardware being targetted, strips may yet be the best choice; mobile hardware (this is GL ES after all) is particularly known for flying in the face of what works better elsewhere. If you do want to retain strip order, then using indexing will let you do that and will nicely cover the case where you need to join two strips; this will be cheaper than adding extra verts to make degenerates as indices are smaller than vertices. Indexing can also cover cases where you need to add free triangles into your mesh, or even add in some fans, and all without any messing and with just one draw call per mesh.

Primitive restart can also do that, but since you're on ES2 you don't have primitive restart available, so do at least try a benchmark with just indexed triangles as in general terms (and as L Spiro says) they are the general-case fastest path nowadays; if you run into performance problems with those (or if you are on a class of mobile hardware where you know for absolute certain that strips are preferred) then is the time to start considering strips, not before.

Essential reading here: http://hacksoflife.blogspot.ie/2010/01/to-strip-or-not-to-strip.html

Depending on the hardware being targetted, strips may yet be the best choice; mobile hardware (this is GL ES after all) is particularly known for flying in the face of what works better elsewhere. If you do want to retain strip order, then using indexing will let you do that and will nicely cover the case where you need to join two strips; this will be cheaper than adding extra verts to make degenerates as indices are smaller than vertices. Indexing can also cover cases where you need to add free triangles into your mesh, or even add in some fans, and all without any messing and with just one draw call per mesh.

Primitive restart can also do that, but since you're on ES2 you don't have primitive restart available, so do at least try a benchmark with just indexed triangles as in general terms (and as L Spiro says) they are the general-case fastest path nowadays; if you run into performance problems with those (or if you are on a class of mobile hardware where you know for absolute certain that strips are preferred) then is the time to start considering strips, not before.

Essential reading here: http://hacksoflife.blogspot.ie/2010/01/to-strip-or-not-to-strip.html

Ah my question was already answered (didn't see this before replying). Thank you guys...seems like normal triangles are the way to go for now. I don't indices are good for my model because (based on input) they are continually changing.

Do you have any more information about this bandwidth issue?

Nothing I could cite but you can read about unified memory models (UMM) in general online.
What it means for mobile devices is that the GPU and CPU share the same memory, unlike in desktops where they each have their own memory.
When a GPU has its own memory it can only access that memory, so whatever you want to draw has to, at some point, be transferred across the bus to the GPU RAM from the CPU RAM. How much and how fast you can transfer is “bandwidth”.

So for desktops your index and vertex buffers have to be copied, thus smaller is better.

For UMM, no copy has to take place since the GPU can access the vertex/index buffers directly wherever they are in “normal” RAM.
Smaller is still better, but not as significantly.
And there are still things that can cause a copy to take place by the driver (though it is just “normal” RAM to “normal” RAM, literally via memcpy()).
If you are not using a VBO, the entire vertex buffer will be copied.
If you are not using an IBO, the entire index buffer will be copied.
If your vertex-buffer elements are poorly aligned (for example using 6-bytes for positions) the entire vertex buffer will be copied, and slowly since it also realigns the vertex data.


L. Spiro

I restore Nintendo 64 video-game OST’s into HD! https://www.youtube.com/channel/UCCtX_wedtZ5BoyQBXEhnVZw/playlists?view=1&sort=lad&flow=grid

Hello all,


Trying to render a model using GL_TRIANGLE_STRIPS and its working fine, however, at certain arbitrary points I want to break off the the triangle strips and essentially start a new set of triangle strips (that are completely separate from the original strips). How should I go about doing this? Multiple VBOs? I currently have only one VBO that holds all my vertices.

(OpenGL ES2.0 in particular but surely applies to OpenGL as a whole)


Thanks!

Under some circumstances, you can do the trick where you alpha out at the end of one strip and alpha back in at the start of the next.

(Still in the same draw call, -it's the same GL_TRIANGLE_STRIP, basically.)


I realize this seems very dirty, but I made it work, -it's easy and performs well when used with caution.

Make sure the triangles fading to transparent and the one connecting strips have zero area too, -and preferably hide them by applying high (enough) depth values, there's no need to process more fragments than necessary.

I used it for patches in a typical heightmap terrain, and there are probably many places where it won't perform well.

I don't indices are good for my model because (based on input) they are continually changing.

I didn’t see this before but now that I have I also recall you saying your performance was lower than you expected.

If you don’t update your VBO’s properly it will have a very huge impact on your performance (literally halving it).
Be sure you are updating your VBO’s properly either by using ring buffers or by orphaning your buffers before changing them.


L. Spiro

I restore Nintendo 64 video-game OST’s into HD! https://www.youtube.com/channel/UCCtX_wedtZ5BoyQBXEhnVZw/playlists?view=1&sort=lad&flow=grid

This topic is closed to new replies.

Advertisement