• Create Account

## Normalmapping Performance

Old topic!

Guest, the last post of this topic is over 60 days old and at this point you may not reply in this topic. If you wish to continue this conversation start a new topic.

7 replies to this topic

### #13TATUK2  Members

714
Like
0Likes
Like

Posted 06 September 2013 - 11:04 PM

Is there any solution to the performance slowdown issue with normalmapping?

It's not even related to the specular calculation but just the texture fill itself.

Meaning if I have something like this:

vec4 colour = texture2D( F_color_texture, f_TexCoord );

vec3 normal = texture2D( F_bumpmap_texture, f_TexCoord );

final = colour * normal;

Then I experience the slowdown issue. Drops from 260 to 130

### #2Hodgman  Moderators

49430
Like
1Likes
Like

Posted 06 September 2013 - 11:59 PM

What units are those numbers in? I'm guessing they're frames per second?
To get some more intuitive numbers, you can calculate milliseconds per frame with ms = 1000/fps.
Which gives: 3.85ms per frame, increased up to 7.69ms per frame, or an increase of 3.85 (a doubling in cost).

If you look at two different shaders:
vec4 colour = texture2D( F_color_texture, f_TexCoord );
final = colour;
vec4 colour = texture2D( F_color_texture, f_TexCoord );
vec3 normal = texture2D( F_bumpmap_texture, f_TexCoord );
final = colour * normal;
Then it's obvious that the second one does twice as much work as the first one... so you should expect this pixel shader to be double the cost of the first one.

If that's hypothesis is backed up by actual measurements, then it means that the pixel shader is the bottleneck in your program.

What kind of GPU do you have? ~3ms to fill the screen with a basic shader (one texture fetch) is quite slow.
What kind of objects are you drawing? How many vertices/triangles are being drawn? How many pixels are covered? What format are the textures? Do the textures have mipmaps, and do you have mipmapping sampling states enabled?

### #33TATUK2  Members

714
Like
0Likes
Like

Posted 07 September 2013 - 03:33 AM

Yes, units in frames per second..

Well... here's the thing though..

Yeah it may seem like it does "twice as much work" - but this isn't exactly true.

For example, I also have a texture2D() call to look up my lightmap - *but it uses different texcoords* - so - you don't have the entire lightmap texture repeated many times everywhere (like the bumpmap), instead you just have small portions of the lightmap applied everywhere, so performance is actually negligent. (Basically the same as without a lightmap lookup)

But, like I just mentioned, the bumpmap is applied in full everywhere the regular diffuse color texturing is, since they share the same texcoords.

Not that it's extremely important, but I do have somewhat of a slower card.. Radeon HD 5450. Drawing the environment (3D FPS map). The map is only 2158/4336 vertices/triangles, and I don't know how many are on screen when I drop to 130 FPS, but it does depend on what's on screen. For example if I just face a wall I get 300+ FPS.. But if I go to one wide open section with a lot of long-distance wall exposed, then I get the drop to 130 (from 260 in the same section without the bumpmap). Resolution is a low 800x600. Textures are GL_UNSIGNED_BYTE GL_RGBA. No mipmaps, MIN/MAG == GL_LINEAR.

### #4Kaptein  Prime Members

2216
Like
0Likes
Like

Posted 07 September 2013 - 04:37 AM

I think maybe your GPU is stalling on things in other places in the rendering

There's no way for 2200 vertices to do that unless your vertex program is gigantic

I really don't think your GPU is that slow, simply because it's not an onboard GPU...

Check your code for glGet*, occlusion result queries that aren't completed yet or in the same frame etc.

Anything that might be forcing GPU to flush

Keep in mind that mipmaps may actually increase fragment shader throughput

Edited by Kaptein, 07 September 2013 - 04:38 AM.

### #53TATUK2  Members

714
Like
0Likes
Like

Posted 07 September 2013 - 04:58 AM

It's definitely the bumpmapping... If I simply comment out "final*=normal;" then I get the 260 FPS :x

### #6Ohforf sake  Members

2048
Like
2Likes
Like

Posted 08 September 2013 - 12:02 PM

Textures are GL_UNSIGNED_BYTE GL_RGBA. No mipmaps, MIN/MAG == GL_LINEAR.

This is your problem, just as Hodgman and Kaptein already said. Without mipmapping you are sampling texels from all over the place, whenever the texture is "scaled down" as in the far reaches of your "long-distance wall". This renders your texture caches ineffective and increases bandwidth.

This is not a bumpmapping problem. You have the same problem with your color texture and you will also experience it when you try to render particles.

### #73TATUK2  Members

714
Like
0Likes
Like

Posted 08 September 2013 - 06:31 PM

Perfect soltuion! thanks!

Edited by 3TATUK2, 08 September 2013 - 06:47 PM.

### #83TATUK2  Members

714
Like
0Likes
Like

Posted 08 September 2013 - 06:36 PM

[deleted]

Edited by 3TATUK2, 08 September 2013 - 06:47 PM.

Old topic!

Guest, the last post of this topic is over 60 days old and at this point you may not reply in this topic. If you wish to continue this conversation start a new topic.