Sign in to follow this  
Oogst

DX11 Shader debugger in DX9

Recommended Posts

I would like to do real shader-debugging on my Windows 7 PC: being able to capture a frame, select a pixel and step through its pixel shader. The real deal, just like I can on my Xbox 360 devkit. Which tools can I use for that on Windows 7?

Supposedly Visual Studio 2012 has a really good PIX inside, but this is only DX10 and DX11, while my game is still DX9. So I cannot use that.

Nvidia's Perfhud used to also be a nice graphics debugger (don't know if it had stepping through pixel shaders, though), but I cannot get Perfhud to work on my Windows 7 computer: it keeps giving the error that the drivers are not instrumented. Also, Perfhud doesn't seem to have been update in six years, suggesting it is simply dead.

So, what to do? Are there other debugging/profiling tools I could use for graphics?

My game is made in DX9 (with the Ogre engine), shader model 3.0 and I can switch between HLSL and CG, so either would be good. I use Windows 7 and have an Nvidia GTX480.

Thanks in advance! smile.png

Share this post


Link to post
Share on other sites

Like Yourself mentioned Nsight is the replacement for PerfHUD. Nsight is the only debugger availble for PC that actually debugs shaders on the hardware, all other debuggers work by emulating the shader on the host PC.

The old PIX for Windows will still work for debugging shaders (via emulation as mentioned above), however you can't install the Windows 7 hotfix that brought DX11.1 to Win7. With that hotfix PIX will crash. However it is possible to uninstall the hotfix and use PIX again.

Share this post


Link to post
Share on other sites
I am okay with software emulation, just as long as the results are correct, since I need it for debugging, not so much for profiling. smile.png

I had a look at Nsight before, but I got the impression it only does shader debugging in DX10/11, not in DX9. Am I misinterpreting the description here, then? https://developer.nvidia.com/nsight-visual-studio-edition-features

I had also found Intel GPA, but I had automatically assumed it would only support Intel videocards. Does it also fully function on Nvidia cards then? I also couldn't find any description of full shader debugging in the GPA description. It only mentions "Shader experiments" in their Product Brief, which suggests things like swapping shaders, not fully stepping through a shader for an individual pixel.

Since you are mentioning 3 different alternative (Nsight, GPA and PIX), which would be my best choice?

Share this post


Link to post
Share on other sites

It does look like you're correct regarding Nsight and DX9. Honestly it's been so long since I used DX9, so I hadn't checked what tools supported it. I think PIX might be your best bet here, it's only tool that I can confirm works for debugging DX9 shaders.

Share this post


Link to post
Share on other sites

Microsoft DirectX SDK June 2010 has 64bit PIX in the package, I just have it on my win7 and I succesfully debug 32bit dx9 applications. PLease, post your result wheather you got it or not, thanks.

Share this post


Link to post
Share on other sites
I tried PIX now and it indeed does exactly what I was looking for! biggrin.png

I did have trouble with PIX in the DirectX June 2010 SD,K giving some weird error when I tried to debug a pixel. But it turns out that the 32 bit PIX in the August 2007 SDK does work. Great, on to some good debugging fun!

Thanks for the advice, folks! smile.png

Share this post


Link to post
Share on other sites

weird error? You could have been more specific. Wasn't it alerting " Multisampled surfaces cannot be pixel shader debugged. " ?

Since PIX 2007 works fine, I wasn't particularly looking for a solution, but I guess newer PIX would of course be better, so if you know how to solve it, I'd love to hear the solution! smile.png

When I try to debug a shader for a specific pixel, I first get a popup that tell me PIX is going to "Enable Shader Debugging" in the DirectX Control Panel. I click Yes, I allow it to make changes through a Windows popup, and then I get the message "Shader debugging could not be enabled", and then "An error occurred while preparing to debug the shader".

I tried enabling shader debugging myself in all the DirectX control panels I could find (2007/2010 32/64bit), and also setting them to Debug instead of Release DirectX, but neither helped. PIX 2007 works fine right away. Edited by Oogst

Share this post


Link to post
Share on other sites

I have that same exact problem with PIX while trying to debug vertex or pixel shaders in my XNA project...  I guess I'll go see if the old version works for me.

Share this post


Link to post
Share on other sites

I have that same exact problem with PIX while trying to debug vertex or pixel shaders in my XNA project...  I guess I'll go see if the old version works for me.

 

If you're using XNA, make sure you check "Disable D3DX Analysis" in the Target Program tab of "More Options" in PIX.

Share this post


Link to post
Share on other sites

Create an account or sign in to comment

You need to be a member in order to leave a comment

Create an account

Sign up for a new account in our community. It's easy!

Register a new account

Sign in

Already have an account? Sign in here.

Sign In Now

Sign in to follow this  

  • Partner Spotlight

  • Forum Statistics

    • Total Topics
      627655
    • Total Posts
      2978459
  • Similar Content

    • By evelyn4you
      hi,
      i have read very much about the binding of a constantbuffer to a shader but something is still unclear to me.
      e.g. when performing :   vertexshader.setConstantbuffer ( buffer,  slot )
       is the buffer bound
      a.  to the VertexShaderStage
      or
      b. to the VertexShader that is currently set as the active VertexShader
      Is it possible to bind a constantBuffer to a VertexShader e.g. VS_A and keep this binding even after the active VertexShader has changed ?
      I mean i want to bind constantbuffer_A  to VS_A, an Constantbuffer_B to VS_B  and  only use updateSubresource without using setConstantBuffer command every time.

      Look at this example:
      SetVertexShader ( VS_A )
      updateSubresource(buffer_A)
      vertexshader.setConstantbuffer ( buffer_A,  slot_A )
      perform drawcall       ( buffer_A is used )

      SetVertexShader ( VS_B )
      updateSubresource(buffer_B)
      vertexshader.setConstantbuffer ( buffer_B,  slot_A )
      perform drawcall   ( buffer_B is used )
      SetVertexShader ( VS_A )
      perform drawcall   (now which buffer is used ??? )
       
      I ask this question because i have made a custom render engine an want to optimize to
      the minimum  updateSubresource, and setConstantbuffer  calls
       
       
       
       
       
    • By noodleBowl
      I got a quick question about buffers when it comes to DirectX 11. If I bind a buffer using a command like:
      IASetVertexBuffers IASetIndexBuffer VSSetConstantBuffers PSSetConstantBuffers  and then later on I update that bound buffer's data using commands like Map/Unmap or any of the other update commands.
      Do I need to rebind the buffer again in order for my update to take effect? If I dont rebind is that really bad as in I get a performance hit? My thought process behind this is that if the buffer is already bound why do I need to rebind it? I'm using that same buffer it is just different data
       
    • By Rockmover
      I am really stuck with something that should be very simple in DirectX 11. 
      1. I can draw lines using a PC (position, colored) vertices and a simple shader just fine.
      2. I can draw 3D triangles using PCN (position, colored, normal) vertices just fine (even transparency and SpecularBlinnPhong shaders).
       
      However, if I'm using my 3D shader, and I want to draw my PC lines in the same scene how can I do that?
       
      If I change my lines to PCN and pass them to the 3D shader with my triangles, then the lighting screws them all up.  I only want the lighting for the 3D triangles, but no SpecularBlinnPhong/Lighting for the lines (just PC). 
      I am sure this is because if I change the lines to PNC there is not really a correct "normal" for the lines.  
      I assume I somehow need to draw the 3D triangles using one shader, and then "switch" to another shader and draw the lines?  But I have no clue how to use two different shaders in the same scene.  And then are the lines just drawn on top of the triangles, or vice versa (maybe draw order dependent)?  
      I must be missing something really basic, so if anyone can just point me in the right direction (or link to an example showing the implementation of multiple shaders) that would be REALLY appreciated.
       
      I'm also more than happy to post my simple test code if that helps as well!
       
      THANKS SO MUCH IN ADVANCE!!!
    • By Reitano
      Hi,
      I am writing a linear allocator of per-frame constants using the DirectX 11.1 API. My plan is to replace the traditional constant allocation strategy, where most of the work is done by the driver behind my back, with a manual one inspired by the DirectX 12 and Vulkan APIs.
      In brief, the allocator maintains a list of 64K pages, each page owns a constant buffer managed as a ring buffer. Each page has a history of the N previous frames. At the beginning of a new frame, the allocator retires the frames that have been processed by the GPU and frees up the corresponding space in each page. I use DirectX 11 queries for detecting when a frame is complete and the ID3D11DeviceContext1::VS/PSSetConstantBuffers1 methods for binding constant buffers with an offset.
      The new allocator appears to be working but I am not 100% confident it is actually correct. In particular:
      1) it relies on queries which I am not too familiar with. Are they 100% reliable ?
      2) it maps/unmaps the constant buffer of each page at the beginning of a new frame and then writes the mapped memory as the frame is built. In pseudo code:
      BeginFrame:
          page.data = device.Map(page.buffer)
          device.Unmap(page.buffer)
      RenderFrame
          Alloc(size, initData)
              ...
              memcpy(page.data + page.start, initData, size)
          Alloc(size, initData)
              ...
              memcpy(page.data + page.start, initData, size)
      (Note: calling Unmap at the end of a frame prevents binding the mapped constant buffers and triggers an error in the debug layer)
      Is this valid ? 
      3) I don't fully understand how many frames I should keep in the history. My intuition says it should be equal to the maximum latency reported by IDXGIDevice1::GetMaximumFrameLatency, which is 3 on my machine. But, this value works fine in an unit test while on a more complex demo I need to manually set it to 5, otherwise the allocator starts overwriting previous frames that have not completed yet. Shouldn't the swap chain Present method block the CPU in this case ?
      4) Should I expect this approach to be more efficient than the one managed by the driver ? I don't have meaningful profile data yet.
      Is anybody familiar with the approach described above and can answer my questions and discuss the pros and cons of this technique based on his experience ? 
      For reference, I've uploaded the (WIP) allocator code at https://paste.ofcode.org/Bq98ujP6zaAuKyjv4X7HSv.  Feel free to adapt it in your engine and please let me know if you spot any mistakes
      Thanks
      Stefano Lanza
       
    • By Matt Barr
      Hey all. I've been working with compute shaders lately, and was hoping to build out some libraries to reuse code. As a prerequisite for my current project, I needed to sort a big array of data in my compute shader, so I was going to implement quicksort as a library function. My implementation was going to use an inout array to apply the changes to the referenced array.

      I spent half the day yesterday debugging in visual studio before I realized that the solution, while it worked INSIDE the function, reverted to the original state after returning from the function.

      My hack fix was just to inline the code, but this is not a great solution for the future.  Any ideas? I've considered just returning an array of ints that represents the sorted indices.
  • Popular Now