Jump to content

  • Log In with Google      Sign In   
  • Create Account

#ActualCryZe

Posted 21 December 2012 - 03:53 AM

I would only use the apron solution for non-seperable kernels or kernels that are too large to fit into groupshared memory, since the apron pixels are getting sampled multiple times from the different thread groups. So you're losing performance there, which is not the case in the solution where every thread simply calculates multiple pixels.

#1CryZe

Posted 21 December 2012 - 03:53 AM

I would only use the apron solution for non-seperable kernels or kernels that are too large to fit into groupshared memory, since the apron pixels are getting sampled times from the different thread groups. So you're losing performance there, which is not the case in the solution where every thread simply calculates multiple pixels.

PARTNERS