Some time ago, maybe last year, I read a post here at gamedev containing a very cool matrix implemenation in C++ (or a link to that implementation). The cool stuff was accessing both row and column vectors without changing the "simple" memory layout (e.g. a 4x4 matrix needed only 16 * sizeof(float)). IIRC it used function pointers or functors internally.
There was a lengthy discussion with some guys trying to be smarter... but their proposals required additional data members.
My problem: I can't find that post or thread anymore. If your memory or searching skills are better than mine... can you give me a hint?