question about VSM

hello, In variance shadow mapping technique, in depth map, is stored depth and in another channel depth*depth... why do I store depth*depth? Can't I just compute it when I fetch depth from first channel? It is depth*depth after all so why do I store it? and then the variance itself, is computed like v^2 = moment.y-moment.x^2, but this results in 0 as moment.x is depth and moment.y is depth*depth. Could somebody enlighten me about this?

