I didn't mean that a programmer could achieve better results through hand coded assembly, I think the point that I and those commenters were making is that unless you post the output assembly for the benchmarks, there's no way of knowing if the computer is really doing the same thing in both cases. If I understood correctly what people were saying was because the benchmark test was just buffering a string and never actually doing anything with it, that the JIT compiler was likely optimizing out the entire benchmark test. However, without the raw assembly output, we'll never know. That's all I was saying.
That still proves the point even if that's the case... the fact that a JIT compiler could make such a marked optimization that a C compiler cannot. Imagine the compounded result of tons of major optimizations, just from the compiler, in the context of a huge program.