According to the hardware orders page[1], a major bottleneck is page rendering.
Does any phase of this stand out? DB fetch? Text parse? Request/Response bandwidth?
I assume profiling on this has been done quite a lot...
[1] http://meta.wikimedia.org/wiki/Hardware_ordered_August_30%2C_2005