1. Nope, the bandwidth you're talking about is between the Graphics card's GPU and its RAM.
(A simple analogy you can think of is the graphics card being a motherboard by itself, with the GPU acting as a CPU and the VRAM acting as regular RAM.)
A graphics card communicates with the CPU via the PCI-Express/AGP bus and transfers only a modest amount of data to and from the CPU. A PCI-E 16x channel (which most Gfx cards use) offers a max bandwidth of 4GB/sec (8GB/sec bidirectional) and that at the moment doesnt seem to be choking any of the graphics cards.
2. A wider HTT obviously offers more bandwidth
. 4/4 8/8 16/16 refers to how wide the HTT bus is in bits. However i recall in earlier times where people would go as low as 400Mhz from the default 1Ghz on the HTT and see a very negligible performance loss.
3. Theoretically yes. Its like how a dual-core CPU with 2 cores running at 1.8 Ghz can be compared to a single-core CPU running at 3.6 Ghz.
Of course in reality, efficiencies are lower than a two-fold increase and largely depends on how good the memory controller is at utilising dual-channels.
*A point to note, while DDR2 @ 800Mhz would be equal to 2 x DDR1 sticks @ 400Mhz , the DDR2 stick achieves such high speeds at the cost of additional latencies. If you had a single DDR1 stick capable of 800Mhz speeds, it would be faster than the DDR2 stick - of course since thats not possible, the DDR2 stick outclasses the DDR1 sticks by its higher clocks and offsets the latency disadvantage.
4. Yep, the bandwidth of the HTT applies to the entire southbridge (when communicating with the CPU - though i have no idea if devices on the southbridge would communicate between themselves :ashamed: ).