> In my non-DMA UMA hardware (sis), upload with sse peaks at about > 540MB/sec, while download peaks at about 50MB/sec (naturally regardless > whether SSE or MMX or whatever). What occurs if you mark the frame buffer memory cachable and then flush the data off the CPU by hand ? That *seemed* to be working on the VIA when I tried abusing it