Annotated i965_prepare_composite ready for optimization

Michel Dänzer michel at tungstengraphics.com
Fri Jun 22 03:35:28 PDT 2007


On Thu, 2007-06-21 at 15:47 -0700, Carl Worth wrote:
> As feedback to a recent blog post[*] I made showing
> i965_prepare_composite to be a bottleneck in a Mozilla rendering
> benchmark with an i965 and with EXA, I received a kind message cluing
> me into opannotate, (thanks Roland!).
> 
> Here's a new entry explaining what I learned:
> 
> 	http://www.cworth.org/exa/opannotate_i965_prepare_composite/
> 
> and here are the annotated source and assembly versions of the
> functions of interest:
> 
> 	http://www.cworth.org/exa/i965_prepare_composite.source_annotate
> 
> 	http://www.cworth.org/exa/i965_prepare_composite.assembly_annotate
> 
> I'd greatly appreciate any ideas for improving the performance of this
> function.

The state memory is probably in the AGP aperture, so uncacheable. It
should also be write-combined though, so an easy improvement could be to
initialize a malloced copy and then memcpy it to the final destination
in one go.


-- 
Earthling Michel Dänzer           |          http://tungstengraphics.com
Libre software enthusiast         |          Debian, X and DRI developer




More information about the xorg mailing list