Annotated i965_prepare_composite ready for optimization
Michel Dänzer
michel at tungstengraphics.com
Fri Jun 22 03:35:28 PDT 2007
On Thu, 2007-06-21 at 15:47 -0700, Carl Worth wrote:
> As feedback to a recent blog post[*] I made showing
> i965_prepare_composite to be a bottleneck in a Mozilla rendering
> benchmark with an i965 and with EXA, I received a kind message cluing
> me into opannotate, (thanks Roland!).
>
> Here's a new entry explaining what I learned:
>
> http://www.cworth.org/exa/opannotate_i965_prepare_composite/
>
> and here are the annotated source and assembly versions of the
> functions of interest:
>
> http://www.cworth.org/exa/i965_prepare_composite.source_annotate
>
> http://www.cworth.org/exa/i965_prepare_composite.assembly_annotate
>
> I'd greatly appreciate any ideas for improving the performance of this
> function.
The state memory is probably in the AGP aperture, so uncacheable. It
should also be write-combined though, so an easy improvement could be to
initialize a malloced copy and then memcpy it to the final destination
in one go.
--
Earthling Michel Dänzer | http://tungstengraphics.com
Libre software enthusiast | Debian, X and DRI developer
More information about the xorg
mailing list