How to calculate Gflops of a kernel

First some general remarks: In general, what you are doing is mostly an exercise in futility and is the reverse of how most people would probably go about performance analysis. The first point to make is that the peak value you are quoting is for strictly for floating point multiply-add instructions (FMAD), which count as … Read more

Alternatives to gprof [closed]

gprof (read the paper) exists for historical reasons. If you think it will help you find performance problems, it was never advertised as such. Here’s what the paper says: The proļ¬le can be used to compare and assess the costs of various implementations. It does not say it can be used to identify the various … Read more