clang 14.0.0 floating point optimizations
This is a pretty deep rabbit hole, and I don’t know if I’ve explored all of its twists and turns yet. But here’s a first draft of an answer; suggestions for improvement are welcome. At its core, the culprit is the so-called “fused multiply-add” (or, in this case, a fused multiply-subtract). Fused multiply-add is a … Read more