I’ve added another document to the technical articles I’m writing on my wiki.
This one is about why I modified the SSE fast sine approximation, in order to squeeze a tiny bit more out of it.
<TL;DR> Rather than a general purpose sine function which takes an input in radians and returns a value in the range ±1. By modifying the coefficients in the approximation, I get a function which takes an input in cycles and returns a value in the range ±5. Which saves me two multiplications.