* removed fma dispatch * added comments to explain why remove fma * fix lint * use fmuladd intrin for fma dispatch