User:Everton

From Eigen
Revision as of 18:24, 17 August 2021 by Everton (Talk | contribs)

Jump to: navigation, search
  • New Power 10 MMA

Initial support for Power 10 matrix multiplication assist instructions for float32, float64 real and complex.

  • Altivec/Power improvements

General performance improvement and bugfixes, better vectorization. Changes to the gebp_kernel specific to Altivec, using VSX implementation of the MMA functions to gain speed improvements up to 4x.