User:Everton

From Eigen
Revision as of 18:39, 17 August 2021 by Everton (Talk | contribs)

Jump to: navigation, search

- New Power 10 MMA

  • Initial support for Power 10 matrix multiplication assist instructions for float32, float64 real and complex.

- Altivec/Power improvements

  • General performance improvement and bugfixes.
  • Enhanced vectorization of current real and complex scalars.
  • Changes to the gebp_kernel specific to Altivec, using VSX implementation of the MMA functions to gain speed improvements up to 4x.
  • Dynamic disspatch for GCC greater than 10 enabling selection of MMA or VSX instructions based on __builtin_cpu_supports.