The purpose of this page is to show the comparison of the best performing BLAS/LAPACK implementations and Eigen2 on an Intel(R) Pentium(R) D CPU. By best performing I mean Intel MKL, GotoBLAS, ATLAS, and ACML (by AMD). Other implementations of BLAS/LAPACK are notably slower (see benchmark) and therefore not worth testing (and using). ACML was not included in the original benchmark.

13 February 2009


  • CPU: Intel(R) Pentium(R) D CPU 3.00GHz (model name: Prescott)
  • Compiler: gcc version 4.3.3 [gcc-4_3-branch revision 143673] (OpenSUSE 11.1 64-bit)


  • eigen2: Revision: 939062. SSE2, SSE3 enabled
  • ACML: version 4.2.0 compiled with gfortran
  • MKL: version 10.1
  • Goto BLAS: version 1.26
  • ATLAS: version 3.8.3

Axpy intel.png

Axpby intel.png

Atv intel.png

Matrix vector intel.png

Matrix matrix intel.png

Symv intel.png

Syr2 intel.png

Aat intel.png

Ata intel.png

Trisolve intel.png

Cholesky intel.png

Hessenberg intel.png

Tridiagonalization intel.png

Lu decomp intel.png