![]() What's in AOCL 4.0: 1. amdblis LPGEMM variants with post-ops support AMD "Zen4" support for BLIS 2. amdlibflame Upgrade to LAPACK 3.10.1 specification Improvements in a few more variants of SVD and Eigen Value routines Multithread support enabled for selected APIs 3. amdfftw AVX-512 enablement of DFT kernels AVX-512 optimization of copy and transpose routines 5. amdlibm Black & Scholes support (logf, expf, erff, both scalar and vector) AVX-512 variants of vector functions 6. aocl-sparse New Iterative Solver APIs AVX-512 support for SPMV API 7. amdscalapack Upgrade to Netlib ScaLAPACK 2.2.0 Co-authored-by: Massimiliano Culpo <massimiliano.culpo@gmail.com> |
||
---|---|---|
.. | ||
spack |