Releases
v6.1.135
Algorithms
New features
Base implementation, SSE4.1, AVX2, AVX-512BW, AMX-BF16 optimizations of class SynetConvolution32fBf16NhwcGemm.
AMX-BF16 optimizations of function Float32ToBFloat16.
Support of SimdSynetUnaryOperation32fCos in function SynetUnaryOperation32f.
Support of SimdSynetUnaryOperation32fSin in function SynetUnaryOperation32f.
Bug fixing
Error in function SimdCpuInfo (wrong AMX-BF16 detection).
Error in AVX-512BF16 optimization of function Float32ToBFloat16.
Error in AMX initialization in function AmxBf16::SupportedByOS.
Crash in function AmxBf16::ConvolutionBf16NhwcConv_2.
Error in Base implementation, SSE4.1, AVX2, AVX-512BW, AMX-BF16 optimizations of class SynetMergedConvolution32fBf16Cdc.
Error in Base implementation, SSE4.1, AVX2, AVX-512BW, AMX-BF16 optimizations of class SynetMergedConvolution32fBf16Cd.
Error in Base implementation, SSE4.1, AVX2, AVX-512BW, AMX-BF16 optimizations of class SynetMergedConvolution32fBf16Dc.
Removing
AVX-512BF16 optimizations of function Float32ToBFloat16.
AVX-512BF16 optimizations of SynetConvolution32fBf16Nhwc.
AVX-512BF16 optimizations of class SynetMergedConvolution32fBf16Cdc.
AVX-512BF16 optimizations of class SynetMergedConvolution32fBf16Cd.
AVX-512BF16 optimizations of class SynetMergedConvolution32fBf16Dc.
Stopping of separate support of AVX-512BF16 extension (only together with AMX-BF16).
Test framework
Bug fixing
Error in test of SynetMergedConvolution32f framework.
Infrastructure
Removing
Avx512Bf16 project for MSVS-2022.
Avx512Bf16 project for MSVS-2019.
Avx512Bf16 project for MSVS-2015.
Avx512Bf16 project for MSVS-2017.
Avx512Bf16 project for CMake.
You can’t perform that action at this time.