16 Mar 01:31

tprimak

7de7e5d

v0.18.1

This is a patch release containing following changes to Intel MKL-DNN v0.18.0:

Fix bug in build system to do not break transitive linking when being used as a subproject (245b331)
Fix fix bias conversion in int8 gemm-based convolution (9670998)

Assets 2

09 Mar 00:19

tprimak

v1.0-pc

002d23b

v1.0-pc Pre-release

Pre-release

This is preview candidate for MKL-DNN v1.0.

The preview candidate implements changes announced in v1.0 RFC. Please provide feedback and report bugs in Github issues.

Assets 2

02 Mar 00:07

vpirogov

v0.18

863ff6e

v0.18

Performance optimizations

Improved RNN functionality performance.
Improved performance of GEMM-based convolutions
Improved performance of backpropagation for stided convolutions on processors with Intel® AVX2 support.
Improved performance of the gemm_s8u8s32 and gemm_s8s8s32 functions on processors with Intel® AVX512 and Intel® AVX512-DL Boost instruction sets.
Improved inner product performance on processors with Intel AVX512 and Intel AVX512-DL Boost instruction sets.
Improved performance of int8 convolutions and deconvolutions on processors with Intel AVX512 and Intel AVX512-DL Boost instruction sets.

New functionality

Convolutions support arbitrary elementwise operations in postops.
Introduced support of signed int8 data for the inner product primitive.
Introduced int8 LSTM cell support.
Introduced automatic dispatching between the direct and Winograd convolution algorithms.

API deprecations and breaking changes

Previously deprecated APIs were removed:
- relu function
- convolution_relu function
- double precision scales support in sum
- negative_slope parameter in eltwise
- omit_stats flag in batch normalization

Usability improvements

Added library version information to verbose output and to headers.
Added information about detected instruction set to verbose output.
Introduced mkldnn_version function.
Added APIs to override behaviors controlled via environment variables, including verbose mode and JIT dump.

Thanks to the contributors

This release contains contributions from many Intel Performance Libraries developers as well as Ruslan Baratov @ruslo, Konstantin Basargin @basargin, Jacek Czaja @jczaja, Eugene Zhulenev @ezhulenev, Haitao Feng @fenghaitao, Yinghai Liu @yinghai, Masahiro Sakai @msakai, and Alexander Grund @Flamefire. We would also like to thank everyone who asked questions and reported issues.

Assets 5

12 Feb 21:55

tprimak

v0.17.4

722901c

v0.17.4

This is a patch release containing following changes to Intel MKL-DNN v0.17.3:

Fix bug in build system for old versions of CMake (61f953e)

Assets 5

08 Feb 03:01

tprimak

v0.18-rc

08bd90c

v0.18-rc Pre-release

Pre-release

This is a release candidate package for MKL-DNN v0.18. Please provide feedback and report bugs in Github issues.

Assets 5

01 Feb 00:58

tprimak

v0.17.3

0c3cb94

v0.17.3

This is a patch release containing following changes to MKL-DNN v0.17.2:

Fix integer overflow in GEMM (059b5fd)
Update Xbyak* to 5.751 (4f809d0)

Assets 5

20 Dec 02:47

tprimak

v0.17.2

b9ce57a

v0.17.2

This is a patch release containing following changes to MKL-DNN v0.17.1:

Fix data race during initialization in the GEMM-based convolution (763513e)
Fix number of dimensions of a tensor in the backward deconvolution primitive descriptor (5a0a50c)
Fix Valgrind* complaints (ed4b08c)

Assets 8

29 Nov 00:32

tprimak

v0.17.1

a7c5f53

v0.17.1

This is a patch release containing following change to MKL-DNN v0.17:

Tentatively turn on reference direct copy reorder for GNU* Compiler Collection (567dfb5)

Assets 5

19 Nov 20:09

tprimak

v0.17

830a100

v0.17

Performance optimizations

Improved int8 convolutions performance on processors with Intel® AVX512-DL Boost instruction set support.
Improved performance of fp32 convolutions with number of input and output channels not divisible by the SIMD width for processors with Intel® AVX2 instruction set support.
Improved performance of Recurrent Neural Networks (RNNs) functionality.
Improved performance of int8 deconvolution.
Added optimizations for fp32 inference and training for processors with Intel® AVX instruction set support.
Added optimizations for convolutions and auxiliary primitives with 3D spatial data for processors with Intel® AVX2 instruction set support.
Improved int8 Winograd convolution performance for real-time inference use cases.

New functionality

Introduced int8 data-type support for inner-product primitive.
Introduced support for int8 convolutions with signed input and signed weights.
Introduced 1D spatial data support in convolution and auxiliary primitives. This functionality is optimized for processors with Intel® AVX512 instruction set support.
Introduced the Shuffle primitive.
Introduced a general-purpose matrix-matrix multiplication function for int8 data (gemm_s8u8s32 and gemm_s8s8s32).
Feature preview: Threading Building Blocks (TBB) support.

API deprecations and breaking changes

Order of the gates for LSTM cells was changed to input, forget, candidate, output. This might produce incorrect results.
Backward RNN primitive creation without the hint in C++ is deprecated.
Int8 Winograd convolution behavior with respect to scales is aligned with the direct convolution algorithm.

Usability improvements

Primitives now accept tensors with 0 for the dimension and do nothing in that case.
Added support for clang sanitizers.
Build system extended with the following capabilities:
- Allow building with static Intel MKL by passing -DMKLDNN_USE_MKL=FULL:STATIC to cmake
- Allow specifying the Intel MKL to use by passing -DMKLDNN_USE_MKL={DEF,NONE,ML,FULL} to cmake for that
- Allow using the compiler's OpenMP RT by passing -DMKLDNN_THREADING=OMP:COMP to cmake for that
- Allow building a static library by passing -DMKLDNN_LIBRARY_TYPE=STATIC to cmake

Thanks to the contributors

This release contains contributions from many Intel Performance Libraries developers as well as Dmitry Baksheev @dbakshee, Yuta Okamoto @okapies, and Eduardo Gonzalez @wmeddie. We would also like to thank everyone who asked questions and reported issues.

*Other names and brands may be claimed as the property of others.

Assets 5

02 Nov 04:33

tprimak

v0.17-rc

21fb5f2

v0.17-rc Pre-release

Pre-release

This is a release candidate package for MKL-DNN v0.17. It is made available for testing by the community. Please provide feedback and report bugs in Github issues.

Assets 5

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

This is preview candidate for MKL-DNN v1.0.

Performance optimizations

New functionality

API deprecations and breaking changes

Usability improvements

Thanks to the contributors

Performance optimizations

New functionality

API deprecations and breaking changes

Usability improvements

Thanks to the contributors

Releases: oneapi-src/oneDNN

v0.18.1

v1.0-pc

This is preview candidate for MKL-DNN v1.0.

v0.18

Performance optimizations

New functionality

API deprecations and breaking changes

Usability improvements

Thanks to the contributors

v0.17.4

v0.18-rc

v0.17.3

v0.17.2

v0.17.1

v0.17

Performance optimizations

New functionality

API deprecations and breaking changes

Usability improvements

Thanks to the contributors

v0.17-rc