diff --git a/CHANGES b/CHANGELOG.md similarity index 87% rename from CHANGES rename to CHANGELOG.md index ceb754700..310397fe1 100644 --- a/CHANGES +++ b/CHANGELOG.md @@ -1,11 +1,20 @@ -Version 2.0.1 (03 Jun 2024) +# Changelog for Omniperf - * new option to force hardware target via OMNIPERF_ARCH_OVERRIDE global (#370) +Full documentation for Omniperf is available at [https://rocm.docs.amd.com/projects/omniperf/en/latest/](https://rocm.docs.amd.com/projects/omniperf/en/latest/). + +## Omniperf 2.0.1 for ROCm 6.2.0 + +### Changes + + * new option to force hardware target via `OMNIPERF_ARCH_OVERRIDE` global (#370) * CI/CD support for MI300 hardware (#373) - * cmake build improvements (#374) * support for MI308X hardware (#375) -Version 2.0.0 (17 May 2024) +### Optimizations + + * cmake build improvements (#374) + +## Omniperf 2.0.0 (17 May 2024) * improved logging than spans all modes (#177) (#317) (#335) (#341) * overhauled CI/CD that spans all modes (#179) @@ -20,7 +29,7 @@ Version 2.0.0 (17 May 2024) * new --quiet option will suppress verbose output and show a progress bar (#308) * roofline support for Ubuntu 22.04 (#319) -Version 1.1.0-PR1 (13 Oct 2023) +## Omniperf 1.1.0-PR1 (13 Oct 2023) * standardize headers to use 'avg' instead of 'mean' * add color code thresholds to standalone gui to match grafana @@ -29,11 +38,11 @@ Version 1.1.0-PR1 (13 Oct 2023) * patch grafana plugin module to address a known issue in the latest version (#186) * enhanced communication between analyze mode kernel flags (#187) -Version 1.0.10 (22 Aug 2023) +## Omniperf 1.0.10 (22 Aug 2023) * critical patch for detection of llvm in rocm installs on SLURM systems -Version 1.0.9 (17 Aug 2023) +## Omniperf 1.0.9 (17 Aug 2023) * add units to L2 per-channel panel (#133) * new quickstart guide for Grafana setup in docs (#135) @@ -45,7 +54,7 @@ Version 1.0.9 (17 Aug 2023) * general metric updates and enhancements (#144, #155, #159) * add min/max/avg breakdown to instruction mix panel (#154) -Version 1.0.8 (30 May 2023) +## Omniperf 1.0.8 (30 May 2023) * add `--kernel-names` option to toggle kernelName overlay in standalone roofline plot (#93) * remove unused python modules (#96) @@ -66,7 +75,7 @@ Version 1.0.8 (30 May 2023) * manually merge rocprof runs, overriding default rocprofiler implementation (#125) * fixed compatibility issues with Python 3.11 (#131) -Version 1.0.8-PR2 (17 Apr 2023) +## Omniperf 1.0.8-PR2 (17 Apr 2023) * ux improvements in standalone GUI (#101) * enhanced readability for filtering dropdowns in standalone GUI (#102) @@ -75,14 +84,14 @@ Version 1.0.8-PR2 (17 Apr 2023) * adding dockerfiles for all supported Linux distros * new examples for `--roof-only` and `--kernel` options added to documentation -Version 1.0.8-PR1 (13 Mar 2023) +## Omniperf 1.0.8-PR1 (13 Mar 2023) * add `--kernel-names` option to toggle kernelName overlay in standalone roofline plot (#93) * remove unused python modules (#96) * fix empirical roofline calculation for single dispatch workloads (#97) * match color of arithmetic intensity points to corresponding bw lines -Version 1.0.7 (21 Feb 2023) +## Omniperf 1.0.7 (21 Feb 2023) * update documentation (#52, #64) * improved detection of invalid command line arguments (#58, #76) @@ -96,12 +105,12 @@ Version 1.0.7 (21 Feb 2023) * improve error message when no GPU(s) detected (#85) * separate CI tests by Linux distro and add status badges -Version 1.0.6 (21 Dec 2022) +## Omniperf 1.0.6 (21 Dec 2022) * CI update: documentation now published via github action (#22) * better error detection for incomplete ROCm installs (#56) -Version 1.0.5 (13 Dec 2022) +## Omniperf 1.0.5 (13 Dec 2022) * store application command-line parameters in profiling output (#27) * enable additional normalizations in CLI mode (#30) @@ -112,7 +121,7 @@ Version 1.0.5 (13 Dec 2022) * enhanced subprocess logging to identify critical errors in rocprofiler (#50) * maintain git sha in production installs from tarball (#53) -Version 1.0.4 (11 Nov 2022) +## Omniperf 1.0.4 (11 Nov 2022) * update python requirements.txt with minimum versions for numpy and pandas * addition of progress bar indicator in web-based GUI (#8) @@ -121,6 +130,6 @@ Version 1.0.4 (11 Nov 2022) * variety of documentation updates * added an optional argument to vcopy.cpp workload example to specify device id -Version 1.0.3 (07 Nov 2022) +## Omniperf 1.0.3 (07 Nov 2022) * initial Omniperf release