Releases: ROCm/rocprofiler-compute
v2.0.0 Tech Preview #1 (03 March 2024)
This is a tech preview release for a forthcoming v2.0.0 release expected in March 2024. The 2.0 release provides a significant refactor of the underlying code base and introduces support for MI300.
Tech Preview documentation available at: https://rocm.github.io/omniperf/2.x
Associated release tarball: omniperf-2.0.0-Tech-Preview1.tar.gz
v1.1.0-PR1 (13 October 2023)
Updates
- standardize headers to use 'avg' instead of 'mean'
- add color code thresholds to standalone gui to match grafana
- modify kernel name shortener to use cpp_filt (#168)
- enable stochastic kernel dispatch selection (#183)
- patch grafana plugin module to address a known issue in the latest version (#186)
- enhanced communication between analyze mode kernel flags (#187)
Documentation available at https://rocm.github.io/omniperf/1.x/
Associated release tarball: omniperf-v1.1.0-PR1.tar.gz
v1.0.10 (22 August 2023)
Updates
- critical patch for detection of llvm in rocm installs on SLURM systems
Documentation available at https://rocm.github.io/omniperf/1.x/
Associated release tarball: omniperf-v1.0.10.tar.gz
v1.0.9 (17 August 2023)
Updates
- add units to L2 per-channel panel (#133)
- new quickstart guide for Grafana setup in docs (#135)
- more detail on kernel and dispatch filtering in docs (#136, #137)
- patch manual join utility for ROCm >5.2.x (#139)
- add % of peak values to low level speed-of-light panels (#140)
- patch critical bug in Grafana by removing a deprecated plugin (#141)
- enhancements to KernelName demangeler (#142)
- general metric updates and enhancements (#144, #155, #159)
- add min/max/avg breakdown to instruction mix panel (#154)
Documentation available at https://rocm.github.io/omniperf/1.x/
Associated release tarball: omniperf-v1.0.9.tar.gz
v1.0.8 (30 May 2023)
PR1 Updates
- add
--kernel-names
option to toggle kernelName overlay in standalone roofline plot (#93) - remove unused python modules (#96)
- fix empirical roofline calculation for single dispatch workloads (#97)
- match color of arithmetic intensity points to corresponding bw lines
PR2 Updates
- ux improvements in standalone GUI (#101)
- enhanced readability for filtering dropdowns in standalone GUI (#102)
- new logfile to capture rocprofiler output (#106)
- roofline support for sles15 sp4 and future service packs (#109)
- adding dockerfiles for all supported Linux distros
- new examples for
--roof-only
and--kernel
options added to documentation
Additional Updates
- enable cli analysis in Windows (#110)
- optional random port number in standalone GUI (#111)
- limit length of visible kernelName in
--kernel-names
option (#115) - adjust metric definitions (#117, #130)
- manually merge rocprof runs, overriding default rocprofiler implementation (#125)
- fixed compatibility issues with Python 3.11 (#131)
Documentation available at https://rocm.github.io/omniperf/1.x/
Associated release tarball: omniperf-v1.0.8.tar.gz
v1.0.8-PR2 (17 April 2023)
Updates
- ux improvements in standalone GUI (#101)
- enhanced readability for filtering dropdowns in standalone GUI (#102)
- new logfile to capture rocprofiler output (#106)
- roofline support for sles15 sp4 and future service packs (#109)
- adding dockerfiles for all supported Linux distos
- new examples for
--roof-only
and--kernel
options added to documentation
Documentation available at https://rocm.github.io/omniperf/1.x/
Associated release tarball: omniperf-v1.0.8-PR2.tar.gz
v1.0.8-PR1 (13 March 2023)
Updates
- add
--kernel-names
option to toggle kernelName overlay in standalone roofline plot (#93) - remove unused python modules (#96)
- fix empirical roofline calculation for single dispatch workloads (#97)
- match color of arithmetic intensity points to corresponding bw lines
Documentation available at https://rocm.github.io/omniperf/1.x/
Associated release tarball: omniperf-v1.0.8-PR1.tar.gz
v1.0.7 (22 February 2023)
Updates
- update documentation (#52, #64)
- improved detection of invalid command line arguments (#58, #76)
- enhancements to standalone roofline (#61)
- enable Omniperf on systems with X-server (#62)
- raise minimum version requirement for rocm (#64)
- enable baseline comparison in CLI analysis (#65)
- add multi-normalization to new metrics (#68, #81)
- support alternative profilers (#70)
- add MI100 configs to override rocprofiler's incomplete default (#75)
- improve error message when no GPU(s) detected (#85)
- separate CI tests by Linux distro and add status badges
Documentation available at https://rocm.github.io/omniperf/1.x/
Associated release tarball: omniperf-v1.0.7.tar.gz
v1.0.6 (21 December 2022)
Updates
- CI update: documentation now published via github action (#22)
- better error detection for incomplete ROCm installs (#56)
Documentation available at https://rocm.github.io/omniperf/1.x/
Associated release tarball: omniperf-v1.0.6.tar.gz
v1.0.5 (13 December 2022)
Updates
- store application command-line parameters in profiling output (#27)
- enable additional normalizations in CLI mode (#30)
- add missing ubuntu 20.04 roofline binary to packaging (#34)
- update L1 bandwidth metric calculations (#36)
- add L1 <-> L2 bandwidth calculation (#37)
- documentation updates (#38, #41)
- enhanced subprocess logging to identify critical errors in rocprofiler (#50)
- maintain git sha in production installs from tarball (#53)
Documentation available at https://rocm.github.io/omniperf/1.x/
Associated release tarball: omniperf-v1.0.5.tar.gz