Harmonize

Harmonize is an execution framework for GPU with the aims of increasing performance and decreasing development burden for applications requiring complex processing patterns. Harmonize is currently available both as a headers-only CUDA/HIP C++ library and as a Python package. In both forms, functionality is exposed as an asynchronous processing framework.

Fundamentals

The system of asynchronous functions that are executed within a given runtime are defined in advance through a program specification. A program specification only represents these systems abstractly, but they may be transformed into an implementation by declaring a program specialization.

Program specifications are defined by application developers, and represent the business logic of what needs to be accomplished in the GPU, whereas the templates (program types) used to transform them into specializations are defined by framework developers. This structure is used because it separates much of the underlying implementation details away from application developers, delegating those concerns to the framework developers. Ideally, with the addition of a program type, transitioning a codebase to this other program could be as little as a one-line refactor:

< using MyProgram = EventProgram<MyProgramSpec>;
> using MyProgram = AsyncProgram<MyProgramSpec>;

Likewise, as long as the interface between the specification and the program type does not change, program types may be updated without requiring refactors from the application developers.

Why Async?

Asynchronous programming represents a looser contract between program and execution environment. Once a function is called, there is no guarantee of where or when that call is actually evaluated. If the call is lazy, then the evaluation may not happen at all.

This looser contract is useful, because it allows an interface to represent a wide variety of execution strategies without breaking any promises. By representing all program types as different asynchronous runtimes, they can be used interchangeably as long as they fulfill the few promises made by the runtime's interface.

Dependencies

CUDA C++ Dependencies

The C++ framework currently requires the following for CUDA:

a CUDA compiler (e.g. nvcc)
the CUDA runtime (host and device)

HIP C++ Dependencies

The C++ framework currently requires the following for ROCM:

a HIP compiler (e.g. hipcc)
the ROCM runtime (host and device)

Python Dependencies

Python bindings require:

Non-package Dependencies
- CUDA execution requires the CUDA toolkit, including:
  - nvcc
  - CUDA runtime (host and device)
- ROCM execution requires the ROCM 6.0.0 toolkit, including its version of:
  - hipcc
  - clang-offload-bundler
  - llvm-link
  - llvm-as
  - HIP runtime (host and device)
Python Package Dependencies
- numpy
- llvmlite
- For CUDA:
  - numba
- For ROCM:
  - the AMD fork of numba

Name		Name	Last commit message	Last commit date
Latest commit History 97 Commits
examples		examples
harmonize		harmonize
.gitignore		.gitignore
CITATION.cff		CITATION.cff
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
LICENSE		LICENSE
README.md		README.md
install.sh		install.sh
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Harmonize

Fundamentals

Why Async?

Dependencies

CUDA C++ Dependencies

HIP C++ Dependencies

Python Dependencies

About

Releases 1

Packages

Contributors 2

Languages

License

CEMeNT-PSAAP/harmonize

Folders and files

Latest commit

History

Repository files navigation

Harmonize

Fundamentals

Why Async?

Dependencies

CUDA C++ Dependencies

HIP C++ Dependencies

Python Dependencies

About

Resources

License

Code of conduct

Stars

Watchers

Forks

Releases 1

Packages 0

Contributors 2

Languages

Packages