Balar mmio with Vanadis #2428

William-An · 2024-12-12T19:36:04Z

Balar mmio with Vanadis

Add support for using Balar as an MMAP device for Vanadis to access
Create a custom CUDA runtime lib to run CUDA programs with Vanadis
Add more CUDA runtime API to support rodinia-2.0 benchmark
Add more unit test test cases

* Add a new CUDA API id "GPU_PARAM_CONFIG" to support querying kernel function argument size and alignment information from GPGPU-Sim. * Add param "cuda_executable" to BalarMMIO so that it can know the CUDA binary path when running LLVM CUDA code (Vanadis cannot know the host file structure). * Add all the CUDA API implementations needed to link the test program inside tests/vanadisLLVMRISCV. * Minor formatting changes.

…hake to riscv gcc

sst-autotester · 2024-12-12T19:48:27Z

Status Flag 'Pre-Test Inspection' - - This Pull Request Requires Inspection... The code must be inspected by a member of the Team before Testing/Merging
NO INSPECTION HAS BEEN PERFORMED ON THIS PULL REQUEST! - This PR must be inspected by setting label 'AT: PRE-TEST INSPECTED'.

hughes-c · 2024-12-18T00:50:54Z

src/sst/elements/balar/README.md

@gvoskuilen @feldergast Do we want to keep the prerequisites in this readme or remove them in favor of the list that we test against? Already discussed what testing we want in the nightlies versus weeklies.

hughes-c · 2024-12-18T00:52:45Z

src/sst/elements/balar/README.md


- Tested on commit `0f358dda178f96db3b0da88b2b965492c4be187d`
 - Use `./configure --prefix=$SST_CORE_HOME --disable-mpi --disable-mem-pools` for sst-core config


@William-An Did you test at all with mem pools enabled?

hughes-c · 2024-12-18T01:13:55Z

src/sst/elements/balar/balarMMIO.cc

+    balar->cuda_ret.is_cuda_call_done = false;
+
+    // Create a DMA request to read the cuda call packet from cache to balar
+    DMAEngine::DMAEngineControlRegisters dma_registers;


@William-An Did we discuss putting this in memH or vanadis?
@gvoskuilen

hughes-c · 2024-12-18T01:21:15Z

src/sst/elements/balar/balarMMIO.cc

                            gridDim, 
                            blockDim, 
                            packet->configure_call.sharedMem, 
-                            packet->configure_call.stream
+                            (cudaStream_t) packet->configure_call.stream


Do CUDA streams work in this framework?

hughes-c · 2024-12-18T01:33:21Z

src/sst/elements/balar/balar_packet.h

+        GPU_MALLOC_HOST_RET,
+    };
+
+    // Future: Make this into a class with additional serialization methods?


@gvoskuilen @feldergast Is this going to be necessary for checkpointing/debug?

hughes-c · 2024-12-18T01:42:33Z

src/sst/elements/balar/tests/balarBlock.py

+# Constans shared across components
+network_bw = "25GB/s"
+clock = "2GHz"
+balar_mmio_testcpu_addr = 4096


@William-An How configurable are the mmio addresses?

hughes-c · 2024-12-18T01:43:24Z

src/sst/elements/balar/tests/balarBlock.py

+clock = "2GHz"
+balar_mmio_testcpu_addr = 4096
+balar_mmio_vanadis_addr = 0x80100000
+balar_mmio_size = 1024


What about the mmio sizes?

hughes-c · 2024-12-18T01:50:20Z

src/sst/elements/balar/tests/vanadisHandshake/cuda_runtime_api.h

            uint64_t size;
            uint64_t offset;
+            uint8_t value[200];


@William-An If this is related to the array from above, we should find a way to ensure that this is propagated everywhere that relies on it.

hughes-c · 2024-12-18T01:52:46Z

src/sst/elements/balar/tests/vanadisHandshake/vanadisHandshake.c

@@ -43,7 +48,8 @@ int main( int argc, char* argv[] ) {

    // Preparing the data


Why only five updates? And why is n = 10k?

hughes-c · 2024-12-18T01:57:42Z

src/sst/elements/balar/tests/vanadisLLVMRISCV/balar_vanadis.h

+
+/**
+ * @file cuda_runtime_api.h
+ * @author Weili An (an107@purdue.edu)


@William-An You should probably remove your email address from these unless you want users bugging you directly. ^-^

William-An added 30 commits December 12, 2024 12:42

balar: update readme links to official repos

b7f5568

balar-mmio: add balar to vanadis device list

89b8255

balar-mmio: update custom cuda lib to map balar to vanadis's VM

7ec7c66

balar-mmio: separate data and command interfaces for balar

a492c36

balar-mmio: finish config script for balar+vanadis via mmap

2bc0f37

balar-mmio: refactor config script to use builder class

a977443

balar-mmio: add vanadis test to testsuite

7e956ff

balar-mmio: update testsuite ref files

0384dc8

balar-mmio: update dist files

e9e6cce

balar-mmio: add cuda files for llvm

5243525

balar-mmio: make scratch mem aligned to cache block, temp fix

9b23271

balar-mmio: make a real vecadd summing sin^2 and cos^2

89d5af3

balar-mmio: encode CUDA version information in vanadis binary

7f582a7

balar-mmio: use DMA engine for read/write cuda packets

9cbea2f

balar-mmio: modify launch scripts for use of DMA engine

8c933dc

balar-mmio: update refFile for testcpu

87b6b7a

balar-mmio: update riscv-cuda cxxflags to avoid macro conflict

ba247be

balar-mmio: add support to append app args

9768256

balar-mmio: add support for unaligned cudamemcpy

7cb8eeb

balar-mmio: update unittest to run rodinia benchmark

84854b0

balar-mmio: update readme to cover balar+vanadis and new unittest

b0c6346

balar-mmio: adding placeholder apis

9ba38f9

balar-mmio: move CUDA packet definition to a single file

e678f13

balar-mmio: update readme

7ca59d3

balar-mmio: add additional CUDA APIs for rodinia benchmark

db6b4f0

balar-mmio: fix some formatting warnings

94862fd

balar-mmio: update README on GPU_ARCH env

f8d4955

balar-mmio: remove include to sst/core/simulation.h

9792086

balar-mmio: add mprotect for balar's addr

5c78d6a

William-An added 22 commits December 12, 2024 12:42

balar-mmio: change default isa to riscv and change compiler for hands…

2c29339

…hake to riscv gcc

balar-mmio: update reffile since we are using riscv64

98a41ac

balar-mmio: add more rodinia benchmarks

61c20d2

balar-mmio: add rodinia hotspot reffile

b1e0b15

balar-mmio: update test time limits

c71ee0a

balar-mmio: fix not returning value for cudaMallocHost

4b12eab

balar-mmio: update time limit and reffile for lud 256

58f9098

balar-mmio: add rodinia pathfinder and srad reffiles

94f4262

balar-mmio: split tests into different testsuites

f3fcc21

balar-mmio: fix args passing in testcases

3342466

balar-mmio: increase test run time limit

d725066

balar-mmio: limit nproc when making rodinia

0d0554d

balar-mmio: restructure testsuite

6fbe7a8

balar-mmio: add support for cudaThreadSynchronize

b4b03dc

balar-mmio: create subdirectory for each testcase

66bab78

balar-mmio: add refFiles

df5a6a8

balar-mmio: add support for cudaMemcpyToSymbol

33b012f

balar-mmio: add rodinia heartwall

f1efc17

balar-mmio: add texture api support

65b5917

balar-mmio: add heartwell ref file

6526407

balar-mmio: remove some comments

8459025

balar-mmio: clean up unused files and update dist

cf077bc

hughes-c added Enhancement SST-balar labels Dec 18, 2024

hughes-c added this to the SST v15.0.0 milestone Dec 18, 2024

hughes-c reviewed Dec 18, 2024

View reviewed changes

hughes-c self-assigned this Dec 18, 2024

hughes-c added the SST-vanadis label Dec 18, 2024

hughes-c mentioned this pull request Dec 18, 2024

Balar does not compile as written #2404

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Balar mmio with Vanadis #2428

Balar mmio with Vanadis #2428

William-An commented Dec 12, 2024 •

edited

Loading

sst-autotester commented Dec 12, 2024

hughes-c Dec 18, 2024

hughes-c Dec 18, 2024

hughes-c Dec 18, 2024

hughes-c Dec 18, 2024

hughes-c Dec 18, 2024

hughes-c Dec 18, 2024

hughes-c Dec 18, 2024

hughes-c Dec 18, 2024

hughes-c Dec 18, 2024

hughes-c Dec 18, 2024


		- Tested on commit `0f358dda178f96db3b0da88b2b965492c4be187d`
		- Use `./configure --prefix=$SST_CORE_HOME --disable-mpi --disable-mem-pools` for sst-core config

		@@ -43,7 +48,8 @@ int main( int argc, char* argv[] ) {

		// Preparing the data

Balar mmio with Vanadis #2428

Are you sure you want to change the base?

Balar mmio with Vanadis #2428

Conversation

William-An commented Dec 12, 2024 • edited Loading

sst-autotester commented Dec 12, 2024

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

William-An commented Dec 12, 2024 •

edited

Loading