Name	Name	Last commit message	Last commit date
parent directory ..
design	design
Makefile	Makefile
README.md	README.md
run_u55c.py	run_u55c.py

TAPA Design

Introduction

Rapidsteam is fully compatible with TAPA. In this recipe, we illustrate how to create a Xilinx objective file (.xo) using TAPA, then optimize the .xo file with Rapidstream, and finally utilize the optimized output in the ongoing Vitis development process.

Xilinx Object Files

Vitis compiled object files (.xo) are IP packages used in the AMD Vitis kernel development flow for programming the programmable logic (PL) region of target devices. These files can be generated from HLS C++ code using the v++ command, packed from RTL code, or created using third-party frameworks like RapidStream TAPA. In this example, we use RapidStream TAPA to generate the VecAdd.xo file, but the same flow applies to object files generated through other methods.

Tutorial

Step 1: C++ Simulation

Since our design calls Xilinx Libraries, we need to source the Vitis environment before running the simulation.

source <Vitis_install_path>/Vitis/2023.2/settings64.sh

Before generating the .xo file, we recommend running a C++ simulation to verify the correctness of the design. This step is optional but highly recommended. Run the following command or make csim to perform C++ simulation:

tapa g++ design/main.cpp design/VecAdd.cpp \
-I /opt/tools/xilinx/Vitis_HLS/2023.2/include \
-o build/run_u55c.py/main.exe -O2
./build/run_u55c.py/main.exe

Your should see the following output:

I20241010 15:14:52.494259 4113880 task.h:66] running software simulation with TAPA library
kernel time: 0.0197967 s
PASS!

Step 2: Generate the Xilinx Object File (`.xo`)

We use TAPA on top of 2023.2 to generate the .xo file. Run the following command or run make xo:

source <Vitis_install_path>/Vitis/2023.2/settings64.sh
mkdir -p build/run_u55c.py
cd build/run_u55c.py && tapa compile \
--top VecAdd \
--part-num xcu280-fsvh2892-2L-e \
--clock-period 3.33 \
-o VecAdd.xo \
-f ../../design/VecAdd.cpp \
2>&1 | tee tapa.log

Step 3 (Optional): Use Vitis --link to Generate the `.xclbin` File

With the .xo file generated, you can use v++ -link to generate the .xclbin file. Run the following command or execute make hw:

v++ -l -t hw \
  --platform  xilinx_u280_gen3x16_xdma_1_202211_1 \
  --kernel VecAdd \
  --connectivity.nk VecAdd:1:VecAdd \
  --config design/link_config.ini \
  --temp_dir build \
  -o build/VecAdd.xclbin \
  build/VecAdd.xo

If your machines is equipped with the target FPGA device, you can deploy the optimized design on the FPGA by running the following command:

./app.exe <path_to_vitis_xclbin>

⚠️ Note: This step can take hours to complete. We recommend using the RapidStream flow to optimize the .xo file instead of generating the .xclbin file if you are familiar with AMD Vitis flow.

Step 4: Define Virtual Device

In this tutorial, we use the Alveo U55C as an example. The device is organized into six slots, each containing 16 clock regions of logic. In actual implementations, the available slots are reduced based on the platform specifics, as some resources are reserved for shell logic.

To generate a device.json file that details the device features, such as slot resources and locations, you can either run the run_u55c.py script by invoking RapidStream as shown below or simply enter make device in the terminal.

rapidstream run_u55c.py

Step 5: Use Rapidstream to Optimize `.xo` Design

The RapidStream flow conducts design space exploration and generates solutions by taking all TAPA-generated .xo file as the input. The RapidStream flow for TAPA requires the following key inputs:

tapa-xo-path: The path to the tapa-generated xo file (VecAdd.xo).
device-config: The virtual device (device.json) generated in previous step 2 by calling rapidstream APIs based on platform.
floorplan-config: The configure file (floorplan_config.json) to guide integrated Autobridge to floorplan the design.
implementation-config: The configure file (impl_config.json) to guide Vitis to implement the design (e.g., kernek clock, vitis_platform and etc.).
connectivity-ini: The link configure file (link_config.ini) to specify how the kernel interfaces are connected the memory controller. This is the same for vitis link configure file.

In floorplan_config.json, we intentionally assign the cell "add_kernel" to "SLOT_X1Y1" by the following configuration. You can also specify the cell assignment by using similar regular expression.

 "cell_pre_assignments": {
        ".*add_kernel.*": "SLOT_X1Y1_TO_SLOT_X1Y1"
    }

We encapulate the rapidstream command for TAPA as rapidstream-tapaop for invoking. You can run the command below or execute make all supported by our Makefile.

rapidstream-tapaopt --work-dir build/run_u55c.py \
                    --tapa-xo-path ./VecAdd.xo \
                    --device-config build/run_u55c.py/device.json \
                    --run-impl \
                    --floorplan-config ../../design/config/run_u55c.py/ab_config.json \
                    --implementation-config ../../ design/config/run_u55c.py/impl_config.json \
                    --connectivity-ini ../../design/config/run_u55c.py/link_config.ini

When finished, you can locate these files using the following command:

find ./build/run_u55c.py/ -name *.xo

If everything is successful, you should at least get one optimized .xo file located in build/run_u55c.py/dse/solution_0/updated.xo.

Since we enable '--run-impl' option, rapidstream will launch Vitis to generate the .xclbin file for the optimized .xo file. You can find the optimized .xclbin file by running the following command:

find ./build -name *.xclbin.info

Step 6: Check the Real Floorplan Report

Since assign the cell "add_kernel" to "SLOT_X1Y1" in the configure file (floorplan_config.json), we can check the real floorplan report by running the following command or make check_floorplan:

rapidstream ../../common/util/get_slot.py \
  -i build/run_u55c.py \
  -o  build/run_u55c.py

You can open the `build/run_u55c.py/floorplan_solution_.csv" to check the real floorplan report. You may find more than one .csv files depending on the number of solutions.

name	floorplan	ff	lut	bram_18k	dsp	uram	unpipelinable
add_kernel_0	SLOT_X1Y1_TO_SLOT_X1Y1	54	65	0	0	0
control_s_axi_U	SLOT_X1Y0_TO_SLOT_X1Y0	245	223	0	0	0
read_mem_0	SLOT_X1Y0_TO_SLOT_X1Y0	790	546	1	0	0
read_mem_1	SLOT_X1Y0_TO_SLOT_X1Y0	790	546	1	0	0
stream_in1	SLOT_X1Y0_TO_SLOT_X1Y0	10	53	0	0	0
stream_in2	SLOT_X1Y1_TO_SLOT_X1Y1	10	53	0	0	0
stream_out	SLOT_X1Y0_TO_SLOT_X1Y0	10	53	0	0	0
write_mem_0	SLOT_X1Y0_TO_SLOT_X1Y0	1024	768	1	0	0
__tapa_fsm_unit_write_mem_0	SLOT_X1Y0_TO_SLOT_X1Y0	0	0	0	0	0	non_pipeline
...	...	...	...	...	...	...	...

Step 7: Check the Group Module Report

RapidStream mandates a clear distinction between communication and computation within user designs.

In Group modules, users are tasked solely with defining inter-submodule communication. For those familiar with Vivado IP Integrator flow, crafting a Group module mirrors the process of connecting IPs in IPI. RapidStream subsequently integrates appropriate pipeline registers into these Group modules.
In Leaf modules, users retain the flexibility to implement diverse computational patterns, as RapidStream leaves these Leaf modules unchanged.

For further details, please consult the code style section in our Documentation.

To generate a report on group types, execute the commands below or run make show_groups:

rapidstream ../../common/util/get_group.py \
	-i build/passes/0-imported.json \
	-o build/module_types.csv

The module types for your design can be found in build/module_types.csv. Below, we list the four Group modules. In this design, VecAdd serves as a Group module, while the other three modules are added by RapidStream.

Module Name	Group Type
VecAdd	grouped_module
__rs_VecAdd_aux	aux_module
...	verilog_module

Step 8: Use Vitis --link with the Optimized `.xo` File (optional)

With the optimized .xo file generated, you can use v++ -link to generate the .xclbin file. Run the following command or run make:

v++ -l -t hw \
  --platform  xilinx_u280_gen3x16_xdma_1_202211_1 \
  --kernel VecAdd \
  --connectivity.nk VecAdd:1:VecAdd \
  --config design/link_config.ini \
  --temp_dir build/rapidstream \
  -o build/VecAdd_rs_opt.xclbin \
  ./build/dse/candidate_0/exported/VecAdd.xo

To examine the timing results for each design point, use this command:

find ./build -name *.xclbin.info

If your machines is equipped with the target FPGA device, you can deploy the optimized design on the FPGA by running the following command:

make host
./app.exe <path_to_optimized_xclbin>

Next Step

Click here to go back to Getting Started

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

tapa_source

tapa_source

README.md

TAPA Design

Introduction

Xilinx Object Files

Tutorial

Step 1: C++ Simulation

Step 2: Generate the Xilinx Object File (`.xo`)

Step 3 (Optional): Use Vitis --link to Generate the `.xclbin` File

Step 4: Define Virtual Device

Step 5: Use Rapidstream to Optimize `.xo` Design

Step 6: Check the Real Floorplan Report

Step 7: Check the Group Module Report

Step 8: Use Vitis --link with the Optimized `.xo` File (optional)

Next Step

Files

tapa_source

Directory actions

More options

Directory actions

More options

Latest commit

History

tapa_source

Folders and files

parent directory

README.md

TAPA Design

Introduction

Xilinx Object Files

Tutorial

Step 1: C++ Simulation

Step 2: Generate the Xilinx Object File (.xo)

Step 3 (Optional): Use Vitis --link to Generate the .xclbin File

Step 4: Define Virtual Device

Step 5: Use Rapidstream to Optimize .xo Design

Step 6: Check the Real Floorplan Report

Step 7: Check the Group Module Report

Step 8: Use Vitis --link with the Optimized .xo File (optional)

Next Step

Step 2: Generate the Xilinx Object File (`.xo`)

Step 3 (Optional): Use Vitis --link to Generate the `.xclbin` File

Step 5: Use Rapidstream to Optimize `.xo` Design

Step 8: Use Vitis --link with the Optimized `.xo` File (optional)