Add Linux build on GPU on Github Actions #9335

luhenry · 2024-04-02T12:18:48Z

No description provided.

netlify · 2024-04-02T12:19:03Z

✅ Deploy Preview for meta-velox canceled.

Name	Link
🔨 Latest commit	`6fbfa92`
🔍 Latest deploy log	https://app.netlify.com/sites/meta-velox/deploys/661e9cc989da3c0008892f05

luhenry · 2024-04-02T12:19:32Z

cc @assignUser @Yuhta @pedroerp @kgpai

It's building on all pull requests. The testing would happen on a subset of PR or on nightly as discussed previously, and would be done on a different job. I've tested locally with fmt version 9.1.0 as well, and it compiles with both CUDA 11.8 and 12.4

Could you please also let me know what are the CUDA versions used internally, to make sure we add them to the test matrix here so we don't break the build?

facebook-github-bot · 2024-04-02T17:28:35Z

@Yuhta has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

facebook-github-bot · 2024-04-02T17:37:46Z

@Yuhta has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

assignUser

Looks good but I think we should just integrate it into the adapters build as the only difference in both is the additional installation of cuda + -DVELOX_ENABLE_GPU=ON.

That way we only add one additional build instead of two. Another option would be to only build the gpu parts with anything not needed (e.g. the connectors I would assume?) .

If I understand correctly the tests will be skipped automatically when no cuda device is detected anyway right?

scripts/setup-centos8.sh

.github/workflows/linux-build.yml

luhenry · 2024-04-03T05:10:47Z

@assignUser ok, sounds good, I'll fold it into the adapters build.

assignUser · 2024-04-03T05:31:02Z

@luhenry Thinking on this a bit more, what are the minimal flags you need? If the build is much smaller (+ potentially skip building test) it might be faster to build 2 standalone builds Vs one additional adapters build.

luhenry · 2024-04-09T07:57:07Z

@assignUser the build isn't much smaller as Wave depends (directly or indirectly) on most of the other velox libraries. Also, it doesn't add much to the existing build in terms of time to compile or produced binaries.

If you're concerned about resource/machine usage, we can build a limited set of cuda versions (what's the one(s) you'd like to focus on?) and build the larger set on the nightlies.

…d-cuda

assignUser · 2024-04-10T03:27:37Z

If you're concerned about resource/machine usage, we can build a limited set of cuda versions (what's the one(s) you'd like to focus on?) and build the larger set on the nightlies.

Yep that was my concern. Sounds like the most efficent way would be adding the wave build to the existing adapters build with one cuda version (someone else should choose which one ^^) + a nightly matrix build with multiple versions. cc @kgpai

luhenry · 2024-04-10T07:31:26Z

If you're concerned about resource/machine usage, we can build a limited set of cuda versions (what's the one(s) you'd like to focus on?) and build the larger set on the nightlies.

Yep that was my concern. Sounds like the most efficent way would be adding the wave build to the existing adapters build with one cuda version (someone else should choose which one ^^) + a nightly matrix build with multiple versions. cc @kgpai

I've changed it to a CUDA_VERSION env variable set currently to 11.8. Happy to change that value to anything else. I'll add the nightly in a follow-up PR since it will require work with GPU runners which I don't know how to access yet.

assignUser · 2024-04-10T21:44:16Z

@luhenry looks good but could you change the test skipping code so it doesn't cause gtest to pick up a failed test? Otherwise this will always fail the adapters job which is a bit unfortunate ^^

I was about to suggest installing cuda in the docker image but it seems that gha has great mirrors and the install only takes 27s so that's probably not much slower than the additional ~1GB to download if we add it to the container :D

luhenry · 2024-04-11T13:16:59Z

@luhenry looks good but could you change the test skipping code so it doesn't cause gtest to pick up a failed test? Otherwise this will always fail the adapters job which is a bit unfortunate ^^

Fixed with 5c3780e

I was about to suggest installing cuda in the docker image but it seems that gha has great mirrors and the install only takes 27s so that's probably not much slower than the additional ~1GB to download if we add it to the container :D

It should be adding ~217MB as that's what's getting installed by yum install cuda-nvcc-11-8 cuda-cudart-11-8 (per the logs https://github.com/facebookincubator/velox/actions/runs/8627576322/job/23662809679#step:5:89)

assignUser · 2024-04-11T13:24:57Z

It should be adding ~217MB

I did local testing (with both versions though) and that added ~ 800mb to the final image but in any case it's fine as it is for now. :)

…d-cuda

luhenry · 2024-04-11T14:36:38Z

@assignUser I've just merged the main branch as it's failing with https://github.com/facebookincubator/velox/actions/runs/8643569195

assignUser · 2024-04-11T14:41:37Z

@luhenry yeah the fix is in #9451 feel free to apply it to this PR, I hope it get's merged soon^^

luhenry · 2024-04-11T14:48:39Z

@luhenry yeah the fix is in #9451 feel free to apply it to this PR, I hope it get's merged soon^^

Done.

luhenry · 2024-04-11T20:54:12Z

@assignUser all tests are now passing.

assignUser

Looks good on the cmake and workflow front, I'd like a C++ approval as well but then this should be ready to merge :)

facebook-github-bot · 2024-04-15T18:53:19Z

@Yuhta has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

Yuhta · 2024-04-15T21:28:29Z

@luhenry Can you rebase to the newest?

…d-cuda

facebook-github-bot · 2024-04-16T14:59:48Z

@Yuhta has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

velox/experimental/gpu/tests/HashTableTest.cu

velox/experimental/wave/common/Cuda.cu

luhenry · 2024-04-16T15:43:17Z

@Yuhta fixed.

facebook-github-bot · 2024-04-16T15:51:40Z

@Yuhta has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

facebook-github-bot · 2024-04-16T20:37:12Z

@Yuhta merged this pull request in 1e901c9.

conbench-facebook · 2024-04-16T21:01:20Z

Conbench analyzed the 1 benchmark run on commit 1e901c9d.

There were no benchmark performance regressions. 🎉

The full Conbench report has more details.

Summary: Pull Request resolved: facebookincubator#9335 Reviewed By: kagamiori Differential Revision: D55646566 Pulled By: Yuhta fbshipit-source-id: 7c1cc2ef8db3da4e9f9889b9c7bcde52283e6bb2

luhenry added 3 commits April 2, 2024 11:51

Install CUDA 11.8 and 12.4 on Ubuntu and CentOS 8 images

76e76a7

Add build on GPU on GHA

878a593

Fix build for CUDA 11.8 and 12.4

b75aafb

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Apr 2, 2024

luhenry mentioned this pull request Apr 2, 2024

Build for GPU on CircleCI #8829

Closed

fix formatting

2c7159d

assignUser reviewed Apr 3, 2024

View reviewed changes

scripts/setup-centos8.sh Outdated Show resolved Hide resolved

.github/workflows/linux-build.yml Outdated Show resolved Hide resolved

.github/workflows/linux-build.yml Outdated Show resolved Hide resolved

luhenry force-pushed the gha-add-cuda branch from a988446 to 2c7159d Compare April 3, 2024 05:11

luhenry added 3 commits April 3, 2024 05:12

Merge gpu build into adapters

e017dc4

Fix whitespace

36e1986

Remove installation by default of CUDA in ubuntu and centos image

71925b9

Don't forget to install CUDA

ae7633f

luhenry added 3 commits April 9, 2024 11:21

Merge branch 'main' of github.com:facebookincubator/velox into gha-ad…

3d7536d

…d-cuda

Compile a single version of CUDA on Adapters build

ef462b8

Enable GCC 9 explicitly

f482b86

Fix test name

a86dce5

luhenry and others added 2 commits April 11, 2024 08:12

Fix skipping dwio decode tests when there are no GPU

5c3780e

fix if gates

df40090

Merge branch 'main' of github.com:facebookincubator/velox into gha-ad…

36d5202

…d-cuda

Merge branch 'fix-ci' of github.com:assignUser/velox into gha-add-cuda

5619d3b

Fix skipping dwio decode tests when there are no GPU (take 2)

3866a8b

assignUser approved these changes Apr 12, 2024

View reviewed changes

assignUser requested a review from Yuhta April 12, 2024 16:00

Merge branch 'main' of github.com:facebookincubator/velox into gha-ad…

e6143c1

…d-cuda

Yuhta reviewed Apr 16, 2024

View reviewed changes

velox/experimental/gpu/tests/HashTableTest.cu Outdated Show resolved Hide resolved

velox/experimental/gpu/tests/HashTableTest.cu Outdated Show resolved Hide resolved

velox/experimental/wave/common/Cuda.cu Outdated Show resolved Hide resolved

linter fix

6fbfa92

facebook-github-bot closed this in 1e901c9 Apr 16, 2024

facebook-github-bot added the Merged label Apr 16, 2024

assignUser mentioned this pull request May 31, 2024

Add support for Centos9 Stream + GCC12 #9903

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add Linux build on GPU on Github Actions #9335

Add Linux build on GPU on Github Actions #9335

luhenry commented Apr 2, 2024

netlify bot commented Apr 2, 2024 •

edited

Loading

luhenry commented Apr 2, 2024 •

edited

Loading

facebook-github-bot commented Apr 2, 2024

facebook-github-bot commented Apr 2, 2024

assignUser left a comment

luhenry commented Apr 3, 2024

assignUser commented Apr 3, 2024

luhenry commented Apr 9, 2024

assignUser commented Apr 10, 2024

luhenry commented Apr 10, 2024

assignUser commented Apr 10, 2024

luhenry commented Apr 11, 2024

assignUser commented Apr 11, 2024

luhenry commented Apr 11, 2024

assignUser commented Apr 11, 2024

luhenry commented Apr 11, 2024

luhenry commented Apr 11, 2024

assignUser left a comment

facebook-github-bot commented Apr 15, 2024

Yuhta commented Apr 15, 2024

facebook-github-bot commented Apr 16, 2024

luhenry commented Apr 16, 2024

facebook-github-bot commented Apr 16, 2024

facebook-github-bot commented Apr 16, 2024

conbench-facebook bot commented Apr 16, 2024

Add Linux build on GPU on Github Actions #9335

Add Linux build on GPU on Github Actions #9335

Conversation

luhenry commented Apr 2, 2024

netlify bot commented Apr 2, 2024 • edited Loading

✅ Deploy Preview for meta-velox canceled.

luhenry commented Apr 2, 2024 • edited Loading

facebook-github-bot commented Apr 2, 2024

facebook-github-bot commented Apr 2, 2024

assignUser left a comment

Choose a reason for hiding this comment

luhenry commented Apr 3, 2024

assignUser commented Apr 3, 2024

luhenry commented Apr 9, 2024

assignUser commented Apr 10, 2024

luhenry commented Apr 10, 2024

assignUser commented Apr 10, 2024

luhenry commented Apr 11, 2024

assignUser commented Apr 11, 2024

luhenry commented Apr 11, 2024

assignUser commented Apr 11, 2024

luhenry commented Apr 11, 2024

luhenry commented Apr 11, 2024

assignUser left a comment

Choose a reason for hiding this comment

facebook-github-bot commented Apr 15, 2024

Yuhta commented Apr 15, 2024

facebook-github-bot commented Apr 16, 2024

luhenry commented Apr 16, 2024

facebook-github-bot commented Apr 16, 2024

facebook-github-bot commented Apr 16, 2024

conbench-facebook bot commented Apr 16, 2024

netlify bot commented Apr 2, 2024 •

edited

Loading

luhenry commented Apr 2, 2024 •

edited

Loading