[R-package] [ci] Add test on R package with sanitizers #3439

jameslamb · 2020-10-06T03:36:07Z

This PR adds tests with the sanitizers run by CRAN on package submissions. See this blog post for a lot more background.

The tests take 22 minutes to run, so in this PR I'm proposing that we add them as a manual test that can be triggered by a comment (copying @StrikerRUS 's great work on #3424 ).

This can be triggered by commenting /gha run r-sanitizers-check on a PR.

How this makes `LightGBM` better

catches issues like memory violations in lib_lightgbm
allows us to catch issues in CI to improve the likelihood of CRAN accepting submissions of the R package

.github/workflows/r-package-ubsan.yml

StrikerRUS · 2020-10-06T15:15:10Z

@jameslamb

The tests take 22 minutes to run, so in this PR I'm proposing that we add them as a manual test that can be triggered by a comment

I don't think that 22 min is something unacceptable for us. For example, right now duration of some tests is about 17 min.

So, given that we have 20 free parallel builds for GitHub Actions, I believe we can run new test just as a normal check.

I remember old times before rebalancing CI jobs when we had to wait about 40min-1h 😄 .

jameslamb · 2020-10-06T15:18:38Z

haha ok! I'll move it back to the main R GitHub Actions then

StrikerRUS · 2020-10-06T17:47:26Z

@jameslamb BTW, haven't you find a way to make one particular check "optional, but if run then required to be passed"?

jameslamb · 2020-10-07T04:34:05Z

@jameslamb BTW, haven't you find a way to make one particular check "optional, but if run then required to be passed"?

no, I'm not sure how to do that, sorry!

src/c_api.cpp

StrikerRUS

Wow! Amazing job as always! I'm pleasantly surprised that we can use container and write just few lines to have a sanitizers checks. Looks great overall, just check my minor comments below.

.github/workflows/r_package.yml

StrikerRUS · 2020-10-08T21:39:47Z

.github/workflows/r_package.yml

+        run: |
+          cd R-package/tests
+          Rscriptdevel testthat.R 2>&1 > ubsan-tests.log
+          cat ubsan-tests.log


Was it left intentionally or it was for debugging purposes?

I left it in intentionally, so that if the job fails you have enough logs to be able to tell what went wrong

StrikerRUS · 2020-10-08T23:37:47Z

@jameslamb

no, I'm not sure how to do that, sorry!

No problem! Just wanted to be sure that I'm not discovering something that is already known 🙂 .

Looks like it is impossible to do such things with standard GitHub Actions API. But I have an idea to use REST API and query a status of required workflow via it in our cool all-successful job:

LightGBM/.github/workflows/r_package.yml

Lines 162 to 168 in 186711d

    
             all-successful: 
        
               # https://gh.neting.ccmunity/t/is-it-possible-to-require-all-github-actions-tasks-to-pass-without-enumerating-them/117957/4?u=graingert 
        
               runs-on: ubuntu-latest 
        
               needs: [test] 
        
               steps: 
        
               - name: Note that all tests succeeded 
        
                 run: echo "🎉"

https://docs.github.com/en/free-pro-team@latest/rest/reference/actions#get-a-workflow-run

Co-authored-by: Nikita Titov <nekit94-08@mail.ru>

StrikerRUS

+1 step towards CRAN, nice!

StrikerRUS · 2020-10-09T13:56:49Z

@jameslamb What do you think about moving all out R jobs into containers? It looks quite comfortable to have all R infrastructure and do not waste CI time to install it (very often with network errors, BTW) by our own. 🙂

StrikerRUS · 2020-10-10T22:47:42Z

But I have an idea to use REST API and query a status of required workflow via it in our cool all-successful job:

OK, today I went ahead and prepared a draft for this feature. I tested it a little bit and it looks like it can work 👍 .
But it requires two actions from maintainers: drop a comment to trigger optional workflow and re-run (if already completed) main workflow to fetch the latest status of optional one.

# .ci/check_workflow_status.py
import json
from os import environ
from sys import exit
from time import sleep
from urllib import request


def get_runs():
    with request.urlopen("https://api.github.com/repos/microsoft/LightGBM/actions/workflows/test_1.yml/runs") as url:
        data = json.loads(url.read().decode())
    pr_runs = []
    if environ.get("GITHUB_EVENT_NAME", "") == "pull_request":
        pr_runs = [i for i in data['workflow_runs']
                   if i['event'] == 'pull_request_review_comment' and
                   (i.get('pull_requests') and
                    i['pull_requests'][0]['number'] == int(environ.get("GITHUB_REF").split('/')[-2]) or
                    i['head_branch'] == environ.get("GITHUB_HEAD_REF").split('/')[-1])]
    return sorted(pr_runs, key=lambda i: i['run_number'], reverse=True)


def get_status(runs):
    status = 'ok'
    for run in runs:
        if run['status'] == 'completed':
            if run['conclusion'] in {'cancelled', 'skipped'}:
                continue
            if run['conclusion'] in {'failure', 'timed_out'}:
                status = 'fail'
                break
            if run['conclusion'] == 'success':
                break
        if run['status'] in {'in_progress', 'queued'}:
            status = 'rerun'
            break
    return status


if __name__ == "__main__":
    while True:
        status = get_status(get_runs())
        if status != 'rerun':
            break
        sleep(60)
    if status == 'fail':
        exit(1)

# .github/workflows/test_1.yml
name: Test 1

on:
  pull_request_review_comment:
    types: [created]

jobs:
  test:
    name: Test 1
    runs-on: ubuntu-latest
    if: github.event.comment.body == '/gha run' && contains('OWNER,MEMBER,COLLABORATOR', github.event.comment.author_association)
    timeout-minutes: 60
    strategy:
      fail-fast: false
    steps:
      - name: Checkout repository
        uses: actions/checkout@v1
        with:
          fetch-depth: 5
          submodules: true
      - name: Test
        run: |
            sleep 2m
            exit -1

# .github/workflows/r_package.yml

...

  all-successful:
    # https://gh.neting.ccmunity/t/is-it-possible-to-require-all-github-actions-tasks-to-pass-without-enumerating-them/117957/4?u=graingert
    runs-on: ubuntu-latest
    needs: [test]
    steps:
      - name: Checkout repository
        uses: actions/checkout@v1
        with:
          fetch-depth: 5
          submodules: false
      - name: Install Python
        uses: actions/setup-python@v2
        with:
          python-version: '3.x'
      - name: Note that all tests succeeded
        run: python "$GITHUB_WORKSPACE/.ci/check_workflow_status.py"

StrikerRUS · 2020-10-11T00:16:23Z

But it requires two actions from maintainers: ...

Probably it is possible to simplify it to just one comment with the help of https://docs.github.com/en/free-pro-team@latest/rest/reference/actions#re-run-a-workflow. But I'm not sure about the required permissions though.

jameslamb · 2020-10-12T00:06:22Z

@jameslamb What do you think about moving all out R jobs into containers? It looks quite comfortable to have all R infrastructure and do not waste CI time to install it (very often with network errors, BTW) by our own. 🙂

I think that would only work for Linux builds, and that that wouldn't help much unfortunately...since most of the temporary errors in CI have been on Mac and Windows.

jameslamb · 2020-10-12T00:07:37Z

But I have an idea to use REST API and query a status of required workflow via it in our cool all-successful job:

OK, today I went ahead and prepared a draft for this feature. I tested it a little bit and it looks like it can work 👍 .
But it requires two actions from maintainers: drop a comment to trigger optional workflow and re-run (if already completed) main workflow to fetch the latest status of optional one.

# .ci/check_workflow_status.py
import json
from os import environ
from sys import exit
from time import sleep
from urllib import request


def get_runs():
    with request.urlopen("https://api.github.com/repos/microsoft/LightGBM/actions/workflows/test_1.yml/runs") as url:
        data = json.loads(url.read().decode())
    pr_runs = []
    if environ.get("GITHUB_EVENT_NAME", "") == "pull_request":
        pr_runs = [i for i in data['workflow_runs']
                   if i['event'] == 'pull_request_review_comment' and
                   (i.get('pull_requests') and
                    i['pull_requests'][0]['number'] == int(environ.get("GITHUB_REF").split('/')[-2]) or
                    i['head_branch'] == environ.get("GITHUB_HEAD_REF").split('/')[-1])]
    return sorted(pr_runs, key=lambda i: i['run_number'], reverse=True)


def get_status(runs):
    status = 'ok'
    for run in runs:
        if run['status'] == 'completed':
            if run['conclusion'] in {'cancelled', 'skipped'}:
                continue
            if run['conclusion'] in {'failure', 'timed_out'}:
                status = 'fail'
                break
            if run['conclusion'] == 'success':
                break
        if run['status'] in {'in_progress', 'queued'}:
            status = 'rerun'
            break
    return status


if __name__ == "__main__":
    while True:
        status = get_status(get_runs())
        if status != 'rerun':
            break
        sleep(60)
    if status == 'fail':
        exit(1)

# .github/workflows/test_1.yml
name: Test 1

on:
  pull_request_review_comment:
    types: [created]

jobs:
  test:
    name: Test 1
    runs-on: ubuntu-latest
    if: github.event.comment.body == '/gha run' && contains('OWNER,MEMBER,COLLABORATOR', github.event.comment.author_association)
    timeout-minutes: 60
    strategy:
      fail-fast: false
    steps:
      - name: Checkout repository
        uses: actions/checkout@v1
        with:
          fetch-depth: 5
          submodules: true
      - name: Test
        run: |
            sleep 2m
            exit -1

# .github/workflows/r_package.yml

...

  all-successful:
    # https://gh.neting.ccmunity/t/is-it-possible-to-require-all-github-actions-tasks-to-pass-without-enumerating-them/117957/4?u=graingert
    runs-on: ubuntu-latest
    needs: [test]
    steps:
      - name: Checkout repository
        uses: actions/checkout@v1
        with:
          fetch-depth: 5
          submodules: false
      - name: Install Python
        uses: actions/setup-python@v2
        with:
          python-version: '3.x'
      - name: Note that all tests succeeded
        run: python "$GITHUB_WORKSPACE/.ci/check_workflow_status.py"

oooo interesting! Are you thinking about something like this for the checks in #3424 and #3443 ?

StrikerRUS · 2020-10-12T12:20:34Z

@jameslamb

Are you thinking about something like this for the checks in #3424 and #3443 ?

Yeah, definitely!

StrikerRUS · 2020-10-18T21:06:32Z

@guolinke Could you please generate a secret access token for LightGBM repository with public_repo and workflows permissions? I'm working on allowing optional checks to be in either skipped or succeeded status (#3439 (comment)). And I need that token to re-trigger our main GitHub Actions workflow from optional one to workaround additional manual step for maintainer.
Steps 1 and 2 from this guide: https://stevenmortimer.com/running-github-actions-sequentially/#step-1---create-a-personal-access-token-pat.

guolinke · 2020-10-19T04:45:39Z

@StrikerRUS you can check WORKFLOW in https://github.com/microsoft/LightGBM/settings/secrets

StrikerRUS · 2020-10-19T04:57:44Z

@guolinke Thanks a lot! Unfortunately, I don't have an access to this tab of LightGBM repo settings:

But I think it shouldn't block me from using WORKFLOW name as a secret value in appropriate place in the workflow 🙂 .

guolinke · 2020-10-19T06:04:45Z

yeah, you can use it as ${{ secrets.WORKFLOW }}

github-actions · 2023-08-24T04:40:24Z

This pull request has been automatically locked since there has not been any recent activity since it was closed. To start a new related discussion, open a new issue at https://github.com/microsoft/LightGBM/issues including a reference to this.

jameslamb added 8 commits September 28, 2020 23:46

[ci] add R CI job with UBSAN

03b9032

stuff

c2b264b

fix command

4d1ffb9

stuff

231ca27

update template

44aabab

fail on errors

29d121d

spaces

d8ffbbe

trigger by comment

c61fab0

jameslamb added in progress maintenance labels Oct 6, 2020

jameslamb requested review from Laurae2 and StrikerRUS as code owners October 6, 2020 03:36

stuff

bc4df98

jameslamb commented Oct 6, 2020

View reviewed changes

.github/workflows/r-package-ubsan.yml Outdated Show resolved Hide resolved

jameslamb added 2 commits October 5, 2020 22:37

Merge branch 'master' into ci/r-ubsan-gcc

b462ea9

add all CI back

0ffb520

jameslamb mentioned this pull request Oct 6, 2020

[R-package] miscellaneous changes to comply with CRAN requirements #3338

Merged

trying things

9a4b11e

jameslamb added 4 commits October 6, 2020 23:37

run sanitizers as a regular job

0d22def

remove comments

7b71fd2

sanitizers

ea24d06

try to trigger UBSAN

c6503bc

jameslamb requested review from btrotta, chivee and guolinke as code owners October 7, 2020 05:33

jameslamb commented Oct 7, 2020

View reviewed changes

src/c_api.cpp Outdated Show resolved Hide resolved

jameslamb added 2 commits October 7, 2020 21:52

Merge branch 'master' into ci/r-ubsan-gcc

bc1b107

remove testing change

4e098a0

jameslamb added awaiting review and removed in progress labels Oct 8, 2020

jameslamb changed the title ~~[WIP] [R-package] [ci] Add manual test on R package with sanitizers~~ [R-package] [ci] Add manual test on R package with sanitizers Oct 8, 2020

jameslamb mentioned this pull request Oct 8, 2020

[R-Package] CRAN issues #629

Closed

12 tasks

StrikerRUS changed the title ~~[R-package] [ci] Add manual test on R package with sanitizers~~ [R-package] [ci] Add test on R package with sanitizers Oct 8, 2020

StrikerRUS reviewed Oct 8, 2020

View reviewed changes

Apply suggestions from code review

e5cf2c9

Co-authored-by: Nikita Titov <nekit94-08@mail.ru>

guolinke approved these changes Oct 9, 2020

View reviewed changes

StrikerRUS approved these changes Oct 9, 2020

View reviewed changes

StrikerRUS merged commit 3f71d75 into master Oct 9, 2020

StrikerRUS removed the awaiting review label Oct 9, 2020

StrikerRUS deleted the ci/r-ubsan-gcc branch October 9, 2020 13:51

jameslamb mentioned this pull request Oct 17, 2020

[ci] ignore R CMD CHECK warnings on new R version #3468

Merged

StrikerRUS mentioned this pull request Jan 8, 2021

[ci] improve experience with optional GitHub workflows #3740

Merged

jameslamb mentioned this pull request Oct 13, 2021

[ci] [R-package] stack-use-after-scope issues detected by address sanitizer #4674

Closed

github-actions bot locked as resolved and limited conversation to collaborators Aug 24, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[R-package] [ci] Add test on R package with sanitizers #3439

[R-package] [ci] Add test on R package with sanitizers #3439

jameslamb commented Oct 6, 2020

StrikerRUS commented Oct 6, 2020

jameslamb commented Oct 6, 2020

StrikerRUS commented Oct 6, 2020

jameslamb commented Oct 7, 2020

StrikerRUS left a comment

StrikerRUS Oct 8, 2020

jameslamb Oct 9, 2020

StrikerRUS Oct 9, 2020

StrikerRUS commented Oct 8, 2020 •

edited

Loading

StrikerRUS left a comment

StrikerRUS commented Oct 9, 2020

StrikerRUS commented Oct 10, 2020 •

edited

Loading

StrikerRUS commented Oct 11, 2020

jameslamb commented Oct 12, 2020

jameslamb commented Oct 12, 2020

StrikerRUS commented Oct 12, 2020

StrikerRUS commented Oct 18, 2020

guolinke commented Oct 19, 2020

StrikerRUS commented Oct 19, 2020 •

edited

Loading

guolinke commented Oct 19, 2020 •

edited

Loading

github-actions bot commented Aug 24, 2023

[R-package] [ci] Add test on R package with sanitizers #3439

[R-package] [ci] Add test on R package with sanitizers #3439

Conversation

jameslamb commented Oct 6, 2020

How this makes LightGBM better

StrikerRUS commented Oct 6, 2020

jameslamb commented Oct 6, 2020

StrikerRUS commented Oct 6, 2020

jameslamb commented Oct 7, 2020

StrikerRUS left a comment

Choose a reason for hiding this comment

StrikerRUS Oct 8, 2020

Choose a reason for hiding this comment

jameslamb Oct 9, 2020

Choose a reason for hiding this comment

StrikerRUS Oct 9, 2020

Choose a reason for hiding this comment

StrikerRUS commented Oct 8, 2020 • edited Loading

StrikerRUS left a comment

Choose a reason for hiding this comment

StrikerRUS commented Oct 9, 2020

StrikerRUS commented Oct 10, 2020 • edited Loading

StrikerRUS commented Oct 11, 2020

jameslamb commented Oct 12, 2020

jameslamb commented Oct 12, 2020

StrikerRUS commented Oct 12, 2020

StrikerRUS commented Oct 18, 2020

guolinke commented Oct 19, 2020

StrikerRUS commented Oct 19, 2020 • edited Loading

guolinke commented Oct 19, 2020 • edited Loading

github-actions bot commented Aug 24, 2023

How this makes `LightGBM` better

StrikerRUS commented Oct 8, 2020 •

edited

Loading

StrikerRUS commented Oct 10, 2020 •

edited

Loading

StrikerRUS commented Oct 19, 2020 •

edited

Loading

guolinke commented Oct 19, 2020 •

edited

Loading