Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Include both benchmarks and tests in the numbers given to TeFiltered{,Out} #103795

Merged
merged 4 commits into from
Oct 31, 2022

Conversation

thomcc
Copy link
Member

@thomcc thomcc commented Oct 31, 2022

Fixes #103794

#[bench] is broken on nightly without this, sadly. It apparently has no test coverage. In addition to manually testing, I've added a run-make smokecheck for this (which would have caught the issue), but it would be nice to have a better way to test, err, libtest. For now we should get this in ASAP IMO

@rustbot
Copy link
Collaborator

rustbot commented Oct 31, 2022

r? @Mark-Simulacrum

(rustbot has picked a reviewer for you, use r? to override)

@rustbot rustbot added S-waiting-on-review Status: Awaiting review from the assignee but also interested parties. T-libs Relevant to the library team, which will review and decide on the PR/issue. labels Oct 31, 2022
@rustbot
Copy link
Collaborator

rustbot commented Oct 31, 2022

Hey! It looks like you've submitted a new PR for the library teams!

If this PR contains changes to any rust-lang/rust public library APIs then please comment with @rustbot label +T-libs-api -T-libs to tag it appropriately. If this PR contains changes to any unstable APIs please edit the PR description to add a link to the relevant API Change Proposal or create one if you haven't already. If you're unsure where your change falls no worries, just leave it as is and the reviewer will take a look and make a decision to forward on if necessary.

Examples of T-libs-api changes:

  • Stabilizing library features
  • Introducing insta-stable changes such as new implementations of existing stable traits on existing stable types
  • Introducing new or changing existing unstable library APIs (excluding permanently unstable features / features without a tracking issue)
  • Changing public documentation in ways that create new stability guarantees
  • Changing observable runtime behavior of library APIs

@Mark-Simulacrum
Copy link
Member

@bors r+ p=5

@bors
Copy link
Contributor

bors commented Oct 31, 2022

📌 Commit 8c71820 has been approved by Mark-Simulacrum

It is now in the queue for this repository.

@bors bors added S-waiting-on-bors Status: Waiting on bors to run and complete tests. Bors will change the label on completion. and removed S-waiting-on-review Status: Awaiting review from the assignee but also interested parties. labels Oct 31, 2022
@Dylan-DPC
Copy link
Member

@bors p=10

@bors
Copy link
Contributor

bors commented Oct 31, 2022

⌛ Testing commit 8c71820 with merge 3882ba490d61277e7ee92301eb86df397ee79c44...

@bors
Copy link
Contributor

bors commented Oct 31, 2022

💔 Test failed - checks-actions

@bors bors added S-waiting-on-review Status: Awaiting review from the assignee but also interested parties. and removed S-waiting-on-bors Status: Waiting on bors to run and complete tests. Bors will change the label on completion. labels Oct 31, 2022
@Dylan-DPC
Copy link
Member

Closing tree till this is merged
@bors tree-closed=1000
@bors p=1000

@Dylan-DPC
Copy link
Member

@bors treeclosed=1000

@Mark-Simulacrum
Copy link
Member

@bors treeclosed-

1 similar comment
@Dylan-DPC
Copy link
Member

@bors treeclosed-

@thomcc
Copy link
Member Author

thomcc commented Oct 31, 2022

Unsure why this failed -- seems to pass locally. Guessing this either needs to be run-make-fulldeps (I usually avoid the compiler tests suites), or perhaps just to exclude those platforms? Pushed up a patch moving it to run-make-fulldeps.

@thomcc
Copy link
Member Author

thomcc commented Oct 31, 2022

LMK if you want me to split the test into a different PR so that you can land it without waiting for me to figure out how to where to put the test for this.

@rust-log-analyzer

This comment has been minimized.

@Mark-Simulacrum
Copy link
Member

It failed for a cross-compile case, I think if you add # ignore-cross-compile to the test it will work. I'm not sure fulldeps will have the same effect.

r=me with that done

@ehuss
Copy link
Contributor

ehuss commented Oct 31, 2022

For the test, if you want to keep it in run-make, then it needs something like ignore-cross-compile so you don't have to deal with all the various targets that need extra setup, or "none" targets which can't generate executables.

EDIT: oops, mark beat me to it 😛

@thomcc
Copy link
Member Author

thomcc commented Oct 31, 2022

Oh, yeah, thanks; this tries to run the output, so indeed cross compile doesn't make sense. I had tried fulldeps since most of the other Makefiles that have $(RUSTC) --test were in there, and github wouldn't let me download the failure log for some reason (and I wanted to do something rather than block all merges into rust-lang/rust... 😅)

I've moved it back and added # ignore-cross-compile (I also waited for the tests to pass locally).

r=me with that done

@bors r=Mark-Simulacrum

@bors
Copy link
Contributor

bors commented Oct 31, 2022

📌 Commit bf88755 has been approved by Mark-Simulacrum

It is now in the queue for this repository.

@bors bors added S-waiting-on-bors Status: Waiting on bors to run and complete tests. Bors will change the label on completion. and removed S-waiting-on-review Status: Awaiting review from the assignee but also interested parties. labels Oct 31, 2022
@bors
Copy link
Contributor

bors commented Oct 31, 2022

⌛ Testing commit bf88755 with merge 95a3a72...

@bors
Copy link
Contributor

bors commented Oct 31, 2022

☀️ Test successful - checks-actions
Approved by: Mark-Simulacrum
Pushing 95a3a72 to master...

@bors bors added the merged-by-bors This PR was explicitly merged by bors. label Oct 31, 2022
@bors bors merged commit 95a3a72 into rust-lang:master Oct 31, 2022
@rustbot rustbot added this to the 1.67.0 milestone Oct 31, 2022
@thomcc thomcc deleted the untest branch October 31, 2022 22:56
@rust-timer
Copy link
Collaborator

Finished benchmarking commit (95a3a72): comparison URL.

Overall result: ❌ regressions - ACTION NEEDED

Next Steps: If you can justify the regressions found in this perf run, please indicate this with @rustbot label: +perf-regression-triaged along with sufficient written justification. If you cannot justify the regressions please open an issue or create a new PR that fixes the regressions, add a comment linking to the newly created issue or PR, and then add the perf-regression-triaged label to this PR.

@rustbot label: +perf-regression
cc @rust-lang/wg-compiler-performance

Instruction count

This is a highly reliable metric that was used to determine the overall result at the top of this comment.

mean range count
Regressions ❌
(primary)
1.3% [1.2%, 1.4%] 2
Regressions ❌
(secondary)
3.6% [3.2%, 4.1%] 6
Improvements ✅
(primary)
- - 0
Improvements ✅
(secondary)
- - 0
All ❌✅ (primary) 1.3% [1.2%, 1.4%] 2

Max RSS (memory usage)

This benchmark run did not return any relevant results for this metric.

Cycles

This benchmark run did not return any relevant results for this metric.

@rustbot rustbot added the perf-regression Performance regression. label Nov 1, 2022
@thomcc
Copy link
Member Author

thomcc commented Nov 1, 2022

Seems pretty likely to be noise.

@lqd
Copy link
Member

lqd commented Nov 1, 2022

Indeed, these 2 benchmarks are currently noisy.

@rustbot label: +perf-regression-triaged

@rustbot rustbot added the perf-regression-triaged The performance regression has been triaged. label Nov 1, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
merged-by-bors This PR was explicitly merged by bors. perf-regression Performance regression. perf-regression-triaged The performance regression has been triaged. S-waiting-on-bors Status: Waiting on bors to run and complete tests. Bors will change the label on completion. T-libs Relevant to the library team, which will review and decide on the PR/issue.
Projects
None yet
Development

Successfully merging this pull request may close these issues.

assertion failed: st.current_test_count() == st.total when running benchmarks
9 participants