-
Notifications
You must be signed in to change notification settings - Fork 13.3k
do not count non-codegen statement as estimzed cgu element #103803
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
@bors try @rust-timer queue |
Awaiting bors try build completion. @rustbot label: +S-waiting-on-perf |
⌛ Trying commit 7ac8751 with merge 23e8280651de33e8cdd2cf18fc22ea3130d18aa3... |
☀️ Try build successful - checks-actions |
Queued 23e8280651de33e8cdd2cf18fc22ea3130d18aa3 with parent 4596f4f, future comparison URL. |
Finished benchmarking commit (23e8280651de33e8cdd2cf18fc22ea3130d18aa3): comparison URL. Overall result: ❌✅ regressions and improvements - ACTION NEEDEDBenchmarking this pull request likely means that it is perf-sensitive, so we're automatically marking it as not fit for rolling up. While you can manually mark this PR as fit for rollup, we strongly recommend not doing so since this PR may lead to changes in compiler perf. Next Steps: If you can justify the regressions found in this try perf run, please indicate this with @bors rollup=never Instruction countThis is a highly reliable metric that was used to determine the overall result at the top of this comment.
Max RSS (memory usage)ResultsThis is a less reliable metric that may be of interest but was not used to determine the overall result at the top of this comment.
CyclesResultsThis is a less reliable metric that may be of interest but was not used to determine the overall result at the top of this comment.
|
The mem usage metric changes more than the others as this patch effects the cgu merging steps. And this makes the size estimation more accurate, seems reasonable to merge it although it has some regressions, how do you think? @bjorn3 |
@bjorn3 another perf-run requested, add back the previous included terminator. |
This comment has been minimized.
This comment has been minimized.
@bors try @rust-timer queue |
Awaiting bors try build completion. @rustbot label: +S-waiting-on-perf |
⌛ Trying commit e5d4856 with merge 392e416a17fde7a21323e4d632402619c9770e51... |
☀️ Try build successful - checks-actions |
Queued 392e416a17fde7a21323e4d632402619c9770e51 with parent c493bae, future comparison URL. |
Finished benchmarking commit (392e416a17fde7a21323e4d632402619c9770e51): comparison URL. Overall result: ❌ regressions - ACTION NEEDEDBenchmarking this pull request likely means that it is perf-sensitive, so we're automatically marking it as not fit for rolling up. While you can manually mark this PR as fit for rollup, we strongly recommend not doing so since this PR may lead to changes in compiler perf. Next Steps: If you can justify the regressions found in this try perf run, please indicate this with @bors rollup=never Instruction countThis is a highly reliable metric that was used to determine the overall result at the top of this comment.
Max RSS (memory usage)ResultsThis is a less reliable metric that may be of interest but was not used to determine the overall result at the top of this comment.
CyclesResultsThis is a less reliable metric that may be of interest but was not used to determine the overall result at the top of this comment.
|
cc @rust-lang/wg-compiler-performance any advise on this? this pr makes the size estimation more accurate, but introduces a new loop on each basicblock, thus instr regressed and mem improved. |
Both cycles and wall-time are slightly worse with this change, which suggests it's not a win overall. |
Found that some statements doesn't codegen nowadays, so they shouldn't be count as a element in cgu size estimation.
cc #69382
r?@bjorn3 may I have a perf-run?