Skip to content

Commit 6846f22

Browse files
committed
Auto merge of #51946 - japaric:emit-stack-sizes, r=nikomatsakis
[eRFC] add -Z emit-stack-sizes # What This PR exposes LLVM's ability to report the stack usage of each function through the unstable / experimental `-Z emit-stack-sizes` flag. # Motivation The end goal is to enable whole program analysis of stack usage to prove absence of stack overflows at compile time. Such property is important in systems that lack a MMU / MPU and where stack overflows can corrupt memory. And in systems that have protection against stack overflows such proof can be used to opt out of runtime checks (e.g. stack probes or the MPU). Such analysis requires the call graph of the program, which can be obtained from MIR, and the stack usage of each function in the program. Precise information about the later later can only be obtained from LLVM as it depends on the optimization level and optimization options like LTO. This PR does **not** attempt to add the ability to perform such whole program analysis to rustc; it simply does the minimal amount of work to enable such analysis to be implemented out of tree. # Implementation This PR exposes a way to set LLVM's `EmitStackSizeSection` option from the command line. The option is documented [here]; the documentation is copied below for convenience and posteriority: [here]: https://llvm.org/docs/CodeGenerator.html#emitting-function-stack-size-information > A section containing metadata on function stack sizes will be emitted when > TargetLoweringObjectFile::StackSizesSection is not null, and TargetOptions::EmitStackSizeSection > is set (-stack-size-section). The section will contain an array of pairs of function symbol values > (pointer size) and stack sizes (unsigned LEB128). The stack size values only include the space > allocated in the function prologue. Functions with dynamic stack allocations are not included. Where the LLVM feature is not available (e.g. LLVM version < 6.0) or can't be applied (e.g. the output format doesn't support sections e.g. .wasm files) the flag does nothing -- i.e. no error or warning is emitted. # Example usage ``` console $ cargo new --bin hello && cd $_ $ cat >src/main.rs <<'EOF' use std::{mem, ptr}; fn main() { registers(); stack(); } #[inline(never)] fn registers() { unsafe { // values loaded into registers ptr::read_volatile(&(0u64, 1u64)); } } #[inline(never)] fn stack() { unsafe { // array allocated on the stack let array: [i32; 4] = mem::uninitialized(); for elem in &array { ptr::read_volatile(&elem); } } } EOF $ # we need a custom linking step to preserve the .stack_sizes section $ # (see unresolved questions for a solution that doesn't require custom linking) $ cat > keep-stack-sizes.x <<'EOF' SECTIONS { .stack_sizes : { KEEP(*(.stack_sizes)); } } EOF $ cargo rustc --release -- \ -Z emit-stack-sizes \ -C link-arg=-Wl,-Tkeep-stack-sizes.x \ -C link-arg=-N $ size -A target/release/hello | grep stack_sizes .stack_sizes 117 185136 ``` Then a tool like [`stack-sizes`] can be used to print the information in human readable format [`stack-sizes`]: https://github.com/japaric/stack-sizes/#stack-sizes ``` console $ stack-sizes target/release/hello address size name 0x000000000004b0 0 core::array::<impl core::iter::traits::IntoIterator for &'a [T; _]>::into_iter::ha50e6661c0ec84aa 0x000000000004c0 8 std::rt::lang_start::ha02aea783e0e1b3e 0x000000000004f0 8 std::rt::lang_start::{{closure}}::h5115b527d5244952 0x00000000000500 8 core::ops::function::FnOnce::call_once::h6bfa1076da82b0fb 0x00000000000510 0 core::ptr::drop_in_place::hb4de82e57787bc70 0x00000000000520 8 hello::main::h08bb6cec0556bd66 0x00000000000530 0 hello::registers::h9d058a5d765ec1d2 0x00000000000540 24 hello::stack::h88c8cb66adfdc6f3 0x00000000000580 8 main 0x000000000005b0 0 __rust_alloc 0x000000000005c0 0 __rust_dealloc 0x000000000005d0 0 __rust_realloc 0x000000000005e0 0 __rust_alloc_zeroed ``` # Stability Like `-Z sanitize` this is a re-export of an LLVM feature. To me knowledge, we don't have a policy about stabilization of such features as they are incompatible with, or demand extra implementation effort from, alternative backends (e.g. cranelift). As such this feature will remain experimental / unstable for the foreseeable future. # Unresolved questions ## Section name Should we rename the `.stack_sizes` section to `.debug_stacksizes`? With the former name linkers will strip the section unless told otherwise using a linker script, which means getting this information requires both knowledge about linker scripts and a custom linker invocation (see example above). If we use the `.debug_stacksizes` name (I believe) linkers will always keep the section, which means `-Z emit-stack-sizes` is the only thing required to get the stack usage information. # ~TODOs~ ~Investigate why this doesn't work with the `thumb` targets. I get the LLVM error shown below:~ ``` console $ cargo new --lib foo && cd $_ $ echo '#![no_std] pub fn foo() {}' > src/lib.rs $ cargo rustc --target thumbv7m-none-eabi -- -Z emit-stack-sizes LLVM ERROR: unsupported relocation on symbol ``` ~which sounds like it might be related to the `relocation-model` option. Maybe `relocation-model = static` is not supported for some reason?~ This fixed itself after the LLVM upgrade. --- r? @nikomatsakis cc @rust-lang/compiler @perlindgren @whitequark
2 parents e783d2b + 531e356 commit 6846f22

File tree

7 files changed

+221
-2
lines changed

7 files changed

+221
-2
lines changed
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,167 @@
1+
# `emit-stack-sizes`
2+
3+
The tracking issue for this feature is: [#54192]
4+
5+
[#54192]: https://github.com/rust-lang/rust/issues/54192
6+
7+
------------------------
8+
9+
The rustc flag `-Z emit-stack-sizes` makes LLVM emit stack size metadata.
10+
11+
> **NOTE**: This LLVM feature only supports the ELF object format as of LLVM
12+
> 8.0. Using this flag with targets that use other object formats (e.g. macOS
13+
> and Windows) will result in it being ignored.
14+
15+
Consider this crate:
16+
17+
```
18+
#![crate_type = "lib"]
19+
20+
use std::ptr;
21+
22+
pub fn foo() {
23+
// this function doesn't use the stack
24+
}
25+
26+
pub fn bar() {
27+
let xs = [0u32; 2];
28+
29+
// force LLVM to allocate `xs` on the stack
30+
unsafe { ptr::read_volatile(&xs.as_ptr()); }
31+
}
32+
```
33+
34+
Using the `-Z emit-stack-sizes` flag produces extra linker sections in the
35+
output *object file*.
36+
37+
``` console
38+
$ rustc -C opt-level=3 --emit=obj foo.rs
39+
40+
$ size -A foo.o
41+
foo.o :
42+
section size addr
43+
.text 0 0
44+
.text._ZN3foo3foo17he211d7b4a3a0c16eE 1 0
45+
.text._ZN3foo3bar17h1acb594305f70c2eE 22 0
46+
.note.GNU-stack 0 0
47+
.eh_frame 72 0
48+
Total 95
49+
50+
$ rustc -C opt-level=3 --emit=obj -Z emit-stack-sizes foo.rs
51+
52+
$ size -A foo.o
53+
foo.o :
54+
section size addr
55+
.text 0 0
56+
.text._ZN3foo3foo17he211d7b4a3a0c16eE 1 0
57+
.stack_sizes 9 0
58+
.text._ZN3foo3bar17h1acb594305f70c2eE 22 0
59+
.stack_sizes 9 0
60+
.note.GNU-stack 0 0
61+
.eh_frame 72 0
62+
Total 113
63+
```
64+
65+
As of LLVM 7.0 the data will be written into a section named `.stack_sizes` and
66+
the format is "an array of pairs of function symbol values (pointer size) and
67+
stack sizes (unsigned LEB128)".
68+
69+
``` console
70+
$ objdump -d foo.o
71+
72+
foo.o: file format elf64-x86-64
73+
74+
Disassembly of section .text._ZN3foo3foo17he211d7b4a3a0c16eE:
75+
76+
0000000000000000 <_ZN3foo3foo17he211d7b4a3a0c16eE>:
77+
0: c3 retq
78+
79+
Disassembly of section .text._ZN3foo3bar17h1acb594305f70c2eE:
80+
81+
0000000000000000 <_ZN3foo3bar17h1acb594305f70c2eE>:
82+
0: 48 83 ec 10 sub $0x10,%rsp
83+
4: 48 8d 44 24 08 lea 0x8(%rsp),%rax
84+
9: 48 89 04 24 mov %rax,(%rsp)
85+
d: 48 8b 04 24 mov (%rsp),%rax
86+
11: 48 83 c4 10 add $0x10,%rsp
87+
15: c3 retq
88+
89+
$ objdump -s -j .stack_sizes foo.o
90+
91+
foo.o: file format elf64-x86-64
92+
93+
Contents of section .stack_sizes:
94+
0000 00000000 00000000 00 .........
95+
Contents of section .stack_sizes:
96+
0000 00000000 00000000 10 .........
97+
```
98+
99+
It's important to note that linkers will discard this linker section by default.
100+
To preserve the section you can use a linker script like the one shown below.
101+
102+
``` text
103+
/* file: keep-stack-sizes.x */
104+
SECTIONS
105+
{
106+
/* `INFO` makes the section not allocatable so it won't be loaded into memory */
107+
.stack_sizes (INFO) :
108+
{
109+
KEEP(*(.stack_sizes));
110+
}
111+
}
112+
```
113+
114+
The linker script must be passed to the linker using a rustc flag like `-C
115+
link-arg`.
116+
117+
```
118+
// file: src/main.rs
119+
use std::ptr;
120+
121+
#[inline(never)]
122+
fn main() {
123+
let xs = [0u32; 2];
124+
125+
// force LLVM to allocate `xs` on the stack
126+
unsafe { ptr::read_volatile(&xs.as_ptr()); }
127+
}
128+
```
129+
130+
``` console
131+
$ RUSTFLAGS="-Z emit-stack-sizes" cargo build --release
132+
133+
$ size -A target/release/hello | grep stack_sizes || echo section was not found
134+
section was not found
135+
136+
$ RUSTFLAGS="-Z emit-stack-sizes" cargo rustc --release -- \
137+
-C link-arg=-Wl,-Tkeep-stack-sizes.x \
138+
-C link-arg=-N
139+
140+
$ size -A target/release/hello | grep stack_sizes
141+
.stack_sizes 90 176272
142+
143+
$ # non-allocatable section (flags don't contain the "A" (alloc) flag)
144+
$ readelf -S target/release/hello
145+
Section Headers:
146+
[Nr] Name Type Address Offset
147+
Size EntSize Flags Link Info Align
148+
(..)
149+
[1031] .stack_sizes PROGBITS 000000000002b090 0002b0f0
150+
000000000000005a 0000000000000000 L 5 0 1
151+
152+
$ objdump -s -j .stack_sizes target/release/hello
153+
154+
target/release/hello: file format elf64-x86-64
155+
156+
Contents of section .stack_sizes:
157+
2b090 c0040000 00000000 08f00400 00000000 ................
158+
2b0a0 00080005 00000000 00000810 05000000 ................
159+
2b0b0 00000000 20050000 00000000 10400500 .... ........@..
160+
2b0c0 00000000 00087005 00000000 00000080 ......p.........
161+
2b0d0 05000000 00000000 90050000 00000000 ................
162+
2b0e0 00a00500 00000000 0000 ..........
163+
```
164+
165+
> Author note: I'm not entirely sure why, in *this* case, `-N` is required in
166+
> addition to `-Tkeep-stack-sizes.x`. For example, it's not required when
167+
> producing statically linked files for the ARM Cortex-M architecture.

src/librustc/session/config.rs

+2
Original file line numberDiff line numberDiff line change
@@ -1385,6 +1385,8 @@ options! {DebuggingOptions, DebuggingSetter, basic_debugging_options,
13851385
"run the self profiler"),
13861386
profile_json: bool = (false, parse_bool, [UNTRACKED],
13871387
"output a json file with profiler results"),
1388+
emit_stack_sizes: bool = (false, parse_bool, [UNTRACKED],
1389+
"emits a section containing stack size metadata"),
13881390
}
13891391

13901392
pub fn default_lib_output() -> CrateType {

src/librustc_codegen_llvm/back/write.rs

+2
Original file line numberDiff line numberDiff line change
@@ -196,6 +196,7 @@ pub fn target_machine_factory(sess: &Session, find_features: bool)
196196
let features = CString::new(features).unwrap();
197197
let is_pie_binary = !find_features && is_pie_binary(sess);
198198
let trap_unreachable = sess.target.target.options.trap_unreachable;
199+
let emit_stack_size_section = sess.opts.debugging_opts.emit_stack_sizes;
199200

200201
let asm_comments = sess.asm_comments();
201202

@@ -213,6 +214,7 @@ pub fn target_machine_factory(sess: &Session, find_features: bool)
213214
trap_unreachable,
214215
singlethread,
215216
asm_comments,
217+
emit_stack_size_section,
216218
)
217219
};
218220

src/librustc_codegen_llvm/llvm/ffi.rs

+2-1
Original file line numberDiff line numberDiff line change
@@ -1460,7 +1460,8 @@ extern "C" {
14601460
DataSections: bool,
14611461
TrapUnreachable: bool,
14621462
Singlethread: bool,
1463-
AsmComments: bool)
1463+
AsmComments: bool,
1464+
EmitStackSizeSection: bool)
14641465
-> Option<&'static mut TargetMachine>;
14651466
pub fn LLVMRustDisposeTargetMachine(T: &'static mut TargetMachine);
14661467
pub fn LLVMRustAddAnalysisPasses(T: &'a TargetMachine, PM: &PassManager<'a>, M: &'a Module);

src/rustllvm/PassWrapper.cpp

+4-1
Original file line numberDiff line numberDiff line change
@@ -373,7 +373,8 @@ extern "C" LLVMTargetMachineRef LLVMRustCreateTargetMachine(
373373
bool DataSections,
374374
bool TrapUnreachable,
375375
bool Singlethread,
376-
bool AsmComments) {
376+
bool AsmComments,
377+
bool EmitStackSizeSection) {
377378

378379
auto OptLevel = fromRust(RustOptLevel);
379380
auto RM = fromRust(RustReloc);
@@ -411,6 +412,8 @@ extern "C" LLVMTargetMachineRef LLVMRustCreateTargetMachine(
411412
}
412413

413414
#if LLVM_VERSION_GE(6, 0)
415+
Options.EmitStackSizeSection = EmitStackSizeSection;
416+
414417
Optional<CodeModel::Model> CM;
415418
#else
416419
CodeModel::Model CM = CodeModel::Model::Default;
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,31 @@
1+
-include ../tools.mk
2+
3+
# This feature only works when the output object format is ELF so we ignore
4+
# macOS and Windows
5+
ifdef IS_WINDOWS
6+
# Do nothing on Windows.
7+
all:
8+
exit 0
9+
else ifneq (,$(filter $(TARGET),i686-apple-darwin x86_64-apple-darwin))
10+
# Do nothing on macOS.
11+
all:
12+
exit 0
13+
else
14+
# check that the .stack_sizes section is generated
15+
# this test requires LLVM >= 6.0.0
16+
vers = $(shell $(RUSTC) -Vv)
17+
ifneq (,$(findstring LLVM version: 3,$(vers)))
18+
all:
19+
exit 0
20+
else ifneq (,$(findstring LLVM version: 4,$(vers)))
21+
all:
22+
exit 0
23+
else ifneq (,$(findstring LLVM version: 5,$(vers)))
24+
all:
25+
exit 0
26+
else
27+
all:
28+
$(RUSTC) -C opt-level=3 -Z emit-stack-sizes --emit=obj foo.rs
29+
size -A $(TMPDIR)/foo.o | $(CGREP) .stack_sizes
30+
endif
31+
endif
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,13 @@
1+
// Copyright 2018 The Rust Project Developers. See the COPYRIGHT
2+
// file at the top-level directory of this distribution and at
3+
// http://rust-lang.org/COPYRIGHT.
4+
//
5+
// Licensed under the Apache License, Version 2.0 <LICENSE-APACHE or
6+
// http://www.apache.org/licenses/LICENSE-2.0> or the MIT license
7+
// <LICENSE-MIT or http://opensource.org/licenses/MIT>, at your
8+
// option. This file may not be copied, modified, or distributed
9+
// except according to those terms.
10+
11+
#![crate_type = "lib"]
12+
13+
pub fn foo() {}

0 commit comments

Comments
 (0)