Performance regression in "Add externfn macro and correctly label fixed_stack_segments"

I've noticed a minor performance regression in 0479d946c83cf9ed90bba5b33820ea4118dd8f9e or maybe 303f650ecfb5580f3d89aa662fbd164b849c8eff. Unfortunately, 303f650ecfb5580f3d89aa662fbd164b849c8eff doesn't compile, so, I can't be quite sure. The revision before that, c178b52fe594c6724d0cf9124665de7e627899a9, is fine. I've also tested it against current master, 1ac7d809af161abc7fb73d76bae4fd0fb2a4826d.

I don't have a small test case that demonstrates the issue. All I have is a large one. Check out the "performance-regression" branch of https://github.com/DaGenix/rust-working and then run the benchmark on the test.rc file - the relevant item is aes::bench::aes_bench_x8 (there is lots of messy code in there; I apologize for that, I use that repository mostly as a backup for my main computer).

All tests were run with --opt-level 3 on a Core i5 under 64-bit Ubuntu. Performance is likely drastically different on an architecture where LLVM can't use the SSE instructions, so, I don't know how well this reproduces outside of x86_64.

c178b52fe594c6724d0cf9124665de7e627899a9 (last fast revision):

```
83.49 MB/s
```

0479d946c83cf9ed90bba5b33820ea4118dd8f9e (first slow revision that compiles):

```
80.60 MB/s
```

1ac7d809af161abc7fb73d76bae4fd0fb2a4826d (current master):

```
79.60 MB/s
```

Here is where it gets interesting: the code being benchmarked is mostly just the contents of the aessafe.rs file. I took code from that file and everything it depends on and moved it into a single file at: https://gist.github.com/DaGenix/6348471. When I benchmark this file using master, I get the faster (83.71 MB/s) performance.

This certainly isn't the ideal test case for this issue, although I'm not sure how to chop down the code to get a better test case at this point.


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Performance regression in "Add externfn macro and correctly label fixed_stack_segments" #8782

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Performance regression in "Add externfn macro and correctly label fixed_stack_segments" #8782

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions