Cache and buffer stdout per-task for printing #10060

alexcrichton · 2013-10-25T00:55:09Z

Almost all languages provide some form of buffering of the stdout stream, and
this commit adds this feature for rust. A handle to stdout is lazily initialized
in the Task structure as a buffered owned Writer trait object. The buffer
behavior depends on where stdout is directed to. Like C, this line-buffers the
stream when the output goes to a terminal (flushes on newlines), and also like C
this uses a fixed-size buffer when output is not directed at a terminal.

We may decide the fixed-size buffering is overkill, but it certainly does reduce
write syscall counts when piping output elsewhere. This is a huge benefit to
any code using logging macros or the printing macros. Formatting emits calls to
write very frequently, and to have each of them backed by a write syscall was
very expensive.

In a local benchmark of printing 10000 lines of "what" to stdout, I got the
following timings:

when	terminal	redirected
before	0.575s	0.525s
after	0.197s	0.013s
C	0.019s	0.004s

I can also confirm that we're buffering the output appropriately in both
situtations. We're still far slower than C, but I believe much of that has to do
with the "homing" that all tasks due, we're still performing an order of
magnitude more write syscalls than C does.

brson · 2013-10-25T01:32:36Z

Super-nice work. 🍭

Almost all languages provide some form of buffering of the stdout stream, and this commit adds this feature for rust. A handle to stdout is lazily initialized in the Task structure as a buffered owned Writer trait object. The buffer behavior depends on where stdout is directed to. Like C, this line-buffers the stream when the output goes to a terminal (flushes on newlines), and also like C this uses a fixed-size buffer when output is not directed at a terminal. We may decide the fixed-size buffering is overkill, but it certainly does reduce write syscall counts when piping output elsewhere. This is a *huge* benefit to any code using logging macros or the printing macros. Formatting emits calls to `write` very frequently, and to have each of them backed by a write syscall was very expensive. In a local benchmark of printing 10000 lines of "what" to stdout, I got the following timings: when | terminal | redirected ----------|---------------|-------- before | 0.575s | 0.525s after | 0.197s | 0.013s C | 0.019s | 0.004s I can also confirm that we're buffering the output appropriately in both situtations. We're still far slower than C, but I believe much of that has to do with the "homing" that all tasks due, we're still performing an order of magnitude more write syscalls than C does.

jdm · 2013-10-25T09:40:14Z

Is there a way to force a buffer flush? I could imagine being confused by debug output not appearing before a task failure.

alexcrichton · 2013-10-25T15:45:27Z

The with_task_stdout function is public, and writers expose a flush function, so that's how I figured that you would flush the task's stdout. For us it doesn't quite make sense to write stdio::flush() because that doesn't mean much when you're not using println and friends, so I figured that explicitly flagging it as the task's stdout then it would be clear what you're flushing.

Almost all languages provide some form of buffering of the stdout stream, and this commit adds this feature for rust. A handle to stdout is lazily initialized in the Task structure as a buffered owned Writer trait object. The buffer behavior depends on where stdout is directed to. Like C, this line-buffers the stream when the output goes to a terminal (flushes on newlines), and also like C this uses a fixed-size buffer when output is not directed at a terminal. We may decide the fixed-size buffering is overkill, but it certainly does reduce write syscall counts when piping output elsewhere. This is a *huge* benefit to any code using logging macros or the printing macros. Formatting emits calls to `write` very frequently, and to have each of them backed by a write syscall was very expensive. In a local benchmark of printing 10000 lines of "what" to stdout, I got the following timings: when | terminal | redirected ---------------------------------- before | 0.575s | 0.525s after | 0.197s | 0.013s C | 0.019s | 0.004s I can also confirm that we're buffering the output appropriately in both situtations. We're still far slower than C, but I believe much of that has to do with the "homing" that all tasks due, we're still performing an order of magnitude more write syscalls than C does.

Almost all languages provide some form of buffering of the stdout stream, and this commit adds this feature for rust. A handle to stdout is lazily initialized in the Task structure as a buffered owned Writer trait object. The buffer behavior depends on where stdout is directed to. Like C, this line-buffers the stream when the output goes to a terminal (flushes on newlines), and also like C this uses a fixed-size buffer when output is not directed at a terminal. We may decide the fixed-size buffering is overkill, but it certainly does reduce write syscall counts when piping output elsewhere. This is a *huge* benefit to any code using logging macros or the printing macros. Formatting emits calls to `write` very frequently, and to have each of them backed by a write syscall was very expensive. In a local benchmark of printing 10000 lines of "what" to stdout, I got the following timings: when | terminal | redirected ----------|---------------|-------- before | 0.575s | 0.525s after | 0.197s | 0.013s C | 0.019s | 0.004s I can also confirm that we're buffering the output appropriately in both situtations. We're still far slower than C, but I believe much of that has to do with the "homing" that all tasks due, we're still performing an order of magnitude more write syscalls than C does.

…x, r=llogiq Fix [match_single_binding] suggestion introduced an extra semicolon Fix rust-lang#9725 --- changelog: [`match_single_binding`]: suggestion no longer introduces unneeded semicolons [rust-lang#10060](rust-lang/rust-clippy#10060)

bors closed this Oct 25, 2013

bors merged commit e8f72c3 into rust-lang:master Oct 25, 2013

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Cache and buffer stdout per-task for printing #10060

Cache and buffer stdout per-task for printing #10060

Uh oh!

alexcrichton commented Oct 25, 2013

Uh oh!

brson commented Oct 25, 2013

Uh oh!

jdm commented Oct 25, 2013

Uh oh!

alexcrichton commented Oct 25, 2013

Uh oh!

Uh oh!

Cache and buffer stdout per-task for printing #10060

Cache and buffer stdout per-task for printing #10060

Uh oh!

Conversation

alexcrichton commented Oct 25, 2013

Uh oh!

brson commented Oct 25, 2013

Uh oh!

jdm commented Oct 25, 2013

Uh oh!

alexcrichton commented Oct 25, 2013

Uh oh!

Uh oh!