Commit bef5701
committed
docs(hardware): add 5090 + Gemma 4 + MTP cross-rig anchor rows (apnar disc #86)
Three operating points from @apnar's full 21-cap 10W sweep:
- 400W (efficiency winner): 571 narr / 701 code, 1.429 TPS/W
- 510W (narr peak): 619 narr / 724 code, 1.215 TPS/W
- 600W (stock baseline): 601 narr / 757 code, 1.103 TPS/W
Cross-workload pattern emerges combining apnar's two 5090 sweeps:
both Qwen3.6-27B AutoRound and Gemma 4 31B + MTP land at the same
~400W efficiency sweet spot despite ~5× different absolute TPS scales.
Updated 5090 compute-saturation note to reflect this is workload-
independent on consumer-air-cooled 5090.
Hardware-physical ceiling for Gemma 4 + MTP at concurrency=4:
~547W actual draw, no thermal throttle (66°C peak). Above 530W
cap = wasted budget.
Validates the calibration fix shipped at 29e7de5: at 600W cap with
new logic (N=4 plateau-detected), TPS jumps from 499/616 (old N=6)
to 600/757 — pure calibration win, +20-25% same-cap TPS.1 parent dfceccb commit bef5701
1 file changed
Lines changed: 4 additions & 1 deletion
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
112 | 112 | | |
113 | 113 | | |
114 | 114 | | |
| 115 | + | |
| 116 | + | |
| 117 | + | |
115 | 118 | | |
116 | 119 | | |
117 | 120 | | |
118 | 121 | | |
119 | 122 | | |
120 | | - | |
| 123 | + | |
121 | 124 | | |
122 | 125 | | |
123 | 126 | | |
| |||
0 commit comments