Replies: 2 comments 1 reply
-
|
The machine is running since 4 hours. No reboot. But the noisy fan suddenly rotates at 41%. |
Beta Was this translation helpful? Give feedback.
-
|
Yeah — this is a known 3090 failure mode, and the 0%-feedback / 100%-physical / 30%-target combo is the tell. A model that fits every symptom you listed: NVML "target" ≠ what the fan actually does. NVML reports the driver's requested duty (your 30%); the actual PWM is set by the card's onboard fan controller, which closes a loop on each fan's tach (RPM-sense) signal. When that controller stops seeing a valid tach pulse, most 3090 AIB controllers treat it as "fan stalled" and ramp that fan to 100% as a thermal fail-safe — exactly "target 30%, feedback 0%, physical 100%." So this almost certainly isn't software or even the controller — it's an intermittent tach/connector fault on that specific rear fan. Two things confirm it: it needed reseating after you opened the card, and it recovers (your 4-hr "settled to 41%" snapshot). A dead controller wouldn't intermittently close the loop again — intermittent = marginal connector or a nicked sense wire. Isolate it before touching the PCB:
PCB/controller replacement is a last resort; the vast majority resolve at the connector or fan cable. Curious what the swap test shows. |
Beta Was this translation helpful? Give feedback.
Uh oh!
There was an error while loading. Please reload this page.
-
RTX 3090 rear fan intermittently runs at 100% although NVML target is 30%
I am looking for help from people who know RTX 3090 fan behavior and fan-control diagnostics well.
Context
I have a machine with three NVIDIA GeForce RTX 3090 GPUs.
The issue concerns GPU 2, identified as:
For GPU 2:
Driver version:
Symptom
The behavior is intermittent.
Sometimes the GPU fan behavior appears normal after boot.
On other boots, the rear fan of GPU 2 physically runs at 100%. Then fan is very noisy at that speed.
I have not found a clear trigger yet.
Earlier observed state
At one point, there was no meaningful GPU load, but the rear fan problem appeared.
Command:
Output:
At the same time, this custom NVML fan feedback tool reported:
So for GPU 2 / fan 1, the NVML target was 30%, while feedback was 100%.
Later observed state
On another boot, the driver and
nvidia-smishowed GPU 2 at 30% fan speed globally:Output:
Then the custom NVML fan feedback tool reported:
At that exact moment, the rear fan of GPU 2 was physically spinning at 100%, even though the NVML feedback tool reported 0% feedback for GPU 2 / fan 1.
Current load snapshot during the issue
Command:
nvidia-smi --query-gpu=index,uuid,temperature.gpu,pstate,power.draw,fan.speed,utilization.gpu --format=csv; nvml-fan-feedbackOutput:
Again, at that moment, the rear fan of GPU 2 was physically at 100%.
Additional note
This same rear fan had already shown a previous issue. After physically opening the GPU and reseating/repositioning the fan cable, the behavior became normal again for a while.
Now the problem appears intermittently depending on reboot.
What I am trying to understand
Based on the data above:
I am deliberately avoiding assumptions here and would appreciate feedback from people who have direct RTX 3090 repair or fan-control experience.
Beta Was this translation helpful? Give feedback.
All reactions