I previously ran 150,000 AMD gpus in all conditions at 100% utilization for year...

rurban · 2026-03-10T07:24:14 1773127454

I have to maintain our GPU's. Generally the worst parts are the watercooling pressure, the HVAC, and the power. I can run it stable only at 300W per CPU, the normal max is 310W. Now with throttling to 300 it's stable for a year, before it burned two mainboards already, with lots of downtimes.

latchkey · 2026-03-10T15:26:09 1773156369

My experience is that power problems stem from not having good power and/or poor airflow.

I'm convinced that this is why we haven't had any issues in our current location. Zero outside air, zero dust, insanely well built zero expense spared airflow and power supply / management.