Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

> > [train]: "The training runs on 64 A100 GPUs over nine days",

> How is that a "year and half of GPU time".

64 GPUs × 9 days = 576 GPU-days ≈ 1.577 GPU-years



Doh, that's entirely fair: haven't been in this thread yet, but would echo what I perceive as implicit puzzlement re: this amount of GPU time being described as bitter-lesson-y.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: