Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Fundamentally I don't believe second-order methods get better data efficiency by itself, but changes to the optimizer can because the convergence behavior changes. ML theory lags behind the results in practice.
 help



Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: