Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

“You don’t need a fancy model, you can just find a similar example directly from the training data and get similar results”


If you look at how they actually "translate" the fancy model to the simple one, it requires fully fitting the original model (and keeping track of the evolution of gradients over the training). So it wouldn't make training more efficient, but perhaps it would be useful in inference or probing the characteristics of the original model.


This has always anecdotally appeared to be the case when investigating the predictions of neural nets. Particularly when it comes time to answer the question “what does this model not handle”


Defining ‘similar’ robustly is the meat of the problem, and what we’re finding deep NNs to do well.




Consider applying for YC's Summer 2026 batch! Applications are open till May 4

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: