Hacker Newsnew | past | comments | ask | show | jobs | submit | Moosturm's commentslogin

Shouldn't A.I. not be used in a way that it only tries to assist? E.g. a doctor takes a look first and if (s)he can't find anything then A.I. is checking as well (or in parallel).


My personal opinion: AI should be still kept out of anything mission critical, in all stages, except for evaluation.

There is other comment very correctly noting that this result is on 100% positive input. Same AI in “real life” would score probably much better eventually. But as you point out, if used as a confirmation tool, is definitely bad.


> Same AI in “real life” would score probably much better eventually

Either I don't understand your reasoning or you are very much wrong. A "real life" dataset would contain real negatives too and the result would be equal if false positive rate was zero and strictly worse if the rate was any higher. One should expect the same AI to score significantly worse in a real life setting.


Depends on what you call better or worse. In real life positives (TN) are far less common than negatives (TN), if this system does not have lots of FP (which is very possible), the accuracy will be much better than you may expect.

What I mean with “score” is having a relatively high accuracy.

Come let’s do the math: incidence of BC is 1 every 12, lets say. Now let’s say we have 12000 patients:

Acuracy = (TP + TN) / (TP + TN + FP + FN) = (1000 + 11000) / (1000 + 11000 + 300 + 0) = 12000 / 12300 =0.976 the test is 97.6% accurate… pretty impressive huh?

Tell me if I’m wrong. Is a know fact that you have to be careful when doctor speak of % accuracy.


There was a study that found that, in radiology, human-first assessment resulted in worse outcomes that human-alone. Possibly the human's letting borderline cases through, on the assumption that the machine will catch them.

There's a roundup of such findings here, but they're a mixed bag: https://www.uxtigers.com/post/humans-negative-value I suspect you need careful process design to get better outcomes, and it's not one-size-fits-all.


In case you missed it, they weren't using AI to make these diagnoses.


My old website from the 90s looks disturbingly similar to this one.


I love "bat". It just give readability in the terminal a boost.


Are there also historical pages like that of IBM to see how good the predictions were?


Now someone has to find interesting ROMs in Pi.


Just took a look at the list and all I said out loud: WOW.


Those are the things I like about John GC: still having fun with the small things in life.


Life is far to short to take it seriously all the time.


Those authors are the elite of DB mastery.


What an absolutely fantastic read.


This is phenomenal


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: