More

Moosturm · 2026-01-08T07:06:31 1767855991

Shouldn't A.I. not be used in a way that it only tries to assist? E.g. a doctor takes a look first and if (s)he can't find anything then A.I. is checking as well (or in parallel).

f1shy · 2026-01-08T07:12:05 1767856325

My personal opinion: AI should be still kept out of anything mission critical, in all stages, except for evaluation.

There is other comment very correctly noting that this result is on 100% positive input. Same AI in “real life” would score probably much better eventually. But as you point out, if used as a confirmation tool, is definitely bad.

friendzis · 2026-01-08T08:17:44 1767860264

> Same AI in “real life” would score probably much better eventually

Either I don't understand your reasoning or you are very much wrong. A "real life" dataset would contain real negatives too and the result would be equal if false positive rate was zero and strictly worse if the rate was any higher. One should expect the same AI to score significantly worse in a real life setting.

f1shy · 2026-01-08T09:46:17 1767865577

Depends on what you call better or worse. In real life positives (TN) are far less common than negatives (TN), if this system does not have lots of FP (which is very possible), the accuracy will be much better than you may expect.

What I mean with “score” is having a relatively high accuracy.

Come let’s do the math: incidence of BC is 1 every 12, lets say. Now let’s say we have 12000 patients:

Acuracy = (TP + TN) / (TP + TN + FP + FN) = (1000 + 11000) / (1000 + 11000 + 300 + 0) = 12000 / 12300 =0.976 the test is 97.6% accurate… pretty impressive huh?

Tell me if I’m wrong. Is a know fact that you have to be careful when doctor speak of % accuracy.

flir · 2026-01-08T09:51:37 1767865897

There was a study that found that, in radiology, human-first assessment resulted in worse outcomes that human-alone. Possibly the human's letting borderline cases through, on the assumption that the machine will catch them.

There's a roundup of such findings here, but they're a mixed bag: https://www.uxtigers.com/post/humans-negative-value I suspect you need careful process design to get better outcomes, and it's not one-size-fits-all.

directevolve · 2026-01-08T08:34:03 1767861243

In case you missed it, they weren't using AI to make these diagnoses.

Moosturm · 2025-11-10T07:48:25 1762760905

My old website from the 90s looks disturbingly similar to this one.

Moosturm · 2025-10-20T08:29:22 1760948962

I love "bat". It just give readability in the terminal a boost.

Moosturm · 2025-09-16T08:13:04 1758010384

Are there also historical pages like that of IBM to see how good the predictions were?

Moosturm · 2025-06-10T14:38:42 1749566322

Now someone has to find interesting ROMs in Pi.

Moosturm · 2025-05-22T07:16:54 1747898214

Just took a look at the list and all I said out loud: WOW.

Moosturm · on Jan 15, 2025

Those are the things I like about John GC: still having fun with the small things in life.

jgrahamc · on Jan 15, 2025

Life is far to short to take it seriously all the time.

Moosturm · on Jan 3, 2025

Those authors are the elite of DB mastery.

Moosturm · on Nov 20, 2024

What an absolutely fantastic read.

Moosturm · on July 31, 2024

This is phenomenal