I used Claude Code to get a second opinion on my MRI

AceJohnny2 · 2026-06-28T18:12:34 1782670354

> There's something incredibly peaceful about being in the hands of an expert you trust. [...] AI can absolutely shatter that feeling in an uncomfortable way [...] but I don't know if I can fully trust AI either.

This really is key. We know we can't trust the AI, but at the same time we're also more comfortable asking the AI for clarifications or confronting it. Not having a time-bound appointment or paying by the hour helps a lot. But even then, more information doesn't necessarily help!

I once brought my 11-year-old car, a Civic with 150k miles, to multiple garages. I figured I'd play the "second opinion" game to correlate what the garages recommended to decide on what needed to be done...

I got 3 completely unrelated recommendations, including one that I knew was invalid! I felt worse off than when I started!

The solution to uncertain information isn't more information, which the AI can certainly provide, it's better information, and AI cannot currently provide that.

rockostrich · 2026-06-29T15:02:46 1782745366

There are 3 kinds of mechanics:

Scammers who do the lowest effort diagnostic and "fix" to get you to pay a smaller amount of money to fix the problem in the short term even though it'll re-present itself a week/month/year later.

Upsellers who will find other things "wrong" with your car and pressure you into paying to fix them because they sound a lot worse than they are.

Good mechanics that will explain what they did to diagnose the issue and recommend different options depending on what the issue is.

Funnily enough, I've found that doctors tend to also fit into these 3 archetypes.

Aurornis · 2026-06-28T18:39:59 1782671999

I have multiple LLM subscriptions at any given time, plus an array of local models.

When I ask a question outside of my domain of expertise I like to ask all of the LLMs I have access to. I also create separate sessions and ask the same question multiple ways.

It’s revealing to see how many different and contradictory answers I get, most of which are presented confidently.

The last time I ran a medical question through Claude I couldn’t even get consistent answers between sessions.

It’s also scary how easily you can lead each LLM to the answer you have in mind. When I would start asking questions about different options that other LLMs had presented, each session would drift toward that explanation.

marcus_holmes · 2026-06-29T02:25:33 1782699933

In my day job we tried creating a credit assessor tool using LLM as the credit assessor.

It did great, generated a report on the assessed business that was incredibly detailed and plausible.

Then I started running tests and getting into the details, and found that if you ran the same report on the same data, it generated completely different, still very plausible, results. I could run the same source data through the assessment process 10 times and get 10 very different results. We had to can the project and go a different route.

LLMs are designed to produce plausible results, not factual results. We can fix this when using them for software dev by using linters and tests (though we've all had the experience where the LLM invents an API endpoint). I would not trust raw LLM output in any situation where that kind of testing and verification capability isn't present.

yubblegum · 2026-06-29T12:53:28 1782737608

> LLMs are designed to produce plausible results, not factual results.

They are true to their name: Language models. It is precisely the same problem in a language: a grammatically correct sentence is not necessarily true.

Suppafly · 2026-06-29T06:26:06 1782714366

What's crazy is that there are ton of businesses building processes around LLMs that haven't done this exercise and fully believe the LLM is giving them accurate data.

bondarchuk · 2026-06-29T13:17:50 1782739070

It's funny that if the LLMs had all given the same result each time (it sounds like) you would have considered it more valid, even though it might just be giving a single wrong answer more consistently.

coderatlarge · 2026-06-29T15:03:20 1782745400

surely getting the same answer multiple ways suggests it is nore likely in the probability space?

xbmcuser · 2026-06-29T06:46:48 1782715608

Yup I use llm to write scripts for me to process data I don't ask the llm to process the data themselves. Even when I wrote something for my day trading I used llm write scripts that do all the processing and predict price movement from that the more data is pre processed the more all the llm come up with similar trades.

adamddev1 · 2026-06-29T05:29:08 1782710948

Linters and tests help of course, but they cannot "fix" the problem since tests cannot prove the absence of bugs.

marcus_holmes · 2026-06-29T05:35:35 1782711335

agree, and I think we'll see more use of formal methods with LLMs for this reason

ryukoposting · 2026-06-29T05:53:08 1782712388

At a certain point it just feels like we're reinventing the concept of programming languages from first principles.

ncruces · 2026-06-29T08:48:28 1782722908

I'm using this new programming language: it's called LLM prompting, and everything is undefined behavior.

marcus_holmes · 2026-06-29T09:10:02 1782724202

Hardly the first time we've done this - we had to do it with compilers too

dirkt · 2026-06-29T05:13:19 1782709999

What happened to VERIFYING an answer? Does nobody do that anymore?

When I ask an LLM, I trace the sources, and see if they make sense.

More often than not the sources don't actually say anything about the topic in particular...

> It’s also scary how easily you can lead each LLM to the answer you have in mind.

Exactly. Which is why "treat an LLM like a human expert who can answer your question" doesn't work. It's more like a human bullshitter who makes up convincing looking answers, and tries to please you. If the answers have actually some grounding in the training material, that's useful as some kind of holistic google, but often it's not.

palata · 2026-06-29T08:19:45 1782721185

> What happened to VERIFYING an answer? Does nobody do that anymore?

The problem with medical advice is that you may not be competent to verify the answer, right?

I agree that asking 5 LLMs to vote and trusting the answer is totally the wrong approach, of course. But LLMs (and traditional material) can help getting more informed. For instance, instead of going to your doctor with the LLM diagnosis and trying to convince the doctor that the LLM is right, you can try to build your own understanding of the problem and go ask the doctor to explain to you what you understood correctly and what you misunderstood.

If you have some understanding, it's harder for a specialist to bullshit you. But you need your own critical thinking and you need to put effort into actually learning something, blindly trusting and repeating what LLMs say doesn't help.

spwa4 · 2026-06-29T14:52:07 1782744727

Or more specifically in this case: the patient was obviously insisting on a diagnosis and treatment based on ... a slightly hurting shoulder, with zero visible or detectable phenomena.

So the doctors gave him what he wanted: a treatment ... and Claude told him the treatment was a placebo. Correctly, I might add.

Yeah, it is absolutely not what the patient wanted to hear. BAD doctors! Except ... no, not really.

Does that explain what happened here?

prmph · 2026-06-29T09:15:49 1782724549

I've also noticed the opposite problem: Sometimes the LLM, when asked a detailed question (probably with some lead-in), pushes back in a way that betrays that they fell back to general tropes without really considering the nuances of your specific context.

This happens many times, and I usually have to lead the LLM through a chain of reasoning to prove to it that its objection, through generally sound, do not apply to my specific situation.

Someone not as well versed in the subject matter would think the LLM found a smoking gun (which they love to do), and be led on a wild goose chase.

wizzwizz4 · 2026-06-29T10:58:47 1782730727

> I usually have to lead the LLM through a chain of reasoning to prove to it

What's the point of doing this?

prmph · 2026-06-29T11:34:50 1782732890

So that hopefully we can go further in the discussion without it having to repeatedly bring up those (discredited) objections.

But it does forget, and I'd have to prime it again for another session.

mathieuh · 2026-06-29T07:35:03 1782718503

As you say, often you check up on the LLM's "reasoning" and it doesn't follow at all, or you can easily get it to contradict itself with just as much certainty as it had about its previous convictions.

It is very scary to me that people are entrusting potentially life-altering decisions to these things.

otabdeveloper4 · 2026-06-29T05:30:33 1782711033

> When I ask an LLM, I trace the sources, and see if they make sense.

Professional tip: you can cut out the LLM middleman here and save a lot of time and money.

microgpt · 2026-06-29T06:26:26 1782714386

What would you use then? Google Search, which is just a shittier LLM?

base698 · 2026-06-29T10:55:31 1782730531

My step mom was having debilitating pain. A year of going to doctors and no one was able to find a cause. I scanned her discharge paper work which had her prescriptions on it and gave it to Claude. It identified a prescription that had that exact side effect. They later confronted her primary care that concurred and took her off it.

A friend of mine's wife recently passed. They were chasing a suspected heart defect for over a year. She had been intermittently fainting. At about the year mark they decided to scope her digestive track. They found bleeding ulcers from cancer that was all over her body. I input her fainting symptoms into Claude and gastro impact was number two suspected after heart issues.

I have a few of other cases it's helped with. I'm not sure it could do worse than my own experience with the medical system. This is doubly true in places that lack any sort of medical care.

LogicFailsMe · 2026-06-29T12:45:16 1782737116

As someone who uses Claude Code to summarize published research, you have to ground it in peer-reviewed results or it gets lost. But also, I am grounded with two degrees in the source material. So I am feeding it my views and asking if the published work agrees or disagrees with my opinions and I get fantastic results that way to the point of knowing current clinical trials and treatment regimens than most of the oncologists and which led to a great conversation with the clinical trials team. This doesn't replace people, but it augments existing expertise amazingly well.

But also, I hear so many tales of running out of tokens. I ask Claude Code to build a tool to perform a task. I review the tool and then I let it rip if I'm happy with it. As I understand things, most just ask Claude Code to do the task. That seems a bit fraught.

Anyway, you have to impose constraints IMO and ask the right questions to get the answers you need or yes Claude Code (or any other LLM) will eventually just agree with you.

parpfish · 2026-06-29T13:58:30 1782741510

LLMs are well suited to my (some would say annoyingly) curious nature.

when i get an answer, and my first instinct is to ask a ton of follow-ups and "what about"s. i've learned to tamp this down with fellow humans, but with LLMs its great because most of the time the response is "you're right, something doesn't add up... let me try again". i think we eventually converge on to something reasonably true

palata · 2026-06-29T08:16:28 1782720988

> It’s also scary how easily you can lead each LLM to the answer you have in mind.

Scary in this context of course, but I find that it is an interesting thought for coding: it suggests that maybe, a developer who knows what they are doing will end up leading the LLM to coding something that make more sense than a developer who doesn't know and just vibe-codes blindly.

Sounds pretty obvious, but I wanted to say it.

ncruces · 2026-06-29T08:52:50 1782723170

And all it takes is not blindingly accepting the first thing it spews if you suspect there's a better answer (and are in a position to evaluate that better answer).

Esophagus4 · 2026-06-28T20:16:02 1782677762

Have you ever let the LLMs “discuss” with each other to see if that would give better answers?

You might end up with the answer from the most persuasive LLM, but you might also end up with better results.

Wonder if there is a paper out there on this.

scheme271 · 2026-06-28T20:42:04 1782679324

The problem is how do you know whether the answer is just the most persuasive or actually the most accurate one? It's hard to figure this out without domain knowledge.

sizzle · 2026-06-29T04:14:27 1782706467

Take the output to a Radiologist and verify the veracity of the statements.

RussianCow · 2026-06-29T06:03:39 1782713019

At that point, cut out the LLM and just see the radiologist.

tuxguy · 2026-06-29T06:33:18 1782714798

there is often discordance between radiologists(& doctors in general) when reading the same scan(same case vignette) as well !

yen223 · 2026-06-29T07:24:06 1782717846

Do people here not realise that "second opinions" are a thing because humans disagree with each other when presented with the same case all the time? It's not just an LLM thing!

marcta · 2026-06-29T05:24:33 1782710673

Why should a radiologist have to debunk AI slop? They have enough to do already. That's the same mentality that is frustrating open-source repositories with sloppy pull requests, and saying "here, sort this out for me".

dumb1224 · 2026-06-29T08:58:23 1782723503

Depending on the disease, even in cancer there's myeloma which may cause bone metastasis in many parts of the body with very focal lesions. Radiologists can't assess each and an every one of them, or even to find them all. So AI can definitely help in these scenarios.

wizzwizz4 · 2026-06-29T11:00:06 1782730806

And that AI will not be fancy autocomplete: it will be some kind of image classifier that is not trained on Reddit.

Esophagus4 · 2026-06-29T02:25:06 1782699906

I dunno, I could see it working.

I do something similar with reviewing code: I have one agent write the code and another reviews it, then they go back and forth for a bit improving the code. Seems to yield better results than one agent alone.

Seems like a similar principle.

scheme271 · 2026-06-29T02:45:25 1782701125

The difference is that in the code situation, you can run unit tests on the code, compile it, etc. Unless your LLMs are ordering diagnostics and reviewing the results, there is no further information that the LLMs have on the situation. Having a second LLM review the first is counterproductive, if the 2nd LLM is better, why not use it directly? If not, then what prevents it from sending the first on some incorrect tangent?

RussianCow · 2026-06-29T06:05:40 1782713140

Also, there are multiple "correct" ways to code something, so imperfect code that solves the problem is still useful. A medical diagnosis is either correct or incorrect.

Esophagus4 · 2026-06-29T13:00:39 1782738039

En, I think you’re just trying to justify your pre-existing position that this can’t work.

https://www.nature.com/articles/s41746-026-02619-0

https://www.nature.com/articles/s44360-025-00007-8?fromPaywa...

Different prompt approaches and training doctors to use LLMs can improve accuracy of LLM-assisted diagnosis. It’s pretty reasonable to hypothesize that LLM “peer review” could improve that as well.

XorNot · 2026-06-28T22:35:33 1782686133

Worse is that LLMs are trained to be persuasive by default. The "you're absolutely right..." stereotype is because these things are A/B tested on response quality and we know from studies people reliably rate vibes better then anything else - e.g. while the quality of hospital accomodations likely has some impact on patient outcomes, the view and decor of the room certainly did not fundamentally change the quality of the care provided but it is the largest determinant in how well people rate that care.

mncharity · 2026-06-29T04:42:01 1782708121

With direct discussion, the same tendency to harmonize towards groupthink applies.

Aside from the statelessness GP mentioned, one can insert anti-conciliatory intermediation. "I saw a random claim go by, but something about it seems not quite right. What am I missing? They said: [...]." Weaponizing the bias, and orchestrating the discourse from the harness.

cadamsdotcom · 2026-06-28T21:18:35 1782681515

The problem with trying to write a paper is the results depend on RNG.

red75prime · 2026-06-29T06:20:32 1782714032

Run it with temperature 0 if you want to minimize randomness. Sampling from a probability distribution is not a problem by itself. The problem is when the probability distribution prioritizes wrong answers.

NonHyloMorph · 2026-06-28T21:52:17 1782683537

That doesn't make it differrnt from any other problem measured by statistical significance in averaged over a big enough series of comparisons, no?

jdblair · 2026-06-29T05:01:20 1782709280

The best mechanic I ever had kept my ‘98 Subaru going past 200k miles. Once during a repair I asked him to do an inspection and tell me if there was anything else I should replace. He told me not to do that, and that any mechanic would always find something, but not necessarily the next thing to break.

He said it better using an expression I hadn’t heard before or since, something like “don’t go looking for goats when your herd is already with you.”

dumb1224 · 2026-06-29T09:04:55 1782723895

Exactly. Old parts of the system will be working if you leave them undisturbed. Mechanics have very good intuitions of this sort of thing.

I read about before there's proper engineering / physics theory about this too, it's like a car as a machine is a linear/smooth physics system with multiple weaknesses. Overtime longtime period of running many places might weaken but it still evolves into a slightly different smooth system, until you introduce a replacement which cause a mis-match of impedance or something like that.

tass · 2026-06-29T09:55:31 1782726931

Maintenance-induced failures are what it’s called with small aircraft.

You’ll do something to prevent a failure (like, replace an old but functional alternator) but cause an oil leak or engine vibrations because you had to remove the propeller to complete the job.

john-tells-all · 2026-06-28T18:51:47 1782672707

There's a big difference between a _puzzle_ and a _mystery_. In a puzzle, the goal state is known, and as more pieces - data - appears, the goal gets closer. You know how far you are from the goal.

A mystery is worse. With each additional piece of data, the goal gets farther away. Everything is more and more confusing.

(Popularized by Malcom Gladwell)

mrlongroots · 2026-06-28T21:12:36 1782681156

Maybe I am missing something but I just find this wrong.

Everything is a puzzle: there is one "Truth" or one diagnosis. You (a smart human) should be able to converge on it by cross-examining your LLMs. By themselves, they have no interest in revealing this, no stakes, which makes them tools only useful at the hands of a capable investigator.

Paracompact · 2026-06-28T21:40:50 1782682850

> You (a smart human) should be able to converge on it by cross-examining your LLMs.

What makes you think this is fundamentally different from cross-examining ELIZA? There is no guarantee that the LLM will help you converge on anything. Indeed actually calling out an LLM on BS tends to eventually produce an "I don't know and can't help you further" answer (as it should).

mrlongroots · 2026-06-28T21:58:26 1782683906

> There is no guarantee that the LLM will help you converge on anything.

Absolutely. The guarantee does not come from the LLM. The LLM is a simply an improved version of Google Search.

The guarantee can only come from a systemic application of epistemic discipline and reasoning, which is very much (smart) human territory.

Put it another way, I could make good decisions with/without LLMs, with some uncertain diagnostics as input. I would have to trawl through 50 papers myself, and it is possible that my decision arrives 5 years too late as a result. LLMs enable trawling and do some of the legwork in connecting the dots, but are ultimately only as capable as the orchestrating human.

fc417fc802 · 2026-06-28T22:31:40 1782685900

The same goes for a human expert. There's no guarantee of convergence and you could eventually end up at "I don't know".

scheme271 · 2026-06-29T01:48:47 1782697727

The problem is that the diagnosis might not be known for a while. There's a few conditions and diseases that require an autopsy for a guaranteed diagnosis and therefore are diagnosis based on symptoms in clinical settings.

010101010101 · 2026-06-28T18:32:27 1782671547

> The solution to uncertain information isn't more information, which the AI can certainly provide, it's better information, and AI cannot currently provide that.

I'd argue that AI _can_ currently provide that, but that it can't do it _reliably_, and that to non-experts it's impossible to differentiate, which makes it all the more dangerous.

margorczynski · 2026-06-28T18:54:30 1782672870

Isn't that the case with human "experts"? If you had encounters with doctors, mechanics, etc. you'll know you can get a completely different diagnosis for the same problem which obviously means (in most cases) that the person you thought an expert is wrong.

What is needed are studies that will take a cold look at the actual results because AI seems to be required to be perfect or it is useless. It just needs to be as good as a human for most stuff, but in the long run it will be much better. At least that what extrapolating current reality shows us.

wwweston · 2026-06-28T21:18:48 1782681528

We have systems around humans that exist to manage expertise gaps, credibility signals, and accountability. This is part of what makes humans as good as they are, along with specialized training and some measure of meritocratic selection. We license and regulate and account and litigate to make a system that responds and improves.

Some of this might be applicable to LLMs, but some isn’t and much of it would be resisted. This is one reason we’re not likely to get “as good as a human” because at some level we’re not optimizing for the outcomes; we’re optimizing for speed, convenience, some participant’s economics, and underlying beliefs.

malfist · 2026-06-28T21:36:38 1782682598

I've been going through PT for a hypermobility disorder related injury and I've use an AI to help me figure out "interview questions" to see if a PT knows anything about hypermobility or is willing to learn. I found it helpful to select a new PT after my first PT I trusted made things worse by prescribing stretches and no load progression from rest and recovery back to deadlifts

kerabatsos · 2026-06-28T23:33:37 1782689617

People put a lot of faith in human “guardrails”, standards, etc. But the same argument could be made that trusting human experts without discernment is as dangerous as trusting AI or Google or whatever other non-human source. It’s always been the case.

malfist · 2026-06-29T13:45:30 1782740730

On the plus side, it's not like it was blind faith. Human judgement lead me to seek out another expert when I didn't like with the current PT was suggesting I do (no practicing hip hinge movements before moving to deadlift, advising against valsava for heavy lifts, ignoring feedback that a movement was causing pain, prescribing stretches to increase flexibility in a hypermobile person).

The LLM also gave me a bunch of questions to ask a new PT but I didn't have the understanding to judge the responses so I did more research. One of the things the LLM wanted me to ask about was questions about form and force closure and ideally would get a response about the oblique sling across the back. My PT didn't give me that exact response, but explained it in much more lay person terms, but because I had done my research I was able to validate their response was directionally correct. And so far, my experience with this PT has been much better. We're doing block pulls at 70% of my prior deadlift weight, next week we're going to go way back on weight and lower it some to get closer to proper deadlifting and work in some asymmetric loading exercises.

draftsman · 2026-06-29T13:33:00 1782739980

May I ask, is the hypermobility disorder you refer to EDS? If so, what was the injury?

malfist · 2026-06-29T13:43:59 1782740639

My doc didn't do the full set of test/exam for an hEDS diagnosis, so technically it's just generalized hypermobility spectrum disorder based on past medical history and a near "perfect" beighton score. It could be hEDS though, as a lay person reading the diagnostic criteria I either fit it, or are borderline.

Injury was to my SI joint, I've historically always irritated it lifting, but I set a new PR for deadlifts and it was debilitating for 3 weeks before my other half made me stop being stubborn and see a doctor about it.

I've also had my left shoulder joint surgically repaired after multiple dislocations.

ed_elliott_asc · 2026-06-28T18:39:48 1782671988

The soothing sound of ChatGPT telling us how right and clever we are…how could it possibly hallucinate, certainly not 5.5

nonethewiser · 2026-06-29T01:05:11 1782695111

You’ve really honed in on the key issue. This is exactly how keen hackers news commenters approach this.

Bratmon · 2026-06-28T20:06:42 1782677202

To provide a competing point of anecdata: A Gemini diagnosis saved me $3,000 in unnecessary repairs on my Civic.

fluidcruft · 2026-06-28T21:27:15 1782682035

YouTube has saved me at least that much in appliance repairs... and it doesn't even have an AI. It's amazing how valuable access to information can be.

ahepp · 2026-06-28T22:45:17 1782686717

I would love to hear more about this

dyauspitr · 2026-06-28T21:05:42 1782680742

Saved me $2000 on a koi pond pump and filtration system

dumb1224 · 2026-06-29T08:49:44 1782722984

I tried that AI diagnosis for my 15 old Ford C MAx too, however with a diagnostic problem the issue is unless you've got the ground truth, there's simply no way to verify any tool / human with a metric that you can compare and decide on future tasks.

The AI might be very good at diagnosing all minor issues, but might not lead to a successful repair, whereas human mechanics are extremely good on 80% of major issues that's not the ground truth, but will lead to successful repairs (that might not address the root but simply patch it). So it comes down to manage expectation / outcomes.

throwaway2037 · 2026-06-29T11:26:00 1782732360

You nerd sniped me with the story about your used car. What happened in the end? I really want to know! There are some fun YouTube channels that basically do the same. Someone who is an expert auto mechanic takes a used car to various repair garages and asks them to recommend a course of action.

namelessone · 2026-06-29T11:41:38 1782733298

Sounds like a fun watch! What is the name of the channel?

clates · 2026-06-29T14:04:05 1782741845

> The solution to uncertain information isn't more information, which the AI can certainly provide, it's better information, and AI cannot currently provide that.

Aside from the LLM-ism (it isn't foo, it's bar) - this is a thought terminating cliche. You definitionally don't know if some information is better or not given that you were uncertain about the information in the first case.

"I went to three mechanics and got three different answers" - your takeaway is just "Ah - I clearly need better informed mechanics."

Which is on it's face absurd because if you could clearly judge the ability of the mechanics you wouldn't need their evaluation. You'd just do the evaluation yourself.

serial_dev · 2026-06-28T21:31:40 1782682300

These tools can’t reliably fix a 4px misalignment on my icon, better ask them about a medical report… but honestly, I would do the same.

Gigachad · 2026-06-28T23:50:21 1782690621

Tbh LLMs pulling data out of medical documents in it's training set and searchable online is likely a much easier task than fixing some weird CSS alignment issue.

dd8601fn · 2026-06-29T14:11:54 1782742314

Also most of them can’t actually see what they’re doing. It’s hard for me to get things pixel perfect while blindfolded, too.

darkwater · 2026-06-29T09:43:09 1782726189

> I got 3 completely unrelated recommendations, including one that I knew was invalid! I felt worse off than when I started!

I would frame it differently: you now know which shops are not to be trusted. So, next time you need one, you will take a better decision.

abirch · 2026-06-29T10:06:57 1782727617

There are few things better in this world than having a car shop you can trust. I found one and pray that management doesn't change.

jbs789 · 2026-06-29T12:23:46 1782735826

Especially in the medical field where the placebo effect / mindset shapes outcomes.

ryukoposting · 2026-06-29T05:50:45 1782712245

> I got 3 completely unrelated recommendations, including one that I knew was invalid! I felt worse off than when I started!

I almost had a very similar experience with my beater Lexus. It took 2 independent shops and 3 dealers to finally figure out what was causing the ABS to go off randomly at low speeds. Turns out there's some obscure Toyota-specific tool from the late '90s that picked up a proprietary diagnostic code, and the third dealer was the only one that still had that particular piece of equipment.

...and of course, the thing that's broken has been out of production for 20 years and remanufactured ones cost more than the car is worth. I ended up just unplugging the ABS control module.

Point being: once I knew what was wrong, all the seemingly contradictory information from the other 4 shops suddenly fit together. It's just such a weird thing to go wrong that no reasonable tech would ever have considered it.

weatherlite · 2026-06-29T07:50:41 1782719441

> it's better information, and AI cannot currently provide that

It sometimes can, if it straight out never can no one would use it. People use it , lots of them.

UltraSane · 2026-06-28T23:56:04 1782690964

> There's something incredibly peaceful about being in the hands of an expert you trust

This is the primary business model of enterprise IT and is why companies pay so much for 4 hour disk replacement.

nonethewiser · 2026-06-29T01:03:08 1782694988

You only got 3 opinions on your car? Why not 50? You could have found a more useful signal by getting more information.

I get it - getting an opinion from a mechanic is time consuming. Not true of AI though.

kgeist · 2026-06-28T21:14:00 1782681240

A few years ago (before the AI craze), I was misdiagnosed with tuberculosis. I had a chronic cough, and an outsourced radiologist at a clinic found signs of tuberculosis. The findings were sent to the city's tuberculosis hospital, as required by the country's law. The doctors there took the radiologist's conclusion at face value and required me to stay at their hospital for at least 8 months under a strict, prison-like regime. There was no option to say no, because I was considered some kind of biohazard, and by law I had to comply.

Before I was admitted, I quickly found another radiologist, who diagnosed pneumonia instead. I sent his report to the chief doctor at the tuberculosis hospital, and after some deliberation they concluded that the original reading was wrong. Turns out the doctors there can't read scans at all and just believe whatever a radiologist says...

The funny thing is, they had already officially put me on the tuberculosis register and didn't want to admit they had made a mistake. So instead, they simply gave me another paper saying that I had been cured of tuberculosis by them... in 7 days. I'm probably the only person in the country to defeat tuberculosis in a week :)

So if you don't trust the radiologist/doctor, maybe find another doctor if you can afford it? You can compare their conclusions and see if they match. Two unrelated doctors or radiologists saying the same thing is probably about as close to the truth as you're going to get. I'm not sure though whether I should trust AI or humans more. AI can hallucinate, but I've been misdiagnosed by humans so many times too...

azan_ · 2026-06-28T21:22:01 1782681721

How is it possible? You can't diagnose tuberculosis just based on imaging and tuberculosis hospital has to know that!

kgeist · 2026-06-28T21:33:12 1782682392

Yeah, I know! It was strange. They gave me a test, and it came back negative, but they insisted it was negative because I had "latent tuberculosis," which supposedly wasn't detectable by the test yet but was about to become active.

I forgot to mention that, besides getting a second opinion from another radiologist, I also took a more modern test at another private clinic. That test has better detection rates than the one the state clinic used, and it came back negative too.

I have suspicions they had some kind of government quota to keep the hospital staffed with patients in order to receive funding. Or they were just completely incompetent. I pushed back by bringing them another radiologist's report and the results of a better test that I paid for myself, so I guess they decided to back down.

spwa4 · 2026-06-29T10:56:49 1782730609

You'll find doctors always believe and treat the worst diagnosis any professional has put on a case. That's a legal thing, not a skill issue.

Think about the consequences of mistakes in both directions ...

shiandow · 2026-06-29T06:40:17 1782715217

Not only that, what is the point confining someone to prevent the spread of a disease about a quarter of the world is already infected with?

I suppose there could be reasons, but I don't know them.

kgeist · 2026-06-29T14:36:40 1782743800

Some countries and jurisdictions still have laws that allow for the involuntary confinement of tuberculosis patients, I guess dating back to the times when tuberculosis was rampant in those countries? And most professionals seem to be okay with the policy:

https://theunion.org/news/is-involuntary-incarceration-of-tb...

>17% said that, as a matter of principle, the involuntary incarceration of TB patients was inappropriate on any grounds.

>Regionally, members from Europe Region had the highest percentage of respondents objecting to the policy as a matter of principle (26.2%) while the North America Region had the lowest (3%).

The emergence of multi-drug resistant tuberculosis in the 1990s is probably one of the reasons:

>Respondents most strongly supported the policy of incarceration for patients known to have multidrug-resistant TB (49.7%)

ryan_n · 2026-06-29T14:06:21 1782741981

Yea I find a lot of stories on the web about doctors misdiagnosing things to have oddities like this that don't seem to make sense. It often seems like the author is leaving something out. Not saying OP is lying, but tb is a very, very weird conclusion to come to from just one radiology report...

kgeist · 2026-06-29T14:08:13 1782742093

See my answer in this same subthread. I was perplexed myself as to why I was diagnosed based on just one radiology report. But the moral of my story is that you can always try to obtain a second opinion from another doctor. I'm not saying doctors shouldn't be trusted in general.

comboy · 2026-06-29T08:31:08 1782721868

Incentives.

engeljohnb · 2026-06-29T11:37:44 1782733064

A second opinion is a smart move if one has doubts about their diagnosis. Doctors make mistakes, and even though I've worked with countless great doctors, I've never worked a job where there wasn't at least one who was undiscerning, or downright lazy and negligent. It's hard to tell people to trust their doctor when I know there are plenty of doctors out there like this.

But AI as of right now is worse than any bad doctor I've ever worked with.

CodingJeebus · 2026-06-29T14:48:40 1782744520

The healthcare affordability crisis is only going to exacerbate the trend of using AI as a replacement for a real doctor. I went to urgent care a few months ago to get tested for COVID and two other flu strains and it came out to almost $500.

Anecdotally, several people in my life who embrace less traditional (and sometimes more conspiratorial) views on modern healthcare tend to be the ones that can't afford it. A confident-sounding chatbot to answer questions day and night about what's going on with your body is very seductive in a world where access to real healthcare is getting further and further out of reach.

engeljohnb · 2026-06-29T15:02:27 1782745347

That's the balance I'm finding it very hard to strike when talking to my family about doctors.

Everyone is either a "all doctors are scams" QAnon type, or they blindly trust everything their doctor says, no matter how fishy, in fear of coming off as one of the former group.

And, to use a phrase we all hate by now, you're absolutely right. When most people have to go into debt to even see a doctor, what can people possibly conclude from that besides "all doctors are out to scam you?"

igortg · 2026-06-28T21:37:47 1782682667

I had a similar experience. My son had pneumonia and was still filling pain after 10 days of antibiotics. Took an X-Ray to three different doctors, and only one got the right diagnosis (pleural effusion). It's really something we should have a central place with top notch professionals looking at it, instead having each doctor to find by themselves.

mncharity · 2026-06-29T05:36:47 1782711407

I once worked on a medical hackathon concept for computer-assisted population screening for cervical cancer in a developing nation. Community health workers take photos. The AI would look at the images, and make a call of "clearly negative" vs "clearly positive" vs "needs (scarce) expert review". But taking good photos is hard, so it's also "photos insufficient" and "worker needs additional mentorship on taking photos". Only by computes reducing all three costs - expert workload, exam success, and quality-control/training - might successful deployment be financially and logistically plausible for that nation.

beacon294 · 2026-06-29T01:28:46 1782696526

What country / municipality are you in? This is not my understanding of Tuberculosis...

rpastuszak · 2026-06-29T08:36:06 1782722166

Asking for a friend, who is in a somewhat similar predicament — it wasn’t Portugal, was it?

themantalope · 2026-06-29T03:02:10 1782702130

Radiologist. I don’t read MR shoulder exams in my day to day practice, but from the few pictures shown , I can’t conclusively disagree with the original report.

These models are generally terrible at reading medical images. The amount of public training data on the internet compared to the number of scans a radiologist reads in training is minuscule. There’s obviously a ton of medical images in general but very few, and even fewer along with a report are available on the internet publicly for download.

There are vision language models coming out of research labs that are excellent in describing and localizing findings. Still at the level of a 1st or 2nd year radiology resident, but as we all say - this is the worst the models will ever be.

deaux · 2026-06-29T07:50:38 1782719438

Absolutely. It's very unfortunate that this post used the worst example possible of using LLMs for medical purposes.

General-purpose LLMs are _fantastic_ at medical diagnosis that do not involve imaging. I am completely convinced that given enough information and time, frontier models already outperform >90% of doctors on initial diagnosis of internal issues and suggesting medical tests to further reject or confirm the most likely theories. To the point where I'm eagerly waiting for the first hospital in the world that's willing to be open and honest about using them for that first step, and then proceeding from there. I'll be on a flight there as soon as one arrives.

At the same time, they're worse than useless at anything involving medical imaging. Asking them to interpret them is worse than trying to interpret them yourself as a layman. And you surely wouldn't interpret them yourself.

throwaway2037 · 2026-06-29T12:00:45 1782734445

    > General-purpose LLMs are _fantastic_ at medical diagnosis that do not involve imaging.

Can you share the reasons that you believe this?

    > At the same time, they're worse than useless at anything involving medical imaging.

What is special about medical imaging that makes AI/LLMs specifically bad?

riahi · 2026-06-29T13:22:07 1782739327

You can see it in just this PDF report.

It's multiple things. It never shows the subscapularis in the way that people actually look the tendon. It hyper fixates on the axial when I find the sagittal much more useful for subscapularis.

Figure 7. There's an arrow pointing "to the acromial undersurface". The arrow is not pointed to that location.

Figure 5. "thin bursal fluid". This is within physiologic variation, but is calling bursitis.

It keeps bringing up irrelevant normal things like the shape of the coracromipal arch, I assume because lots of websites have information about that as a patient focused possible cause for rotator cuff impingement.

I am reminded of the recent Stanford MIRAGE study which found that LLMs will happily hallucinate answers about medical images if the medical images are omitted.

https://arxiv.org/html/2603.21687v2

jawilson2 · 2026-06-29T14:14:29 1782742469

I don't understand why this is still confusing to people. The second "L" in LLM is language; these things are AWESOME at producing things that SOUND like language, including code. They have so much training data that it is almost always grammatically correct, and often makes sense. Extending this, it has obviously been trained on data containing phrases like "acromial undersurface" and "thin bursal fluid", and "coracromipal arch", in the context of shoulder injury and related imaging. BUT IT DOES NOT KNOW HOW TO DIAGNOSE ANYTHING. So, it SOUNDS like a radiologist or specialist, and might be in the ballpark of correct-ish-ness, but ultimately is a fancy Markov model.

Maro · 2026-06-29T13:42:21 1782740541

I don't have insider information, but: if one of the AI companies really wants their models to become really good at this and publicly available datasets are scarce, they can probably just buy anonymized X-ray/MRI scans paired with the human doctor's diagnosis, and train on them. I don't know what the legal story is around this, but AI companies have near infinite money, so I'm sure they can buy their way around regulations (eg. by buying them from a less regulated country).

yfontana · 2026-06-29T09:38:04 1782725884

Yeah, medical computer vision is a (fascinating) field with a lot of ongoing research. SOTA models are highly specialized, and are only getting good enough to be used by actual doctors and patients. Using a general purpose LLM to do this is similar to giving a credit card to Openclaw and telling it to make you rich through the stock market & cryptos.

throwaway2037 · 2026-06-29T11:55:58 1782734158

No trolling here: Do you feel threatened by the advance of AI/LLMs with respect to your field? I would. I am a computer programmer, and it absolutely feels threatening.

zoul · 2026-06-29T15:05:33 1782745533

As a programmer, I don’t feel threatened by the technology itself, but I do feel threatened by the second-degree effects such as what the technology does to our field, especially in the wrong hands.

odiroot · 2026-06-29T10:15:14 1782728114

I can see how your thesis is valid.

Like OP, I also had a shoulder MRI, and asked two AIs for opinion (awaiting a follow up appointment to discuss the results).

They both insinuated much more serious problem than it was (as judged by an orthopaedic doctor).

billynomates · 2026-06-29T07:37:06 1782718626

Anecdotally, I've had Claude (Sonnet and Opus latest) consistently misread numbers from screenshots of my macro tracking app. Makes me skeptical of claims about its usefulness for anything requiring accurate image interpretation, let alone MRI analysis.

sxg · 2026-06-28T16:53:07 1782665587

I'm a radiologist but can't really weigh in without seeing the full 3D MRI dataset. Regarding this point:

> They performed shockwave therapy on my shoulder even though a recent clinical practice guideline says clinicians should not use or recommend shockwave therapy for rotator-cuff tendinopathy without calcification; I was told during ultrasound that there was no calcification.

Ultrasound isn't a great way to assess for calcification. It'll find large calcification but easily miss small ones. Plain radiograph would be more helpful, but the MRI may have revealed it as well. Either way, shockwave therapy isn't harmful in the absence of calcification--it's just not helpful.

Edit: when a radiology report says something isn't present, there's always an implicit caveat that the finding isn't present within the context of the modality and images obtained. So an ultrasound report can state there are no calcifications while a plain radiograph can report the presence of calcifications without being inconsistent. Obviously very confusing to patients and people unfamiliar with medical jargon, but clarifying this in reports would make them sound even more qualified, "hedgey", and annoying to read than they already are.

ambicapter · 2026-06-28T19:20:21 1782674421

> So an ultrasound report can state there are no calcifications while a plain radiograph can report the presence of calcifications without being inconsistent. Obviously very confusing to patients and people unfamiliar with medical jargon

This is being overly nice, I think. Anyone who doesn't understand this is an idiot imo. You would have to assume that every type of diagnosis instrument has infinite clarity and is always correct to be confused in this case.

Reminds me of the Babbage quote where somebody asked him, if I put the wrong question into this computing device, will it still give me the right answer? His response, paraphrased "I can not fathom the logic of the minds which would come up with such a question".

MattyMc · 2026-06-28T23:37:21 1782689841

> Anyone who doesn't understand this is an idiot imo

I don’t think that’s true. Avoiding this mistake requires knowing that an ultrasound may not detect calcification. For a patient reading their own report, I don’t think that’s intuitive. I would expect most people to read “no calcifications” and assume that their joint has no calcifications.

Fr0styMatt88 · 2026-06-29T03:33:45 1782704025

Exactly. I was about to reply to the comment with “perfect example of not knowing what you don’t know” in terms of self-diagnosis.

My internal model is/was “if the scan wasn’t set up / can’t detect the thing, why would the statement be present at all?”.

That implicit assumption is really subtle.

nkrisc · 2026-06-29T00:06:36 1782691596

Most people should have learned at a young age that absence of evidence is not evidence of absence. My 8 year old understands this. After all, you can rarely ever prove something does not exist, only that it is unlikely to exist.

If a report states that X was not found, it does not mean X did not exist, it means it was not found.

What may be lost on the layperson is the nuance and understanding of how thorough or not a particular scan is and how much weight to give the findings and thus the odds that the report is correct.

sjducb · 2026-06-29T10:13:59 1782728039

> Most people should have learned at a young age that absence of evidence is not evidence of absence.

I’m fairly sure that there are no lions in my house. Lions are quite large and I’m capable of detecting lion sized objects with my eyes.

To demonstrate that something is not present you first define the object, then come up with a test that will reliably detect the object. If the test comes back negative then the object is not there.

In a strict philosophical sense I cannot prove that there are no lions in my house, the external world might not exist! A hypothesis that no one has thought of might be correct and that hypothesis could show that there are invisible lions in my house!

However I intend to act with the certainty that there are no lions in my house. Because I have no evidence of lions in my house.

Absence of evidence is evidence of absence.

aforwardslash · 2026-06-29T00:52:27 1782694347

This is - by far - the most stupid stuff I've read on the internet the past few days. They didnt find cancer either (as well as a plethora of diseases that could be related to the symptoms), and afaik its not in the report.

Yah you can argue that the tool is not ideal for that diagnostic, yadda yadda. I get it, and in the end I agree with the subtle difference you highlight, because it is something that makes sense to a certain kind of people. You know how many medics would read the report exactly like the author did? Too many.

How do I know? Im not in a wheelchair after being constantly misdiagnosed by using the wrong imagiology technique by (mostly) chance, and a good help from friends, including a surgeon. This seems to be a case where AI would be a valuable doctor tool for differential diagnosis; instead we have know-it-alls that can't bother to verify, and AI that often gets details wrong. That is the problem.

Fr0styMatt88 · 2026-06-29T03:36:10 1782704170

I think it’s the combined depth AND breadth of knowledge that can be captured by AI models that is going to make them way better than most humans at this kind of stuff.

Sabinus · 2026-06-29T00:55:06 1782694506

It's like when finding out about the sex of your baby via ultrasound before they're born. If you're told it's a boy, you can be pretty certain you're getting a boy. If you're told it's a girl, you shouldn't get too attached to the idea. The ultrasound tech might just have missed the evidence your baby was a boy.

ytoawwhra92 · 2026-06-29T05:50:50 1782712250

"Calcifications not found" is a different statement from "no calcifications".

Even then, the context that "ultrasound isn't a great way to assess for calcification" is important when reading either statement. Laypeople don't necessarily have that context.

mewpmewp2 · 2026-06-29T03:53:18 1782705198

But the problem was that the report is not saying "not found", it is saying "is not present" or "there is no X".

And I think we can easily have examples where we can reasonably trust this, and a spectrum of such.

E.g. there is a math solution and the report says "there is no errors in this solution", you would imagine that to be quite reliable, no?

O_H_E · 2026-06-29T01:58:48 1782698328

> Most people should have learned at a young age that absence of evidence is not evidence of absence.

That might be true, but it is definitely not the world we live in.

ambicapter · 2026-06-29T13:39:30 1782740370

Not really, it just requires to assume an ultrasound has infinite, perfect resolution when you are faced with a different imaging tech which reports things that didn't appear in the first one. That's just stupid.

eqmvii · 2026-06-29T02:17:11 1782699431

It’s 2026 and my computer will happily give me the right answer even when i make typos. I love it.

tomlockwood · 2026-06-28T23:40:51 1782690051

It's a fatal flaw to think counter-intuitive == wrong.

Georgelemental · 2026-06-29T00:18:55 1782692335

> You would have to assume that every type of diagnosis instrument has infinite clarity and is always correct to be confused in this case.

There's a difference between 99.9% clarity and 50% clarity. Even if neither exactly equals 100%, it's understandable that a layperson would expect different language between them

dd8601fn · 2026-06-29T14:28:30 1782743310

It’s funny that the answer to this has increasingly become “yes” over the last few decades.

BrokenCogs · 2026-06-29T02:26:43 1782700003

This comment sounds like it's written by someone who doesn't interact with real people very often

DrewADesign · 2026-06-29T02:58:19 1782701899

I’ll bet they’ve got a debilitating case of engineer’s disease, too.

Paracompact · 2026-06-28T21:26:51 1782682011

"On two occasions I have been asked [by members of Parliament], 'Pray, Mr. Babbage, if you put into the machine wrong figures, will the right answers come out?' I am not able rightly to apprehend the kind of confusion of ideas that could provoke such a question."

IanCal · 2026-06-28T21:37:07 1782682627

Off topic but I have always felt this seemed like his misunderstanding rather than theirs. It’s an odd question, but it’s a very sensible point to make if Babbage has just told you this will solve the problem of mistakes in calculations - humans being involved at the start means human error still plagues the output.

jrumbut · 2026-06-29T03:50:51 1782705051

> I am not able rightly to apprehend the kind of confusion of ideas that could provoke such a question.

Well, he did diagnose the situation correctly. He couldn't comprehend the confusion of ideas that provoked the question.

I'm also not entirely sure it's an odd question to ask. To this day, users are surprised when their software produces garbage output instead of failing. Perhaps the members of parliament were expecting some form of input validation or sanity checking out output.

Paracompact · 2026-06-28T23:05:29 1782687929

Looking into his biography, it seems that he was indeed pitching the engine not as a means of efficiency, but as a means of avoiding mistakes in mathematical tables. It would have done Babbage well to insist he couldn't possibly solve all classes of mistakes, but would have solved a great many of them! "Why yes Senator, you are quite intelligent and handsome and make a fair point, allow me to give you the finer picture..."

Would have also been a fair point if Babbage had channeled his inner techbro and insisted it would directly replace human calculators; simple machines like Babbage's will chug along blindly on obviously erroneous data, but humans for all their sloppiness can often backtrack on errors.

areoform · 2026-06-28T22:00:22 1782684022

To quote the LLM-ism, they were making a sharp point. It doesn't matter how precise the calculations are if you're calculating the wrong thing.

I suspect their sarcasm might have escaped Babbage who seems to have been on what we now call "the spectrum."

Fr0styMatt88 · 2026-06-29T03:39:26 1782704366

Actually, I would be really pleased if a member of Parliament asked that. That shows a level of deeper consideration.

Isn’t there a saying about there being no stupid questions, only stupid answers or something?

akoboldfrying · 2026-06-28T23:29:08 1782689348

> Anyone who doesn't understand this is an idiot imo

I disagree. A priori it's not obvious to a layperson whether or not a statement that uses unconditional phrasing is intended to be authoritative or conditional on something unspecified, like the resolution of the measuring device. This goes for any sufficiently technical field.

If you got the brakes checked on your car, and the mechanic did <something> and told you there are no issues with them, and you then took your car to a different mechanic who did <something else> and told you there is a problem, you would not be an idiot for thinking that these conclusions contradict one another.

DrewADesign · 2026-06-29T02:55:36 1782701736

I don’t think people are idiots if they don’t understand how a normally intelligent person might not intuit that. I do think they have a seriously underdeveloped theory of mind.

ambicapter · 2026-06-29T13:42:09 1782740529

> idiots

> seriously underdeveloped

What's the difference?

BurningFrog · 2026-06-29T02:55:03 1782701703

> Anyone who doesn't understand this is an idiot imo

Even if this is true, so what?

Idiots get sick at least as often as others, and the medical system needs to work as well as it can for that population too.

rylando · 2026-06-28T22:27:27 1782685647

As a rad tech, YOU TELL ‘EM DOC! I do like some uses of AI I’ve seen that help patients advocate for themselves or understand basic things like blood panel numbers, but it’s really bad at glazing people and leading them down medical rabbit holes kind of like the OP.

You would think that the AI would point out that calcium is best demonstrated on Radiographs/CT imaging vs Ultrasound or something to that effect.

garciasn · 2026-06-28T23:11:00 1782688260

Semi-related: my father has complications from a motorcycle accident ~25y ago that crushed arteries in his leg coupled with diabetes (insulin / kept sugar at ~100 and his A1C was kept under 6.7 for ~15y). 6w ago had to have his toes removed due to dry gangrene; they eventually (2.5w ago) had to remove his leg below the knee because of the severe blood flow issues below the knee.

Between the toes and the below the knee amputation, there were no less than 15 different doctors and PAs / related personnel who COULD NOT COME TO A CONSENSUS. They would just tell my mother and I (PoA) the details; they refused to come up with a singular plan of action moving forward, leaving it up to us to make 'an informed decision,' something that's IMPOSSIBLE when you have to take up to 15 different opinions into consideration.

What exactly are we supposed to do as patients/family members when medical personnel cannot give reasonable paths forward and instead just throw a bunch of shit over the fence at you and tell you, "you decide what to do from here," regardless of how many VERY DIRECT conversations I had w/the 'care team' on doing better to provide a limited array of options and reasons/likelihood of 'positive outcomes'.

I'm used to dealing with a wide variety of stakeholders/SMEs in decision-making; it's my job to apply my extensive industry experience to present our clients with their options, ranked and reasoned. Doctors, in my experience and most recently with my father, clearly do NOT do that (I assume due to liability; but, no real idea, honestly). So; when dealing with LIFE CHANGING circumstances, what are we supposed to do except rely on what might be able to offer more analysis and option narrowing w/AI?

I certainly don't want to make the job of medical staff more difficult by putting out crazy theories I found on the interwebbernets through my own research, etc; but, when we're having to deal with uncertainty and insanity, what else can we do?

resonious · 2026-06-29T00:03:07 1782691387

This lines up with my experience with my mother, though it played out differently. In her case, she would switch doctors every ~5-10 years and each time they'd basically say everything the previous doctor said was wrong. First it was "you have Lupus", second it was "actually it's some other autoimmune disease", then it was "actually whatever you had has been in remission for some time now and you've been taking brain-numming medicine for no reason." Then it was "you have cancer", "it's a rare one", and "oh turns out the brain-numming meds have a correlation with rare cancers". The cancer part was handled well (albeit unsuccessful) though. After such a bad time with rheumatologists, I was shocked by how competent people were when it came to cancer.

All of the above was intertwined with brief stints with doctors that would just berate her for being a painkiller junkie, even though she hated the stuff and just wanted to find/fix the problem.

Kind of a rant, really. I'm not sure how to tie it back into AI. I do wish we had AI at the time so that we could at least cross-check, but I also understand that doctors are already sick of patients self-diagnosing on the web and that AI probably just makes that worse. At the same time, if our medical system could catch up a bit (more doctors? less corruption/paperwork? not sure what it needs) then maybe people would be less inclined to take matters into their own hands.

anon84873628 · 2026-06-29T03:48:25 1782704905

I'm sorry to hear that. The accusations of drug seeking are particularly galling.

AI is absolutely a god send for patients navigating the medical system.

I know the US system is horrible and I sympathize with doctors doing their best within it. But we must admit, they are also responsible for the countless stories just like yours, and have contributed to the public's deteriorating trust of medical institutions. It's not just the insurance companies and conglomerate CEOs.

osmano807 · 2026-06-29T00:33:33 1782693213

Probably liability... on the amputations I indicated and contraindicated, it's increasingly difficult to navigate trough patient perceptions while not disclosing so much as to give them rope to hang us. Some decisions are a game of probability that often we don't have clear numbers. In trauma, I have both cases where I recommended an amputation and at last minute decided to see that happened and the patient is walking with their leg today; and cases where I didn't recommend and later had to amputate as the lesion evolved. With cancer it's more straightforward, the cancer is what dictates the surgery... some cancers have poor response to other treatments, so we amputate. Some cancers had invaded the neurovascular bundle, so curative options involve necessarily amputation to get good margins. In cancer there's less doubt in the prognosis, so less chance of legal ramifications.

rylando · 2026-06-29T13:31:15 1782739875

Keep in mind I’m not a doctor, just the guy who takes your images, but I believe it’s all liability. I really wouldn’t want to be a doctor from all the crazy stories of liability stuff I hear everyday.

I empathize with you as my grandmother is also being what feels like gaslit by several doctors being told her symptoms are dementia and not from the chronic UTI’s she suffers from, but when the UTI’s clear all of a sudden no symptoms. Our medical system is very frustrating, between doctors who don’t have the time needed for complicated cases, or the threat of every patient suing them causing them to be overly cautious.

I think as long as you’re aware of the pitfalls of AI, as you seem to be, it’s a solid tool for helping to understand medical situations with the right amount of double checking.

I don’t think our system will improve until 1) We increase the amount of doctors in our country and tell Congress to quit limiting the amount we can train yearly, and 2) the system of liability for doctors is changed to be more like New Zealand’s where their liability insurance is nationalized and they’re not at constant threat of losing everything to a lawsuit over a patient who wouldn’t take their meds and got worse (the cases are generally much more complex than that, but the idea is it’s not always the doctors fault as media would have us believe).

Fr0styMatt88 · 2026-06-29T03:43:40 1782704620

Your see this in coding agents too. The only times so far I’ve really seen Opus tie itself into a knot is where I’ve asked it to fix something that I thought was broken but actually wasn’t in the way I had described. It will bias towards your description (I’m guessing because that’s the most recent context it has?).

mring33621 · 2026-06-29T01:30:24 1782696624

i'm sorry, but AIs only "know" about stuff that they have been trained on.

If we would allow AIs to be trained on the petabytes of medical data hidden in hospital systems, they would most likely be much better at diagnosing illnesses and conditions than the average doctor.

(Justifiable) Privacy around medical records so far prevents this.

You think you're cheering for humans, but in fact you are gatekeeping healthcare.

prirun · 2026-06-29T02:32:56 1782700376

I dunno... if we gave an AI all of these medical records as training data, wouldn't it be trained to give the same answers as the doctors already gave, without knowing whether those diagnoses were correct or not?

anon84873628 · 2026-06-29T03:57:09 1782705429

Except it would see all the times similar starting conditions led to different diagnosis and recognize those contradictions. Or all the different treatments and their outcomes. And it would never forget or have bias.

It would be like the sum of all medical professors in existence.

Eufrat · 2026-06-29T00:10:44 1782691844

I feel like the promise of these models is to help people make more informed decisions. Improving the knowledge economy and general understanding.

The problem is these are just statistical models at the end of the day, so you need to know something to be able to identify the errors. You can’t let them really be autonomous and you also can’t really have people turn into glorified approvers. If the machine is correct 89% of the time, you cannot make people responsible for that 11%. It’ll just cause automation fatigue.

tl;dr: the actual use cases of these LLM (or generative AI in general) is rather limited, so it is offensive how much hay has been given to them eating the entire capitalist system. They are not fit for purpose.

anon84873628 · 2026-06-29T03:52:48 1782705168

Why should we not expect a computer vision model to outperform humans on reading medical images?

The human experts are literally just a trained biological neural network. In this domain they are not capable of anything a computer can't already do.

Eufrat · 2026-06-29T05:09:31 1782709771

> Why should we not expect a computer vision model to outperform humans on reading medical images?

Humans can identify. A computer vision model can return a statistical value. Both can make errors, but these errors are orthogonal to how we work and what is being asked of them. I think a CV model can absolutely provide value as augmentation. Identifying possible misses or a different diagnosis worth considering, but that is not what is being asked of them here. The pitch by Altman and Amodei is not to say, “This tool that might cost $1,000/month can help increase the accuracy of your diagnoses by 10%,” instead it’s, “This tool can allow you to keep 10% of your workers to monitor it and you can fire the rest. Also, the workers carry all the liability.”

> The human experts are literally just a trained biological neural network. In this domain they are not capable of anything a computer can't already do.

People need to stop making this baseless claim. Human beings are not stochastic computing devices, we are not neural networks. We don’t fully understand human cognition and intelligence. I have the highest confidence we will figure it out one day, though.

Yes, neural networks were based on a superficial view of the human brain, that’s it. For instance, it is biological impossible for the human brain to do backpropagation, which is kind of important for a modern neural net.

This really rubs me the wrong way because it's objectively false, but people keep bring it up because I think people want it to be true rather than accepting generative AI for what it is: a tool with a bunch of caveats.

2ap · 2026-06-28T19:28:36 1782674916

Agreed. Not a radiologist, but I do a fair bit of MRI research. Experts vs lay people probably have different success with getting the right diangosis out of a frontier model. Subtle changes in prompts can cause different diagnosis[1]

[1] https://www.nature.com/articles/s41591-026-04501-8

haldujai · 2026-06-29T03:46:51 1782704811

Radiologist who does read shoulder MRI would like to add that over half the annotations are wrong, glaring mistakes in anatomy and cardinal direction which begs the question of how is it making these findings without knowing what it’s looking at (here’s a hint, it’s hallucinated based on reports it sees).

red75prime · 2026-06-29T06:32:38 1782714758

What is "it"? Claude Opus 4.x? ChatGPT-5.x? GLM? DeepSeek? RadFM? Med-PaLM?

odiroot · 2026-06-29T10:18:34 1782728314

Can vouch for it. Ultrasound hasn't found calcification in my shoulder but MRI did. Exactly as you said, because it was very small.

foobarian · 2026-06-28T17:52:38 1782669158

Huh, I'm reading and looking up these words you guys are saying and it is starting to look exactly like the symptoms I have been having with my own right shoulder! I feel like a giant gaping rabbit hole just opened up next to my desk.

sxg · 2026-06-28T18:13:42 1782670422

We're discussing calcific tendinitis (https://radiopaedia.org/articles/calcific-tendinitis?lang=us). If you think you have it, you can see a doctor and consider shoulder radiographs to start.

YeGoblynQueenne · 2026-06-28T21:47:11 1782683231

If you think you have it, then you don't. If you have it, you won't think, you'll know.

Spoiler: because it hurts like hell.

CyberDildonics · 2026-06-29T14:22:34 1782742954

That doesn't make any sense, it would imply that anything that hurts would automatically be the same diagnosis.

Both inductive and abductive reasoning would say that just because something hurts that doesn't mean that everything that hurts is that thing.

tiahura · 2026-06-28T16:58:32 1782665912

Why isn’t diagnostic ultrasound used in orthopedics? They inspect fetus hearts and other organs everyday, why not shoulders? Seems much cheaper and faster.

sxg · 2026-06-28T17:04:10 1782666250

They do. Ultrasound in orthopedics is a relatively newer field, and there aren't quite as many sonography techs and radiologists experienced in reading these studies, which is likely why you don't see it offered more widely.

Edit: I should mention that ultrasound is basically unusable for evaluating bones. Sound waves can't penetrate bone, and so you end up just seeing a huge black void. That's a huge orthopedics use case that ultrasound just can't benefit. However, ultrasound is fantastic for evaluating muscles, ligaments, tendons, and other superficial soft tissues.

VoidWhisperer · 2026-06-29T03:21:58 1782703318

Serious question: If the bones specifically show up as black on ultrasound but the surrounding (muscle, etc) don't, wouldn't that be an option that could be used to try to determine a broken/fractured bone without the radiation from an xray? Or are the gaps in those cases usually too small to pick up?

scrollop · 2026-06-28T18:18:47 1782670727

We order ultrasounds all the time for shoulders (for like soft tissue issues; for trauma, you'd start with an xray). For other joints, such as the knee, MRIs are a better choice (unless htere has been substantial trauma, in which case xray initially or further), though more expensive, unless you're excluding a Baker's cyst, in which case an ultrasound is fine.

Since MRIs are more expensive, private doctor's might order them instead of an ultrasounds.

(I'm a doctor)

tiahura · 2026-06-28T22:16:24 1782684984

Where are you? Pi and work comp attorney in medium US midwest metro. I've never seen one in 20y. Not from HCA ERs, medicaid er visits to univ affiliated er, nor prestige practices.

trentor · 2026-06-29T00:21:06 1782692466

Ultrasound was overlooked by US medicine as a first line imaging tool for a long time because it takes real skill and experience to do it right. But it's making a comeback. We've had Chinese, Indian, Australian, and American doctors visit us for one to two month stints to build up their skills.

Given the skill involved, it's probably a liability concern they don't want the exposure over there.

prdonahue · 2026-06-28T21:46:47 1782683207

They're used quite a bit for nerve entrapment—both in diagnosing and treating.

bflesch · 2026-06-28T18:09:52 1782670192

It's a manual, non-standardized process without a standardized output. Image quality depends both on user skills (how deeply they press the sensor on the skin) and the machine they have. Unlike CT/MRI the examination results cannot be easily shared and compared between patients for studies.

RA_Fisher · 2026-06-28T19:30:26 1782675026

So Opus might be correct?

engeljohnb · 2026-06-28T18:03:12 1782669792

> I'm a radiologist

Any comment that doesn't start with this or similar qulaification should be taken with a grain of salt (yes, including this one).

Medical imaging is one of those things everyone thinks is simple because they don't know what they don't know. I'm a cardiac sonographer, and I have to assume radiologists hear at least as many eye-rolling takes on AI coming for their job as I do.

lostlogin · 2026-06-28T18:17:00 1782670620

Ahh, AI is coming for your job.

Full sarcasm, is there one that’s that’s more immune?

engeljohnb · 2026-06-28T19:14:25 1782674065

I don't completely understand what you mean, but I can tell you for my job, having AI tell you how to get the images is (without exaggeration) like putting someone who's never played an instrument on stage and saying "don't worry, the AI will show you how to do it."

lostlogin · 2026-06-28T21:38:53 1782682733

I did a lot of cardiac MR and often GA cases. Sometimes after the scan an echo would be done.

I know my anatomy and etc and have done a short stint in ultrasound. I have no idea what you are doing or looking at and can identify pretty much nothing.

Echo techs are going to be around a lot longer than MR techs.

LearnYouALisp · 2026-06-28T19:04:42 1782673482

cough Immunology

2ap · 2026-06-28T19:23:37 1782674617

I mean, probably not. No expert, but everytime I go to an immunology meeting (I'm a paediatrician) they've got a whole stack of new diseases. The field is moving fast, and there has to be a careful amount of shared decision making about when to test, what a positive test means and so on. I reckon they're as safe as any of us.

LearnYouALisp · 2026-06-28T22:04:09 1782684249

yeah, you said "one that is more immune"

backtoyoujim · 2026-06-28T19:35:29 1782675329

Does radiology really make +$700,000.00 a year ?

Someone on reddit claiming to be a radiologist claimed that.

I wonder where the savings will go when those jobs are gone.

Eji1700 · 2026-06-28T19:44:01 1782675841

> Does radiology really make +$700,000.00 a year ?

The radiologist I know does not, but they are paid very well (and these numbers are always dumb when you're not sure if they're living in Manhattan vs literally anywhere in Kentucky)

Like most medicine, a large % of the job could be done by any decently talented person willing to follow instructions and shadow for a few months.

Like most medicine, the remaining % is what you're paying for, because it is literally life and death and you can't do things like "pull the logs" or "lets turn it off and take it apart" or "huh i need to put this down and come back later". Even in radiology, because "well lets just do it again to be sure" is often not a viable option.

While there is a problem in how we have inflated the cost of education for medical fields, the insane health insurance issues (US obviously, but it does have some effect globally when the expert radiologist you hire from the US to help with research costs that much), and probably some better ways to approach splitting the work for the entire field, like most professions dealing in life or death, medicine likely will always be paid well.

sarchertech · 2026-06-28T21:06:05 1782680765

Physicians salaries account for about 8% of healthcare costs in the US.

recursive · 2026-06-28T22:21:01 1782685261

The savings go straight into patients' worse outcomes.

blanched · 2026-06-28T19:51:53 1782676313

You know the radiologist you're responding to is a real person? Your last line seems needlessly callous.

the_real_cher · 2026-06-28T19:39:04 1782675544

To the consumer! Haha just kidding. We all know where they'll go.

piterrro · 2026-06-28T20:25:00 1782678300

It funny to see the community here expects the human body to be treated like a deterministic function: for input X expect output Y - and that transfers to diagnosis - people expect to receive the same diagnosis from different specialists for the same issue.

Given human body complexity, the diagnosis is a compound output of the experience, knowledge gained throughout the career and diagnosis methods/equipment, the title (like Dr) is a certification imposed by the state so its "safe" to let people practice since they passed "the bar" - but that doesn't imply everyone will be treating the same.

Some specialists update their knowledge monthly, some yearly and some don't do it at all, there are so many variables in play here (geo, politics, even weather haha).

Having said that, choosing the specialist is really important, getting opinions about their practice and their speciality, you can only maximize your chance of getting the right diagnosis, but don't expect to get it right just because somebody is called a Dr.

charles_f · 2026-06-28T21:23:52 1782681832

> It funny to see the community here expects the human body to be treated like a deterministic function

In a community largely made of people whose job it is to produce such functions, I'd say it's to be expected

KingMob · 2026-06-29T05:58:07 1782712687

It's funny (and a little depressing), because HN routinely assumes that their world view, and thus, their domain expertise, transfers.

There's no shortage of tech people convinced they deeply understand law, medicine, philosophy, etc. despite never having read much on the topics.

johnwalkr · 2026-06-29T15:15:31 1782746131

Most of my "favorited" comments on here are by software people with confident yet incorrect statements (usually by way of vastly underestimating complexity) about one of my domains of expertise.

I can't find it but one of the greatest show HN was a blog post about someone who was annoyed by his inconsistent shower temperature control. From memory, he spent a full weekend adjusting it, taking measurements, making graphs, and proposed "next steps" about prototyping better temperature control with microcontrollers and servo and pontificated about developing a product, of course controlled by software. He skipped the part where a bit of research leads you to the already common "thermostatic mixing valve".

bpicolo · 2026-06-29T12:04:01 1782734641

The internet at large is full of armchair experts, it's not just a tech thing.

b800h · 2026-06-28T21:06:17 1782680777

I'm not sure what your point is. Are you saying that medicine is inherently fallible and therefore AI is more likely to make a good diagnosis - particularly a cluster of specialist AIs?

mrlongroots · 2026-06-28T21:16:55 1782681415

Yeah I think the OP is muddling the point by conflating "physician's version of the diagnosis" with "The Diagnosis".

There is absolutely one "The Diagnosis". Human body is a machine, albeit a very complex one, and all measurement sources have noise. But they are all measuring one reality, and if there is a problem, there should be one explanation that all measurements align with. They can be noisy but can never be conflicting (instrument error notwithstanding).

Physicians' ability to arrive at "The Diagnosis" would vary, but it does not mean one does not exist. I am not sure if characterizing human body as derministic or not is relevant here.

piterrro · 2026-06-28T22:01:59 1782684119

I think „the diagnosis” is over simplification and lots of professionals would disagree that there’s always a single one. As a patient your goal is to eliminate the symptoms of whatever is going on in your system. Often times there could be many reasons for it and only curing one can help you already. The diagnosis is a help tool to choose the roght curation method.

Thus, chasing the „right” diagnosis (whatever that is?) is pointless, as it only the outcome (reducing symptoms, stopping the damage) can tell you if the diagnosis was right, but not the only one right.

mrlongroots · 2026-06-28T22:05:11 1782684311

> I think „the diagnosis” is over simplification and lots of professionals would disagree that there’s always a single one.

"The Diagnosis" does not mean "one root cause".

Situation: my car has some unexplained vibrations. 1. Mechanic A says that it is the engine mounts 2. Mechanic B says that it is some weirdness in how the exhaust assembly is hanging to the underbody 3. Mechanic C says that it is just my wife farting

I replace engine mounts and 40% of the problem is reduced. I then drive without my wife and the remaining 60% is solved.

"The Diagnosis" was: 40% mounts, 60% wife, 0% exhaust.

There is always one "The Diagnosis".

exmadscientist · 2026-06-28T22:51:24 1782687084

> There is always one "The Diagnosis".

No, that is not true at all.

This is a kind of thinking a lot of programmers fall prey to. The real world, outside of code, is a very fuzzy and inherently analog place. There is very rarely one in any complex system having a complex problem needing a complex solution. At some point even the definition of diagnosis gets fuzzy.

The best demonstration of this in medicine is probably the DSM-5. What, really, is the difference between Narcissistic Personality Disorder and Borderline Personality Disorder and Generalized Anxiety Disorder? Can they overlap? (Yes.) How do you treat them? (It's not easy.) What about depression: how do you tell if someone has Major Depressive Disorder or Bipolar Depression? (Again: not easy.) In some circumstances the only way to tell the difference between the two is what drugs work: if antidepressants help, it's Major Depression; if mood stabilizers help, it's Bipolar Depression. It's kind of odd to define a One True Diagnosis by "well we fixed it this way, so it must have been that", with no other way to do it, isn't it? (What if both work? What if one works for a while, then the other works? What if treatment with antidepressants induces bipolar (hypo)mania? All of those happen!)

And that's just a few examples.

mrlongroots · 2026-06-29T01:18:44 1782695924

Pyschiatry gets complicated because the failures are not mechanical. Even if you could image every single neuron in a person's head we do not have a very good way to define an algorithm for these issues. I do not have a good answer for psychiatry.

> This is a kind of thinking a lot of programmers fall prey to. The real world, outside of code, is a very fuzzy and inherently analog place.

Having said that, I would vehemently reject and push back against this, and without doubting your sincerity, characterize it as an ad hominem.

The vast majority of issues with the human body are mechanical in nature. Restricted blood flow, unwanted tissue, a broken bone, a bad valve etc. These are causal descriptions of "disease". Where causal descriptions exist, the "One True Diagnosis" principle holds. Psychiatry just happens to be unique in that it is a fuzzy science where we rely on checklists and ultimately all diagnosis is probabilistic.

EDIT:

> This is a kind of thinking a lot of programmers fall prey to. The real world, outside of code, is a very fuzzy and inherently analog place. There is very rarely one in any complex system having a complex problem needing a complex solution. At some point even the definition of diagnosis gets fuzzy.

I would also push back against this mindset in general. This is not a falsifiable claim, it is incoherence as an argument, and I do not need to be a programmer to hold this position.

That the real world is analog is irrelevant to its amenability to causal explanations. Or "fuzzy": "fuzzy" in this context just does not mean anything.

I am not trying to sound exasperated or win internet points, just impress this point on you and anyone reading this. We can write math to predict weather, make it tractable to solve using approximations, tolerate IEEE 754 weirdness, and finally tell what the clouds will do a week from now. This is nature telling us that there is a pattern to how it behaves, and it is the only weapon we have as scientists.

To say that nature is not amenable to explanations is a very defeatist thing to say: neither Newton nor Einstein nor any of the million-odd people that have built modern society would exist if nature did not have causal explanations. I urge you to reject this defeatist thinking.

scheme271 · 2026-06-29T01:56:19 1782698179

There's quite a few diseases and conditions that don't have definitive tests. For example, alzheimer's and parkinsons are diagnosed based on medical history and symptoms. With alzheimer's an autopsy can tell for sure but that's not much help for a patient. I'm sure there's other things out there with similar situations. Hard to come up with "the one true" diagnosis with an definitive way to determine it.

mrlongroots · 2026-06-29T02:12:52 1782699172

> With alzheimer's an autopsy can tell for sure but that's not much help for a patient.

Ok let us unpack this statement.

For your point to hold, I would have to be saying "all kinds of practical diagnostics are invented now. No progress can be made in better diagnostics".

If Alzheimer's can be validated by slicing open a dead patient, there is a causal mechanical explanation for the disease. If we can not confirm that defect without slicing open the patient, that is a limitation of 2026 tools. The "One True Diagnosis" is an Oracle explanation that all real diagnostic techniques try to approach in the asymptotic sense, and it is helpful exactly because it clarifies in discussions like this.

There are going to be diseases where we do not yet have causal explanations. Or where we treat them without establishing them. Hypertension is one example: while technically it can be caused by vascular stiffness, some weirdness with the RAAS system, some hyperadrenergic weirdness, practically you get a lot of mileage out of just prescribing people telmisartan if they're old.

That does not mean the frontier of hypertension is settled, or the 10% who do not have a vascular stiffness problem would not benefit from better causal models of hypertension. Science is us continuously pushing back against the fog: of the tools we have in 2026, some are great, some are imperfect, some are promising etc.

scheme271 · 2026-06-29T02:42:15 1782700935

There might be "one true diagnosis" but there's no reason to believe that we'll have practical diagnostic tools to get it. If we need to sample the brain chemistry to diagnose a neurochemical disorder, it's probably not too useful in a clinical setting. The world makes no guarantees that we will be able to differentiate between certain situations with tools that we can realistically access and build.

mrlongroots · 2026-06-29T03:29:24 1782703764

Today's limits are known and undisputable. Tomorrow's limits are a promise: some promises over-deliver, others under-deliver. :)

Regardless, to bring the discussion back to the claim at hand: at all points in future, we will need the ability to reason under partial information. "Absolutely flawlessly complete diagnostics" is an asymptotic goal we get closer to but never reach. This is both very doable for a disciplined human, and very hard to outsource completely to an LLM. Treated as tools operatored by competent users, they are magical. But they can not outperform their user.

movpasd · 2026-06-29T10:12:21 1782727941

Not GP, but I'd argue that over-rationalism and underestimating both the complexity of the real world and the theory-ladenness of one's perspective is just as dangerous. The point is not to be paralysed by complexity, but to acknowledge it and acknowledge the reality of unknowable unknowns in our decision-making. I don't consider that defeatist in the least. Epistemic humility is the rational response to a complex world; courage is to act anyway.

KronisLV · 2026-06-29T07:43:45 1782719025

> We can write math to predict weather, make it tractable to solve using approximations, tolerate IEEE 754 weirdness, and finally tell what the clouds will do a week from now.

Even so, we’re operating on approximate datasets and sometimes our predictions are wrong. I think a lot of the medical field is like that - people are doing the best they can with what they have.

It’s entirely possible that DSM-5 will be viewed as flawed and inaccurate in a century, but it’s better than nothing.

Similarly, for every possible medical affliction there could be “The Diagnosis” that would describe how to treat it, we’re just unable to be that accurate and thorough. The fuzziness just means that you’d need 10’000 data points about the state of the body instead of 10-100 and also be able to reason about them.

Paracompact · 2026-06-29T07:58:47 1782719927

Most disorders in the DSM-5 are defined by polythetic criteria, i.e. meeting X out of Y symptoms from a list for a given duration of time, or by conjunction of polythetic criteria. These definitions are socially constructed and statistically validated for pragmatic use, but very rarely have definite underlying biological markers. Especially as concerns personality disorders, these disorders can also simply be an inheritance of cultural or political baggage and prior psychoanalytic theory.

> In some circumstances the only way to tell the difference between the two is what drugs work: if antidepressants help, it's Major Depression; if mood stabilizers help, it's Bipolar Depression.

This is ridiculous. There is zero mention in the DSM-5 or ICD-11 of "if these drugs work, it's this, otherwise it's this." I would question a psychiatrist dispositively making a diagnosis on such grounds.