• Home
  • Politics
  • Health
  • World
  • Business
  • Finance
  • Tech
  • More
    • Sports
    • Entertainment
    • Lifestyle
What's Hot

Trump Mulls At Ending His Signature Trade Deal

June 11, 2026

Natalie Portman Backs Israeli Director Nadav Lapid, Who Bashed Israel’s War Against Hamas, After He’s Booted from French Film Festival ‘Because He’s Israeli’

June 11, 2026

Aaron Sorkin Makes Mark Zuckerberg a ‘Free Speech’ Boogeyman in ‘The Social Reckoning’ Teaser

June 11, 2026
Facebook Twitter Instagram
  • Contact
  • Privacy Policy
  • Terms & Conditions
Thursday, June 11
Patriot Now NewsPatriot Now News
  • Home
  • Politics

    Trump Mulls At Ending His Signature Trade Deal

    June 11, 2026

    Democrats Have All The Info They Need To End Trump And Vance With A Real Epstein Investigation

    June 11, 2026

    Some Senate Dems still won’t commit to Graham Platner

    June 10, 2026

    Iowa’s Rob Sand Vows To Force Republicans To ‘Take Their Medicine’

    June 10, 2026

    Trump Claims He Loves High Inflation In New Disaster For Republicans

    June 10, 2026
  • Health

    The World Cup’s Real Viral Threats Aren’t Ebola Or Hantavirus

    June 11, 2026

    Humana To Divest End-Of-Life Care Business For $900 Million

    June 11, 2026

    Diabetes association CEO apologizes for conference expulsions

    June 10, 2026

    Before Getting An Operation, Ask Your Surgeon This Question

    June 10, 2026

    OB-GYN group issues vaccine recommendations, deviating from CDC

    June 10, 2026
  • World

    More Than a Million People Visit Serbian Church to Witness Holy Relic of Virgin Mary

    June 11, 2026

    Trump’s Attendance At Big Games Often Jinxes Home Team

    June 11, 2026

    Man Arrested After Apparent Beheading Attempt in Belfast

    June 11, 2026

    House Passes $70 Billion Bill To Fund Trump’s Immigration Crackdown

    June 11, 2026

    Iran-Backed Houthi Terrorists Threaten to ‘Ban’ Israel from Red Sea

    June 10, 2026
  • Business

    Pilot Union Members Orchestrate Coup Against Labor Bosses

    June 9, 2026

    Jobs Report Blows Past Expectations In Welcome Bright Spot For Inflation-Plagued Economy

    June 5, 2026

    Wall Street Giants Bet Big On Tech As The Iran War Roils Global Markets

    June 4, 2026

    Harley-Davidson Backsliding On Wokeness Despite Previous Policy Reversal

    June 3, 2026

    Another Major Company Flees From Blue State To Texas

    June 3, 2026
  • Finance

    Is IQVIA Holdings Stock Outperforming the Dow?

    June 11, 2026

    Citigroup shares outperform down market after Trump endorsement

    June 10, 2026

    How to file a travel insurance claim: A step-by-step guide

    June 10, 2026

    North Carolina treasurer passes on SpaceX citing valuation concerns; favors OpenAI, Anthropic

    June 10, 2026

    1 Underappreciated Energy Stock You Won’t Want to Overlook

    June 10, 2026
  • Tech

    Aaron Sorkin Makes Mark Zuckerberg a ‘Free Speech’ Boogeyman in ‘The Social Reckoning’ Teaser

    June 11, 2026

    Trump Asks for Short-Term Extension of Key Spy Power Authority

    June 11, 2026

    Chrysler Recalls 17,000 Pacifica Plug-In Hybrid Minivans over Battery Fire Risk, Advises Owners to Park Outside

    June 10, 2026

    Bill Gates to Face U.S. Congress Questioning over Epstein Links

    June 10, 2026

    Kamala Harris Prompts 2028 Run Chatter After Appearing in Netflix Doc ‘The American Experiment’

    June 10, 2026
  • More
    • Sports
    • Entertainment
    • Lifestyle
Patriot Now NewsPatriot Now News
Home»Health»How The ARISE Network Is Rethinking Clinical AI
Health

How The ARISE Network Is Rethinking Clinical AI

May 20, 2026No Comments9 Mins Read
Facebook Twitter Pinterest LinkedIn Tumblr Email
How The ARISE Network Is Rethinking Clinical AI
Share
Facebook Twitter LinkedIn Pinterest Email

The Arise Network aims to understand and explain what AI can do in healthcare.

getty

You’ve seen the headlines: AI aces the medical boards. AI outperforms expert physicians. But what does this actually mean? And how do we evaluate technology that’s advancing faster than we can fully make sense of it?

The AI Research and Science Evaluation (ARISE) Healthcare Network was formed to help answer these questions. Spanning multiple medical centers and led by physicians at Harvard and Stanford with diverse and complementary backgrounds, ARISE is trying to understand what AI systems can do in medicine and how we can evaluate and explain their performance.

They are working to define what holds up in real-world medicine, what we mean by clinical reasoning, how clinicians and AI should work together, when either may perform better alone, and how we might recognize if AI approaches “medical superintelligence.”

The Physician Data Scientist And AI Magic Tricks

Physician, magician, and data scientist Jonathan H. Chen is working to demystify clinical AI.

ARISE Network

Arthur C. Clarke famously wrote, “Any sufficiently advanced technology is indistinguishable from magic.”

Decades later, many people see A.I. as magic. So, who better to spot a magic trick than Jonathan H. Chen, a physician, data scientist, and performing magician?

Chen’s path is not typical. He started college at 13 and worked as a software engineer before returning to school to earn an MD and PhD in computer science and then training in internal medicine. Since joining the Stanford faculty in 2017, he’s been evaluating how AI applies to medical problems.

He points out that the first rule of magic is (mis)directing the audience to look where you want them to look. So, when LLMs like ChatGPT arrived, he knew to look in the other direction to understand what they’re doing, where they fail, and how clinicians might use them.

A core theme of his research is understanding how physicians and AI can best work together.

In late 2024, his team made headlines after finding that, on diagnostic reasoning tasks, LLMs alone outperformed both physicians using AI and physicians working alone. This ran counter to the long-held “fundamental theorem” of informatics that physicians plus AI will outperform either alone.

Part of the explanation was timing. It was still early, and many physicians used LLMs like search engines.

So, in a follow-up trial, the team tested a customized LLM tailored for clinical collaboration that taught clinicians in real time how to use it. This time, physician-plus-AI outperformed physicians alone, while matching—but still not surpassing—AI alone in diagnostic reasoning.

The group later reported similar results in another study on management reasoning tasks. Through a new ARPA-H grant, ARISE is now building a “flight simulator” for medicine to study and improve how clinicians and AI work together.

Taken together, these findings raise a deeper question: if AI alone sometimes outperforms physicians working with AI on reasoning tasks, what exactly are we measuring when we talk about “clinical reasoning” in the first place?

The Physician Historian And The Nature of Reasoning

A medical historian, Dr. Adam Rodman draws on the past to understand the present and predict the future.

Danielle Duffey

Adam Rodman, a fast-talking and even faster-thinking Harvard internist, medical historian, and clinical educator, has spent the past two decades studying clinical reasoning and decision-making.

The first thing he will tell you is that none of this is new. Technology has always changed what it means to be a doctor. Think of the stethoscope, anesthesia, penicillin, MRI, and the electronic health record.

What’s different this time is that AI moves up the cognitive stack, shifting knowledge and even thinking to machines. Yet there’s also a long history behind this work, and clinical reasoning may not be what medicine portrays it to be.

Rodman points out that modern ideas about both clinical reasoning and AI surprisingly share common roots in World War II-era signal detection theory, which gave rise to frameworks such as sensitivity, specificity, and ROC curves.

Building on this tradition, pioneers such as Robert Ledley and Lee Lusted argued in a landmark 1959 article that medical decision-making could be understood through logic, probability, and value theory. Their work laid the groundwork for a series of computerized diagnostic tools like INTERNIST-1 and Isabel that sought to model the clinical reasoning of expert physicians.

Rodman believes that computer science, in turn, shaped how medical schools teach clinical reasoning, using frameworks such as Fagan’s nomogram, pretest probabilities, and rule-based heuristics. While these approaches are useful for teaching and assessing trainees, he believes they may not fully capture how experts actually practice.

In his words, “We train doctors in ways that reflect the appearance of expertise, based on cognitive models and computer-era abstractions, rather than how real experts behave, which is often fast, intuitive, and non-linear. And we are now building AI systems that mimic that same abstraction.”

These same traditions shaped how medical AI systems came to be evaluated, often using complex clinical case vignettes drawn from the New England Journal of Medicine clinicopathological case conference series.

Following this tradition, when LLMs emerged, Rodman and colleagues were the first to report that GPT-4 provided the correct diagnosis in its differential in two-thirds of these challenging cases. Still, he was quick to admit that studies like his are limited by saturated benchmarks and a lack of physician comparators.

So, Rodman and the ARISE team went a few steps further in a set of experiments recently published in Science. They found that OpenAI’s o1 reasoning model outperformed physicians across multiple historical clinical reasoning tasks.

More notably, o1 performed as well or better than two Harvard internists in generating differential diagnoses based on EHR data for 76 real-world emergency cases.

While the study captured widespread attention, Rodman sees this as an incremental step on a much longer journey.

“What we need now,” he told me, “are prospective clinical trials in real-world patient care settings.”

Of course, diagnosis is just one aspect of clinical reasoning. And clinical reasoning is just one domain in which medical AI is being developed. As AI systems begin to perform differently across tasks—and sometimes outperform physicians—how should we evaluate them in ways that actually matter?

The Physician Bridge Builder And Communicating Science

Dr. Ethan Goh aims to clearly explain what AI can do in healthcare.

ARISE Network

Ethan Goh is a thoughtful, mild-mannered physician with a diverse range of experience beyond his years. After starting his career as a hospitalist in Europe and Asia, he served as a policymaker in Singapore, an advisor to the UK National Health Service, and an executive at a digital health startup before joining Stanford as a postdoctoral fellow in informatics.

Now serving as ARISE’s Executive Director, he draws on his diverse background to connect AI development, academic research, and real-world clinical care.

Goh sees ARISE’s main role as understanding and clearly explaining what AI can do in healthcare.

Traditionally, AI was evaluated using medical exam questions, such as those on the USMLE. Yet while these exams assess knowledge, they do not reflect real-world practice, where clinicians iteratively gather information from patients who often present differently than textbook descriptions. And a machine that performs well on a standardized knowledge test will not necessarily provide good clinical care.

The field is now moving toward simulations that more closely mirror clinical practice, often using rubrics rather than single correct answers.

Still, even these newer benchmarks typically lack context and focus on isolated cognition rather than actual clinical work.

Because medicine is not a single task and “doctoring” is not a single function, Goh argues that benchmarks must be framed around precise tasks such as triage, diagnosis, treatment, and communication, each with different thresholds for AI readiness.

Accordingly, ARISE introduced the Medical AI Superintelligence Test (MAST), which combines multiple domains of clinical competence—including diagnosis, management, reasoning, safety, and agentic workflow use—all benchmarked against realistic physician baselines and incorporating physicians working with AI, not just models alone.

As Goh explained, “Our goal is to open-source benchmarks and constantly index the latest models to find out where they are strong or weak on key clinical tasks, rather than the industry relying on its own limited benchmarks.”

One MAST component benchmark is NOHARM, which quantifies how often an LLM makes potentially harmful recommendations. Recently, the ARISE team reported that even top models generate potentially harmful advice in up to 22% of cases, typically due to errors of omission. Still, the best models outperformed generalist physicians on safety, and ensembles of models made fewer errors than individual models.

Another MAST component is MedAgentBench, which assesses models’ ability to independently perform 300 patient-specific, clinically relevant tasks—like ordering medications and aggregating test results—in a realistic FHIR-based EHR setting.

In mid-2025, the ARISE team reported that the best-performing model achieved a 70% success rate, with most failures clustering around tasks requiring three or more steps. However, just six months later, Anthropic announced that its Opus 4.6 model achieved a 92% success rate, underscoring how quickly these capabilities are advancing and how quickly benchmarks themselves may become outdated.

In response, ARISE developed PhysicianBench, a new benchmark designed to evaluate how well AI agents complete multi-step medical consultation and execution tasks in realistic EHR settings.

Yet AI may quickly outpace this benchmark, too. If these trends continue and AI approaches superintelligence—which ARISE defines as outperforming top clinicians across a range of clinically meaningful tasks under real-world conditions—evaluating AI based on concordance with physician experts will break down, just as experts in the game of Go were confounded by AlphaGo’s winning Move 37.

This will force a shift to real-world randomized controlled trials with hard clinical outcomes.

Where Are We Going?

Chen, Rodman, and Goh each believe we are on the verge of a fundamental shift in what it means to be a doctor. Whether they are right or wrong, AI is forcing medicine to reconsider some of its deepest assumptions about clinical reasoning and expertise, human-machine interaction, and how we define good care.

In the process, AI is pushing us to think more carefully about what physicians do, where AI helps, and how and when the two should work together. These questions are no longer theoretical.

See also  Bite your nails or pick at your skin? A new study has a solution for that
ARISE clinical Network rethinking
Share. Facebook Twitter Pinterest LinkedIn Tumblr Email

Related Posts

The World Cup’s Real Viral Threats Aren’t Ebola Or Hantavirus

June 11, 2026

Humana To Divest End-Of-Life Care Business For $900 Million

June 11, 2026

Diabetes association CEO apologizes for conference expulsions

June 10, 2026

Before Getting An Operation, Ask Your Surgeon This Question

June 10, 2026
Add A Comment

Leave A Reply Cancel Reply

Top Posts

‘Pride & Prejudice’ Star Anna Chancellor Mourns Death of 36-Year-Old Daughter

October 2, 2023

Google Blocked Christian ‘TruPlay’ App for ‘Inappropriate’ Imagery of Jesus Christ, then Backtracked When Breitbart Asked Why

May 14, 2026

Startup Darling WeWork Teeters on the Brink of Bankruptcy

November 2, 2023

New Shows and Films for October 2023

September 29, 2023
Don't Miss

Trump Mulls At Ending His Signature Trade Deal

Politics June 11, 2026

President Donald Trump says he is “not looking to renew” the US-Mexico-Canada Agreement (USMCA), a…

Natalie Portman Backs Israeli Director Nadav Lapid, Who Bashed Israel’s War Against Hamas, After He’s Booted from French Film Festival ‘Because He’s Israeli’

June 11, 2026

Aaron Sorkin Makes Mark Zuckerberg a ‘Free Speech’ Boogeyman in ‘The Social Reckoning’ Teaser

June 11, 2026

More Than a Million People Visit Serbian Church to Witness Holy Relic of Virgin Mary

June 11, 2026
About
About

This is your World, Tech, Health, Entertainment and Sports website. We provide the latest breaking news straight from the News industry.

We're social. Connect with us:

Facebook Twitter Instagram Pinterest
Categories
  • Business (4,379)
  • Entertainment (5,013)
  • Finance (3,728)
  • Health (2,251)
  • Lifestyle (1,892)
  • Politics (3,510)
  • Sports (4,460)
  • Tech (2,245)
  • Uncategorized (4)
  • World (4,885)
Our Picks

Elon Musk’s Tesla ‘Autopilot’ System Linked to 736 Crashes and 17 Fatalities Since 2019

June 13, 2023

Vermont Snowboarding Coach Sues School That Fired Him for Saying Men Should Not Compete in Women’s Sports

July 21, 2023

United Airlines pilots reach labor agreement, boost pay

July 17, 2023
Popular Posts

Trump Mulls At Ending His Signature Trade Deal

June 11, 2026

Natalie Portman Backs Israeli Director Nadav Lapid, Who Bashed Israel’s War Against Hamas, After He’s Booted from French Film Festival ‘Because He’s Israeli’

June 11, 2026

Aaron Sorkin Makes Mark Zuckerberg a ‘Free Speech’ Boogeyman in ‘The Social Reckoning’ Teaser

June 11, 2026
© 2026 Patriotnownews.com - All rights reserved.
  • Contact
  • Privacy Policy
  • Terms & Conditions

Type above and press Enter to search. Press Esc to cancel.