Researchers from Mass General Brigham determined that ChatGPT achieved an accuracy rate of almost 72% across all medical specialties and phases of clinical care, and 77 percent accuracy in making final diagnoses.
Researchers from Mass General Brigham have conducted a study which reveals that ChatGPT demonstrated an accuracy rate of approximately 72% in overall clinical decision-making processes, ranging from suggesting potential diagnoses to finalizing diagnoses and determining care management strategies. This expansive language model-based AI chatbot exhibited consistent performance in both primary care and emergency medical environments across diverse medical fields. The findings were recently published in the Journal of Medical Internet Research.
“Our paper comprehensively assesses decision support via ChatGPT from the very beginning of working with a patient through the entire care scenario, from differential diagnosis all the way through testing, diagnosis, and management,” said corresponding author Marc Succi, MD, associate chair of innovation and commercialization and strategic innovation leader at Mass General Brigham and executive director of the MESH Incubator.
“No real benchmarks exist, but we estimate this performance to be at the level of someone who has just graduated from medical school, such as an intern or resident. This tells us that LLMs, in general, have the potential to be an augmenting tool for the practice of medicine and support clinical decision-making with impressive accuracy.”
The study was done by pasting successive portions of 36 standardized, published clinical vignettes into ChatGPT. The tool first was asked to come up with a set of possible, or differential, diagnoses based on the patient’s initial information, which included age, gender, symptoms, and whether the case was an emergency. ChatGPT was then given additional pieces of information and asked to make management decisions as well as give a final diagnosis—simulating the entire process of seeing a real patient. The team compared ChatGPT’s accuracy on differential diagnosis, diagnostic testing, final diagnosis, and management in a structured blinded process, awarding points for correct answers and using linear regressions to assess the relationship between ChatGPT’s performance and the vignette’s demographic information.
The researchers found that overall, ChatGPT was about 72 percent accurate and that it was best in making a final diagnosis, where it was 77 percent accurate. It was lowest-performing in making differential diagnoses, where it was only 60 percent accurate. And it was only 68 percent accurate in clinical management decisions, such as figuring out what medications to treat the patient with after arriving at the correct diagnosis. Other notable findings from the study included that ChatGPT’s answers did not show gender bias and that its overall performance was steady across both primary and emergency care.
“ChatGPT struggled with differential diagnosis, which is the meat and potatoes of medicine when a physician has to figure out what to do,” said Succi. “That is important because it tells us where physicians are truly experts and adding the most value—in the early stages of patient care with little presenting information, when a list of possible diagnoses is needed.”
The authors note that before tools like ChatGPT can be considered for integration into clinical care, more benchmark research and regulatory guidance is needed. Next, Succi’s team is looking at whether AI tools can improve patient care and outcomes in hospitals’ resource-constrained areas.
The emergence of artificial intelligence tools in health has been groundbreaking and has the potential to positively reshape the continuum of care. Mass General Brigham, as one of the nation’s top integrated academic health systems and largest innovation enterprises, is leading the way in conducting rigorous research on new and emerging technologies to inform the responsible incorporation of AI into care delivery, workforce support, and administrative processes.
“Mass General Brigham sees great promise for LLMs to help improve care delivery and clinician experience,” said co-author Adam Landman, MD, MS, MIS, MHS, chief information officer and senior vice president of digital at Mass General Brigham. “We are currently evaluating LLM solutions that assist with clinical documentation and draft responses to patient messages with a focus on understanding their accuracy, reliability, safety, and equity. Rigorous studies like this one are needed before we integrate LLM tools into clinical care.”
Reference: “Assessing the Utility of ChatGPT Throughout the Entire Clinical Workflow: Development and Usability Study” by Arya Rao, Michael Pang, John Kim, Meghana Kamineni, Winston Lie, Anoop K Prasad, Adam Landman, Keith Dreyer and Marc D Succi, 22 August 2023, Journal of Medical Internet Research.
DOI: 10.2196/48659
The study was funded by the National Institute of General Medical Sciences.
News
Hidden Scars: How COVID Lockdowns Altered Teen Brains Forever
Research from the University of Washington revealed that COVID-19 lockdowns led to accelerated cortical thinning in adolescents, impacting brain development significantly. This effect was more pronounced in females than males, raising concerns about long-term brain health. The study [...]
Simple Blood Test To Detect Dementia Before Symptoms Appear
UCLA researchers have identified placental growth factor (PlGF) as a potential blood biomarker for early detection of cognitive impairment and dementia. High PlGF levels correlate with increased vascular permeability, suggesting its role in the development [...]
Investing Goldman Sachs asks ‘Is curing patients a sustainable business model?’
Goldman Sachs analysts attempted to address a touchy subject for biotech companies, especially those involved in the pioneering “gene therapy” treatment: cures could be bad for business in the long run. “Is curing patients [...]
The risks of reversed chirality: Study highlights dangers of mirror organisms
A groundbreaking study evaluates the feasibility, risks, and ethical considerations of creating mirror bacteria with reversed chirality, highlighting potential threats to health and ecosystems. In a recent study published in Science, a team of researchers [...]
Alarming Mutation in H5N1 Virus Raises Pandemic Red Flags
NIH-funded study concludes that the risk of human infection remains low A recent study published in Science and funded by the National Institutes of Health (NIH) has found that a single alteration in a protein on the surface [...]
Scientists Discover Genetic Changes Linked to Autism, Schizophrenia
The Tbx1 gene influences brain volume and social behavior in autism and schizophrenia, with its deficiency linked to amygdala shrinkage and impaired social incentive evaluation. A study published in Molecular Psychiatry has linked changes in brain [...]
How much permafrost will melt this century, and where will its carbon go?
Among the many things global warming will be melting this century—sea ice, land glaciers and tourist businesses in seaside towns across the world—is permafrost. Lying underneath 15% of the northern hemisphere, permafrost consists of [...]
A Physics Discovery So Strange It’s Changing Quantum Theory
MIT physicists surprised to discover electrons in pentalayer graphene can exhibit fractional charge. New theoretical research from MIT physicists explains how it could work, suggesting that electron interactions in confined two-dimensional spaces lead to novel quantum states, [...]
Inside the Nano-Universe: New 3D X-Ray Imaging Transforms Material Science
A cutting-edge X-ray method reveals the 3D orientation of nanoscale material structures, offering fresh insights into their functionality. Researchers at the Swiss Light Source (SLS) have developed a groundbreaking technique called X-ray linear dichroic orientation tomography [...]
X-chromosome study reveals hidden genetic links to Alzheimer’s disease
Despite decades of research, the X-chromosome’s impact on Alzheimer’s was largely ignored until now. Explore how seven newly discovered genetic loci could revolutionize our understanding of the disease. Conventional investigations of the genetic contributors [...]
The Unresolved Puzzle of Long COVID: 30% of Young People Still Suffer After Two Years
A UCL study found that 70% of young people with long Covid recovered within 24 months, but recovery was less likely among older teenagers, females, and those from deprived backgrounds. Researchers emphasized the need [...]
Needle-Free: New Nano-Vaccine Effective Against All COVID-19 Variants
A new nano-vaccine developed by TAU and the University of Lisbon offers a needle-free, room-temperature-storable solution against COVID-19, targeting all key variants effectively. Professor Ronit Satchi-Fainaro’s lab at Tel Aviv University’s Faculty of Medical and [...]
Photoacoustic PDA-ICG Nanoprobe for Detecting Senescent Cells in Cancer
A study in Scientific Reports evaluated a photoacoustic polydopamine-indocyanine green (PDA-ICG) nanoprobe for detecting senescent cells. Senescent cells play a role in tumor progression and therapeutic resistance, with potential adverse effects such as inflammation and tissue [...]
How Dysregulated Cell Signaling Causes Disease
Cell signaling is crucial for cells to communicate and function correctly. Disruptions in these pathways, caused by genetic mutations or environmental factors, can lead to uncontrolled cell growth, improper immune responses, or errors in [...]
Scientists Develop Super-Strong, Eco-Friendly Plastic That Bacteria Can Eat
Researchers at the Weizmann Institute have developed a biodegradable composite material that could play a significant role in addressing the global plastic waste crisis. Billions of tons of plastic waste clutter our planet. Most [...]
Building a “Google Maps” for Biology: Human Cell Atlas Revolutionizes Medicine
New research from the Human Cell Atlas offers insights into cell development, disease mechanisms, and genetic influences, enhancing our understanding of human biology and health. The Human Cell Atlas (HCA) consortium has made significant [...]