Close Menu
Ztoog
    What's Hot
    AI

    Meet Objaverse-XL: An Open Dataset of Over 10 Million 3D Objects

    Mobile

    OnePlus Watch 2 leak reveals sleek new design, could launch with Wear OS 4

    Technology

    After 25 years, you can finally unlock all of Castlevania 64’s playable characters with a Konami Code variant

    Important Pages:
    • About Us
    • Contact us
    • Privacy Policy
    • Terms & Conditions
    Facebook X (Twitter) Instagram Pinterest
    Facebook X (Twitter) Instagram Pinterest
    Ztoog
    • Home
    • The Future

      How I Turn Unstructured PDFs into Revenue-Ready Spreadsheets

      Is it the best tool for 2025?

      The clocks that helped define time from London’s Royal Observatory

      Summer Movies Are Here, and So Are the New Popcorn Buckets

      India-Pak conflict: Pak appoints ISI chief, appointment comes in backdrop of the Pahalgam attack

    • Technology

      Ensure Hard Work Is Recognized With These 3 Steps

      Cicada map 2025: Where will Brood XIV cicadas emerge this spring?

      Is Duolingo the face of an AI jobs crisis?

      The US DOD transfers its AI-based Open Price Exploration for National Security program to nonprofit Critical Minerals Forum to boost Western supply deals (Ernest Scheyder/Reuters)

      The more Google kills Fitbit, the more I want a Fitbit Sense 3

    • Gadgets

      Apple plans to split iPhone 18 launch into two phases in 2026

      Upgrade your desk to Starfleet status with this $95 USB-C hub

      37 Best Graduation Gift Ideas (2025): For College Grads

      Backblaze responds to claims of “sham accounting,” customer backups at risk

      Snapdragon X Plus Could Bring Faster, More Powerful Chromebooks

    • Mobile

      What are people doing with those free T-Mobile lines? Way more than you’d expect

      Samsung doesn’t want budget Galaxy phones to use exclusive AI features

      COROS’s charging adapter is a neat solution to the smartwatch charging cable problem

      Fortnite said to return to the US iOS App Store next week following court verdict

      Chinese tech icon is about to raise the stakes in a battle with US chipmaker over AI processors

    • Science

      Trump administration cuts off all future federal funding to Harvard

      Does kissing spread gluten? New research offers a clue.

      Why Balcony Solar Panels Haven’t Taken Off in the US

      ‘Dark photon’ theory of light aims to tear up a century of physics

      Signs of alien life on exoplanet K2-18b may just be statistical noise

    • AI

      How to build a better AI benchmark

      Q&A: A roadmap for revolutionizing health care through data-driven innovation | Ztoog

      This data set helps researchers spot harmful stereotypes in LLMs

      Making AI models more trustworthy for high-stakes settings | Ztoog

      The AI Hype Index: AI agent cyberattacks, racing robots, and musical models

    • Crypto

      ‘The Big Short’ Coming For Bitcoin? Why BTC Will Clear $110,000

      Bitcoin Holds Above $95K Despite Weak Blockchain Activity — Analytics Firm Explains Why

      eToro eyes US IPO launch as early as next week amid easing concerns over Trump’s tariffs

      Cardano ‘Looks Dope,’ Analyst Predicts Big Move Soon

      Speak at Ztoog Disrupt 2025: Applications now open

    Ztoog
    Home » New method assesses and improves the reliability of radiologists’ diagnostic reports | Ztoog
    AI

    New method assesses and improves the reliability of radiologists’ diagnostic reports | Ztoog

    Facebook Twitter Pinterest WhatsApp
    New method assesses and improves the reliability of radiologists’ diagnostic reports | Ztoog
    Share
    Facebook Twitter LinkedIn Pinterest WhatsApp

    Due to the inherent ambiguity in medical pictures like X-rays, radiologists usually use phrases like “may” or “likely” when describing the presence of a sure pathology, comparable to pneumonia.

    But do the phrases radiologists use to specific their confidence stage precisely mirror how usually a specific pathology happens in sufferers? A brand new research exhibits that when radiologists specific confidence a couple of sure pathology utilizing a phrase like “very likely,” they are typically overconfident, and vice-versa after they specific much less confidence utilizing a phrase like “possibly.”

    Using medical knowledge, a multidisciplinary staff of MIT researchers in collaboration with researchers and clinicians at hospitals affiliated with Harvard Medical School created a framework to quantify how dependable radiologists are after they specific certainty utilizing pure language phrases.

    They used this method to offer clear options that assist radiologists select certainty phrases that will enhance the reliability of their medical reporting. They additionally confirmed that the identical approach can successfully measure and enhance the calibration of massive language fashions by higher aligning the phrases fashions use to specific confidence with the accuracy of their predictions.

    By serving to radiologists extra precisely describe the probability of sure pathologies in medical pictures, this new framework may enhance the reliability of vital medical data.

    “The words radiologists use are important. They affect how doctors intervene, in terms of their decision making for the patient. If these practitioners can be more reliable in their reporting, patients will be the ultimate beneficiaries,” says Peiqi Wang, an MIT graduate scholar and lead writer of a paper on this analysis.

    He is joined on the paper by senior writer Polina Golland, a Sunlin and Priscilla Chou Professor of Electrical Engineering and Computer Science (EECS), a principal investigator in the MIT Computer Science and Artificial Intelligence Laboratory (CSAIL), and the chief of the Medical Vision Group; in addition to Barbara D. Lam, a medical fellow at the Beth Israel Deaconess Medical Center; Yingcheng Liu, at MIT graduate scholar; Ameneh Asgari-Targhi, a analysis fellow at Massachusetts General Brigham (MGB); Rameswar Panda, a analysis employees member at the MIT-IBM Watson AI Lab; William M. Wells, a professor of radiology at MGB and a analysis scientist in CSAIL; and Tina Kapur, an assistant professor of radiology at MGB. The analysis might be introduced at the International Conference on Learning Representations.

    Decoding uncertainty in phrases

    A radiologist writing a report a couple of chest X-ray may say the picture exhibits a “possible” pneumonia, which is an an infection that inflames the air sacs in the lungs. In that case, a physician may order a follow-up CT scan to substantiate the prognosis.

    However, if the radiologist writes that the X-ray exhibits a “likely” pneumonia, the physician may start remedy instantly, comparable to by prescribing antibiotics, whereas nonetheless ordering extra checks to evaluate severity.

    Trying to measure the calibration, or reliability, of ambiguous pure language phrases like “possibly” and “likely” presents many challenges, Wang says.

    Existing calibration strategies usually depend on the confidence rating supplied by an AI mannequin, which represents the mannequin’s estimated probability that its prediction is right.

    For occasion, a climate app may predict an 83 p.c likelihood of rain tomorrow. That mannequin is well-calibrated if, throughout all situations the place it predicts an 83 p.c likelihood of rain, it rains roughly 83 p.c of the time.

    “But humans use natural language, and if we map these phrases to a single number, it is not an accurate description of the real world. If a person says an event is ‘likely,’ they aren’t necessarily thinking of the exact probability, such as 75 percent,” Wang says.

    Rather than attempting to map certainty phrases to a single share, the researchers’ method treats them as chance distributions. A distribution describes the vary of potential values and their likelihoods — suppose of the basic bell curve in statistics.

    “This captures more nuances of what each word means,” Wang provides.

    Assessing and enhancing calibration

    The researchers leveraged prior work that surveyed radiologists to acquire chance distributions that correspond to every diagnostic certainty phrase, starting from “very likely” to “consistent with.”

    For occasion, since extra radiologists imagine the phrase “consistent with” means a pathology is current in a medical picture, its chance distribution climbs sharply to a excessive peak, with most values clustered round the 90 to 100% vary.

    In distinction the phrase “may represent” conveys better uncertainty, resulting in a broader, bell-shaped distribution centered round 50 p.c.

    Typical strategies consider calibration by evaluating how properly a mannequin’s predicted chance scores align with the precise quantity of constructive outcomes.

    The researchers’ method follows the identical basic framework however extends it to account for the proven fact that certainty phrases symbolize chance distributions somewhat than chances.

    To enhance calibration, the researchers formulated and solved an optimization downside that adjusts how usually sure phrases are used, to higher align confidence with actuality.

    They derived a calibration map that implies certainty phrases a radiologist ought to use to make the reports extra correct for a selected pathology.

    “Perhaps, for this dataset, if every time the radiologist said pneumonia was ‘present,’ they changed the phrase to ‘likely present’ instead, then they would become better calibrated,” Wang explains.

    When the researchers used their framework to guage medical reports, they discovered that radiologists had been typically underconfident when diagnosing widespread situations like atelectasis, however overconfident with extra ambiguous situations like an infection.

    In addition, the researchers evaluated the reliability of language fashions utilizing their method, offering a extra nuanced illustration of confidence than classical strategies that depend on confidence scores. 

    “A lot of times, these models use phrases like ‘certainly.’ But because they are so confident in their answers, it does not encourage people to verify the correctness of the statements themselves,” Wang provides.

    In the future, the researchers plan to proceed collaborating with clinicians in the hopes of enhancing diagnoses and remedy. They are working to develop their research to incorporate knowledge from belly CT scans.

    In addition, they’re considering finding out how receptive radiologists are to calibration-improving options and whether or not they can mentally regulate their use of certainty phrases successfully.

    “Expression of diagnostic certainty is a crucial aspect of the radiology report, as it influences significant management decisions. This study takes a novel approach to analyzing and calibrating how radiologists express diagnostic certainty in chest X-ray reports, offering feedback on term usage and associated outcomes,” says Atul B. Shinagare, affiliate professor of radiology at Harvard Medical School, who was not concerned with this work. “This approach has the potential to improve radiologists’ accuracy and communication, which will help improve patient care.”

    The work was funded, partially, by a Takeda Fellowship, the MIT-IBM Watson AI Lab, the MIT CSAIL Wistrom Program, and the MIT Jameel Clinic.

    Share. Facebook Twitter Pinterest LinkedIn WhatsApp

    Related Posts

    AI

    How to build a better AI benchmark

    AI

    Q&A: A roadmap for revolutionizing health care through data-driven innovation | Ztoog

    AI

    This data set helps researchers spot harmful stereotypes in LLMs

    AI

    Making AI models more trustworthy for high-stakes settings | Ztoog

    AI

    The AI Hype Index: AI agent cyberattacks, racing robots, and musical models

    AI

    Novel method detects microbial contamination in cell cultures | Ztoog

    Crypto

    Speak at Ztoog Disrupt 2025: Applications now open

    AI

    Seeing AI as a collaborator, not a creator

    Leave A Reply Cancel Reply

    Follow Us
    • Facebook
    • Twitter
    • Pinterest
    • Instagram
    Top Posts
    AI

    This robot can tidy a room without any help

    To develop the system, researchers from New York University and Meta examined Stretch, a commercially…

    Gadgets

    15 Best Mattresses You Can Buy Online (2023)

    Searching for the(*15*) greatest mattress on-line is a waking nightmare, and choosing the fallacious one…

    Technology

    Asian American Officials Cite Unfair Scrutiny and Lost Jobs in China Spy Tensions

    When Thomas Wong set foot in the United States Embassy in Beijing this summer season…

    AI

    Top AI Translation Software/Tools (September 2023)

    Almost each enterprise sector, together with translation providers, is being reworked by synthetic intelligence (AI).…

    Science

    NASA HQ picked their best photos of the year. Here are our 13 favorites.

    On September 24, 2023, a capsule from NASA’s OSIRIS-REx mission floated again to Earth, touchdown…

    Our Picks
    Science

    Swimming behind someone cuts drag by up to 40 per cent

    Mobile

    COROS Heart Rate Monitor review: A seamless band I didn’t know I needed

    Technology

    China and Norway Lead the World’s EV Switchover

    Categories
    • AI (1,482)
    • Crypto (1,744)
    • Gadgets (1,795)
    • Mobile (1,838)
    • Science (1,852)
    • Technology (1,789)
    • The Future (1,635)
    Most Popular
    Technology

    Today Only: Upgrade Your Kicks With This 25% Off Deal at Nike

    Mobile

    Samsung Galaxy Tab A8 vs Samsung Galaxy Tab S9 FE: Should you upgrade?

    Mobile

    Android 14 QPR2 beta 1.1 rolling out now

    Ztoog
    Facebook X (Twitter) Instagram Pinterest
    • Home
    • About Us
    • Contact us
    • Privacy Policy
    • Terms & Conditions
    © 2025 Ztoog.

    Type above and press Enter to search. Press Esc to cancel.