Close Menu
Ztoog
    What's Hot
    Gadgets

    The best smart refrigerators for 2024

    Science

    The Secret of How Cells Make ‘Dark Oxygen’ Without Light

    AI

    Meet Wisdom AI: An AI Startup that Bring Insights at your Fingertips with AI-Powered Analytics

    Important Pages:
    • About Us
    • Contact us
    • Privacy Policy
    • Terms & Conditions
    Facebook X (Twitter) Instagram Pinterest
    Facebook X (Twitter) Instagram Pinterest
    Ztoog
    • Home
    • The Future

      What is Project Management? 5 Best Tools that You Can Try

      Operational excellence strategy and continuous improvement

      Hannah Fry: AI isn’t as powerful as we think

      FanDuel goes all in on responsible gaming push with new Play with a Plan campaign

      Gettyimages.com Is the Best Website on the Internet Right Now

    • Technology

      Iran war: How could it end?

      Democratic senators question CFTC staffing cuts in Chicago enforcement office

      Google’s Cloud AI lead on the three frontiers of model capability

      AMD agrees to backstop a $300M loan from Goldman Sachs for Crusoe to buy AMD AI chips, the first known case of AMD chips used as debt collateral (The Information)

      Productivity apps failed me when I needed them most

    • Gadgets

      macOS Tahoe 26.3.1 update will “upgrade” your M5’s CPU to new “super” cores

      Lenovo Shows Off a ThinkBook Modular AI PC Concept With Swappable Ports and Detachable Displays at MWC 2026

      POCO M8 Review: The Ultimate Budget Smartphone With Some Cons

      The Mission: Impossible of SSDs has arrived with a fingerprint lock

      6 Best Phones With Headphone Jacks (2026), Tested and Reviewed

    • Mobile

      Android’s March update is all about finding people, apps, and your missing bags

      Watch Xiaomi’s global launch event live here

      Our poll shows what buyers actually care about in new smartphones (Hint: it’s not AI)

      Is Strava down for you? You’re not alone

      The Motorola Razr FIFA World Cup 2026 Edition was literally just unveiled, and Verizon is already giving them away

    • Science

      Big Tech Signs White House Data Center Pledge With Good Optics and Little Substance

      Inside the best dark matter detector ever built

      NASA’s Artemis moon exploration programme is getting a major makeover

      Scientists crack the case of “screeching” Scotch tape

      Blue-faced, puffy-lipped monkey scores a rare conservation win

    • AI

      Online harassment is entering its AI era

      Meet NullClaw: The 678 KB Zig AI Agent Framework Running on 1 MB RAM and Booting in Two Milliseconds

      New method could increase LLM training efficiency | Ztoog

      The human work behind humanoid robots is being hidden

      NVIDIA Releases DreamDojo: An Open-Source Robot World Model Trained on 44,711 Hours of Real-World Human Video Data

    • Crypto

      Google paid startup Form Energy $1B for its massive 100-hour battery

      Ethereum Breakout Alert: Corrective Channel Flip Sparks Impulsive Wave

      Show Your ID Or No Deal

      Jane Street sued for alleged front-running trades that accelerated Terraform Labs meltdown

      Bitcoin Trades Below ETF Cost-Basis As MVRV Signals Mounting Pressure

    Ztoog
    Home » Personalization features can make LLMs more agreeable | Ztoog
    AI

    Personalization features can make LLMs more agreeable | Ztoog

    Facebook Twitter Pinterest WhatsApp
    Personalization features can make LLMs more agreeable | Ztoog
    Share
    Facebook Twitter LinkedIn Pinterest WhatsApp

    Many of the newest giant language fashions (LLMs) are designed to recollect particulars from previous conversations or retailer person profiles, enabling these fashions to personalize responses.

    But researchers from MIT and Penn State University discovered that, over lengthy conversations, such personalization features usually improve the chance an LLM will change into overly agreeable or start mirroring the person’s perspective.

    This phenomenon, often known as sycophancy, can forestall a mannequin from telling a person they’re flawed, eroding the accuracy of the LLM’s responses. In addition, LLMs that mirror somebody’s political views or worldview can foster misinformation and deform a person’s notion of actuality.

    Unlike many previous sycophancy research that consider prompts in a lab setting with out context, the MIT researchers collected two weeks of dialog information from people who interacted with an actual LLM throughout their every day lives. They studied two settings: agreeableness in private recommendation and mirroring of person beliefs in political explanations.

    Although interplay context elevated agreeableness in 4 of the 5 LLMs they studied, the presence of a condensed person profile within the mannequin’s reminiscence had the best influence. On the opposite hand, mirroring conduct solely elevated if a mannequin might precisely infer a person’s beliefs from the dialog.

    The researchers hope these outcomes encourage future analysis into the event of personalization strategies which can be more sturdy to LLM sycophancy.

    “From a user perspective, this work highlights how important it is to understand that these models are dynamic and their behavior can change as you interact with them over time. If you are talking to a model for an extended period of time and start to outsource your thinking to it, you may find yourself in an echo chamber that you can’t escape. That is a risk users should definitely remember,” says Shomik Jain, a graduate pupil within the Institute for Data, Systems, and Society (IDSS) and lead writer of a paper on this analysis.

    Jain is joined on the paper by Charlotte Park, {an electrical} engineering and pc science (EECS) graduate pupil at MIT; Matt Viana, a graduate pupil at Penn State University; in addition to co-senior authors Ashia Wilson, the Lister Brothers Career Development Professor in EECS and a principal investigator in LIDS; and Dana Calacci PhD ’23, an assistant professor on the Penn State. The analysis will probably be offered on the ACM CHI Conference on Human Factors in Computing Systems.

    Extended interactions

    Based on their very own sycophantic experiences with LLMs, the researchers began desirous about potential advantages and penalties of a mannequin that’s overly agreeable. But once they searched the literature to broaden their evaluation, they discovered no research that tried to know sycophantic conduct throughout long-term LLM interactions.

    “We are using these models through extended interactions, and they have a lot of context and memory. But our evaluation methods are lagging behind. We wanted to evaluate LLMs in the ways people are actually using them to understand how they are behaving in the wild,” says Calacci.

    To fill this hole, the researchers designed a person research to discover two forms of sycophancy: settlement sycophancy and perspective sycophancy.

    Agreement sycophancy is an LLM’s tendency to be overly agreeable, typically to the purpose the place it provides incorrect info or refuses the inform the person they’re flawed. Perspective sycophancy happens when a mannequin mirrors the person’s values and political beliefs.

    “There is a lot we know about the benefits of having social connections with people who have similar or different viewpoints. But we don’t yet know about the benefits or risks of extended interactions with AI models that have similar attributes,” Calacci provides.

    The researchers constructed a person interface centered on an LLM and recruited 38 contributors to speak with the chatbot over a two-week interval. Each participant’s conversations occurred in the identical context window to seize all interplay information.

    Over the two-week interval, the researchers collected a median of 90 queries from every person.

    They in contrast the conduct of 5 LLMs with this person context versus the identical LLMs that weren’t given any dialog information.

    “We found that context really does fundamentally change how these models operate, and I would wager this phenomenon would extend well beyond sycophancy. And while sycophancy tended to go up, it didn’t always increase. It really depends on the context itself,” says Wilson.

    Context clues

    For occasion, when an LLM distills details about the person into a selected profile, it results in the biggest positive aspects in settlement sycophancy. This person profile characteristic is more and more being baked into the latest fashions.

    They additionally discovered that random textual content from artificial conversations additionally elevated the chance some fashions would agree, despite the fact that that textual content contained no user-specific information. This suggests the size of a dialog might typically influence sycophancy more than content material, Jain provides.

    But content material issues significantly relating to perspective sycophancy. Conversation context solely elevated perspective sycophancy if it revealed some details about a person’s political perspective.

    To acquire this perception, the researchers fastidiously queried fashions to deduce a person’s beliefs then requested every particular person if the mannequin’s deductions had been right. Users stated LLMs precisely understood their political beliefs about half the time.

    “It is easy to say, in hindsight, that AI companies should be doing this kind of evaluation. But it is hard and it takes a lot of time and investment. Using humans in the evaluation loop is expensive, but we’ve shown that it can reveal new insights,” Jain says.

    While the goal of their analysis was not mitigation, the researchers developed some suggestions.

    For occasion, to cut back sycophancy one might design fashions that higher determine related particulars in context and reminiscence. In addition, fashions can be constructed to detect mirroring behaviors and flag responses with extreme settlement. Model builders might additionally give customers the power to reasonable personalization in lengthy conversations.

    “There are many ways to personalize models without making them overly agreeable. The boundary between personalization and sycophancy is not a fine line, but separating personalization from sycophancy is an important area of future work,” Jain says.

    “At the end of the day, we need better ways of capturing the dynamics and complexity of what goes on during long conversations with LLMs, and how things can misalign during that long-term process,” Wilson provides.

    Share. Facebook Twitter Pinterest LinkedIn WhatsApp

    Related Posts

    AI

    Online harassment is entering its AI era

    AI

    Meet NullClaw: The 678 KB Zig AI Agent Framework Running on 1 MB RAM and Booting in Two Milliseconds

    AI

    New method could increase LLM training efficiency | Ztoog

    AI

    The human work behind humanoid robots is being hidden

    AI

    NVIDIA Releases DreamDojo: An Open-Source Robot World Model Trained on 44,711 Hours of Real-World Human Video Data

    AI

    AI is already making online crimes easier. It could get much worse.

    AI

    NVIDIA Researchers Introduce KVTC Transform Coding Pipeline to Compress Key-Value Caches by 20x for Efficient LLM Serving

    AI

    Study: Platforms that rank the latest LLMs can be unreliable | Ztoog

    Leave A Reply Cancel Reply

    Follow Us
    • Facebook
    • Twitter
    • Pinterest
    • Instagram
    Top Posts
    The Future

    Software usage tracking: Turn visibility into performance

    20 Do you actually understand how efficient your software program usage monitoring is? As a…

    Gadgets

    Best MacBook Accessories (2023): Keyboards, External Monitors, and Sleeves

    The MacGuide is a robust machine. Whether you are utilizing a MacGuide Air for net…

    Crypto

    Crypto Analyst Says February Will Be A Bullish Month For Bitcoin, Here’s Why

    Crypto analyst and long-term crypto investor Jelle has highlighted an fascinating historic sample that implies…

    Gadgets

    Lenovo Unveils AI-Enhanced Legion Y700 (2026): A New Benchmark For Compact Gaming Tablets

    Lenovo appears to be pushing the boundaries of the small-form-factor gaming market with its fifth-generation…

    The Future

    Most large fishing boats go untracked as ‘dark vessels’

    The majority of the world’s industrial fishing vessels are usually not publicly trackedThree-quarters of the…

    Our Picks
    Mobile

    Galaxy Tab S9 FE and S9 FE Plus leak reveals prices, colors, and versions

    The Future

    The Samsung Galaxy S24 is here

    Crypto

    Ethereum End Of Month Challenge: Can ETH Hit $2,000?

    Categories
    • AI (1,560)
    • Crypto (1,826)
    • Gadgets (1,870)
    • Mobile (1,910)
    • Science (1,939)
    • Technology (1,862)
    • The Future (1,716)
    Most Popular
    Technology

    Nvidia’s stellar 2023 performance: A decade’s best in stock market

    Gadgets

    The best compact treadmills of 2023

    Technology

    The not-so-secret-anymore lunar mining startup

    Ztoog
    Facebook X (Twitter) Instagram Pinterest
    • Home
    • About Us
    • Contact us
    • Privacy Policy
    • Terms & Conditions
    © 2026 Ztoog.

    Type above and press Enter to search. Press Esc to cancel.