Close Menu
Ztoog
    What's Hot
    Crypto

    Clinton Vs. Novogratz In Heated War Of Words

    Mobile

    Sony Xperia 5 VI leaks in case maker’s images

    The Future

    FTC withdraws its in-house challenge to Microsoft’s Activision-Blizzard deal

    Important Pages:
    • About Us
    • Contact us
    • Privacy Policy
    • Terms & Conditions
    Facebook X (Twitter) Instagram Pinterest
    Facebook X (Twitter) Instagram Pinterest
    Ztoog
    • Home
    • The Future

      Can work-life balance tracking improve well-being?

      Any wall can be turned into a camera to see around corners

      JD Vance and President Trump’s Sons Hype Bitcoin at Las Vegas Conference

      AI may already be shrinking entry-level jobs in tech, new research suggests

      Today’s NYT Strands Hints, Answer and Help for May 26 #449

    • Technology

      Elon Musk tries to stick to spaceships

      A Replit employee details a critical security flaw in web apps created using AI-powered app builder Lovable that exposes API keys and personal info of app users (Reed Albergotti/Semafor)

      Gemini in Google Drive can now help you skip watching that painfully long Zoom meeting

      Apple iPhone exports from China to the US fall 76% as India output surges

      Today’s NYT Wordle Hints, Answer and Help for May 26, #1437

    • Gadgets

      Future-proof your career by mastering AI skills for just $20

      8 Best Vegan Meal Delivery Services and Kits (2025), Tested and Reviewed

      Google Home is getting deeper Gemini integration and a new widget

      Google Announces AI Ultra Subscription Plan With Premium Features

      Google shows off Android XR-based glasses, announces Warby Parker team-up

    • Mobile

      Deals: the Galaxy S25 series comes with a free tablet, Google Pixels heavily discounted

      Microsoft is done being subtle – this new tool screams “upgrade now”

      Wallpaper Wednesday: Android wallpapers 2025-05-28

      Google can make smart glasses accessible with Warby Parker, Gentle Monster deals

      vivo T4 Ultra specs leak

    • Science

      Analysts Say Trump Trade Wars Would Harm the Entire US Energy Sector, From Oil to Solar

      Do we have free will? Quantum experiments may soon reveal the answer

      Was Planet Nine exiled from the solar system as a baby?

      How farmers can help rescue water-loving birds

      A trip to the farm where loofahs grow on vines

    • AI

      Rationale engineering generates a compact new tool for gene therapy | Ztoog

      The AI Hype Index: College students are hooked on ChatGPT

      Learning how to predict rare kinds of failures | Ztoog

      Anthropic’s new hybrid AI model can work on tasks autonomously for hours at a time

      AI learns how vision and sound are connected, without human intervention | Ztoog

    • Crypto

      Bitcoin Maxi Isn’t Buying Hype Around New Crypto Holding Firms

      GameStop bought $500 million of bitcoin

      CoinW Teams Up with Superteam Europe to Conclude Solana Hackathon and Accelerate Web3 Innovation in Europe

      Ethereum Net Flows Turn Negative As Bulls Push For $3,500

      Bitcoin’s Power Compared To Nuclear Reactor By Brazilian Business Leader

    Ztoog
    Home » Unveiling the Shortcuts: How Retrieval Augmented Generation (RAG) Influences Language Model Behavior and Memory Utilization
    AI

    Unveiling the Shortcuts: How Retrieval Augmented Generation (RAG) Influences Language Model Behavior and Memory Utilization

    Facebook Twitter Pinterest WhatsApp
    Unveiling the Shortcuts: How Retrieval Augmented Generation (RAG) Influences Language Model Behavior and Memory Utilization
    Share
    Facebook Twitter LinkedIn Pinterest WhatsApp

    Researchers from Microsoft, the University of Massachusetts, Amherst, and the University of Maryland, College Park, tackle the problem of understanding how Retrieval Augmented Generation (RAG) impacts language fashions’ reasoning and factual accuracy (LMs). The research focuses on whether or not LMs rely extra on the exterior context supplied by RAG than their parametric reminiscence when producing responses to factual queries.

    Current strategies for enhancing the factual accuracy of LMs usually contain both enhancing the inner parameters of the fashions or utilizing exterior retrieval methods to offer extra context throughout inference. Techniques like ROME and MEMIT concentrate on enhancing the mannequin’s inner parameters to replace information. However, there was restricted exploration into how these fashions steadiness the use of inner (parametric) information and exterior (non-parametric) context in RAG.

    The researchers suggest a mechanistic examination of RAG pipelines to find out how a lot LMs depend upon exterior context versus their inner reminiscence when answering factual queries. They use two superior LMs, LLaMa-2 and Phi-2, to conduct their evaluation, using strategies like Causal Mediation Analysis, Attention Contributions, and Attention Knockouts.

    The researchers utilized three key methods to handle the internal workings of LMs below RAG:

    1. Causal tracing identifies which hidden states in the mannequin are essential for factual predictions. By evaluating a corrupted run (the place a part of the enter is intentionally altered) with a clear run and a restoration run (the place clear activations are reintroduced into the corrupted run), the researchers measure the Indirect Effect (IE) to find out the significance of particular hidden states.

    2. Attention contributions look into the consideration weights between the topic token and the final token in the output. This helps by analyzing how a lot consideration every token receives to see if the mannequin depends extra on the exterior context supplied by RAG or its inner information.

    3. Attention knockouts contain setting essential consideration weights to detrimental infinity to dam info circulate between particular tokens. By observing the drop in prediction high quality when these consideration weights are knocked out, the researchers can determine which connections are important for correct predictions.

    The outcomes revealed that in the presence of RAG context, each LLaMa-2 and Phi-2 fashions confirmed a major lower in reliance on their inner parametric reminiscence. The Average Indirect Effect of topic tokens in the question was notably decrease when RAG context was current. Additionally, the final token residual stream derived extra enriched info from the attribute tokens in the context relatively than the topic tokens in the question. Attention Contributions and Knockouts additional confirmed that the fashions prioritized exterior context over inner reminiscence for factual predictions. However, the precise nature of how this strategy works isn’t clearly understood.

    In conclusion, the proposed methodology demonstrates that language fashions current a “shortcut” habits, closely counting on the exterior context supplied by RAG over their inner parametric reminiscence for factual queries. By mechanistically analyzing how LMs course of and prioritize info, the researchers present invaluable insights into the interaction between parametric and non-parametric information in retrieval-augmented technology. The research highlights the want for understanding these dynamics to enhance mannequin efficiency and reliability in sensible functions.


    Check out the Paper. All credit score for this analysis goes to the researchers of this venture. Also, don’t overlook to comply with us on Twitter. 

    Join our Telegram Channel and LinkedIn Group.

    If you want our work, you’ll love our publication..

    Don’t Forget to hitch our 44k+ ML SubReddit


    Pragati Jhunjhunwala is a consulting intern at MarktechPost. She is at the moment pursuing her B.Tech from the Indian Institute of Technology(IIT), Kharagpur. She is a tech fanatic and has a eager curiosity in the scope of software program and information science functions. She is at all times studying about the developments in numerous area of AI and ML.

    🐝 Join the Fastest Growing AI Research Newsletter Read by Researchers from Google + NVIDIA + Meta + Stanford + MIT + Microsoft and many others…

    Share. Facebook Twitter Pinterest LinkedIn WhatsApp

    Related Posts

    AI

    Rationale engineering generates a compact new tool for gene therapy | Ztoog

    AI

    The AI Hype Index: College students are hooked on ChatGPT

    AI

    Learning how to predict rare kinds of failures | Ztoog

    AI

    Anthropic’s new hybrid AI model can work on tasks autonomously for hours at a time

    AI

    AI learns how vision and sound are connected, without human intervention | Ztoog

    AI

    How AI is introducing errors into courtrooms

    AI

    With AI, researchers predict the location of virtually any protein within a human cell | Ztoog

    AI

    Google DeepMind’s new AI agent cracks real-world problems better than humans can

    Leave A Reply Cancel Reply

    Follow Us
    • Facebook
    • Twitter
    • Pinterest
    • Instagram
    Top Posts
    AI

    Enabling large-scale health studies for the research community – Google Research Blog

    Posted by Chintan Ghate, Software Engineer, and Diana Mincu, Research Engineer, Google Research

    The Future

    Real ants in my PC? — the answer is yes

    Can actual ants be in my PC — the answer is yes. Every PC gamer…

    The Future

    Boost vs Optus: The matter has been settled in a confidential agreement

    After Optus launched merchandise named Mobile Boost and Internet Boost, Boost Mobile took exception to…

    Mobile

    Where is Qi2? Not on the Galaxy S24 or OnePlus 12!

    Ryan Haines / Android Authority January noticed the launch of two of 2024’s greatest flagships:…

    AI

    This AI Paper Proposes Retentive Networks (RetNet) as a Foundation Architecture for Large Language Models: Achieving Training Parallelism, Low-Cost Inference, and Good Performance

    Transformer, which was first developed to deal with the sequential coaching downside with recurrent fashions,…

    Our Picks
    Crypto

    AI and blockchains might need one another to evolve, according to new report

    The Future

    Disney Parks Halloween, Holidays, New Frozen and Zootopia Lands

    Mobile

    The Sony WH-1000XM5 drop back to lowest price in Memorial Day sale

    Categories
    • AI (1,493)
    • Crypto (1,754)
    • Gadgets (1,805)
    • Mobile (1,851)
    • Science (1,866)
    • Technology (1,803)
    • The Future (1,649)
    Most Popular
    Gadgets

    Dealmaster: Deals from Apple and Sony ahead of Amazon’s big event

    AI

    Purdue Researchers Utilize Deep Learning and Topological Data Analysis for Advanced Model Interpretation and Precision in Complex Predictions

    Gadgets

    Upgraded Motorola Moto G Stylus 5G Unveiled With Wireless Charging And More

    Ztoog
    Facebook X (Twitter) Instagram Pinterest
    • Home
    • About Us
    • Contact us
    • Privacy Policy
    • Terms & Conditions
    © 2025 Ztoog.

    Type above and press Enter to search. Press Esc to cancel.