Close Menu
Ztoog
    What's Hot
    Mobile

    Switching from a small iPhone to iPhone 15 Pro Max: The best or worst mistake one can make?

    Mobile

    OnePlus Ace 3’s flagship-tier OLED display teased

    Technology

    Bengaluru-based fintech Perfios, which offers real-time credit underwriting solutions to financial companies, raised a $229M Series D led by Kedaara Capital (Manish Singh/Ztoog)

    Important Pages:
    • About Us
    • Contact us
    • Privacy Policy
    • Terms & Conditions
    Facebook X (Twitter) Instagram Pinterest
    Facebook X (Twitter) Instagram Pinterest
    Ztoog
    • Home
    • The Future

      How to Get Bot Lobbies in Fortnite? (2025 Guide)

      Can work-life balance tracking improve well-being?

      Any wall can be turned into a camera to see around corners

      JD Vance and President Trump’s Sons Hype Bitcoin at Las Vegas Conference

      AI may already be shrinking entry-level jobs in tech, new research suggests

    • Technology

      What does a millennial midlife crisis look like?

      Elon Musk tries to stick to spaceships

      A Replit employee details a critical security flaw in web apps created using AI-powered app builder Lovable that exposes API keys and personal info of app users (Reed Albergotti/Semafor)

      Gemini in Google Drive can now help you skip watching that painfully long Zoom meeting

      Apple iPhone exports from China to the US fall 76% as India output surges

    • Gadgets

      Watch Apple’s WWDC 2025 keynote right here

      Future-proof your career by mastering AI skills for just $20

      8 Best Vegan Meal Delivery Services and Kits (2025), Tested and Reviewed

      Google Home is getting deeper Gemini integration and a new widget

      Google Announces AI Ultra Subscription Plan With Premium Features

    • Mobile

      YouTube is testing a leaderboard to show off top live stream fans

      Deals: the Galaxy S25 series comes with a free tablet, Google Pixels heavily discounted

      Microsoft is done being subtle – this new tool screams “upgrade now”

      Wallpaper Wednesday: Android wallpapers 2025-05-28

      Google can make smart glasses accessible with Warby Parker, Gentle Monster deals

    • Science

      Some parts of Trump’s proposed budget for NASA are literally draconian

      June skygazing: A strawberry moon, the summer solstice… and Asteroid Day!

      Analysts Say Trump Trade Wars Would Harm the Entire US Energy Sector, From Oil to Solar

      Do we have free will? Quantum experiments may soon reveal the answer

      Was Planet Nine exiled from the solar system as a baby?

    • AI

      Fueling seamless AI at scale

      Rationale engineering generates a compact new tool for gene therapy | Ztoog

      The AI Hype Index: College students are hooked on ChatGPT

      Learning how to predict rare kinds of failures | Ztoog

      Anthropic’s new hybrid AI model can work on tasks autonomously for hours at a time

    • Crypto

      JPMorgan Chase set to accept Bitcoin, crypto ETFs as loan collateral

      Bitcoin Maxi Isn’t Buying Hype Around New Crypto Holding Firms

      GameStop bought $500 million of bitcoin

      CoinW Teams Up with Superteam Europe to Conclude Solana Hackathon and Accelerate Web3 Innovation in Europe

      Ethereum Net Flows Turn Negative As Bulls Push For $3,500

    Ztoog
    Home » Unveiling the Shortcuts: How Retrieval Augmented Generation (RAG) Influences Language Model Behavior and Memory Utilization
    AI

    Unveiling the Shortcuts: How Retrieval Augmented Generation (RAG) Influences Language Model Behavior and Memory Utilization

    Facebook Twitter Pinterest WhatsApp
    Unveiling the Shortcuts: How Retrieval Augmented Generation (RAG) Influences Language Model Behavior and Memory Utilization
    Share
    Facebook Twitter LinkedIn Pinterest WhatsApp

    Researchers from Microsoft, the University of Massachusetts, Amherst, and the University of Maryland, College Park, tackle the problem of understanding how Retrieval Augmented Generation (RAG) impacts language fashions’ reasoning and factual accuracy (LMs). The research focuses on whether or not LMs rely extra on the exterior context supplied by RAG than their parametric reminiscence when producing responses to factual queries.

    Current strategies for enhancing the factual accuracy of LMs usually contain both enhancing the inner parameters of the fashions or utilizing exterior retrieval methods to offer extra context throughout inference. Techniques like ROME and MEMIT concentrate on enhancing the mannequin’s inner parameters to replace information. However, there was restricted exploration into how these fashions steadiness the use of inner (parametric) information and exterior (non-parametric) context in RAG.

    The researchers suggest a mechanistic examination of RAG pipelines to find out how a lot LMs depend upon exterior context versus their inner reminiscence when answering factual queries. They use two superior LMs, LLaMa-2 and Phi-2, to conduct their evaluation, using strategies like Causal Mediation Analysis, Attention Contributions, and Attention Knockouts.

    The researchers utilized three key methods to handle the internal workings of LMs below RAG:

    1. Causal tracing identifies which hidden states in the mannequin are essential for factual predictions. By evaluating a corrupted run (the place a part of the enter is intentionally altered) with a clear run and a restoration run (the place clear activations are reintroduced into the corrupted run), the researchers measure the Indirect Effect (IE) to find out the significance of particular hidden states.

    2. Attention contributions look into the consideration weights between the topic token and the final token in the output. This helps by analyzing how a lot consideration every token receives to see if the mannequin depends extra on the exterior context supplied by RAG or its inner information.

    3. Attention knockouts contain setting essential consideration weights to detrimental infinity to dam info circulate between particular tokens. By observing the drop in prediction high quality when these consideration weights are knocked out, the researchers can determine which connections are important for correct predictions.

    The outcomes revealed that in the presence of RAG context, each LLaMa-2 and Phi-2 fashions confirmed a major lower in reliance on their inner parametric reminiscence. The Average Indirect Effect of topic tokens in the question was notably decrease when RAG context was current. Additionally, the final token residual stream derived extra enriched info from the attribute tokens in the context relatively than the topic tokens in the question. Attention Contributions and Knockouts additional confirmed that the fashions prioritized exterior context over inner reminiscence for factual predictions. However, the precise nature of how this strategy works isn’t clearly understood.

    In conclusion, the proposed methodology demonstrates that language fashions current a “shortcut” habits, closely counting on the exterior context supplied by RAG over their inner parametric reminiscence for factual queries. By mechanistically analyzing how LMs course of and prioritize info, the researchers present invaluable insights into the interaction between parametric and non-parametric information in retrieval-augmented technology. The research highlights the want for understanding these dynamics to enhance mannequin efficiency and reliability in sensible functions.


    Check out the Paper. All credit score for this analysis goes to the researchers of this venture. Also, don’t overlook to comply with us on Twitter. 

    Join our Telegram Channel and LinkedIn Group.

    If you want our work, you’ll love our publication..

    Don’t Forget to hitch our 44k+ ML SubReddit


    Pragati Jhunjhunwala is a consulting intern at MarktechPost. She is at the moment pursuing her B.Tech from the Indian Institute of Technology(IIT), Kharagpur. She is a tech fanatic and has a eager curiosity in the scope of software program and information science functions. She is at all times studying about the developments in numerous area of AI and ML.

    🐝 Join the Fastest Growing AI Research Newsletter Read by Researchers from Google + NVIDIA + Meta + Stanford + MIT + Microsoft and many others…

    Share. Facebook Twitter Pinterest LinkedIn WhatsApp

    Related Posts

    AI

    Fueling seamless AI at scale

    AI

    Rationale engineering generates a compact new tool for gene therapy | Ztoog

    AI

    The AI Hype Index: College students are hooked on ChatGPT

    AI

    Learning how to predict rare kinds of failures | Ztoog

    AI

    Anthropic’s new hybrid AI model can work on tasks autonomously for hours at a time

    AI

    AI learns how vision and sound are connected, without human intervention | Ztoog

    AI

    How AI is introducing errors into courtrooms

    AI

    With AI, researchers predict the location of virtually any protein within a human cell | Ztoog

    Leave A Reply Cancel Reply

    Follow Us
    • Facebook
    • Twitter
    • Pinterest
    • Instagram
    Top Posts
    The Future

    ChatGPT-5: release date, price, and what we know so far

    In a current dialog between the CEOs of Microsoft and OpenAI, it was revealed by…

    Crypto

    BNB Going Strong Short-Term Despite Outflows On Binance

    Binance finds itself entangled in a lawsuit filed by the US Securities and Exchange Commission…

    AI

    MIT researchers make language models scalable self-learners | Ztoog

    Socrates as soon as stated: “It is not the size of a thing, but the…

    Gadgets

    The Pixel Fold’s screen repair will cost $900

    The entrance of the Pixel Fold show. iFixit The again of the Pixel Fold show…

    Science

    The Steam Locomotive is Back, Although with an Eco-Friendly Twist

    What would develop into of all these black and white farewells at practice stations with…

    Our Picks
    Mobile

    Which one should you buy?

    Mobile

    Sony Xperia 5 VI leaks in case maker’s images

    Science

    The curious case of the clown wedgefish

    Categories
    • AI (1,494)
    • Crypto (1,755)
    • Gadgets (1,806)
    • Mobile (1,852)
    • Science (1,868)
    • Technology (1,804)
    • The Future (1,650)
    Most Popular
    Technology

    Don’t expect Sony to release games simultaneously on PS5 and PC anytime soon

    The Future

    Intel yet to announce key client for its expanding foundry services

    Technology

    Do You Have ‘Bookshelf Wealth’?

    Ztoog
    Facebook X (Twitter) Instagram Pinterest
    • Home
    • About Us
    • Contact us
    • Privacy Policy
    • Terms & Conditions
    © 2025 Ztoog.

    Type above and press Enter to search. Press Esc to cancel.