Close Menu
Ztoog
    What's Hot
    AI

    Evolving tables in the reasoning chain for table understanding – Google Research Blog

    Mobile

    OnePlus 11 Marble Odyssey will go on sale in India starting June 6

    The Future

    Kevin Hartz’s A* raises its second oversubscribed fund in three years

    Important Pages:
    • About Us
    • Contact us
    • Privacy Policy
    • Terms & Conditions
    Facebook X (Twitter) Instagram Pinterest
    Facebook X (Twitter) Instagram Pinterest
    Ztoog
    • Home
    • The Future

      Any wall can be turned into a camera to see around corners

      JD Vance and President Trump’s Sons Hype Bitcoin at Las Vegas Conference

      AI may already be shrinking entry-level jobs in tech, new research suggests

      Today’s NYT Strands Hints, Answer and Help for May 26 #449

      LiberNovo Omni: The World’s First Dynamic Ergonomic Chair

    • Technology

      A Replit employee details a critical security flaw in web apps created using AI-powered app builder Lovable that exposes API keys and personal info of app users (Reed Albergotti/Semafor)

      Gemini in Google Drive can now help you skip watching that painfully long Zoom meeting

      Apple iPhone exports from China to the US fall 76% as India output surges

      Today’s NYT Wordle Hints, Answer and Help for May 26, #1437

      5 Skills Kids (and Adults) Need in an AI World – O’Reilly

    • Gadgets

      Future-proof your career by mastering AI skills for just $20

      8 Best Vegan Meal Delivery Services and Kits (2025), Tested and Reviewed

      Google Home is getting deeper Gemini integration and a new widget

      Google Announces AI Ultra Subscription Plan With Premium Features

      Google shows off Android XR-based glasses, announces Warby Parker team-up

    • Mobile

      Deals: the Galaxy S25 series comes with a free tablet, Google Pixels heavily discounted

      Microsoft is done being subtle – this new tool screams “upgrade now”

      Wallpaper Wednesday: Android wallpapers 2025-05-28

      Google can make smart glasses accessible with Warby Parker, Gentle Monster deals

      vivo T4 Ultra specs leak

    • Science

      Analysts Say Trump Trade Wars Would Harm the Entire US Energy Sector, From Oil to Solar

      Do we have free will? Quantum experiments may soon reveal the answer

      Was Planet Nine exiled from the solar system as a baby?

      How farmers can help rescue water-loving birds

      A trip to the farm where loofahs grow on vines

    • AI

      Rationale engineering generates a compact new tool for gene therapy | Ztoog

      The AI Hype Index: College students are hooked on ChatGPT

      Learning how to predict rare kinds of failures | Ztoog

      Anthropic’s new hybrid AI model can work on tasks autonomously for hours at a time

      AI learns how vision and sound are connected, without human intervention | Ztoog

    • Crypto

      GameStop bought $500 million of bitcoin

      CoinW Teams Up with Superteam Europe to Conclude Solana Hackathon and Accelerate Web3 Innovation in Europe

      Ethereum Net Flows Turn Negative As Bulls Push For $3,500

      Bitcoin’s Power Compared To Nuclear Reactor By Brazilian Business Leader

      Senate advances GENIUS Act after cloture vote passes

    Ztoog
    Home » LongRAG: A New Artificial Intelligence AI Framework that Combines RAG with Long-Context LLMs to Enhance Performance
    AI

    LongRAG: A New Artificial Intelligence AI Framework that Combines RAG with Long-Context LLMs to Enhance Performance

    Facebook Twitter Pinterest WhatsApp
    LongRAG: A New Artificial Intelligence AI Framework that Combines RAG with Long-Context LLMs to Enhance Performance
    Share
    Facebook Twitter LinkedIn Pinterest WhatsApp

    Retrieval-Augmented Generation (RAG) strategies improve the capabilities of huge language fashions (LLMs) by incorporating exterior information retrieved from huge corpora. This method is especially helpful for open-domain query answering, the place detailed and correct responses are essential. By leveraging exterior data, RAG methods can overcome the constraints of relying solely on the parametric information embedded in LLMs, making them more practical in dealing with complicated queries.

    A vital problem in RAG methods is the imbalance between the retriever and reader parts. Traditional frameworks usually use brief retrieval models, similar to 100-word passages, requiring the retriever to sift by means of massive quantities of knowledge. This design burdens the retriever closely whereas the reader’s activity stays comparatively easy, main to inefficiencies and potential semantic incompleteness due to doc truncation. This imbalance restricts the general efficiency of RAG methods, necessitating a re-evaluation of their design.

    Current strategies in RAG methods embrace methods like Dense Passage Retrieval (DPR), which focuses on discovering exact, brief retrieval models from massive corpora. These strategies usually contain recalling many models and using complicated re-ranking processes to obtain excessive accuracy. While efficient to some extent, these approaches nonetheless want to work on inherent inefficiency and incomplete semantic illustration due to their reliance on brief retrieval models.

    To tackle these challenges, the analysis workforce from the University of Waterloo launched a novel framework known as LongRAG. This framework contains a “long retriever” and a “long reader” part, designed to course of longer retrieval models of round 4K tokens every. By growing the scale of the retrieval models, LongRAG reduces the variety of models from 22 million to 600,000, considerably easing the retriever’s workload and enhancing retrieval scores. This revolutionary method permits the retriever to deal with extra complete data models, enhancing the system’s effectivity and accuracy.

    The LongRAG framework operates by grouping associated paperwork into lengthy retrieval models, which the lengthy retriever then processes to determine related data. To extract the ultimate solutions, the retriever filters the highest 4 to 8 models, concatenated and fed right into a long-context LLM, similar to Gemini-1.5-Pro or GPT-4o. This methodology leverages the superior capabilities of long-context fashions to course of massive quantities of textual content effectively, guaranteeing an intensive and correct extraction of data.

    In-depth, the methodology includes utilizing an encoder to map the enter query to a vector and a special encoder to map the retrieval models to vectors. The similarity between the query and the retrieval models is calculated to determine essentially the most related models. The lengthy retriever searches by means of these models, decreasing the corpus dimension and enhancing the retriever’s precision. The retrieved models are then concatenated and fed into the lengthy reader, which makes use of the context to generate the ultimate reply. This method ensures that the reader processes a complete set of data, enhancing the system’s general efficiency.

    The efficiency of LongRAG is really exceptional. On the Natural Questions (NQ) dataset, it achieved an actual match (EM) rating of 62.7%, a major leap ahead in contrast to conventional strategies. On the HotpotQA dataset, it reached an EM rating of 64.3%. These spectacular outcomes show the effectiveness of LongRAG, matching the efficiency of state-of-the-art fine-tuned RAG fashions. The framework decreased the corpus dimension by 30 instances and improved the reply recall by roughly 20 proportion factors in contrast to conventional strategies, with a solution recall@1 rating of 71% on NQ and 72% on HotpotQA.

    LongRAG’s capacity to course of lengthy retrieval models preserves the semantic integrity of paperwork, permitting for extra correct and complete responses. By decreasing the burden on the retriever and leveraging superior long-context LLMs, LongRAG provides a extra balanced and environment friendly method to retrieval-augmented era. The analysis from the University of Waterloo not solely gives helpful insights into modernizing RAG system design but additionally highlights the thrilling potential for additional developments on this discipline, sparking optimism for the way forward for retrieval-augmented era methods.

    In conclusion, LongRAG represents a major step ahead in addressing the inefficiencies and imbalances in conventional RAG methods. Employing lengthy retrieval models and leveraging the capabilities of superior LLMs’ capabilities enhances the accuracy and effectivity of open-domain question-answering duties. This revolutionary framework improves retrieval efficiency and units the stage for future developments in retrieval-augmented era methods.


    Check out the Paper and GitHub. All credit score for this analysis goes to the researchers of this undertaking. Also, don’t overlook to comply with us on Twitter. 

    Join our Telegram Channel and LinkedIn Group.

    If you want our work, you’ll love our publication..

    Don’t Forget to be part of our 45k+ ML SubReddit

    🚀 Create, edit, and increase tabular information with the primary compound AI system, Gretel Navigator, now usually obtainable! [Advertisement]


    Nikhil is an intern guide at Marktechpost. He is pursuing an built-in twin diploma in Materials on the Indian Institute of Technology, Kharagpur. Nikhil is an AI/ML fanatic who’s at all times researching purposes in fields like biomaterials and biomedical science. With a robust background in Material Science, he’s exploring new developments and creating alternatives to contribute.

    [Announcing Gretel Navigator] Create, edit, and increase tabular information with the primary compound AI system trusted by EY, Databricks, Google, and Microsoft

    Share. Facebook Twitter Pinterest LinkedIn WhatsApp

    Related Posts

    AI

    Rationale engineering generates a compact new tool for gene therapy | Ztoog

    AI

    The AI Hype Index: College students are hooked on ChatGPT

    AI

    Learning how to predict rare kinds of failures | Ztoog

    AI

    Anthropic’s new hybrid AI model can work on tasks autonomously for hours at a time

    AI

    AI learns how vision and sound are connected, without human intervention | Ztoog

    AI

    How AI is introducing errors into courtrooms

    AI

    With AI, researchers predict the location of virtually any protein within a human cell | Ztoog

    AI

    Google DeepMind’s new AI agent cracks real-world problems better than humans can

    Leave A Reply Cancel Reply

    Follow Us
    • Facebook
    • Twitter
    • Pinterest
    • Instagram
    Top Posts
    Science

    Underwater “GPS” to Use Sound as a Power Source

    GPS know-how has turn into an important ally for any journey. At least on the…

    Science

    Why the Baltimore Bridge Collapsed So Quickly

    Photographs of the aftermath present the bow of the ship pinned beneath fallen sections of…

    AI

    OLAPH: A Simple and Novel AI Framework that Enables the Improvement of Factuality through Automatic Evaluations

    Large Language Models (LLMs) are entering into medical and medical fields as they develop in…

    Crypto

    A bitcoin spot ETF could open the floodgates for wider crypto demand

    Years after it received preliminary approval, Jacobi Asset Management’s bitcoin spot trade traded fund (ETF)…

    Science

    US to again offer free COVID tests ahead of respiratory virus season

    Americans will again have a chance to obtain free at-home COVID-19 fast tests from the…

    Our Picks
    The Future

    YouTube will no longer be deleting videos from inactive accounts

    The Future

    No, Grimes Didn’t Make Fun of Elon Musk for Saying Rich Ex-Wives Have Destroyed Civilization

    AI

    Google at ACL 2023 – Google Research Blog

    Categories
    • AI (1,493)
    • Crypto (1,753)
    • Gadgets (1,805)
    • Mobile (1,851)
    • Science (1,866)
    • Technology (1,802)
    • The Future (1,648)
    Most Popular
    AI

    Diffusion Transformers (DiTs) for Unprecedented Architectural Innovation: Transforming Image Generation with Transformer-Based Diffusion Models

    Mobile

    Samsung Galaxy S23 FE renders reveal the device in full

    Gadgets

    HP Spectre Foldable PC Unveiled As A Versatile PC That Folds

    Ztoog
    Facebook X (Twitter) Instagram Pinterest
    • Home
    • About Us
    • Contact us
    • Privacy Policy
    • Terms & Conditions
    © 2025 Ztoog.

    Type above and press Enter to search. Press Esc to cancel.