Close Menu
Ztoog
    What's Hot
    AI

    Using AI, scientists find a drug that could combat drug-resistant infections | Ztoog

    Gadgets

    Gigabyte BIOS update outs next-gen AMD Ryzen APUs with upgraded Radeon GPUs

    Science

    Nano-textiles: T-Shirts that Control Body Odor and Temperature

    Important Pages:
    • About Us
    • Contact us
    • Privacy Policy
    • Terms & Conditions
    Facebook X (Twitter) Instagram Pinterest
    Facebook X (Twitter) Instagram Pinterest
    Ztoog
    • Home
    • The Future

      Any wall can be turned into a camera to see around corners

      JD Vance and President Trump’s Sons Hype Bitcoin at Las Vegas Conference

      AI may already be shrinking entry-level jobs in tech, new research suggests

      Today’s NYT Strands Hints, Answer and Help for May 26 #449

      LiberNovo Omni: The World’s First Dynamic Ergonomic Chair

    • Technology

      A Replit employee details a critical security flaw in web apps created using AI-powered app builder Lovable that exposes API keys and personal info of app users (Reed Albergotti/Semafor)

      Gemini in Google Drive can now help you skip watching that painfully long Zoom meeting

      Apple iPhone exports from China to the US fall 76% as India output surges

      Today’s NYT Wordle Hints, Answer and Help for May 26, #1437

      5 Skills Kids (and Adults) Need in an AI World – O’Reilly

    • Gadgets

      Future-proof your career by mastering AI skills for just $20

      8 Best Vegan Meal Delivery Services and Kits (2025), Tested and Reviewed

      Google Home is getting deeper Gemini integration and a new widget

      Google Announces AI Ultra Subscription Plan With Premium Features

      Google shows off Android XR-based glasses, announces Warby Parker team-up

    • Mobile

      Deals: the Galaxy S25 series comes with a free tablet, Google Pixels heavily discounted

      Microsoft is done being subtle – this new tool screams “upgrade now”

      Wallpaper Wednesday: Android wallpapers 2025-05-28

      Google can make smart glasses accessible with Warby Parker, Gentle Monster deals

      vivo T4 Ultra specs leak

    • Science

      Analysts Say Trump Trade Wars Would Harm the Entire US Energy Sector, From Oil to Solar

      Do we have free will? Quantum experiments may soon reveal the answer

      Was Planet Nine exiled from the solar system as a baby?

      How farmers can help rescue water-loving birds

      A trip to the farm where loofahs grow on vines

    • AI

      Rationale engineering generates a compact new tool for gene therapy | Ztoog

      The AI Hype Index: College students are hooked on ChatGPT

      Learning how to predict rare kinds of failures | Ztoog

      Anthropic’s new hybrid AI model can work on tasks autonomously for hours at a time

      AI learns how vision and sound are connected, without human intervention | Ztoog

    • Crypto

      GameStop bought $500 million of bitcoin

      CoinW Teams Up with Superteam Europe to Conclude Solana Hackathon and Accelerate Web3 Innovation in Europe

      Ethereum Net Flows Turn Negative As Bulls Push For $3,500

      Bitcoin’s Power Compared To Nuclear Reactor By Brazilian Business Leader

      Senate advances GENIUS Act after cloture vote passes

    Ztoog
    Home » Microsoft AI Research Proposes a New Artificial Intelligence Framework for Collaborative NLP Development (CoDev) that Enables Multiple Users to Align a Model with Their Beliefs
    AI

    Microsoft AI Research Proposes a New Artificial Intelligence Framework for Collaborative NLP Development (CoDev) that Enables Multiple Users to Align a Model with Their Beliefs

    Facebook Twitter Pinterest WhatsApp
    Microsoft AI Research Proposes a New Artificial Intelligence Framework for Collaborative NLP Development (CoDev) that Enables Multiple Users to Align a Model with Their Beliefs
    Share
    Facebook Twitter LinkedIn Pinterest WhatsApp

    Although NLP fashions have demonstrated extraordinary strengths, they’ve challenges. The want to train these fashions concepts is highlighted by unacceptable values buried of their coaching knowledge, recurrent failures, or breaches of enterprise requirements. The phrase “religion does not connote sentiment” is an instance of a notion that hyperlinks a assortment of inputs to desired behaviors. Similar to this, the bigger thought of “downward monotonicity” within the area of pure language inference (NLI) describes entailment relations when sure parts of statements are made extra exact (for instance, “All cats like tuna” implies “All small cats like tuna”). Introducing contemporary coaching knowledge that demonstrates the concept, comparable to introducing impartial phrases containing non secular phrases or including entailment pairs that exhibit downward monotonicity, is the normal methodology of educating ideas to fashions. 

    It is tough to assure that the information introduced doesn’t end in shortcuts, i.e., false correlations or heuristics, which permit fashions to make predictions with out actually understanding the underlying idea, comparable to “all sentences with religious terms are neutral” or “going from general to specific leads to entailment.” The mannequin may additionally overfit, failing to generalize from the equipped examples to the true notion, for occasion, solely recognizing pairings of the shape (“all X…”, “all ADJECTIVE X…”). Not pairs like (“all animals eat” or “all cats eat”). Finally, shortcuts and overfitting each have the potential to intrude with the unique knowledge or different concepts, for instance, by inflicting failures on statements like “I love Islam” or pairings like “Some cats like tuna,” “Some small cats like tuna,” and many others. 

    In conclusion, operationalizing concepts is tough as a result of customers steadily need assistance to foresee all idea borders and interactions. One potential choice is asking material specialists to produce knowledge that fully and precisely illustrates the idea as possible, such because the GLUE diagnostics dataset or the FraCaS take a look at suite. These datasets, nevertheless, are steadily costly to produce, restricted (and therefore unsuitable for coaching), and incomplete since even specialists generally overlook vital particulars and subtleties of a topic. Another methodology is to make the most of adversarial coaching or adaptive testing, the place customers enter knowledge progressively whereas getting suggestions from the mannequin. These can reveal and tackle mannequin flaws with out requiring customers to plan every little thing. 

    Contrarily, neither adversarial coaching nor adaptive testing straight tackle the concept of concepts, nor do they tackle how one idea interacts with one other or with the unique knowledge. Users might need assistance to examine thought borders correctly. As a consequence, they need assistance to decide when a idea has been adequately lined or whether or not they have induced interference with different ideas. Researchers from Microsoft describe the Collaborative Development of NLP Models (CoDev) on this research. Instead of relying on a single consumer, CoDev makes use of the mixed experience of quite a few customers to cowl a wide selection of matters. 

    They rely on the concept that fashions show easier behaviors in small areas and practice a native mannequin for every idea as well as to a international mannequin incorporating the preliminary knowledge and any additional concepts. The LLM is then directed to present situations the place the native and international fashions battle. These situations are both through which the native mannequin isn’t but fully developed or through which the worldwide mannequin continues to produce conceptual errors due to overfitting or shortcut dependence. Both fashions are up to date when customers annotate these situations till convergence or till the concept has been discovered in a trend that doesn’t contradict earlier info or ideas (Figure 1). 

    Figure 1: CoDev loop for operationalizing a single idea. (a) The consumer begins by offering some seed knowledge from the idea and their labels, (b) they’re used to be taught a native idea mannequin. GPT-3 is then prompted to generate new examples, prioritizing examples the place the native mannequin disagrees with the worldwide mannequin. (d) Actual disagreements are proven to the consumer for labeling, and (e) every label improves both the native or the worldwide mannequin. The loop c-d-e is repeated till convergence, i.e., till the consumer has operationalized the idea and the worldwide mannequin has discovered it.

    Every native mannequin is a low cost specialist in its notion and is at all times creating. Users might examine the boundaries between concepts and existent knowledge thanks to the LLM’s fast native mannequin predictions and numerous situations, which is an inquiry that can be tough for customers to perform on their very own. Their experimental findings show the effectivity of CoDev in operationalizing ideas and managing interference. They first show by figuring out and resolving points extra completely, CoDev beats AdaTest, a SOTA software for debugging GPT-3-based NLP fashions. They then present that CoDev outperforms a mannequin that completely relies on knowledge gathering by operationalizing concepts even when the consumer begins with biased knowledge. 

    By using a simplified type of CoDev, whereby they iteratively select samples from a pool of unlabeled knowledge as a substitute of GPT-3, they’ll evaluate the information choice strategy of CoDev to random choice and uncertainty sampling. They show that CoDev beats each baselines when educating a sentiment evaluation mannequin about Amazon product critiques and an NLI mannequin about downward- and upward-monotone concepts. Finally, they confirmed that CoDev assisted customers in refining their ideas in pilot analysis.


    Check out the Paper. All Credit For This Research Goes To the Researchers on This Project. Also, don’t neglect to be part of our 31k+ ML SubReddit, 40k+ Facebook Community, Discord Channel, and Email Newsletter, the place we share the newest AI analysis information, cool AI initiatives, and extra.

    If you want our work, you’ll love our e-newsletter..


    Aneesh Tickoo is a consulting intern at MarktechPost. He is at the moment pursuing his undergraduate diploma in Data Science and Artificial Intelligence from the Indian Institute of Technology(IIT), Bhilai. He spends most of his time engaged on initiatives geared toward harnessing the ability of machine studying. His analysis curiosity is picture processing and is keen about constructing options round it. He loves to join with folks and collaborate on attention-grabbing initiatives.


    Share. Facebook Twitter Pinterest LinkedIn WhatsApp

    Related Posts

    AI

    Rationale engineering generates a compact new tool for gene therapy | Ztoog

    AI

    The AI Hype Index: College students are hooked on ChatGPT

    AI

    Learning how to predict rare kinds of failures | Ztoog

    AI

    Anthropic’s new hybrid AI model can work on tasks autonomously for hours at a time

    AI

    AI learns how vision and sound are connected, without human intervention | Ztoog

    AI

    How AI is introducing errors into courtrooms

    AI

    With AI, researchers predict the location of virtually any protein within a human cell | Ztoog

    AI

    Google DeepMind’s new AI agent cracks real-world problems better than humans can

    Leave A Reply Cancel Reply

    Follow Us
    • Facebook
    • Twitter
    • Pinterest
    • Instagram
    Top Posts
    Gadgets

    Honor Magic6 Pro, An Excellent Camera Combined with A Massive Battery and Enhanced Durability

    HONOR launched its flagship HONOR Magic6 Pro globally at MWC in Barcelona after unveiling the…

    Mobile

    Samsung Galaxy S24 series listed on 3C database with same charging speeds as its predecessors

    There had been hopes that Samsung would provide assist for as much as 65W quick…

    Technology

    Fintech Kennek raises $12.5M seed round to digitize lending

    London-based fintech startup Kennek has raised $12.5 million in seed funding to broaden its lending…

    The Future

    Google researchers use off-the-shelf headphones to measure heart rate

    Typical heart rate monitoring in wearable tech, like good watches or wi-fi earbuds, depends at…

    The Future

    Tiny backpack for bees can track their position and temperature

    A bee with the sensor hooked upB. GLEICH, I. SCHMALE, T. NIELSEN, AND J. RAHMER,…

    Our Picks
    Science

    Researchers describe how to tell if ChatGPT is confabulating

    Gadgets

    Democratize 2024, AI and Web3 Summit – San Francisco March 20-21

    Mobile

    UGREEN 300W 48000mAh Power Bank review: The only power bank you’ll ever need

    Categories
    • AI (1,493)
    • Crypto (1,753)
    • Gadgets (1,805)
    • Mobile (1,851)
    • Science (1,866)
    • Technology (1,802)
    • The Future (1,648)
    Most Popular
    Science

    In the 1960s, swindlers pushed fake radioactive medicine

    Gadgets

    The best 3D modeling software in 2023

    Gadgets

    New Zinc Battery Tech Promises On-Demand Green Hydrogen With 50% Better Storage

    Ztoog
    Facebook X (Twitter) Instagram Pinterest
    • Home
    • About Us
    • Contact us
    • Privacy Policy
    • Terms & Conditions
    © 2025 Ztoog.

    Type above and press Enter to search. Press Esc to cancel.