Close Menu
Ztoog
    What's Hot
    The Future

    Meet Alma, the AI Assistant for Real Estate Investors

    AI

    How ChatGPT search paves the way for AI agents

    The Future

    8 Best Foods to Boost Happiness, According to Science

    Important Pages:
    • About Us
    • Contact us
    • Privacy Policy
    • Terms & Conditions
    Facebook X (Twitter) Instagram Pinterest
    Facebook X (Twitter) Instagram Pinterest
    Ztoog
    • Home
    • The Future

      Today’s NYT Connections Hints, Answers for May 12, #701

      OPPO launches A5 Pro 5G: Premium features at a budget price

      How I Turn Unstructured PDFs into Revenue-Ready Spreadsheets

      Is it the best tool for 2025?

      The clocks that helped define time from London’s Royal Observatory

    • Technology

      Today’s NYT Wordle Hints, Answer and Help for May 12, #1423

      What It Is and Why It Matters—Part 1 – O’Reilly

      Ensure Hard Work Is Recognized With These 3 Steps

      Cicada map 2025: Where will Brood XIV cicadas emerge this spring?

      Is Duolingo the face of an AI jobs crisis?

    • Gadgets

      Google Tests Automatic Password-to-Passkey Conversion On Android

      Maono Caster G1 Neo & PD200X Review: Budget Streaming Gear for Aspiring Creators

      Apple plans to split iPhone 18 launch into two phases in 2026

      Upgrade your desk to Starfleet status with this $95 USB-C hub

      37 Best Graduation Gift Ideas (2025): For College Grads

    • Mobile

      Motorola’s Moto Watch needs to start living up to the brand name

      Samsung Galaxy S25 Edge promo materials leak

      What are people doing with those free T-Mobile lines? Way more than you’d expect

      Samsung doesn’t want budget Galaxy phones to use exclusive AI features

      COROS’s charging adapter is a neat solution to the smartwatch charging cable problem

    • Science

      Nothing is stronger than quantum connections – and now we know why

      Failed Soviet probe will soon crash to Earth – and we don’t know where

      Trump administration cuts off all future federal funding to Harvard

      Does kissing spread gluten? New research offers a clue.

      Why Balcony Solar Panels Haven’t Taken Off in the US

    • AI

      Hybrid AI model crafts smooth, high-quality videos in seconds | Ztoog

      How to build a better AI benchmark

      Q&A: A roadmap for revolutionizing health care through data-driven innovation | Ztoog

      This data set helps researchers spot harmful stereotypes in LLMs

      Making AI models more trustworthy for high-stakes settings | Ztoog

    • Crypto

      Ethereum Breaks Key Resistance In One Massive Move – Higher High Confirms Momentum

      ‘The Big Short’ Coming For Bitcoin? Why BTC Will Clear $110,000

      Bitcoin Holds Above $95K Despite Weak Blockchain Activity — Analytics Firm Explains Why

      eToro eyes US IPO launch as early as next week amid easing concerns over Trump’s tariffs

      Cardano ‘Looks Dope,’ Analyst Predicts Big Move Soon

    Ztoog
    Home » The Alignment Problem Is Not New – O’Reilly
    Technology

    The Alignment Problem Is Not New – O’Reilly

    Facebook Twitter Pinterest WhatsApp
    The Alignment Problem Is Not New – O’Reilly
    Share
    Facebook Twitter LinkedIn Pinterest WhatsApp

    “Mitigating the risk of extinction from A.I. should be a global priority alongside other societal-scale risks, such as pandemics and nuclear war,” in keeping with an announcement signed by greater than 350 enterprise and technical leaders, together with the builders of as we speak’s most essential AI platforms.

    Among the potential dangers resulting in that final result is what is called “the alignment problem.” Will a future super-intelligent AI share human values, or would possibly it think about us an impediment to fulfilling its personal objectives? And even when AI continues to be topic to our needs, would possibly its creators—or its customers—make an ill-considered want whose penalties turn into catastrophic, just like the want of fabled King Midas that every little thing he touches flip to gold? Oxford thinker Nick Bostrom, writer of the e book Superintelligence, as soon as posited as a thought experiment an AI-managed manufacturing unit given the command to optimize the manufacturing of paperclips. The “paperclip maximizer” involves monopolize the world’s assets and finally decides that people are in the best way of its grasp goal.



    Learn quicker. Dig deeper. See farther.

    Far-fetched as that sounds, the alignment drawback is not only a far future consideration. We have already created a race of paperclip maximizers. Science fiction author Charlie Stross has famous that as we speak’s firms will be regarded as “slow AIs.” And a lot as Bostrom feared, we’ve given them an overriding command: to extend company earnings and shareholder worth. The penalties, like these of Midas’s contact, aren’t fairly. Humans are seen as a price to be eradicated. Efficiency, not human flourishing, is maximized.

    In pursuit of this overriding purpose, our fossil gasoline corporations proceed to disclaim local weather change and hinder makes an attempt to change to different vitality sources, drug corporations peddle opioids, and meals corporations encourage weight problems. Even once-idealistic web corporations have been unable to withstand the grasp goal, and in pursuing it have created addictive merchandise of their very own, sown disinformation and division, and resisted makes an attempt to restrain their conduct.

    Even if this analogy appears far fetched to you, it ought to provide you with pause when you concentrate on the issues of AI governance.

    Corporations are nominally underneath human management, with human executives and governing boards liable for strategic course and decision-making. Humans are “in the loop,” and usually talking, they make efforts to restrain the machine, however because the examples above present, they typically fail, with disastrous outcomes. The efforts at human management are hobbled as a result of we’ve given the people the identical reward perform because the machine they’re requested to control: we compensate executives, board members, and different key workers with choices to revenue richly from the inventory whose worth the company is tasked with maximizing. Attempts so as to add environmental, social, and governance (ESG) constraints have had solely restricted influence. As lengthy because the grasp goal stays in place, ESG too typically stays one thing of an afterthought.

    Much as we concern a superintelligent AI would possibly do, our firms resist oversight and regulation. Purdue Pharma efficiently lobbied regulators to restrict the danger warnings deliberate for medical doctors prescribing Oxycontin and marketed this harmful drug as non-addictive. While Purdue finally paid a value for its misdeeds, the harm had largely been executed and the opioid epidemic rages unabated.

    What would possibly we find out about AI regulation from failures of company governance?

    1. AIs are created, owned, and managed by firms, and can inherit their goals. Unless we alter company goals to embrace human flourishing, we’ve little hope of constructing AI that may achieve this.
    2. We want analysis on how finest to coach AI fashions to fulfill a number of, generally conflicting objectives fairly than optimizing for a single purpose. ESG-style considerations can’t be an add-on, however should be intrinsic to what AI builders name the reward perform. As Microsoft CEO Satya Nadella as soon as stated to me, “We [humans] don’t optimize. We satisfice.” (This thought goes again to Herbert Simon’s 1956 e book Administrative Behavior.) In a satisficing framework, an overriding purpose could also be handled as a constraint, however a number of objectives are at all times in play. As I as soon as described this concept of constraints, “Money in a business is like gas in your car. You need to pay attention so you don’t end up on the side of the road. But your trip is not a tour of gas stations.” Profit ought to be an instrumental purpose, not a purpose in and of itself. And as to our precise objectives, Satya put it nicely in our dialog: “the moral philosophy that guides us is everything.”
    3. Governance isn’t a “once and done” train. It requires fixed vigilance, and adaptation to new circumstances on the velocity at which these circumstances change. You have solely to have a look at the gradual response of financial institution regulators to the rise of CDOs and different mortgage-backed derivatives within the runup to the 2009 monetary disaster to know that point is of the essence.

    OpenAI CEO Sam Altman has begged for presidency regulation, however tellingly, has advised that such regulation apply solely to future, extra highly effective variations of AI. This is a mistake. There is way that may be executed proper now.

    We ought to require registration of all AI fashions above a sure stage of energy, a lot as we require company registration. And we must always outline present finest practices within the administration of AI methods and make them necessary, topic to common, constant disclosures and auditing, a lot as we require public corporations to frequently disclose their financials.

    The work that Timnit Gebru, Margaret Mitchell, and their coauthors have executed on the disclosure of coaching knowledge (“Datasheets for Datasets”) and the efficiency traits and dangers of skilled AI fashions (“Model Cards for Model Reporting”) are a very good first draft of one thing very similar to the Generally Accepted Accounting Principles (and their equal in different nations) that information US monetary reporting. Might we name them “Generally Accepted AI Management Principles”?

    It’s important that these rules be created in shut cooperation with the creators of AI methods, in order that they mirror precise finest apply fairly than a algorithm imposed from with out by regulators and advocates. But they will’t be developed solely by the tech corporations themselves. In his e book Voices within the Code, James G. Robinson (now Director of Policy for OpenAI) factors out that each algorithm makes ethical selections, and explains why these selections should be hammered out in a participatory and accountable course of. There isn’t any completely environment friendly algorithm that will get every little thing proper. Listening to the voices of these affected can seriously change our understanding of the outcomes we’re in search of.

    But there’s one other issue too. OpenAI has stated that “Our alignment research aims to make artificial general intelligence (AGI) aligned with human values and follow human intent.” Yet lots of the world’s ills are the results of the distinction between acknowledged human values and the intent expressed by precise human selections and actions. Justice, equity, fairness, respect for fact, and long-term pondering are all in brief provide. An AI mannequin reminiscent of GPT4 has been skilled on an enormous corpus of human speech, a report of humanity’s ideas and emotions. It is a mirror. The biases that we see there are our personal. We must look deeply into that mirror, and if we don’t like what we see, we have to change ourselves, not simply regulate the mirror so it reveals us a extra pleasing image!

    To ensure, we don’t need AI fashions to be spouting hatred and misinformation, however merely fixing the output is inadequate. We should rethink the enter—each within the coaching knowledge and within the prompting. The quest for efficient AI governance is a chance to interrogate our values and to remake our society according to the values we select. The design of an AI that won’t destroy us could be the very factor that saves us in the long run.

    Share. Facebook Twitter Pinterest LinkedIn WhatsApp

    Related Posts

    Technology

    Today’s NYT Wordle Hints, Answer and Help for May 12, #1423

    Technology

    What It Is and Why It Matters—Part 1 – O’Reilly

    Technology

    Ensure Hard Work Is Recognized With These 3 Steps

    Technology

    Cicada map 2025: Where will Brood XIV cicadas emerge this spring?

    Technology

    Is Duolingo the face of an AI jobs crisis?

    Technology

    The US DOD transfers its AI-based Open Price Exploration for National Security program to nonprofit Critical Minerals Forum to boost Western supply deals (Ernest Scheyder/Reuters)

    Technology

    The more Google kills Fitbit, the more I want a Fitbit Sense 3

    Technology

    Sorry Shoppers, Amazon Says Tariff Cost Feature ‘Is Not Going to Happen’

    Leave A Reply Cancel Reply

    Follow Us
    • Facebook
    • Twitter
    • Pinterest
    • Instagram
    Top Posts
    The Future

    Microsoft eyes closing its giant Activision Blizzard deal next week

    Microsoft is planning to finalize its $68.7 billion proposed acquisition of Activision Blizzard next week.…

    AI

    Introducing MIT Technology Review Roundtables, real-time conversations about what’s next in tech

    There is little doubt that generative AI will have an effect on the financial system—however…

    Mobile

    Oculus Quest avatars actually have legs now

    What it’s essential to knowMeta Quest avatars within the Horizon Home atmosphere now have legs…

    Crypto

    Cardano Price Rejected at $0.36, How Long Will The Correction Last?

    The value of Cardano has been following a bearish trajectory for the previous couple of…

    Gadgets

    The 5 most interesting PC monitors from CES 2024

    Enlarge / Dell’s upcoming UltraSharp U4025QW. Scharon Harding (*5*) Each 12 months, the Consumer Electronics…

    Our Picks
    Technology

    Do You Have ‘Bookshelf Wealth’?

    The Future

    Netflix Review: Our Top Choice in a Crowded Market

    Science

    DART Showed How to Smash an Asteroid. So Where Did the Space Shrapnel Go?

    Categories
    • AI (1,483)
    • Crypto (1,745)
    • Gadgets (1,797)
    • Mobile (1,840)
    • Science (1,854)
    • Technology (1,791)
    • The Future (1,637)
    Most Popular
    Mobile

    Android 15 QPR1 Beta 1 kicks off before we even have the stable version

    AI

    This Research Explains How Simplified Optical Neural Network Component Saves Space And Energy

    Technology

    The Supreme Court doesn’t seem eager to get involved with homelessness policy, in Grants Pass v. Johnson

    Ztoog
    Facebook X (Twitter) Instagram Pinterest
    • Home
    • About Us
    • Contact us
    • Privacy Policy
    • Terms & Conditions
    © 2025 Ztoog.

    Type above and press Enter to search. Press Esc to cancel.