Close Menu
Ztoog
    What's Hot
    Technology

    SEC filing: Alphabet sold ~90% of its stake in Robinhood as of June 30, as it pares back large positions in multiple publicly traded firms, including Duolingo (Rohan Goswami/CNBC)

    AI

    How AI is changing gymnastics judging 

    Crypto

    Token Plummets 60% Following Binance Delisting

    Important Pages:
    • About Us
    • Contact us
    • Privacy Policy
    • Terms & Conditions
    Facebook X (Twitter) Instagram Pinterest
    Facebook X (Twitter) Instagram Pinterest
    Ztoog
    • Home
    • The Future

      Can work-life balance tracking improve well-being?

      Any wall can be turned into a camera to see around corners

      JD Vance and President Trump’s Sons Hype Bitcoin at Las Vegas Conference

      AI may already be shrinking entry-level jobs in tech, new research suggests

      Today’s NYT Strands Hints, Answer and Help for May 26 #449

    • Technology

      Elon Musk tries to stick to spaceships

      A Replit employee details a critical security flaw in web apps created using AI-powered app builder Lovable that exposes API keys and personal info of app users (Reed Albergotti/Semafor)

      Gemini in Google Drive can now help you skip watching that painfully long Zoom meeting

      Apple iPhone exports from China to the US fall 76% as India output surges

      Today’s NYT Wordle Hints, Answer and Help for May 26, #1437

    • Gadgets

      Future-proof your career by mastering AI skills for just $20

      8 Best Vegan Meal Delivery Services and Kits (2025), Tested and Reviewed

      Google Home is getting deeper Gemini integration and a new widget

      Google Announces AI Ultra Subscription Plan With Premium Features

      Google shows off Android XR-based glasses, announces Warby Parker team-up

    • Mobile

      Deals: the Galaxy S25 series comes with a free tablet, Google Pixels heavily discounted

      Microsoft is done being subtle – this new tool screams “upgrade now”

      Wallpaper Wednesday: Android wallpapers 2025-05-28

      Google can make smart glasses accessible with Warby Parker, Gentle Monster deals

      vivo T4 Ultra specs leak

    • Science

      June skygazing: A strawberry moon, the summer solstice… and Asteroid Day!

      Analysts Say Trump Trade Wars Would Harm the Entire US Energy Sector, From Oil to Solar

      Do we have free will? Quantum experiments may soon reveal the answer

      Was Planet Nine exiled from the solar system as a baby?

      How farmers can help rescue water-loving birds

    • AI

      Rationale engineering generates a compact new tool for gene therapy | Ztoog

      The AI Hype Index: College students are hooked on ChatGPT

      Learning how to predict rare kinds of failures | Ztoog

      Anthropic’s new hybrid AI model can work on tasks autonomously for hours at a time

      AI learns how vision and sound are connected, without human intervention | Ztoog

    • Crypto

      Bitcoin Maxi Isn’t Buying Hype Around New Crypto Holding Firms

      GameStop bought $500 million of bitcoin

      CoinW Teams Up with Superteam Europe to Conclude Solana Hackathon and Accelerate Web3 Innovation in Europe

      Ethereum Net Flows Turn Negative As Bulls Push For $3,500

      Bitcoin’s Power Compared To Nuclear Reactor By Brazilian Business Leader

    Ztoog
    Home » Artificial Intelligence (AI) Researchers from Cornell University Propose a Novel Neural Network Framework to Address the Video Matting Problem
    AI

    Artificial Intelligence (AI) Researchers from Cornell University Propose a Novel Neural Network Framework to Address the Video Matting Problem

    Facebook Twitter Pinterest WhatsApp
    Artificial Intelligence (AI) Researchers from Cornell University Propose a Novel Neural Network Framework to Address the Video Matting Problem
    Share
    Facebook Twitter LinkedIn Pinterest WhatsApp

    Image and video modifying are two of the hottest purposes for laptop customers. With the creation of Machine Learning (ML) and Deep Learning (DL), picture and video modifying have been progressively studied by means of a number of neural community architectures. Until very not too long ago, most DL fashions for picture and video modifying have been supervised and, extra particularly, required the coaching knowledge to comprise pairs of enter and output knowledge to be used for studying the particulars of the desired transformation. Lately, end-to-end studying frameworks have been proposed, which require as enter solely a single picture to be taught the mapping to the desired edited output. 

    Video matting is a particular activity belonging to video modifying. The time period “matting “dates again to the nineteenth century when glass plates of matte paint have been set in entrance of a digicam throughout filming to create the phantasm of an atmosphere that was not current at the filming location. Nowadays, the composition of a number of digital photographs follows comparable proceedings. A composite formulation is exploited to shade the depth of the foreground and background of every picture, expressed as a linear mixture of the two parts. 

    Although actually highly effective, this course of has some limitations. It requires an unambiguous factorization of the picture into foreground and background layers, that are then assumed to be independently treatable. In some conditions like video matting, therefore a sequence of temporal- and spatial-dependent frames, the layers decomposition turns into a complicated activity.

    🚀 Build high-quality coaching datasets with Kili Technology and remedy NLP machine studying challenges to develop highly effective ML purposes

    This paper’s targets are the enlightenment of this course of and growing decomposition accuracy. The authors suggest issue matting, a variant of the matting drawback that elements video into extra impartial parts for downstream modifying duties. To deal with this drawback, they then current FactorMatte, an easy-to-use framework that mixes classical matting priors with conditional ones based mostly on anticipated deformations in a scene. The traditional Bayes formulation, for example, referring to the estimation of the most a posteriori likelihood, is prolonged to take away the limiting assumption on the independence of foreground and background. The majority of the approaches moreover assume that background layers stay static over time, which is significantly limiting for many video sequences.

    To overcome these limitations, FactorMatte depends on two modules: a decomposition community that elements the enter video into a number of layers for every part and a set of patch-based discriminators that characterize conditional priors on every part. The structure pipeline is depicted beneath.

    The enter to the decomposition community consists by a video and a tough segmentation masks for the object of curiosity body by body (left, yellow field). With this info, the community produces layers of shade and alpha (center, inexperienced and blue packing containers) based mostly on a reconstruction loss. The foreground layer fashions the foreground part (proper, inexperienced

    field), whereas the atmosphere layer and residual layer collectively mannequin the background part (proper, blue field). The atmosphere layer represents the static-like facets of the background, whereas the residual layer captures extra irregular modifications in the background part due to interactions with the foreground objects (the pillow deformation in the determine). For every of those layers, one discriminator has been educated to be taught the respective marginal priors. 

    The matting end result for some chosen samples is introduced in the determine beneath.

    Although FactorMatte shouldn’t be excellent, the produced outcomes are clearly extra correct than the baseline strategy (OmniMatte). In all given samples, background and foreground layers current a clear separation between one another, which cannot be asserted for the in contrast answer. Furthermore, ablation research have been performed to show the effectiveness of the proposed answer.

    This was the abstract of FactorMatte, a novel framework to deal with the video matting drawback. If you have an interest, you will discover extra info in the hyperlinks beneath.


    Check out the paper, code, and mission All Credit For This Research Goes To Researchers on This Project. Also, don’t overlook to be a part of our Reddit web page and discord channel, the place we share the newest AI analysis information, cool AI tasks, and extra.


    Daniele Lorenzi acquired his M.Sc. in ICT for Internet and Multimedia Engineering in 2021 from the University of Padua, Italy. He is a Ph.D. candidate at the Institute of Information Technology (ITEC) at the Alpen-Adria-Universität (AAU) Klagenfurt. He is at present working in the Christian Doppler Laboratory ATHENA and his analysis pursuits embrace adaptive video streaming, immersive media, machine studying, and QoS/QoE analysis.


    🔥 Gain a aggressive
    edge with knowledge: Actionable market intelligence for international manufacturers, retailers, analysts, and traders. (Sponsored)

    Share. Facebook Twitter Pinterest LinkedIn WhatsApp

    Related Posts

    AI

    Rationale engineering generates a compact new tool for gene therapy | Ztoog

    AI

    The AI Hype Index: College students are hooked on ChatGPT

    AI

    Learning how to predict rare kinds of failures | Ztoog

    AI

    Anthropic’s new hybrid AI model can work on tasks autonomously for hours at a time

    AI

    AI learns how vision and sound are connected, without human intervention | Ztoog

    AI

    How AI is introducing errors into courtrooms

    AI

    With AI, researchers predict the location of virtually any protein within a human cell | Ztoog

    AI

    Google DeepMind’s new AI agent cracks real-world problems better than humans can

    Leave A Reply Cancel Reply

    Follow Us
    • Facebook
    • Twitter
    • Pinterest
    • Instagram
    Top Posts
    Technology

    Prosecutors Turn Sam Bankman-Fried’s Own Words Against Him

    When the cryptocurrency trade FTX was driving excessive, Sam Bankman-Fried, the corporate founder, communicated with…

    AI

    UCL and Imperial College London Researchers Unveil Energy-Efficient Machine Learning through Task-Adaptive Reservoir Computing

    Conventional computer systems use plenty of power; they make up round 10% of the world’s…

    Crypto

    GameStop bought $500 million of bitcoin

    GameStop introduced on Wednesday that it bought 4,710 bitcoin, which is price over $500 million…

    Technology

    OnePlus 12 Deals: Save Up to $800 Off with Trade-In Plus Instant Savings Directly Through OnePlus

    See at Amazon OnePlus 12 at Amazon Save up to $401 in trade-in credit score…

    Technology

    What to know about Zuranolone, the first FDA-approved pill for postpartum depression

    The US Food and Drug Administration has accredited the first pill that can be utilized…

    Our Picks
    Technology

    Unico’s Battery Testing Enters a Competitive Industry

    Technology

    A new generation of storm chasers takes on Mother Nature in Twisters trailer

    Technology

    How military-grade AI, developed by US defense contractors for intelligence, has been repurposed to combat labor organizing, find internal leakers, and more (Wired)

    Categories
    • AI (1,493)
    • Crypto (1,754)
    • Gadgets (1,805)
    • Mobile (1,851)
    • Science (1,867)
    • Technology (1,803)
    • The Future (1,649)
    Most Popular
    AI

    Can LLM Already Serve as A Database Interface? Meet BIRD: A Big Bench for Large-scale Database Grounded Text-to-SQLs

    Mobile

    Apple iPhone 15 Pro Max vs iPhone 14 Pro Max: expected differences

    Technology

    Banned robocallers receive record-breaking $300 million fine for auto-warranty scam enterprise

    Ztoog
    Facebook X (Twitter) Instagram Pinterest
    • Home
    • About Us
    • Contact us
    • Privacy Policy
    • Terms & Conditions
    © 2025 Ztoog.

    Type above and press Enter to search. Press Esc to cancel.