Close Menu
Ztoog
    What's Hot
    Crypto

    FTX misused customer funds, accounting expert who assisted in Enron prosecution testifies

    Technology

    Sources: the Irish DPC is set to hand Meta a record EU privacy fine and order the company to stop all data transfers to the US that rely on certain clauses (Stephanie Bodoni/Bloomberg)

    Mobile

    Indian government moves to ban ProtonMail after bomb threat

    Important Pages:
    • About Us
    • Contact us
    • Privacy Policy
    • Terms & Conditions
    Facebook X (Twitter) Instagram Pinterest
    Facebook X (Twitter) Instagram Pinterest
    Ztoog
    • Home
    • The Future

      JD Vance and President Trump’s Sons Hype Bitcoin at Las Vegas Conference

      AI may already be shrinking entry-level jobs in tech, new research suggests

      Today’s NYT Strands Hints, Answer and Help for May 26 #449

      LiberNovo Omni: The World’s First Dynamic Ergonomic Chair

      Common Security Mistakes Made By Businesses and How to Avoid Them

    • Technology

      Gemini in Google Drive can now help you skip watching that painfully long Zoom meeting

      Apple iPhone exports from China to the US fall 76% as India output surges

      Today’s NYT Wordle Hints, Answer and Help for May 26, #1437

      5 Skills Kids (and Adults) Need in an AI World – O’Reilly

      How To Come Back After A Layoff

    • Gadgets

      8 Best Vegan Meal Delivery Services and Kits (2025), Tested and Reviewed

      Google Home is getting deeper Gemini integration and a new widget

      Google Announces AI Ultra Subscription Plan With Premium Features

      Google shows off Android XR-based glasses, announces Warby Parker team-up

      The market’s down, but this OpenAI for the stock market can help you trade up

    • Mobile

      Wallpaper Wednesday: Android wallpapers 2025-05-28

      Google can make smart glasses accessible with Warby Parker, Gentle Monster deals

      vivo T4 Ultra specs leak

      Forget screens: more details emerge on the mysterious Jony Ive + OpenAI device

      Android 16 QPR1 lets you check what fingerprints you’ve enrolled on your Pixel phone

    • Science

      Was Planet Nine exiled from the solar system as a baby?

      How farmers can help rescue water-loving birds

      A trip to the farm where loofahs grow on vines

      AI Is Eating Data Center Power Demand—and It’s Only Getting Worse

      Liquid physics: Inside the lab making black hole analogues on Earth

    • AI

      The AI Hype Index: College students are hooked on ChatGPT

      Learning how to predict rare kinds of failures | Ztoog

      Anthropic’s new hybrid AI model can work on tasks autonomously for hours at a time

      AI learns how vision and sound are connected, without human intervention | Ztoog

      How AI is introducing errors into courtrooms

    • Crypto

      CoinW Teams Up with Superteam Europe to Conclude Solana Hackathon and Accelerate Web3 Innovation in Europe

      Ethereum Net Flows Turn Negative As Bulls Push For $3,500

      Bitcoin’s Power Compared To Nuclear Reactor By Brazilian Business Leader

      Senate advances GENIUS Act after cloture vote passes

      Is Bitcoin Bull Run Back? Daily RSI Shows Only Mild Bullish Momentum

    Ztoog
    Home » When SAM Meets NeRF: This AI Model Can Segment Anything in 3D
    AI

    When SAM Meets NeRF: This AI Model Can Segment Anything in 3D

    Facebook Twitter Pinterest WhatsApp
    When SAM Meets NeRF: This AI Model Can Segment Anything in 3D
    Share
    Facebook Twitter LinkedIn Pinterest WhatsApp

    We are all amazed by the generative AI developments not too long ago, however that doesn’t imply we don’t get any vital breakthroughs in different purposes. For instance, the pc imaginative and prescient area has been seeing comparatively speedy developments not too long ago as effectively. The Segment Anything Model (SAM) launch by Meta was an enormous success and adjusted the sport in 2D picture segmentation solely. 

    In picture segmentation, the purpose is to detect and type of “paint” all of the objects in the scene. Usually, that is finished by coaching a mannequin on a dataset of objects we need to segmentize. Then, we are able to use the mannequin to phase the very objects in completely different pictures. However, the principle drawback right here is that the mannequin is bounded by the objects we present it throughout the coaching; and it can’t segmentize unseen objects.

    With SAM, that is modified. SAM is the primary mannequin that might segmentize something, actually. This is achieved by coaching the SAM on large-scale information and giving it the flexibility to carry out zero-shot segmentation throughout varied types of picture information. It is designed to mechanically phase objects of curiosity in pictures, no matter their form, dimension, or look. SAM has demonstrated outstanding efficiency in segmenting objects in 2D pictures, revolutionizing the sector of pc imaginative and prescient.

    🚀 JOIN the quickest ML Subreddit Community

    Of course, individuals didn’t merely cease there. They began engaged on methods to increase SAM’s capabilities past 2D. However, a key query has remained unanswered: Can SAM’s segmentation means be prolonged to 3D, thereby bridging the hole between 2D and 3D notion attributable to information shortage? The reply is trying like sure, and it’s time to meet with SA3D.

    SA3D leverages developments in Neural Radiance Fields (NeRF) and the SAM mannequin to revolutionize 3D segmentation. NeRF has emerged as probably the most standard 3D representations in latest years. NeRF builds connections between sparse 2D pictures and actual 3D factors by means of differentiable quantity rendering. It has seen quite a few enhancements, making it a strong device for tackling the challenges of 3D notion.

    There have been some makes an attempt to increase NeRF-based methods for 3D segmentation. These approaches concerned coaching an extra function area aligned with a pre-trained 2D visible spine. While efficient, these strategies undergo from limitations corresponding to excessive reminiscence footprint, artifacts in radiance fields affecting function fields, and inefficiency as a result of want for coaching an extra function area for each scene.

    This is the place SA3D comes into play. Unlike earlier strategies, SA3D doesn’t require coaching an extra function area. Instead, it leverages the ability of SAM and NeRF to phase desired objects from all views mechanically.

    SA3D works by taking user-specified prompts from a single rendered view to provoke the segmentation course of. The segmentation maps generated by SAM are then projected onto 3D masks grids utilizing density-guided inverse rendering, offering preliminary 3D segmentation outcomes. To refine the segmentation, incomplete 2D masks from different views are rendered and used as cross-view self-prompts. These masks are fed into SAM to generate refined masks, that are then projected onto the 3D masks grids. This iterative course of permits for the technology of full 3D segmentation outcomes.

    Overview of how SA3D works. Source: https://arxiv.org/abs/2304.12308

    SA3D gives a number of benefits over earlier approaches. It can simply adapt to any pre-trained NeRF mannequin with out the necessity for adjustments or re-training, making it extremely suitable and adaptable. The whole segmentation course of with SA3D is environment friendly, taking roughly two minutes with out requiring engineering optimization. This velocity makes SA3D a sensible resolution for real-world purposes. Moreover, experimental outcomes have demonstrated that SA3D can generate fine-grained segmentation outcomes for varied forms of 3D objects, opening up new potentialities for purposes corresponding to robotics, augmented actuality, and digital actuality.


    Check out the Paper, Project, and Github hyperlink. Don’t overlook to hitch our 21k+ ML SubReddit, Discord Channel, and Email Newsletter, the place we share the newest AI analysis information, cool AI tasks, and extra. If you have got any questions relating to the above article or if we missed something, be happy to e mail us at Asif@marktechpost.com

    🚀 Check Out 100’s AI Tools in AI Tools Club


    Ekrem Çetinkaya acquired his B.Sc. in 2018 and M.Sc. in 2019 from Ozyegin University, Istanbul, Türkiye. He wrote his M.Sc. thesis about picture denoising utilizing deep convolutional networks. He is at present pursuing a Ph.D. diploma on the University of Klagenfurt, Austria, and dealing as a researcher on the ATHENA undertaking. His analysis pursuits embody deep studying, pc imaginative and prescient, and multimedia networking.


    ➡️ Meet Bright Data: The World’s #1 Web Data Platform

    Share. Facebook Twitter Pinterest LinkedIn WhatsApp

    Related Posts

    AI

    The AI Hype Index: College students are hooked on ChatGPT

    AI

    Learning how to predict rare kinds of failures | Ztoog

    AI

    Anthropic’s new hybrid AI model can work on tasks autonomously for hours at a time

    AI

    AI learns how vision and sound are connected, without human intervention | Ztoog

    AI

    How AI is introducing errors into courtrooms

    AI

    With AI, researchers predict the location of virtually any protein within a human cell | Ztoog

    AI

    Google DeepMind’s new AI agent cracks real-world problems better than humans can

    AI

    Study shows vision-language models can’t handle queries with negation words | Ztoog

    Leave A Reply Cancel Reply

    Follow Us
    • Facebook
    • Twitter
    • Pinterest
    • Instagram
    Top Posts
    Science

    Future of fusion: How the UK’s JET reactor paved the way for ITER

    Earlier this 12 months, the Joint European Torus (JET) turned 40. JET is a fusion…

    Gadgets

    Morphobot, The Versatile Transformative Robot Revolutionizing Mobility

    In a leap in the direction of cutting-edge robotics, the California Institute of Technology (Caltech)…

    Technology

    Meta AI tested: Doesn’t quite justify its own existence, but free is free

    Meta’s new massive language mannequin, Llama 3, powers the imaginatively named “Meta AI,” a newish…

    Gadgets

    7 Best Bike Locks (2023): U-Locks, Chain Locks, and Tips

    Whichever lock you go together with, make sure that it will probably loop round your…

    Technology

    Creating Domestic Robots That Really Help

    Episode 2: How Labrador and iRobot Create Domestic Robots That Really HelpEvan Ackerman: I’m Evan…

    Our Picks
    AI

    LMSYS ORG Introduces Arena-Hard: A Data Pipeline to Build High-Quality Benchmarks from Live Data in Chatbot Arena, which is a Crowd-Sourced Platform for LLM Evals

    Science

    Big Pharma hiked the price of 775 drugs this year so far: Report

    Technology

    Determinism vs. free will: A scientific showdown

    Categories
    • AI (1,492)
    • Crypto (1,752)
    • Gadgets (1,804)
    • Mobile (1,849)
    • Science (1,864)
    • Technology (1,801)
    • The Future (1,647)
    Most Popular
    The Future

    Unlock Donghua Jinlong’s food grade glycine

    Technology

    Elon Musk, Video Game King? Well, Maybe Not.

    AI

    Three ways we can fight deepfake porn

    Ztoog
    Facebook X (Twitter) Instagram Pinterest
    • Home
    • About Us
    • Contact us
    • Privacy Policy
    • Terms & Conditions
    © 2025 Ztoog.

    Type above and press Enter to search. Press Esc to cancel.