Close Menu
Ztoog
    What's Hot
    Science

    Heat Waves in the Ground Are Getting More Extreme—and Perilous

    Technology

    Why Monday.com decided to build its new database instead of buying one

    Gadgets

    Comparison: Pixel 8 Pro vs Galaxy S23 Ultra

    Important Pages:
    • About Us
    • Contact us
    • Privacy Policy
    • Terms & Conditions
    Facebook X (Twitter) Instagram Pinterest
    Facebook X (Twitter) Instagram Pinterest
    Ztoog
    • Home
    • The Future

      How I Turn Unstructured PDFs into Revenue-Ready Spreadsheets

      Is it the best tool for 2025?

      The clocks that helped define time from London’s Royal Observatory

      Summer Movies Are Here, and So Are the New Popcorn Buckets

      India-Pak conflict: Pak appoints ISI chief, appointment comes in backdrop of the Pahalgam attack

    • Technology

      Ensure Hard Work Is Recognized With These 3 Steps

      Cicada map 2025: Where will Brood XIV cicadas emerge this spring?

      Is Duolingo the face of an AI jobs crisis?

      The US DOD transfers its AI-based Open Price Exploration for National Security program to nonprofit Critical Minerals Forum to boost Western supply deals (Ernest Scheyder/Reuters)

      The more Google kills Fitbit, the more I want a Fitbit Sense 3

    • Gadgets

      Maono Caster G1 Neo & PD200X Review: Budget Streaming Gear for Aspiring Creators

      Apple plans to split iPhone 18 launch into two phases in 2026

      Upgrade your desk to Starfleet status with this $95 USB-C hub

      37 Best Graduation Gift Ideas (2025): For College Grads

      Backblaze responds to claims of “sham accounting,” customer backups at risk

    • Mobile

      Samsung Galaxy S25 Edge promo materials leak

      What are people doing with those free T-Mobile lines? Way more than you’d expect

      Samsung doesn’t want budget Galaxy phones to use exclusive AI features

      COROS’s charging adapter is a neat solution to the smartwatch charging cable problem

      Fortnite said to return to the US iOS App Store next week following court verdict

    • Science

      Failed Soviet probe will soon crash to Earth – and we don’t know where

      Trump administration cuts off all future federal funding to Harvard

      Does kissing spread gluten? New research offers a clue.

      Why Balcony Solar Panels Haven’t Taken Off in the US

      ‘Dark photon’ theory of light aims to tear up a century of physics

    • AI

      How to build a better AI benchmark

      Q&A: A roadmap for revolutionizing health care through data-driven innovation | Ztoog

      This data set helps researchers spot harmful stereotypes in LLMs

      Making AI models more trustworthy for high-stakes settings | Ztoog

      The AI Hype Index: AI agent cyberattacks, racing robots, and musical models

    • Crypto

      ‘The Big Short’ Coming For Bitcoin? Why BTC Will Clear $110,000

      Bitcoin Holds Above $95K Despite Weak Blockchain Activity — Analytics Firm Explains Why

      eToro eyes US IPO launch as early as next week amid easing concerns over Trump’s tariffs

      Cardano ‘Looks Dope,’ Analyst Predicts Big Move Soon

      Speak at Ztoog Disrupt 2025: Applications now open

    Ztoog
    Home » Research at Stanford Introduces PointOdyssey: A Large-Scale Synthetic Dataset for Long-Term Point Tracking
    AI

    Research at Stanford Introduces PointOdyssey: A Large-Scale Synthetic Dataset for Long-Term Point Tracking

    Facebook Twitter Pinterest WhatsApp
    Research at Stanford Introduces PointOdyssey: A Large-Scale Synthetic Dataset for Long-Term Point Tracking
    Share
    Facebook Twitter LinkedIn Pinterest WhatsApp

    Large-scale annotated datasets have served as a freeway for creating exact fashions in numerous pc imaginative and prescient duties. They need to supply such a freeway on this examine to perform fine-grained long-range monitoring. Fine-grained long-range monitoring goals to comply with the matching world floor level for so long as possible, given any pixel location in any body of a film. There are a number of generations of datasets aimed at fine-grained short-range monitoring (e.g., optical circulation) and usually up to date datasets aimed at numerous kinds of coarse-grained long-range monitoring (e.g., single-object monitoring, multi-object monitoring, video object segmentation). However, there are solely so many works at the interface between these two kinds of monitoring. 

    Researchers have already examined fine-grained trackers on real-world films with sparse human-provided annotations (BADJA and TAPVid) and educated them on unrealistic artificial knowledge (FlyingThings++ and Kubric-MOVi-E), which consists of random objects shifting in surprising instructions on random backdrops. While it’s intriguing that these fashions can generalize to precise movies, utilizing such primary coaching prevents the event of long-range temporal context and scene-level semantic consciousness. They contend that long-range level monitoring shouldn’t be thought-about an extension of optical circulation, the place naturalism could also be deserted with out struggling unfavorable penalties. 

    While the video’s pixels could transfer considerably randomly, their path displays a number of modellable components, resembling digicam shaking, object-level actions and deformations, and multi-object connections, together with social and bodily interactions. Progress relies on individuals realizing the difficulty’s magnitude, each when it comes to their knowledge and methodology. Researchers from Stanford University recommend PointOdyssey, a big artificial dataset for long-term fine-grained monitoring coaching and evaluation. The intricacy, variety, and realism of real-world video are all represented of their assortment, with pixel-perfect annotation solely being attainable via simulation. 

    They use motions, scene layouts, and digicam trajectories which might be mined from real-world movies and movement captures (versus being random or hand-designed), distinguishing their work from prior artificial datasets. They additionally use area randomization on numerous scene attributes, resembling surroundings maps, lighting, human and animal our bodies, digicam trajectories, and supplies. They may give extra photograph realism than was beforehand achievable due to developments within the accessibility of high-quality content material and rendering applied sciences. The movement profiles of their knowledge are derived from sizable human and animal movement seize datasets. They make use of these captures to generate lifelike long-range trajectories for humanoids and different animals in outside conditions. 

    In outside conditions, they pair these actors with 3D objects dispersed randomly on the bottom airplane. These issues reply to the actors following physics, resembling being kicked away when the toes come into contact with them. Then, they make use of movement captures of inside settings to create lifelike indoor eventualities and manually recreate the seize environments of their simulator. This allows us to recreate the exact motions and interactions whereas sustaining the scene-aware character of the unique knowledge. To present complicated multi-view knowledge of the conditions, they import digicam trajectories derived from actual footage and join further cameras to the artificial beings’ heads. In distinction to Kubric and FlyingThings’ largely random movement patterns, they take a capture-driven strategy. 

    Their knowledge will stimulate the event of monitoring methods that transfer past the traditional reliance solely on bottom-up cues like feature-matching and make the most of scene-level cues to supply sturdy priors on monitor. A huge assortment of simulated belongings, together with 42 humanoid varieties with artist-created textures, 7 animals, 1K+ object/background textures, 1K+ objects, 20 authentic 3D sceneries, and 50 surroundings maps, provides their knowledge its aesthetic variety. To create a wide range of darkish and shiny sceneries, they randomize the scene’s lighting. Additionally, they add dynamic fog and smoke results to their sceneries, including a sort of partial occlusion that FlyingThings and Kubric fully lack. One of the brand new issues that PointOdyssey opens is tips on how to make use of long-range temporal context. 

    For occasion, the state-of-the-art monitoring algorithm Persistent Independent Particles (PIPs) has an 8-frame temporal window. They recommend a number of modifications to PIPs as a primary step in the direction of utilizing arbitrarily prolonged temporal context, together with significantly increasing its 8-frame temporal scope and including a template-update mechanism. According to experimental findings, their answer outperforms all others relating to monitoring accuracy, each on the PointOdyssey take a look at set and on real-world benchmarks. In conclusion, PointOdyssey, a large artificial dataset for long-term level monitoring that tries to replicate the difficulties—and alternatives—of real-world fine-grained monitoring, is the foremost contribution of this examine.


    Check out the Paper, Project, and Dataset. All Credit For This Research Goes To the Researchers on This Project. Also, don’t neglect to hitch our 30k+ ML SubReddit, 40k+ Facebook Community, Discord Channel, and Email Newsletter, the place we share the most recent AI analysis information, cool AI initiatives, and extra.

    If you want our work, you’ll love our publication..


    Aneesh Tickoo is a consulting intern at MarktechPost. He is presently pursuing his undergraduate diploma in Data Science and Artificial Intelligence from the Indian Institute of Technology(IIT), Bhilai. He spends most of his time engaged on initiatives aimed at harnessing the ability of machine studying. His analysis curiosity is picture processing and is obsessed with constructing options round it. He loves to attach with individuals and collaborate on attention-grabbing initiatives.


    🚀 The finish of challenge administration by people (Sponsored)

    Share. Facebook Twitter Pinterest LinkedIn WhatsApp

    Related Posts

    AI

    How to build a better AI benchmark

    AI

    Q&A: A roadmap for revolutionizing health care through data-driven innovation | Ztoog

    AI

    This data set helps researchers spot harmful stereotypes in LLMs

    AI

    Making AI models more trustworthy for high-stakes settings | Ztoog

    AI

    The AI Hype Index: AI agent cyberattacks, racing robots, and musical models

    AI

    Novel method detects microbial contamination in cell cultures | Ztoog

    AI

    Seeing AI as a collaborator, not a creator

    AI

    “Periodic table of machine learning” could fuel AI discovery | Ztoog

    Leave A Reply Cancel Reply

    Follow Us
    • Facebook
    • Twitter
    • Pinterest
    • Instagram
    Top Posts
    Gadgets

    Elon Musk Can’t Solve Tesla’s China Crisis With His Desperate Asia Visit

    Elon Musk will be happy that his shock jaunt to China on Sunday garnered many…

    Crypto

    Clinton Vs. Novogratz In Heated War Of Words

    Sparks flew this week on the Bloomberg New Economy Forum as political heavyweight Hilary Clinton…

    Technology

    Samsung Galaxy Unpacked 2024 date leaked via a very official-looking teaser

    C. Scott Brown / Android AuthorityTL;DR A seemingly official picture of a countdown has leaked,…

    Gadgets

    How to Use Split Screen (2023): Windows, Mac, Chromebook, Android, iPad

    Life is busy. Multitasking is crucial for anybody struggling to steadiness work, play, and the…

    Crypto

    Ethereum Layer 2 Networks Just Set A New Record

    The complete worth locked (TVL) on Ethereum layer-2 networks not too long ago hit a…

    Our Picks
    The Future

    BlueAnt Soundblade Review – Stylish, modern and capable

    Mobile

    iQOO 12 hands-on image leaks along with a list of color versions

    Gadgets

    Vidnoz AI Review: Free Text to Video AI Generator in a Minute?

    Categories
    • AI (1,482)
    • Crypto (1,744)
    • Gadgets (1,796)
    • Mobile (1,839)
    • Science (1,853)
    • Technology (1,789)
    • The Future (1,635)
    Most Popular
    Gadgets

    Microsoft open-sources infamously weird, RAM-hungry MS-DOS 4.00 release

    The Future

    Best Beginner Drones of 2023

    Mobile

    Best Samsung Galaxy Tab S9 Plus screen protectors

    Ztoog
    Facebook X (Twitter) Instagram Pinterest
    • Home
    • About Us
    • Contact us
    • Privacy Policy
    • Terms & Conditions
    © 2025 Ztoog.

    Type above and press Enter to search. Press Esc to cancel.