Close Menu
Ztoog
    What's Hot
    Gadgets

    Logitech Reach Review: Super Camera Arm, Subpar Camera

    Mobile

    All the features I want to see

    Mobile

    Ex ByteDance exec says China’s Communist Party has special access to TikTok’s US user data

    Important Pages:
    • About Us
    • Contact us
    • Privacy Policy
    • Terms & Conditions
    Facebook X (Twitter) Instagram Pinterest
    Facebook X (Twitter) Instagram Pinterest
    Ztoog
    • Home
    • The Future

      Any wall can be turned into a camera to see around corners

      JD Vance and President Trump’s Sons Hype Bitcoin at Las Vegas Conference

      AI may already be shrinking entry-level jobs in tech, new research suggests

      Today’s NYT Strands Hints, Answer and Help for May 26 #449

      LiberNovo Omni: The World’s First Dynamic Ergonomic Chair

    • Technology

      A Replit employee details a critical security flaw in web apps created using AI-powered app builder Lovable that exposes API keys and personal info of app users (Reed Albergotti/Semafor)

      Gemini in Google Drive can now help you skip watching that painfully long Zoom meeting

      Apple iPhone exports from China to the US fall 76% as India output surges

      Today’s NYT Wordle Hints, Answer and Help for May 26, #1437

      5 Skills Kids (and Adults) Need in an AI World – O’Reilly

    • Gadgets

      Future-proof your career by mastering AI skills for just $20

      8 Best Vegan Meal Delivery Services and Kits (2025), Tested and Reviewed

      Google Home is getting deeper Gemini integration and a new widget

      Google Announces AI Ultra Subscription Plan With Premium Features

      Google shows off Android XR-based glasses, announces Warby Parker team-up

    • Mobile

      Deals: the Galaxy S25 series comes with a free tablet, Google Pixels heavily discounted

      Microsoft is done being subtle – this new tool screams “upgrade now”

      Wallpaper Wednesday: Android wallpapers 2025-05-28

      Google can make smart glasses accessible with Warby Parker, Gentle Monster deals

      vivo T4 Ultra specs leak

    • Science

      Analysts Say Trump Trade Wars Would Harm the Entire US Energy Sector, From Oil to Solar

      Do we have free will? Quantum experiments may soon reveal the answer

      Was Planet Nine exiled from the solar system as a baby?

      How farmers can help rescue water-loving birds

      A trip to the farm where loofahs grow on vines

    • AI

      Rationale engineering generates a compact new tool for gene therapy | Ztoog

      The AI Hype Index: College students are hooked on ChatGPT

      Learning how to predict rare kinds of failures | Ztoog

      Anthropic’s new hybrid AI model can work on tasks autonomously for hours at a time

      AI learns how vision and sound are connected, without human intervention | Ztoog

    • Crypto

      GameStop bought $500 million of bitcoin

      CoinW Teams Up with Superteam Europe to Conclude Solana Hackathon and Accelerate Web3 Innovation in Europe

      Ethereum Net Flows Turn Negative As Bulls Push For $3,500

      Bitcoin’s Power Compared To Nuclear Reactor By Brazilian Business Leader

      Senate advances GENIUS Act after cloture vote passes

    Ztoog
    Home » This AI Paper Proposes a Novel Pre-Training Strategy Called Privacy-Preserving MAE-Align’ to Effectively Combine Synthetic Data and Human-Removed Real Data
    AI

    This AI Paper Proposes a Novel Pre-Training Strategy Called Privacy-Preserving MAE-Align’ to Effectively Combine Synthetic Data and Human-Removed Real Data

    Facebook Twitter Pinterest WhatsApp
    This AI Paper Proposes a Novel Pre-Training Strategy Called Privacy-Preserving MAE-Align’ to Effectively Combine Synthetic Data and Human-Removed Real Data
    Share
    Facebook Twitter LinkedIn Pinterest WhatsApp

    Action recognition, the duty of figuring out and classifying human actions from video sequences, is a essential discipline inside laptop imaginative and prescient. However, its reliance on large-scale datasets containing photographs of individuals brings forth important challenges associated to privateness, ethics, and knowledge safety. These points come up due to the potential identification of people primarily based on private attributes and knowledge assortment with out express consent. Moreover, biases associated to gender, race, or particular actions carried out by sure teams can have an effect on the accuracy and equity of fashions skilled on such datasets. 

    In motion recognition, developments in pre-training methodologies on huge video datasets have been pivotal. However, these developments include challenges, akin to moral concerns, privateness points, and biases inherent in datasets with human imagery. Existing approaches to sort out these points embody blurring faces, downsampling movies, or using artificial knowledge for coaching. Despite these efforts, there wants to be extra evaluation of how effectively privacy-preserving pre-trained fashions switch their discovered representations to downstream duties. The state-of-the-art fashions generally fail to predict actions precisely due to biases or a lack of numerous representations within the coaching knowledge. These challenges demand novel approaches that deal with privateness considerations and improve the transferability of discovered representations to varied motion recognition duties.

    To overcome the challenges posed by privateness considerations and biases in human-centric datasets used for motion recognition, a new technique was not too long ago offered at NeurIPS 2023, the well-known convention, that introduces a groundbreaking method. This newly revealed work devises a methodology to pre-train motion recognition fashions utilizing a mixture of artificial movies containing digital people and real-world movies with people eliminated. By leveraging this novel pre-training technique termed Privacy-Preserving MAE-Align (PPMA), the mannequin learns temporal dynamics from artificial knowledge and contextual options from actual movies with out people. This modern technique helps deal with privateness and moral considerations associated to human knowledge. It considerably improves the transferability of discovered representations to numerous downstream motion recognition duties, closing the efficiency hole between fashions skilled with and with out human-centric knowledge.

    Concretely, the proposed PPMA technique follows these key steps:

    1. Privacy-Preserving Real Data: The course of begins with the Kinetics dataset, from which people are eliminated utilizing the HAT framework, ensuing within the No-Human Kinetics dataset.
    2. Synthetic Data Addition: Synthetic movies from SynAPT are included, providing digital human actions facilitating concentrate on temporal options.
    3. Downstream Evaluation: Six numerous duties consider the mannequin’s transferability throughout varied motion recognition challenges.
    4. MAE-Align Pre-training: This two-stage technique entails:
    • Stage 1: MAE Training to predict pixel values, studying real-world contextual options.
    • Stage 2: Supervised Alignment utilizing each No-Human Kinetics and artificial knowledge for motion label-based coaching.
    1. Privacy-Preserving MAE-Align (PPMA): Combining Stage 1 (MAE skilled on No-Human Kinetics) with Stage 2 (alignment utilizing each No-Human Kinetics and artificial knowledge), PPMA ensures sturdy illustration studying whereas safeguarding privateness.

    The analysis crew performed experiments to consider the proposed method. Using ViT-B fashions skilled from scratch with out ImageNet pre-training, they employed a two-stage course of: MAE coaching for 200 epochs adopted by supervised alignment for 50 epochs. Across six numerous duties, PPMA outperformed different privacy-preserving strategies by 2.5% in finetuning (FT) and 5% in linear probing (LP). Although barely much less efficient on excessive scene-object bias duties, PPMA considerably decreased the efficiency hole in contrast to fashions skilled on actual human-centric knowledge, showcasing promise in reaching sturdy representations whereas preserving privateness. Ablation experiments highlighted the effectiveness of MAE pre-training in studying transferable options, notably evident when finetuned on downstream duties. Additionally, exploring the mix of contextual and temporal options, strategies like averaging mannequin weights and dynamically studying mixing proportions confirmed potential for bettering representations, opening avenues for additional exploration.

    This article introduces PPMA, a novel privacy-preserving method for motion recognition fashions, addressing privateness, ethics, and bias challenges in human-centric datasets. Leveraging artificial and human-free real-world knowledge, PPMA successfully transfers discovered representations to numerous motion recognition duties, minimizing the efficiency hole between fashions skilled with and with out human-centric knowledge. The experiments underscore PPMA’s effectiveness in advancing motion recognition whereas making certain privateness and mitigating moral considerations and biases linked to typical datasets.


    Check out the Paper and Github. All credit score for this analysis goes to the researchers of this mission. Also, don’t overlook to be a part of our 33k+ ML SubReddit, 41k+ Facebook Community, Discord Channel, and Email Newsletter, the place we share the newest AI analysis information, cool AI initiatives, and extra.

    If you want our work, you’ll love our e-newsletter..


    Mahmoud is a PhD researcher in machine studying. He additionally holds a
    bachelor’s diploma in bodily science and a grasp’s diploma in
    telecommunications and networking methods. His present areas of
    analysis concern laptop imaginative and prescient, inventory market prediction and deep
    studying. He produced a number of scientific articles about particular person re-
    identification and the research of the robustness and stability of deep
    networks.


    ↗ Step by Step Tutorial on ‘How to Build LLM Apps that may See Hear Speak’

    Share. Facebook Twitter Pinterest LinkedIn WhatsApp

    Related Posts

    AI

    Rationale engineering generates a compact new tool for gene therapy | Ztoog

    AI

    The AI Hype Index: College students are hooked on ChatGPT

    AI

    Learning how to predict rare kinds of failures | Ztoog

    AI

    Anthropic’s new hybrid AI model can work on tasks autonomously for hours at a time

    AI

    AI learns how vision and sound are connected, without human intervention | Ztoog

    AI

    How AI is introducing errors into courtrooms

    AI

    With AI, researchers predict the location of virtually any protein within a human cell | Ztoog

    AI

    Google DeepMind’s new AI agent cracks real-world problems better than humans can

    Leave A Reply Cancel Reply

    Follow Us
    • Facebook
    • Twitter
    • Pinterest
    • Instagram
    Top Posts
    Technology

    Wattpad is revamping its creator program and making it more accessible

    Storytelling platform Wattpad is revamping its creator program and making it more accessible for writers,…

    Mobile

    European Commission to approve Apple’s opening up of iPhone tap-to-pay to third parties in May

    iOS 17.4 got here out final month with monumental modifications for iPhone homeowners in the…

    Science

    Emergence: What is it and how could it help solve consciousness?

    THE subsequent time you get caught in a downpour, don’t take into consideration how moist…

    AI

    Meet TinyLLaVA: The Game-Changer in Machine Learning with Smaller Multimodal Frameworks Outperforming Larger Models

    Large multimodal fashions (LMMs) have the potential to revolutionize how machines work together with human…

    Gadgets

    8 Best Breast Pumps (2024): Wearable, Portable, Easy to Clean

    By the time you learn this story, my pumping journey shall be over. I spent…

    Our Picks
    Mobile

    OnePlus 12R specs and launch date leak

    Crypto

    Bitcoin Whales Increase Their Holdings By $3 Billion

    The Future

    Protecting Your Digital Assets on a Limited Budget

    Categories
    • AI (1,493)
    • Crypto (1,753)
    • Gadgets (1,805)
    • Mobile (1,851)
    • Science (1,866)
    • Technology (1,802)
    • The Future (1,648)
    Most Popular
    AI

    Generative AI: Differentiating disruptors from the disrupted

    The Future

    Veriato vs ActivTrak: 2023 a head-to-head comparison

    The Future

    Cyber Acoustics DS-6000 Essential Docking Station hands on – A single connection and you’re ready for anything

    Ztoog
    Facebook X (Twitter) Instagram Pinterest
    • Home
    • About Us
    • Contact us
    • Privacy Policy
    • Terms & Conditions
    © 2025 Ztoog.

    Type above and press Enter to search. Press Esc to cancel.