Close Menu
Ztoog
    What's Hot
    Science

    Ketamine pills for depression show positive results in trial—but with caveats

    The Future

    More complaints prompt NHTSA probe of Ford 2022 Mach-E recall

    AI

    LocalMamba: Revolutionizing Visual Perception with Innovative State Space Models for Enhanced Local Dependency Capture

    Important Pages:
    • About Us
    • Contact us
    • Privacy Policy
    • Terms & Conditions
    Facebook X (Twitter) Instagram Pinterest
    Facebook X (Twitter) Instagram Pinterest
    Ztoog
    • Home
    • The Future

      OPPO launches A5 Pro 5G: Premium features at a budget price

      How I Turn Unstructured PDFs into Revenue-Ready Spreadsheets

      Is it the best tool for 2025?

      The clocks that helped define time from London’s Royal Observatory

      Summer Movies Are Here, and So Are the New Popcorn Buckets

    • Technology

      What It Is and Why It Matters—Part 1 – O’Reilly

      Ensure Hard Work Is Recognized With These 3 Steps

      Cicada map 2025: Where will Brood XIV cicadas emerge this spring?

      Is Duolingo the face of an AI jobs crisis?

      The US DOD transfers its AI-based Open Price Exploration for National Security program to nonprofit Critical Minerals Forum to boost Western supply deals (Ernest Scheyder/Reuters)

    • Gadgets

      Maono Caster G1 Neo & PD200X Review: Budget Streaming Gear for Aspiring Creators

      Apple plans to split iPhone 18 launch into two phases in 2026

      Upgrade your desk to Starfleet status with this $95 USB-C hub

      37 Best Graduation Gift Ideas (2025): For College Grads

      Backblaze responds to claims of “sham accounting,” customer backups at risk

    • Mobile

      Samsung Galaxy S25 Edge promo materials leak

      What are people doing with those free T-Mobile lines? Way more than you’d expect

      Samsung doesn’t want budget Galaxy phones to use exclusive AI features

      COROS’s charging adapter is a neat solution to the smartwatch charging cable problem

      Fortnite said to return to the US iOS App Store next week following court verdict

    • Science

      Nothing is stronger than quantum connections – and now we know why

      Failed Soviet probe will soon crash to Earth – and we don’t know where

      Trump administration cuts off all future federal funding to Harvard

      Does kissing spread gluten? New research offers a clue.

      Why Balcony Solar Panels Haven’t Taken Off in the US

    • AI

      Hybrid AI model crafts smooth, high-quality videos in seconds | Ztoog

      How to build a better AI benchmark

      Q&A: A roadmap for revolutionizing health care through data-driven innovation | Ztoog

      This data set helps researchers spot harmful stereotypes in LLMs

      Making AI models more trustworthy for high-stakes settings | Ztoog

    • Crypto

      Ethereum Breaks Key Resistance In One Massive Move – Higher High Confirms Momentum

      ‘The Big Short’ Coming For Bitcoin? Why BTC Will Clear $110,000

      Bitcoin Holds Above $95K Despite Weak Blockchain Activity — Analytics Firm Explains Why

      eToro eyes US IPO launch as early as next week amid easing concerns over Trump’s tariffs

      Cardano ‘Looks Dope,’ Analyst Predicts Big Move Soon

    Ztoog
    Home » Differentially private heatmaps – Ztoog
    AI

    Differentially private heatmaps – Ztoog

    Facebook Twitter Pinterest WhatsApp
    Differentially private heatmaps – Ztoog
    Share
    Facebook Twitter LinkedIn Pinterest WhatsApp

    Posted by Badih Ghazi, Staff Research Scientist, and Nachiappan Valliappan, Staff Software Engineer, Google Research

    Recently, differential privateness (DP) has emerged as a mathematically sturdy notion of consumer privateness for information aggregation and machine studying (ML), with sensible deployments together with the 2022 US Census and in trade. Over the previous couple of years, we have now open-sourced libraries for privacy-preserving analytics and ML and have been always enhancing their capabilities. Meanwhile, new algorithms have been developed by the analysis group for a number of analytic duties involving private aggregation of knowledge.

    One such necessary information aggregation methodology is the heatmap. Heatmaps are standard for visualizing aggregated information in two or extra dimensions. They are extensively utilized in many fields together with pc imaginative and prescient, picture processing, spatial information evaluation, bioinformatics, and extra. Protecting the privateness of consumer information is important for a lot of purposes of heatmaps. For instance, heatmaps for gene microdata are primarily based on private information from people. Similarly, a heatmap of standard places in a geographic space are primarily based on consumer location check-ins that must be saved private.

    Motivated by such purposes, in “Differentially Private Heatmaps” (offered at AAAI 2023), we describe an environment friendly DP algorithm for computing heatmaps with provable ensures and consider it empirically. At the core of our DP algorithm for heatmaps is an answer to the essential downside of privately combination sparse enter vectors (i.e., enter vectors with a small variety of non-zero coordinates) with a small error as measured by the Earth Mover’s Distance (EMD). Using a hierarchical partitioning process, our algorithm views every enter vector, in addition to the output heatmap, as a chance distribution over a variety of gadgets equal to the dimension of the info. For the issue of sparse aggregation beneath EMD, we give an environment friendly algorithm with error asymptotically near the absolute best.

    Algorithm description

    Our algorithm works by privatizing the aggregated distribution (obtained by averaging over all consumer inputs), which is ample for computing a ultimate heatmap that’s private because of the post-processing property of DP. This property ensures that any transformation of the output of a DP algorithm stays differentially private. Our important contribution is a brand new privatization algorithm for the aggregated distribution, which we are going to describe subsequent.

    The EMD measure, which is a distance-like measure of dissimilarity between two chance distributions initially proposed for pc imaginative and prescient duties, is well-suited for heatmaps because it takes the underlying metric house into consideration and considers “neighboring” bins. EMD is utilized in quite a lot of purposes together with deep studying, spatial evaluation, human mobility, picture retrieval, face recognition, visible monitoring, form matching, and extra.

    To obtain DP, we have to add noise to the aggregated distribution. We would additionally prefer to protect statistics at totally different scales of the grid to attenuate the EMD error. So, we create a hierarchical partitioning of the grid, add noise at every stage, after which recombine into the ultimate DP aggregated distribution. In explicit, the algorithm has the next steps:

    1. Quadtree building: Our hierarchical partitioning process first divides the grid into 4 cells, then divides every cell into 4 subcells; it recursively continues this course of till every cell is a single pixel. This process creates a quadtree over the subcells the place the basis represents all the grid and every leaf represents a pixel. The algorithm then calculates the entire chance mass for every tree node (obtained by including up the aggregated distribution’s chances of all leaves within the subtree rooted at this node). This step is illustrated under.
      In step one, we take the (non-private) aggregated distribution (high left) and repeatedly divide it to create a quadtree. Then, we compute the entire chance mass is every cell (backside).
    2. Noise addition: To every tree node’s mass we then add Laplace noise calibrated to the use case.
    3. Truncation: To assist scale back the ultimate quantity of noise in our DP aggregated distribution, the algorithm traverses the tree ranging from the basis and, at every stage, it discards all however the high w nodes with highest (noisy) plenty along with their descendants.
    4. Reconstruction: Finally, the algorithm solves a linear program to get well the aggregated distribution. This linear program is impressed by the sparse restoration literature the place the noisy plenty are seen as (noisy) measurements of the info.
    In step 2, noise is added to every cell’s chance mass. Then in step 3, solely top-w cells are saved (inexperienced) whereas the remaining cells are truncated (purple). Finally, within the final step, we write a linear program on these high cells to reconstruct the aggregation distribution, which is now differentially private.

    Experimental outcomes

    We consider the efficiency of our algorithm in two totally different domains: real-world location check-in information and picture saliency information. We take into account as a baseline the ever present Laplace mechanism, the place we add Laplace noise to every cell, zero out any detrimental cells, and produce the heatmap from this noisy combination. We additionally take into account a “thresholding” variant of this baseline that’s extra suited to sparse information: solely maintain high t% of the cell values (primarily based on the chance mass in every cell) after noising whereas zeroing out the remaining. To consider the standard of an output heatmap in comparison with the true heatmap, we use Pearson coefficient, KL-divergence, and EMD. Note that when the heatmaps are extra comparable, the primary metric will increase however the latter two lower.

    The places dataset is obtained by combining two datasets, Gowalla and Brightkite, each of which include check-ins by customers of location-based social networks. We pre-processed this dataset to think about solely check-ins within the continental US leading to a ultimate dataset consisting of ~500,000 check-ins by ~20,000 customers. Considering the highest cells (from an preliminary partitioning of all the house right into a 300 x 300 grid) which have check-ins from not less than 200 distinctive customers, we partition every such cell into subgrids with a decision of ∆ × ∆ and assign every check-in to considered one of these subgrids.

    In the primary set of experiments, we repair ∆ = 256. We take a look at the efficiency of our algorithm for various values of ε (the privateness parameter, the place smaller ε means stronger DP ensures), starting from 0.1 to 10, by operating our algorithms along with the baseline and its variants on all cells, randomly sampling a set of 200 customers in every trial, after which computing the gap metrics between the true heatmap and the DP heatmap. The common of those metrics is offered under. Our algorithm (the purple line) performs higher than all variations of the baseline throughout all metrics, with enhancements which can be particularly vital when ε shouldn’t be too massive or small (i.e., 0.2 ≤ ε ≤ 5).

    Metrics averaged over 60 runs when various ε for the placement dataset. Shaded areas point out 95% confidence interval.

    Next, we examine the impact of various the quantity n of customers. By fixing a single cell (with > 500 customers) and ε, we fluctuate n from 50 to 500 customers. As predicted by idea, our algorithms and the baseline carry out higher as n will increase. However, the conduct of the thresholding variants of the baseline are much less predictable.

    We additionally run one other experiment the place we repair a single cell and ε, and fluctuate the decision ∆ from 64 to 256. In settlement with idea, our algorithm’s efficiency stays almost fixed for all the vary of ∆. However, the baseline suffers throughout all metrics as ∆ will increase whereas the thresholding variants often enhance as ∆ will increase.

    Effect of the variety of customers and grid decision on EMD.

    We additionally experiment on the Salicon picture saliency dataset (SALICON). This dataset is a set of saliency annotations on the Microsoft Common Objects in Context picture database. We downsized the photographs to a hard and fast decision of 320 × 240 and every [user, image] pair consists of a sequence of coordinates within the picture the place the consumer appeared. We repeat the experiments described beforehand on 38 randomly sampled photos (with ≥ 50 customers every) from SALICON. As we will see from the examples under, the heatmap obtained by our algorithm could be very near the bottom reality.

    Example visualization of various algorithms for 2 totally different pure photos from SALICON for ε = 10 and n = 50 customers. The algorithms from left to proper are: unique heatmap (no privateness), baseline, and ours.

    Additional experimental outcomes, together with these on different datasets, metrics, privateness parameters and DP fashions, could be discovered within the paper.

    Conclusion

    We offered a privatization algorithm for sparse distribution aggregation beneath the EMD metric, which in flip yields an algorithm for producing privacy-preserving heatmaps. Our algorithm extends naturally to distributed fashions that may implement the Laplace mechanism, together with the safe aggregation mannequin and the shuffle mannequin. This doesn’t apply to the extra stringent native DP mannequin, and it stays an attention-grabbing open query to plot sensible native DP heatmap/EMD aggregation algorithms for “moderate” variety of customers and privateness parameters.

    Acknowledgments

    This work was completed collectively with Junfeng He, Kai Kohlhoff, Ravi Kumar, Pasin Manurangsi, and Vidhya Navalpakkam.

    Share. Facebook Twitter Pinterest LinkedIn WhatsApp

    Related Posts

    AI

    Hybrid AI model crafts smooth, high-quality videos in seconds | Ztoog

    AI

    How to build a better AI benchmark

    AI

    Q&A: A roadmap for revolutionizing health care through data-driven innovation | Ztoog

    AI

    This data set helps researchers spot harmful stereotypes in LLMs

    AI

    Making AI models more trustworthy for high-stakes settings | Ztoog

    AI

    The AI Hype Index: AI agent cyberattacks, racing robots, and musical models

    AI

    Novel method detects microbial contamination in cell cultures | Ztoog

    AI

    Seeing AI as a collaborator, not a creator

    Leave A Reply Cancel Reply

    Follow Us
    • Facebook
    • Twitter
    • Pinterest
    • Instagram
    Top Posts
    Technology

    Chia Seed Water Can Help You Boost Your Hydration. The Benefits, Risks and How to Make It

    Chia seeds is likely to be small, however they pack an enormous punch. These small,…

    Technology

    An atheist chaplain visits death row

    When you hear the phrase “chaplain,” you most likely consider a priest or an Imam…

    AI

    This AI Research Introduces TinyGPT-V: A Parameter-Efficient MLLMs (Multimodal Large Language Models) Tailored for a Range of Real-World Vision-Language Applications

    The improvement of multimodal massive language fashions (MLLMs) represents a vital leap ahead. These superior…

    The Future

    Sam Altman gives up control of OpenAI Startup Fund, resolving unusual corporate venture structure

    OpenAI CEO Sam Altman has transferred formal control of the eponymously agency’s named corporate venture…

    Technology

    Remembering IEEE Director Emeritus Theodore W. Hissey

    IEEE Life Fellow Theodore W. “Ted” Hissey died on 14 October on the age of…

    Our Picks
    Technology

    Oregon’s opioid crisis: Why the state is going to recriminalize all drugs, including psychedelics like LSD, MDMA, and ketamine

    AI

    CMU Researchers Propose Test-Time Adaptation with Slot-Centric Models (Slot-TTA): A Semi-Supervised Model Equipped with a Slot-Centric Bottleneck that Jointly Segments and Reconstructs Scenes

    Crypto

    Will It Retake $28K Before August Ends?

    Categories
    • AI (1,483)
    • Crypto (1,745)
    • Gadgets (1,796)
    • Mobile (1,839)
    • Science (1,854)
    • Technology (1,790)
    • The Future (1,636)
    Most Popular
    Mobile

    CMF Buds by Nothing in for review

    The Future

    Mahbod Moghadam, who rose to fame as the co-founder of Genius, has died

    Science

    Draconid meteor shower: How to see the Draconids this October 2023

    Ztoog
    Facebook X (Twitter) Instagram Pinterest
    • Home
    • About Us
    • Contact us
    • Privacy Policy
    • Terms & Conditions
    © 2025 Ztoog.

    Type above and press Enter to search. Press Esc to cancel.