Close Menu
Ztoog
    What's Hot
    Gadgets

    Streaming apps are trying to bundle their way out of customer disenchantment

    AI

    How does Bing Chat Surpass ChatGPT in Providing Up-to-Date Real-Time Knowledge? Meet Retrieval Augmented Generation (RAG)

    The Future

    Confused between No Caller ID vs. Unknown Caller? Not Anymore

    Important Pages:
    • About Us
    • Contact us
    • Privacy Policy
    • Terms & Conditions
    Facebook X (Twitter) Instagram Pinterest
    Facebook X (Twitter) Instagram Pinterest
    Ztoog
    • Home
    • The Future

      How I Turn Unstructured PDFs into Revenue-Ready Spreadsheets

      Is it the best tool for 2025?

      The clocks that helped define time from London’s Royal Observatory

      Summer Movies Are Here, and So Are the New Popcorn Buckets

      India-Pak conflict: Pak appoints ISI chief, appointment comes in backdrop of the Pahalgam attack

    • Technology

      Ensure Hard Work Is Recognized With These 3 Steps

      Cicada map 2025: Where will Brood XIV cicadas emerge this spring?

      Is Duolingo the face of an AI jobs crisis?

      The US DOD transfers its AI-based Open Price Exploration for National Security program to nonprofit Critical Minerals Forum to boost Western supply deals (Ernest Scheyder/Reuters)

      The more Google kills Fitbit, the more I want a Fitbit Sense 3

    • Gadgets

      Maono Caster G1 Neo & PD200X Review: Budget Streaming Gear for Aspiring Creators

      Apple plans to split iPhone 18 launch into two phases in 2026

      Upgrade your desk to Starfleet status with this $95 USB-C hub

      37 Best Graduation Gift Ideas (2025): For College Grads

      Backblaze responds to claims of “sham accounting,” customer backups at risk

    • Mobile

      Samsung Galaxy S25 Edge promo materials leak

      What are people doing with those free T-Mobile lines? Way more than you’d expect

      Samsung doesn’t want budget Galaxy phones to use exclusive AI features

      COROS’s charging adapter is a neat solution to the smartwatch charging cable problem

      Fortnite said to return to the US iOS App Store next week following court verdict

    • Science

      Failed Soviet probe will soon crash to Earth – and we don’t know where

      Trump administration cuts off all future federal funding to Harvard

      Does kissing spread gluten? New research offers a clue.

      Why Balcony Solar Panels Haven’t Taken Off in the US

      ‘Dark photon’ theory of light aims to tear up a century of physics

    • AI

      How to build a better AI benchmark

      Q&A: A roadmap for revolutionizing health care through data-driven innovation | Ztoog

      This data set helps researchers spot harmful stereotypes in LLMs

      Making AI models more trustworthy for high-stakes settings | Ztoog

      The AI Hype Index: AI agent cyberattacks, racing robots, and musical models

    • Crypto

      ‘The Big Short’ Coming For Bitcoin? Why BTC Will Clear $110,000

      Bitcoin Holds Above $95K Despite Weak Blockchain Activity — Analytics Firm Explains Why

      eToro eyes US IPO launch as early as next week amid easing concerns over Trump’s tariffs

      Cardano ‘Looks Dope,’ Analyst Predicts Big Move Soon

      Speak at Ztoog Disrupt 2025: Applications now open

    Ztoog
    Home » Google at ICCV 2023 – Google Research Blog
    AI

    Google at ICCV 2023 – Google Research Blog

    Facebook Twitter Pinterest WhatsApp
    Google at ICCV 2023 – Google Research Blog
    Share
    Facebook Twitter LinkedIn Pinterest WhatsApp

    Google is proud to be a Platinum Sponsor of the International Conference on Computer Vision (ICCV 2023), a premier annual convention, which is being held this week in Paris, France. As a pacesetter in pc imaginative and prescient analysis, Google has a robust presence at this yr’s convention with 60 accepted papers and energetic involvement in 27 workshops and tutorials. Google can also be proud to be a Platinum Sponsor for the LatinX in CV workshop. We look ahead to sharing a few of our in depth pc imaginative and prescient analysis and increasing our partnership with the broader analysis neighborhood.

    Attending ICCV 2023? We hope you’ll go to the Google sales space to speak with researchers who’re actively pursuing the newest improvements in pc imaginative and prescient, and take a look at a few of the scheduled sales space actions (e.g., demos and Q&A classes listed under). Visit the @GoogleAI Twitter account to seek out out extra in regards to the Google sales space actions at ICCV 2023.

    Take a glance under to be taught extra in regards to the Google analysis being offered at ICCV 2023 (Google affiliations in daring).

    Multi-Modal Neural Radiance Field for Monocular Dense SLAM with a Light-Weight ToF Sensor

    Xinyang Liu, Yijin Li, Yanbin Teng, Hujun Bao, Guofeng Zhang, Yinda Zhang, Zhaopeng Cui

    ITI-GEN: Inclusive Text-to-Image Generation

    Cheng Zhang, Xuanbai Chen, Siqi Chai, Chen Henry Wu, Dmitry Lagun, Thabo Beeler, Fernando De la Torre

    ASIC: Aligning Sparse in-the-wild Image Collections

    Kamal Gupta, Varun Jampani, Carlos Esteves, Abhinav Shrivastava, Ameesh Makadia, Noah Snavely, Abhishek Kar

    VQ3D: Learning a 3D-Aware Generative Model on ImageWeb

    Kyle Sargent, Jing Yu Koh, Han Zhang, Huiwen Chang, Charles Herrmann, Pratul Srinivasan, Jiajun Wu, Deqing Sun

    Open-domain Visual Entity Recognition: Towards Recognizing Millions of Wikipedia Entities

    Hexiang Hu, Yi Luan, Yang Chen*, Urvashi Khandelwal, Mandar Joshi, Kenton Lee, Kristina Toutanova, Ming-Wei Chang

    Sigmoid Loss for Language Image Pre-training

    Xiaohua Zhai, Basil Mustafa, Alexander Kolesnikov, Lucas Beyer

    Tracking Everything Everywhere All at Once

    Qianqian Wang, Yen-Yu Chang, Ruojin Cai, Zhengqi Li, Bharath Hariharan, Aleksander Holynski, Noah Snavely

    Zip-NeRF: Anti-Aliased Grid-Based Neural Radiance Fields

    Jonathan T. Barron, Ben Mildenhall, Dor Verbin, Pratul P. Srinivasan, Peter Hedman

    Delta Denoising Score

    Amir Hertz*, Kfir Aberman, Daniel Cohen-Or*

    DreamBooth3D: Subject-Driven Text-to-3D Generation

    Amit Raj, Srinivas Kaza, Ben Poole, Michael Niemeyer, Nataniel Ruiz, Ben Mildenhall, Shiran Zada, Kfir Aberman, Michael Rubinstein, Jonathan Barron, Yuanzhen Li, Varun Jampani

    Encyclopedic VQA: Visual Questions about Detailed Properties of Fine-grained Categories

    Thomas Mensink, Jasper Uijlings, Lluis Castrejon, Arushi Goel*, Felipe Cadar*, Howard Zhou, Fei Sha, André Araujo, Vittorio Ferrari

    GECCO: Geometrically-Conditioned Point Diffusion Models

    Michał J. Tyszkiewicz, Pascal Fua, Eduard Trulls

    Learning from Semantic Alignment between Unpaired Multiviews for Egocentric Video Recognition

    Qitong Wang, Long Zhao, Liangzhe Yuan, Ting Liu, Xi Peng

    Neural Microfacet Fields for Inverse Rendering

    Alexander Mai, Dor Verbin, Falko Kuester, Sara Fridovich-Keil

    Rosetta Neurons: Mining the Common Units in a Model Zoo

    Amil Dravid, Yossi Gandelsman, Alexei A. Efros, Assaf Shocher

    Teaching CLIP to Count to Ten

    Roni Paiss*, Ariel Ephrat, Omer Tov, Shiran Zada, Inbar Mosseri, Michal Irani, Tali Dekel

    Vox-E: Text-guided Voxel Editing of 3D Objects

    Etai Sella, Gal Fiebelman, Peter Hedman, Hadar Averbuch-Elor

    CC3D: Layout-Conditioned Generation of Compositional 3D Scenes

    Sherwin Bahmani, Jeong Joon Park, Despoina Paschalidou, Xingguang Yan, Gordon Wetzstein, Leonidas Guibas, Andrea Tagliasacchi

    Delving into Motion-Aware Matching for Monocular 3D Object Tracking

    Kuan-Chih Huang, Ming-Hsuan Yang, Yi-Hsuan Tsai

    Generative Multiplane Neural Radiance for 3D-Aware Image Generation

    Amandeep Kumar, Ankan Kumar Bhunia, Sanath Narayan, Hisham Cholakkal, Rao Muhammad Anwer, Salman Khan, Ming-Hsuan Yang, Fahad Shahbaz Khan

    M2T: Masking Transformers Twice for Faster Decoding

    Fabian Mentzer, Eirikur Agustsson, Michael Tschannen

    MULLER: Multilayer Laplacian Resizer for Vision

    Zhengzhong Tu, Peyman Milanfar, Hossein Talebi

    SVDiff: Compact Parameter Space for Diffusion Fine-Tuning

    Ligong Han*, Yinxiao Li, Han Zhang, Peyman Milanfar, Dimitris Metaxas, Feng Yang

    Towards Authentic Face Restoration with Iterative Diffusion Models and Beyond

    Yang Zhao, Tingbo Hou, Yu-Chuan Su, Xuhui Jia, Yandong Li, Matthias Grundmann

    Unified Visual Relationship Detection with Vision and Language Models

    Long Zhao, Liangzhe Yuan, Boqing Gong, Yin Cui, Florian Schroff, Ming-Hsuan Yang, Hartwig Adam, Ting Liu

    3D Motion Magnification: Visualizing Subtle Motions from Time-Varying Radiance Fields

    Brandon Y. Feng, Hadi Alzayer, Michael Rubinstein, William T. Freeman, Jia-Bin Huang

    Global Features are All You Need for Image Retrieval and Reranking

    Shihao Shao, Kaifeng Chen, Arjun Karpur, Qinghua Cui, André Araujo, Bingyi Cao

    Introducing Language Guidance in Prompt-Based Continual Learning

    Muhammad Gul Zain Ali Khan, Muhammad Ferjad Naeem, Luc Van Gool, Didier Stricker, Federico Tombari, Muhammad Zeshan Afzal

    Multiscale Structure Guided Diffusion for Image Deblurring

    Mengwei Ren*, Mauricio Delbracio, Hossein Talebi, Guido Gerig, Peyman Milanfar

    Robust Monocular Depth Estimation underneath Challenging Conditions

    Stefano Gasperini, Nils Morbitzer, HyunJun Jung, Nassir Navab, Federico Tombari

    Score-Based Diffusion Models as Principled Priors for Inverse Imaging

    Berthy T. Feng*, Jamie Smith, Michael Rubinstein, Huiwen Chang, Katherine L. Bouman, William T. Freeman

    Towards Universal Image Embeddings: A Large-Scale Dataset and Challenge for Generic Image Representations

    Nikolaos-Antonios Ypsilantis, Kaifeng Chen, Bingyi Cao, Mario Lipovsky, Pelin Dogan-Schonberger, Grzegorz Makosa, Boris Bluntschli, Mojtaba Seyedhosseini, Ondrej Chum, André Araujo

    U-RED: Unsupervised 3D Shape Retrieval and Deformation for Partial Point Clouds

    Yan Di, Chenyangguang Zhang, Ruida Zhang, Fabian Manhardt, Yongzhi Su, Jason Rambach, Didier Stricker, Xiangyang Ji, Federico Tombari

    AvatarCraft: Transforming Text into Neural Human Avatars with Parameterized Shape and Pose Control

    Ruixiang Jiang, Can Wang, Jingbo Zhang, Menglei Chai, Mingming He, Dongdong Chen, Jing Liao

    Learning Versatile 3D Shape Generation with Improved AR Models

    Simian Luo, Xuelin Qian, Yanwei Fu, Yinda Zhang, Ying Tai, Zhenyu Zhang, Chengjie Wang, Xiangyang Xue

    Novel-view Synthesis and Pose Estimation for Hand-Object Interaction from Sparse Views

    Wentian Qu, Zhaopeng Cui, Yinda Zhang, Chenyu Meng, Cuixia Ma, Xiaoming Deng, Hongan Wang

    PreSTU: Pre-Training for Scene-Text Understanding

    Jihyung Kil*, Soravit Changpinyo, Xi Chen, Hexiang Hu, Sebastian Goodman, Wei-Lun Chao, Radu Soricut

    Self-supervised Learning of Implicit Shape Representation with Dense Correspondence for Deformable Objects

    Baowen Zhang, Jiahe Li, Xiaoming Deng, Yinda Zhang, Cuixia Ma, Hongan Wang

    Self-regulating Prompts: Foundational Model Adaptation with out Forgetting

    Muhammad Uzair Khattak, Syed Talal Wasi, Muzammal Nasee, Salman Kha, Ming-Hsuan Yan, Fahad Shahbaz Khan

    Spectral Graphormer: Spectral Graph-Based Transformer for Egocentric Two-Hand Reconstruction utilizing Multi-View Color Images

    Tze Ho Elden Tse*, Franziska Mueller, Zhengyang Shen, Danhang Tang, Thabo Beeler, Mingsong Dou, Yinda Zhang, Sasa Petrovic, Hyung Jin Chang, Jonathan Taylor, Bardia Doosti

    Synthesizing Diverse Human Motions in 3D Indoor Scenes

    Kaifeng Zhao, Yan Zhang, Shaofei Wang, Thabo Beeler, Siyu Tang

    Tracking by 3D Model Estimation of Unknown Objects in Videos

    Denys Rozumnyi, Jiri Matas, Marc Pollefeys, Vittorio Ferrari, Martin R. Oswald

    UnLoc: A Unified Framework for Video Localization Tasks

    Shen Yan, Xuehan Xiong, Arsha Nagrani, Anurag Arnab, Zhonghao Wang*, Weina Ge, David Ross, Cordelia Schmid

    Verbs in Action: Improving Verb Understanding in Video-language Models

    Liliane Momeni, Mathilde Caron, Arsha Nagrani, Andrew Zisserman, Cordelia Schmid

    VLSlice: Interactive Vision-and-Language Slice Discovery

    Eric Slyman, Minsuk Kahng, Stefan Lee

    Yes, we CANN: Constrained Approximate Nearest Neighbors for Local Feature-Based Visual Localization

    Dror Aiger, André Araujo, Simon Lynen

    Audiovisual Masked Autoencoders

    Mariana-Iuliana Georgescu*, Eduardo Fonseca, Radu Tudor Ionescu, Mario Lucic, Cordelia Schmid, Anurag Arnab

    CLR: Channel-wise Lightweight Reprogramming for Continual Learning

    Yunhao Ge, Yuecheng Li, Shuo Ni, Jiaping Zhao, Ming-Hsuan Yang, Laurent Itti

    LU-NeRF: Scene and Pose Estimation by Synchronizing Local Unposed NeRFs

    Zezhou Cheng*, Carlos Esteves, Varun Jampani, Abhishek Kar, Subhransu Maji, Ameesh Makadia

    Multiscale Representation for Real-Time Anti-Aliasing Neural Rendering

    Dongting Hu, Zhenkai Zhang, Tingbo Hou, Tongliang Liu, Huan Fu, Mingming Gong

    Nerfbusters: Removing Ghostly Artifacts from Casually Captured NeRFs

    Frederik Warburg, Ethan Weber, Matthew Tancik, Aleksander Holynski, Angjoo Kanazawa

    Segmenting Known Objects and Unseen Unknowns with out Prior Knowledge

    Stefano Gasperini, Alvaro Marcos-Ramiro, Michael Schmidt, Nassir Navab, Benjamin Busam, Federico Tombari

    SparseFusion: Fusing Multi-Modal Sparse Representations for Multi-Sensor 3D Object Detection

    Yichen Xie, Chenfeng Xu, Marie-Julie Rakotosaona, Patrick Rim, Federico Tombari, Kurt Keutzer, Masayoshi Tomizuka, Wei Zhan

    SwiftFormer: Efficient Additive Attention for Transformer-Based Real-time Mobile Vision Applications

    Abdelrahman Shaker, Muhammad Maa, Hanoona Rashee, Salman Kha, Ming-Hsuan Yan, Fahad Shahbaz Kha

    Agile Modeling: From Concept to Classifier in Minutes

    Otilia Stretcu, Edward Vendrow, Kenji Hata, Krishnamurthy Viswanathan, Vittorio Ferrari, Sasan Tavakkol, Wenlei Zhou, Aditya Avinash, Enming Luo, Neil Gordon Alldrin, MohammadHossein Bateni, Gabriel Berger, Andrew Bunner, Chun-Ta Lu, Javier A Rey, Giulia DeSalvo, Ranjay Krishna, Ariel Fuxman

    CAD-Estate: Large-Scale CAD Model Annotation in RGB Videos

    Kevis-Kokitsi Maninis, Stefan Popov, Matthias Niessner, Vittorio Ferrari

    Counting Crowds in Bad Weather

    Zhi-Kai Huang, Wei-Ting Chen, Yuan-Chun Chiang, Sy-Yen Kuo, Ming-Hsuan Yang

    DreamPose: Fashion Video Synthesis with Stable Diffusion

    Johanna Karras, Aleksander Holynski, Ting-Chun Wang, Ira Kemelmacher-Shlizerman

    InfiniCity: Infinite-Scale City Synthesis

    Chieh Hubert Lin, Hsin-Ying Lee, Willi Menapace, Menglei Chai, Aliaksandr Siarohin, Ming-Hsuan Yang, Sergey Tulyakov

    SAMPLING: Scene-Adaptive Hierarchical Multiplane Images Representation for Novel View Synthesis from a Single Image

    Xiaoyu Zhou, Zhiwei Lin, Xiaojun Shan, Yongtao Wang, Deqing Sun, Ming-Hsuan Yang

    Share. Facebook Twitter Pinterest LinkedIn WhatsApp

    Related Posts

    AI

    How to build a better AI benchmark

    AI

    Q&A: A roadmap for revolutionizing health care through data-driven innovation | Ztoog

    AI

    This data set helps researchers spot harmful stereotypes in LLMs

    AI

    Making AI models more trustworthy for high-stakes settings | Ztoog

    AI

    The AI Hype Index: AI agent cyberattacks, racing robots, and musical models

    AI

    Novel method detects microbial contamination in cell cultures | Ztoog

    AI

    Seeing AI as a collaborator, not a creator

    AI

    “Periodic table of machine learning” could fuel AI discovery | Ztoog

    Leave A Reply Cancel Reply

    Follow Us
    • Facebook
    • Twitter
    • Pinterest
    • Instagram
    Top Posts
    Technology

    The art and science of swearing

    When you hear somebody casually drop the phrase “fuck,” what’s your response? Offended? Surprised? Confused?…

    Crypto

    Bitcoin Rebounds Strongly, Crosses $42,000 Post Fed Rate Decision

    In an announcement as we speak, the Federal Reserve has determined to uphold its benchmark…

    Science

    Amazon’s Project Kuiper satellites add to astronomers’ light-pollution woes

    Alan Dyer/Getty Images Amazon is ready to launch two satellite tv for pc prototypes for…

    The Future

    Samsung Galaxy Watch6 review: A polished and more refined fashinable choice

    In the latter half of the 12 months, Samsung launched a brand new addition to…

    Technology

    Permission denied for reentry of Varda’s orbiting experiment capsule

    Enlarge / Varda’s reentry capsule measures almost 3 toes (1 meter) in diameter, and can…

    Our Picks
    Mobile

    Wade through that busy group chat as Google assistant helps Android Auto summarize texts

    The Future

    Epic Games Store and Fortnite are coming to iPhones in 2024

    Science

    Astronomers discover new moons orbiting Uranus and Neptune

    Categories
    • AI (1,482)
    • Crypto (1,744)
    • Gadgets (1,796)
    • Mobile (1,839)
    • Science (1,853)
    • Technology (1,789)
    • The Future (1,635)
    Most Popular
    Gadgets

    Review: BYD Atto 3 | WIRED

    Crypto

    Options Traders Target $4,000 Mark Amid Market Optimism

    Technology

    QR Codes Can Hide Risky Links, F.T.C. Warns

    Ztoog
    Facebook X (Twitter) Instagram Pinterest
    • Home
    • About Us
    • Contact us
    • Privacy Policy
    • Terms & Conditions
    © 2025 Ztoog.

    Type above and press Enter to search. Press Esc to cancel.