Skip to content

Latest commit

 

History

History
6641 lines (4981 loc) · 261 KB

CVPR2021_accept_all_papers.md

File metadata and controls

6641 lines (4981 loc) · 261 KB

CVPR2021所有录取文章!

Invertible Denoising Network: A Light Solution for Real Noise Removal Yang Liu, Zhenyue Qin, Saeed Anwar, Pan Ji, Dongwoo Kim, Sabrina Caldwell, Tom Gedeon [pdf] [arXiv] [bibtex]

Greedy Hierarchical Variational Autoencoders for Large-Scale Video Prediction Bohan Wu, Suraj Nair, Roberto Martin-Martin, Li Fei-Fei, Chelsea Finn [pdf] [supp] [bibtex]

Over-the-Air Adversarial Flickering Attacks Against Video Recognition Networks Roi Pony, Itay Naeh, Shie Mannor [pdf] [supp] [arXiv] [bibtex]

Encoder Fusion Network With Co-Attention Embedding for Referring Image Segmentation Guang Feng, Zhiwei Hu, Lihe Zhang, Huchuan Lu [pdf] [arXiv] [bibtex]

Polka Lines: Learning Structured Illumination and Reconstruction for Active Stereo Seung-Hwan Baek, Felix Heide [pdf] [supp] [arXiv] [bibtex]

Image Inpainting With External-Internal Learning and Monochromic Bottleneck Tengfei Wang, Hao Ouyang, Qifeng Chen [pdf] [supp] [arXiv] [bibtex]

Patch2Pix: Epipolar-Guided Pixel-Level Correspondences Qunjie Zhou, Torsten Sattler, Laura Leal-Taixe [pdf] [supp] [bibtex]

Diverse Part Discovery: Occluded Person Re-Identification With Part-Aware Transformer Yulin Li, Jianfeng He, Tianzhu Zhang, Xiang Liu, Yongdong Zhang, Feng Wu [pdf] [supp] [bibtex]

Counterfactual Zero-Shot and Open-Set Visual Recognition Zhongqi Yue, Tan Wang, Qianru Sun, Xian-Sheng Hua, Hanwang Zhang [pdf] [supp] [arXiv] [bibtex]

Person30K: A Dual-Meta Generalization Network for Person Re-Identification Yan Bai, Jile Jiao, Wang Ce, Jun Liu, Yihang Lou, Xuetao Feng, Ling-Yu Duan [pdf] [bibtex]

Patch-NetVLAD: Multi-Scale Fusion of Locally-Global Descriptors for Place Recognition Stephen Hausler, Sourav Garg, Ming Xu, Michael Milford, Tobias Fischer [pdf] [supp] [bibtex]

Visually Informed Binaural Audio Generation without Binaural Audios Xudong Xu, Hang Zhou, Ziwei Liu, Bo Dai, Xiaogang Wang, Dahua Lin [pdf] [supp] [arXiv] [bibtex]

Dual Attention Guided Gaze Target Detection in the Wild Yi Fang, Jiapeng Tang, Wang Shen, Wei Shen, Xiao Gu, Li Song, Guangtao Zhai [pdf] [bibtex]

Privacy Preserving Localization and Mapping From Uncalibrated Cameras Marcel Geppert, Viktor Larsson, Pablo Speciale, Johannes L. Schonberger, Marc Pollefeys [pdf] [supp] [bibtex]

Learning Calibrated Medical Image Segmentation via Multi-Rater Agreement Modeling Wei Ji, Shuang Yu, Junde Wu, Kai Ma, Cheng Bian, Qi Bi, Jingjing Li, Hanruo Liu, Li Cheng, Yefeng Zheng [pdf] [bibtex]

Points As Queries: Weakly Semi-Supervised Object Detection by Points Liangyu Chen, Tong Yang, Xiangyu Zhang, Wei Zhang, Jian Sun [pdf] [arXiv] [bibtex]

Removing Diffraction Image Artifacts in Under-Display Camera via Dynamic Skip Connection Network Ruicheng Feng, Chongyi Li, Huaijin Chen, Shuai Li, Chen Change Loy, Jinwei Gu [pdf] [supp] [arXiv] [bibtex]

iVPF: Numerical Invertible Volume Preserving Flow for Efficient Lossless Compression Shifeng Zhang, Chen Zhang, Ning Kang, Zhenguo Li [pdf] [supp] [arXiv] [bibtex]

Pose Recognition With Cascade Transformers Ke Li, Shijie Wang, Xiang Zhang, Yifan Xu, Weijian Xu, Zhuowen Tu [pdf] [arXiv] [bibtex]

Data-Uncertainty Guided Multi-Phase Learning for Semi-Supervised Object Detection Zhenyu Wang, Yali Li, Ye Guo, Lu Fang, Shengjin Wang [pdf] [supp] [arXiv] [bibtex]

Prototype-Guided Saliency Feature Learning for Person Search Hanjae Kim, Sunghun Joung, Ig-Jae Kim, Kwanghoon Sohn [pdf] [bibtex]

Contrastive Learning for Compact Single Image Dehazing Haiyan Wu, Yanyun Qu, Shaohui Lin, Jian Zhou, Ruizhi Qiao, Zhizhong Zhang, Yuan Xie, Lizhuang Ma [pdf] [supp] [arXiv] [bibtex]

I3Net: Implicit Instance-Invariant Network for Adapting One-Stage Object Detectors Chaoqi Chen, Zebiao Zheng, Yue Huang, Xinghao Ding, Yizhou Yu [pdf] [arXiv] [bibtex]

Body Meshes as Points Jianfeng Zhang, Dongdong Yu, Jun Hao Liew, Xuecheng Nie, Jiashi Feng [pdf] [supp] [arXiv] [bibtex]

Pixel-Aligned Volumetric Avatars Amit Raj, Michael Zollhofer, Tomas Simon, Jason Saragih, Shunsuke Saito, James Hays, Stephen Lombardi [pdf] [supp] [bibtex]

UC2: Universal Cross-Lingual Cross-Modal Vision-and-Language Pre-Training Mingyang Zhou, Luowei Zhou, Shuohang Wang, Yu Cheng, Linjie Li, Zhou Yu, Jingjing Liu [pdf] [supp] [arXiv] [bibtex]

Generative PointNet: Deep Energy-Based Learning on Unordered Point Sets for 3D Generation, Reconstruction and Classification Jianwen Xie, Yifei Xu, Zilong Zheng, Song-Chun Zhu, Ying Nian Wu [pdf] [arXiv] [bibtex]

Blur, Noise, and Compression Robust Generative Adversarial Networks Takuhiro Kaneko, Tatsuya Harada [pdf] [arXiv] [bibtex]

Invisible Perturbations: Physical Adversarial Examples Exploiting the Rolling Shutter Effect Athena Sayles, Ashish Hooda, Mohit Gupta, Rahul Chatterjee, Earlence Fernandes [pdf] [supp] [arXiv] [bibtex]

Introvert: Human Trajectory Prediction via Conditional 3D Attention Nasim Shafiee, Taskin Padir, Ehsan Elhamifar [pdf] [supp] [bibtex]

Camouflaged Object Segmentation With Distraction Mining Haiyang Mei, Ge-Peng Ji, Ziqi Wei, Xin Yang, Xiaopeng Wei, Deng-Ping Fan [pdf] [supp] [arXiv] [bibtex]

RfD-Net: Point Scene Understanding by Semantic Instance Reconstruction Yinyu Nie, Ji Hou, Xiaoguang Han, Matthias Niessner [pdf] [supp] [bibtex]

In the Light of Feature Distributions: Moment Matching for Neural Style Transfer Nikolai Kalischek, Jan D. Wegner, Konrad Schindler [pdf] [supp] [arXiv] [bibtex]

DOTS: Decoupling Operation and Topology in Differentiable Architecture Search Yu-Chao Gu, Li-Juan Wang, Yun Liu, Yi Yang, Yu-Huan Wu, Shao-Ping Lu, Ming-Ming Cheng [pdf] [supp] [arXiv] [bibtex]

DriveGAN: Towards a Controllable High-Quality Neural Simulation Seung Wook Kim, Jonah Philion, Antonio Torralba, Sanja Fidler [pdf] [supp] [arXiv] [bibtex]

Style-Aware Normalized Loss for Improving Arbitrary Style Transfer Jiaxin Cheng, Ayush Jaiswal, Yue Wu, Pradeep Natarajan, Prem Natarajan [pdf] [supp] [arXiv] [bibtex]

Wide-Depth-Range 6D Object Pose Estimation in Space Yinlin Hu, Sebastien Speierer, Wenzel Jakob, Pascal Fua, Mathieu Salzmann [pdf] [arXiv] [bibtex]

Learning Salient Boundary Feature for Anchor-free Temporal Action Localization Chuming Lin, Chengming Xu, Donghao Luo, Yabiao Wang, Ying Tai, Chengjie Wang, Jilin Li, Feiyue Huang, Yanwei Fu [pdf] [supp] [arXiv] [bibtex]

Monocular Depth Estimation via Listwise Ranking Using the Plackett-Luce Model Julian Lienen, Eyke Hullermeier, Ralph Ewerth, Nils Nommensen [pdf] [supp] [bibtex]

Holistic 3D Scene Understanding From a Single Image With Implicit Representation Cheng Zhang, Zhaopeng Cui, Yinda Zhang, Bing Zeng, Marc Pollefeys, Shuaicheng Liu [pdf] [supp] [arXiv] [bibtex]

MultiBodySync: Multi-Body Segmentation and Motion Estimation via 3D Scan Synchronization Jiahui Huang, He Wang, Tolga Birdal, Minhyuk Sung, Federica Arrigoni, Shi-Min Hu, Leonidas J. Guibas [pdf] [supp] [arXiv] [bibtex]

Learning Optical Flow From a Few Matches Shihao Jiang, Yao Lu, Hongdong Li, Richard Hartley [pdf] [arXiv] [bibtex]

Learnable Motion Coherence for Correspondence Pruning Yuan Liu, Lingjie Liu, Cheng Lin, Zhen Dong, Wenping Wang [pdf] [supp] [arXiv] [bibtex]

ManipulaTHOR: A Framework for Visual Object Manipulation Kiana Ehsani, Winson Han, Alvaro Herrasti, Eli VanderBilt, Luca Weihs, Eric Kolve, Aniruddha Kembhavi, Roozbeh Mottaghi [pdf] [supp] [arXiv] [bibtex]

DeepI2P: Image-to-Point Cloud Registration via Deep Classification Jiaxin Li, Gim Hee Lee [pdf] [supp] [arXiv] [bibtex]

Scene-Intuitive Agent for Remote Embodied Visual Grounding Xiangru Lin, Guanbin Li, Yizhou Yu [pdf] [supp] [arXiv] [bibtex]

Human-Like Controllable Image Captioning With Verb-Specific Semantic Roles Long Chen, Zhihong Jiang, Jun Xiao, Wei Liu [pdf] [supp] [arXiv] [bibtex]

Enhancing the Transferability of Adversarial Attacks Through Variance Tuning Xiaosen Wang, Kun He [pdf] [supp] [arXiv] [bibtex]

HistoGAN: Controlling Colors of GAN-Generated and Real Images via Color Histograms Mahmoud Afifi, Marcus A. Brubaker, Michael S. Brown [pdf] [supp] [arXiv] [bibtex]

BiCnet-TKS: Learning Efficient Spatial-Temporal Representation for Video Person Re-Identification Ruibing Hou, Hong Chang, Bingpeng Ma, Rui Huang, Shiguang Shan [pdf] [bibtex]

Probabilistic Model Distillation for Semantic Correspondence Xin Li, Deng-Ping Fan, Fan Yang, Ao Luo, Hong Cheng, Zicheng Liu [pdf] [bibtex]

OpenRooms: An Open Framework for Photorealistic Indoor Scene Datasets Zhengqin Li, Ting-Wei Yu, Shen Sang, Sarah Wang, Meng Song, Yuhan Liu, Yu-Ying Yeh, Rui Zhu, Nitesh Gundavarapu, Jia Shi, Sai Bi, Hong-Xing Yu, Zexiang Xu, Kalyan Sunkavalli, Milos Hasan, Ravi Ramamoorthi, Manmohan Chandraker [pdf] [supp] [bibtex]

SSAN: Separable Self-Attention Network for Video Representation Learning Xudong Guo, Xun Guo, Yan Lu [pdf] [arXiv] [bibtex]

4D Panoptic LiDAR Segmentation Mehmet Aygun, Aljosa Osep, Mark Weber, Maxim Maximov, Cyrill Stachniss, Jens Behley, Laura Leal-Taixe [pdf] [supp] [arXiv] [bibtex]

SceneGen: Learning To Generate Realistic Traffic Scenes Shuhan Tan, Kelvin Wong, Shenlong Wang, Sivabalan Manivasagam, Mengye Ren, Raquel Urtasun [pdf] [supp] [arXiv] [bibtex]

Natural Adversarial Examples Dan Hendrycks, Kevin Zhao, Steven Basart, Jacob Steinhardt, Dawn Song [pdf] [supp] [arXiv] [bibtex]

CausalVAE: Disentangled Representation Learning via Neural Structural Causal Models Mengyue Yang, Furui Liu, Zhitang Chen, Xinwei Shen, Jianye Hao, Jun Wang [pdf] [supp] [arXiv] [bibtex]

VideoMoCo: Contrastive Video Representation Learning With Temporally Adversarial Examples Tian Pan, Yibing Song, Tianyu Yang, Wenhao Jiang, Wei Liu [pdf] [arXiv] [bibtex]

Zero-Shot Instance Segmentation Ye Zheng, Jiahong Wu, Yongqiang Qin, Faen Zhang, Li Cui [pdf] [supp] [arXiv] [bibtex]

Stereo Radiance Fields (SRF): Learning View Synthesis for Sparse Views of Novel Scenes Julian Chibane, Aayush Bansal, Verica Lazova, Gerard Pons-Moll [pdf] [supp] [arXiv] [bibtex]

Global Transport for Fluid Reconstruction With Learned Self-Supervision Erik Franz, Barbara Solenthaler, Nils Thuerey [pdf] [supp] [arXiv] [bibtex]

SliceNet: Deep Dense Depth Estimation From a Single Indoor Panorama Using a Slice-Based Representation Giovanni Pintore, Marco Agus, Eva Almansa, Jens Schneider, Enrico Gobbetti [pdf] [supp] [bibtex]

Offboard 3D Object Detection From Point Cloud Sequences Charles R. Qi, Yin Zhou, Mahyar Najibi, Pei Sun, Khoa Vo, Boyang Deng, Dragomir Anguelov [pdf] [supp] [arXiv] [bibtex]

STaR: Self-Supervised Tracking and Reconstruction of Rigid Objects in Motion With Neural Rendering Wentao Yuan, Zhaoyang Lv, Tanner Schmidt, Steven Lovegrove [pdf] [supp] [arXiv] [bibtex]

Generalization on Unseen Domains via Inference-Time Label-Preserving Target Projections Prashant Pandey, Mrigank Raman, Sumanth Varambally, Prathosh AP [pdf] [bibtex]

Monocular 3D Object Detection: An Extrinsic Parameter Free Approach Yunsong Zhou, Yuan He, Hongzi Zhu, Cheng Wang, Hongyang Li, Qinhong Jiang [pdf] [bibtex]

Communication Efficient SGD via Gradient Sampling With Bayes Prior Liuyihan Song, Kang Zhao, Pan Pan, Yu Liu, Yingya Zhang, Yinghui Xu, Rong Jin [pdf] [bibtex]

AdaBins: Depth Estimation Using Adaptive Bins Shariq Farooq Bhat, Ibraheem Alhashim, Peter Wonka [pdf] [supp] [arXiv] [bibtex]

VirFace: Enhancing Face Recognition via Unlabeled Shallow Data Wenyu Li, Tianchu Guo, Pengyu Li, Binghui Chen, Biao Wang, Wangmeng Zuo, Lei Zhang [pdf] [supp] [bibtex]

Pulsar: Efficient Sphere-Based Neural Rendering Christoph Lassner, Michael Zollhofer [pdf] [supp] [arXiv] [bibtex]

Contrastive Learning Based Hybrid Networks for Long-Tailed Image Classification Peng Wang, Kai Han, Xiu-Shen Wei, Lei Zhang, Lei Wang [pdf] [arXiv] [bibtex]

Visualizing Adapted Knowledge in Domain Transfer Yunzhong Hou, Liang Zheng [pdf] [arXiv] [bibtex]

Delving into Data: Effectively Substitute Training for Black-box Attack Wenxuan Wang, Bangjie Yin, Taiping Yao, Li Zhang, Yanwei Fu, Shouhong Ding, Jilin Li, Feiyue Huang, Xiangyang Xue [pdf] [arXiv] [bibtex]

How To Exploit the Transferability of Learned Image Compression to Conventional Codecs Jan P. Klopp, Keng-Chi Liu, Liang-Gee Chen, Shao-Yi Chien [pdf] [supp] [arXiv] [bibtex]

CorrNet3D: Unsupervised End-to-End Learning of Dense Correspondence for 3D Point Clouds Yiming Zeng, Yue Qian, Zhiyu Zhu, Junhui Hou, Hui Yuan, Ying He [pdf] [arXiv] [bibtex]

Single-View Robot Pose and Joint Angle Estimation via Render & Compare Yann Labbe, Justin Carpentier, Mathieu Aubry, Josef Sivic [pdf] [arXiv] [bibtex]

Harmonious Semantic Line Detection via Maximal Weight Clique Selection Dongkwon Jin, Wonhui Park, Seong-Gyun Jeong, Chang-Su Kim [pdf] [supp] [arXiv] [bibtex]

Learning the Non-Differentiable Optimization for Blind Super-Resolution Zheng Hui, Jie Li, Xiumei Wang, Xinbo Gao [pdf] [supp] [bibtex]

Progressive Temporal Feature Alignment Network for Video Inpainting Xueyan Zou, Linjie Yang, Ding Liu, Yong Jae Lee [pdf] [supp] [arXiv] [bibtex]

Bottleneck Transformers for Visual Recognition Aravind Srinivas, Tsung-Yi Lin, Niki Parmar, Jonathon Shlens, Pieter Abbeel, Ashish Vaswani [pdf] [supp] [arXiv] [bibtex]

Calibrated RGB-D Salient Object Detection Wei Ji, Jingjing Li, Shuang Yu, Miao Zhang, Yongri Piao, Shunyu Yao, Qi Bi, Kai Ma, Yefeng Zheng, Huchuan Lu, Li Cheng [pdf] [bibtex]

S3: Neural Shape, Skeleton, and Skinning Fields for 3D Human Modeling Ze Yang, Shenlong Wang, Sivabalan Manivasagam, Zeng Huang, Wei-Chiu Ma, Xinchen Yan, Ersin Yumer, Raquel Urtasun [pdf] [supp] [arXiv] [bibtex]

OSTeC: One-Shot Texture Completion Baris Gecer, Jiankang Deng, Stefanos Zafeiriou [pdf] [supp] [arXiv] [bibtex]

Learning To Count Everything Viresh Ranjan, Udbhav Sharma, Thu Nguyen, Minh Hoai [pdf] [supp] [arXiv] [bibtex]

Robust Representation Learning With Feedback for Single Image Deraining Chenghao Chen, Hao Li [pdf] [arXiv] [bibtex]

Fully Understanding Generic Objects: Modeling, Segmentation, and Reconstruction Feng Liu, Luan Tran, Xiaoming Liu [pdf] [supp] [arXiv] [bibtex]

SSN: Soft Shadow Network for Image Compositing Yichen Sheng, Jianming Zhang, Bedrich Benes [pdf] [supp] [arXiv] [bibtex]

MIST: Multiple Instance Self-Training Framework for Video Anomaly Detection Jia-Chang Feng, Fa-Ting Hong, Wei-Shi Zheng [pdf] [supp] [arXiv] [bibtex]

VinVL: Revisiting Visual Representations in Vision-Language Models Pengchuan Zhang, Xiujun Li, Xiaowei Hu, Jianwei Yang, Lei Zhang, Lijuan Wang, Yejin Choi, Jianfeng Gao [pdf] [supp] [arXiv] [bibtex]

Bottom-Up Human Pose Estimation via Disentangled Keypoint Regression Zigang Geng, Ke Sun, Bin Xiao, Zhaoxiang Zhang, Jingdong Wang [pdf] [arXiv] [bibtex]

CoMoGAN: Continuous Model-Guided Image-to-Image Translation Fabio Pizzati, Pietro Cerri, Raoul de Charette [pdf] [supp] [arXiv] [bibtex]

Self-Supervised Video Hashing via Bidirectional Transformers Shuyan Li, Xiu Li, Jiwen Lu, Jie Zhou [pdf] [bibtex]

From Synthetic to Real: Unsupervised Domain Adaptation for Animal Pose Estimation Chen Li, Gim Hee Lee [pdf] [arXiv] [bibtex]

Safe Local Motion Planning With Self-Supervised Freespace Forecasting Peiyun Hu, Aaron Huang, John Dolan, David Held, Deva Ramanan [pdf] [supp] [bibtex]

Camera-Space Hand Mesh Recovery via Semantic Aggregation and Adaptive 2D-1D Registration Xingyu Chen, Yufeng Liu, Chongyang Ma, Jianlong Chang, Huayan Wang, Tian Chen, Xiaoyan Guo, Pengfei Wan, Wen Zheng [pdf] [supp] [arXiv] [bibtex]

CondenseNet V2: Sparse Feature Reactivation for Deep Networks Le Yang, Haojun Jiang, Ruojin Cai, Yulin Wang, Shiji Song, Gao Huang, Qi Tian [pdf] [supp] [arXiv] [bibtex]

Learning Graphs for Knowledge Transfer With Limited Labels Pallabi Ghosh, Nirat Saini, Larry S. Davis, Abhinav Shrivastava [pdf] [supp] [bibtex]

DRANet: Disentangling Representation and Adaptation Networks for Unsupervised Cross-Domain Adaptation Seunghun Lee, Sunghyun Cho, Sunghoon Im [pdf] [supp] [arXiv] [bibtex]

Look Before You Leap: Learning Landmark Features for One-Stage Visual Grounding Binbin Huang, Dongze Lian, Weixin Luo, Shenghua Gao [pdf] [arXiv] [bibtex]

Information Bottleneck Disentanglement for Identity Swapping Gege Gao, Huaibo Huang, Chaoyou Fu, Zhaoyang Li, Ran He [pdf] [supp] [bibtex]

DualGraph: A Graph-Based Method for Reasoning About Label Noise HaiYang Zhang, XiMing Xing, Liang Liu [pdf] [bibtex]

Automatic Correction of Internal Units in Generative Neural Networks Ali Tousi, Haedong Jeong, Jiyeon Han, Hwanil Choi, Jaesik Choi [pdf] [arXiv] [bibtex]

Generating Manga From Illustrations via Mimicking Manga Creation Workflow Lvmin Zhang, Xinrui Wang, Qingnan Fan, Yi Ji, Chunping Liu [pdf] [bibtex]

Multi-Decoding Deraining Network and Quasi-Sparsity Based Training Yinglong Wang, Chao Ma, Bing Zeng [pdf] [bibtex]

Open-Vocabulary Object Detection Using Captions Alireza Zareian, Kevin Dela Rosa, Derek Hao Hu, Shih-Fu Chang [pdf] [supp] [arXiv] [bibtex]

Unveiling the Potential of Structure Preserving for Weakly Supervised Object Localization Xingjia Pan, Yingguo Gao, Zhiwen Lin, Fan Tang, Weiming Dong, Haolei Yuan, Feiyue Huang, Changsheng Xu [pdf] [supp] [arXiv] [bibtex]

From Points to Multi-Object 3D Reconstruction Francis Engelmann, Konstantinos Rematas, Bastian Leibe, Vittorio Ferrari [pdf] [arXiv] [bibtex]

Dual-Stream Multiple Instance Learning Network for Whole Slide Image Classification With Self-Supervised Contrastive Learning Bin Li, Yin Li, Kevin W. Eliceiri [pdf] [supp] [arXiv] [bibtex]

Regressive Domain Adaptation for Unsupervised Keypoint Detection Junguang Jiang, Yifei Ji, Ximei Wang, Yufeng Liu, Jianmin Wang, Mingsheng Long [pdf] [arXiv] [bibtex]

Mask Guided Matting via Progressive Refinement Network Qihang Yu, Jianming Zhang, He Zhang, Yilin Wang, Zhe Lin, Ning Xu, Yutong Bai, Alan Yuille [pdf] [arXiv] [bibtex]

Monocular Reconstruction of Neural Face Reflectance Fields Mallikarjun B R, Ayush Tewari, Tae-Hyun Oh, Tim Weyrich, Bernd Bickel, Hans-Peter Seidel, Hanspeter Pfister, Wojciech Matusik, Mohamed Elgharib, Christian Theobalt [pdf] [supp] [arXiv] [bibtex]

SelfSAGCN: Self-Supervised Semantic Alignment for Graph Convolution Network Xu Yang, Cheng Deng, Zhiyuan Dang, Kun Wei, Junchi Yan [pdf] [bibtex]

ECKPN: Explicit Class Knowledge Propagation Network for Transductive Few-Shot Learning Chaofan Chen, Xiaoshan Yang, Changsheng Xu, Xuhui Huang, Zhe Ma [pdf] [bibtex]

Coarse-Fine Networks for Temporal Activity Detection in Videos Kumara Kahatapitiya, Michael S. Ryoo [pdf] [arXiv] [bibtex]

Can Audio-Visual Integration Strengthen Robustness Under Multimodal Attacks? Yapeng Tian, Chenliang Xu [pdf] [supp] [arXiv] [bibtex]

Deep Gradient Projection Networks for Pan-sharpening Shuang Xu, Jiangshe Zhang, Zixiang Zhao, Kai Sun, Junmin Liu, Chunxia Zhang [pdf] [arXiv] [bibtex]

ReNAS: Relativistic Evaluation of Neural Architecture Search Yixing Xu, Yunhe Wang, Kai Han, Yehui Tang, Shangling Jui, Chunjing Xu, Chang Xu [pdf] [supp] [arXiv] [bibtex]

When Human Pose Estimation Meets Robustness: Adversarial Algorithms and Benchmarks Jiahang Wang, Sheng Jin, Wentao Liu, Weizhong Liu, Chen Qian, Ping Luo [pdf] [supp] [arXiv] [bibtex]

ReMix: Towards Image-to-Image Translation With Limited Data Jie Cao, Luanxuan Hou, Ming-Hsuan Yang, Ran He, Zhenan Sun [pdf] [supp] [arXiv] [bibtex]

Adaptive Rank Estimate in Robust Principal Component Analysis Zhengqin Xu, Rui He, Shoulie Xie, Shiqian Wu [pdf] [supp] [bibtex]

Continual Adaptation of Visual Representations via Domain Randomization and Meta-Learning Riccardo Volpi, Diane Larlus, Gregory Rogez [pdf] [supp] [arXiv] [bibtex]

DeepACG: Co-Saliency Detection via Semantic-Aware Contrast Gromov-Wasserstein Distance Kaihua Zhang, Mingliang Dong, Bo Liu, Xiao-Tong Yuan, Qingshan Liu [pdf] [bibtex]

SurFree: A Fast Surrogate-Free Black-Box Attack Thibault Maho, Teddy Furon, Erwan Le Merrer [pdf] [arXiv] [bibtex]

Beyond Image to Depth: Improving Depth Prediction Using Echoes Kranti Kumar Parida, Siddharth Srivastava, Gaurav Sharma [pdf] [supp] [arXiv] [bibtex]

Rich Features for Perceptual Quality Assessment of UGC Videos Yilin Wang, Junjie Ke, Hossein Talebi, Joong Gon Yim, Neil Birkbeck, Balu Adsumilli, Peyman Milanfar, Feng Yang [pdf] [supp] [bibtex]

Sequential Graph Convolutional Network for Active Learning Razvan Caramalau, Binod Bhattarai, Tae-Kyun Kim [pdf] [arXiv] [bibtex]

Generative Classifiers as a Basis for Trustworthy Image Classification Radek Mackowiak, Lynton Ardizzone, Ullrich Kothe, Carsten Rother [pdf] [supp] [arXiv] [bibtex]

EffiScene: Efficient Per-Pixel Rigidity Inference for Unsupervised Joint Learning of Optical Flow, Depth, Camera Pose and Motion Segmentation Yang Jiao, Trac D. Tran, Guangming Shi [pdf] [arXiv] [bibtex]

Localizing Visual Sounds the Hard Way Honglie Chen, Weidi Xie, Triantafyllos Afouras, Arsha Nagrani, Andrea Vedaldi, Andrew Zisserman [pdf] [arXiv] [bibtex]

Synthesize-It-Classifier: Learning a Generative Classifier Through Recurrent Self-Analysis Arghya Pal, Raphael C.-W. Phan, KokSheik Wong [pdf] [supp] [bibtex]

Self-Point-Flow: Self-Supervised Scene Flow Estimation From Point Clouds With Optimal Transport and Random Walk Ruibo Li, Guosheng Lin, Lihua Xie [pdf] [supp] [bibtex]

Toward Joint Thing-and-Stuff Mining for Weakly Supervised Panoptic Segmentation Yunhang Shen, Liujuan Cao, Zhiwei Chen, Feihong Lian, Baochang Zhang, Chi Su, Yongjian Wu, Feiyue Huang, Rongrong Ji [pdf] [bibtex]

Intelligent Carpet: Inferring 3D Human Pose From Tactile Signals Yiyue Luo, Yunzhu Li, Michael Foshey, Wan Shou, Pratyusha Sharma, Tomas Palacios, Antonio Torralba, Wojciech Matusik [pdf] [supp] [bibtex]

Railroad Is Not a Train: Saliency As Pseudo-Pixel Supervision for Weakly Supervised Semantic Segmentation Seungho Lee, Minhyun Lee, Jongwuk Lee, Hyunjung Shim [pdf] [supp] [arXiv] [bibtex]

Stable View Synthesis Gernot Riegler, Vladlen Koltun [pdf] [arXiv] [bibtex]

Deep Two-View Structure-From-Motion Revisited Jianyuan Wang, Yiran Zhong, Yuchao Dai, Stan Birchfield, Kaihao Zhang, Nikolai Smolyanskiy, Hongdong Li [pdf] [supp] [arXiv] [bibtex]

Rethinking Style Transfer: From Pixels to Parameterized Brushstrokes Dmytro Kotovenko, Matthias Wright, Arthur Heimbrecht, Bjorn Ommer [pdf] [supp] [arXiv] [bibtex]

Cluster, Split, Fuse, and Update: Meta-Learning for Open Compound Domain Adaptive Semantic Segmentation Rui Gong, Yuhua Chen, Danda Pani Paudel, Yawei Li, Ajad Chhatkuli, Wen Li, Dengxin Dai, Luc Van Gool [pdf] [supp] [arXiv] [bibtex]

Beyond Short Clips: End-to-End Video-Level Learning With Collaborative Memories Xitong Yang, Haoqi Fan, Lorenzo Torresani, Larry S. Davis, Heng Wang [pdf] [arXiv] [bibtex]

PointDSC: Robust Point Cloud Registration Using Deep Spatial Consistency Xuyang Bai, Zixin Luo, Lei Zhou, Hongkai Chen, Lei Li, Zeyu Hu, Hongbo Fu, Chiew-Lan Tai [pdf] [supp] [arXiv] [bibtex]

Task Programming: Learning Data Efficient Behavior Representations Jennifer J. Sun, Ann Kennedy, Eric Zhan, David J. Anderson, Yisong Yue, Pietro Perona [pdf] [supp] [arXiv] [bibtex]

ACRE: Abstract Causal REasoning Beyond Covariation Chi Zhang, Baoxiong Jia, Mark Edmonds, Song-Chun Zhu, Yixin Zhu [pdf] [supp] [arXiv] [bibtex]

DeepLM: Large-Scale Nonlinear Least Squares on Deep Learning Frameworks Using Stochastic Domain Decomposition Jingwei Huang, Shan Huang, Mingwei Sun [pdf] [supp] [bibtex]

TDN: Temporal Difference Networks for Efficient Action Recognition Limin Wang, Zhan Tong, Bin Ji, Gangshan Wu [pdf] [supp] [arXiv] [bibtex]

LiBRe: A Practical Bayesian Approach to Adversarial Detection Zhijie Deng, Xiao Yang, Shizhen Xu, Hang Su, Jun Zhu [pdf] [supp] [arXiv] [bibtex]

ArtCoder: An End-to-End Method for Generating Scanning-Robust Stylized QR Codes Hao Su, Jianwei Niu, Xuefeng Liu, Qingfeng Li, Ji Wan, Mingliang Xu, Tao Ren [pdf] [bibtex]

Self-Supervised Pillar Motion Learning for Autonomous Driving Chenxu Luo, Xiaodong Yang, Alan Yuille [pdf] [supp] [arXiv] [bibtex]

Quantum Permutation Synchronization Tolga Birdal, Vladislav Golyanik, Christian Theobalt, Leonidas J. Guibas [pdf] [supp] [arXiv] [bibtex]

QAIR: Practical Query-Efficient Black-Box Attacks for Image Retrieval Xiaodan Li, Jinfeng Li, Yuefeng Chen, Shaokai Ye, Yuan He, Shuhui Wang, Hang Su, Hui Xue [pdf] [supp] [arXiv] [bibtex]

MagFace: A Universal Representation for Face Recognition and Quality Assessment Qiang Meng, Shichao Zhao, Zhida Huang, Feng Zhou [pdf] [supp] [arXiv] [bibtex]

Wasserstein Barycenter for Multi-Source Domain Adaptation Eduardo Fernandes Montesuma, Fred Maurice Ngole Mboula [pdf] [supp] [bibtex]

Unsupervised Hyperbolic Metric Learning Jiexi Yan, Lei Luo, Cheng Deng, Heng Huang [pdf] [bibtex]

Improving Sign Language Translation With Monolingual Data by Sign Back-Translation Hao Zhou, Wengang Zhou, Weizhen Qi, Junfu Pu, Houqiang Li [pdf] [arXiv] [bibtex]

Background Splitting: Finding Rare Classes in a Sea of Background Ravi Teja Mullapudi, Fait Poms, William R. Mark, Deva Ramanan, Kayvon Fatahalian [pdf] [supp] [arXiv] [bibtex]

Adaptive Convolutions for Structure-Aware Style Transfer Prashanth Chandran, Gaspard Zoss, Paulo Gotardo, Markus Gross, Derek Bradley [pdf] [supp] [bibtex]

Few-Shot Incremental Learning With Continually Evolved Classifiers Chi Zhang, Nan Song, Guosheng Lin, Yun Zheng, Pan Pan, Yinghui Xu [pdf] [supp] [arXiv] [bibtex]

NExT-QA: Next Phase of Question-Answering to Explaining Temporal Actions Junbin Xiao, Xindi Shang, Angela Yao, Tat-Seng Chua [pdf] [supp] [bibtex]

LayoutGMN: Neural Graph Matching for Structural Layout Similarity Akshay Gadi Patil, Manyi Li, Matthew Fisher, Manolis Savva, Hao Zhang [pdf] [supp] [arXiv] [bibtex]

TransNAS-Bench-101: Improving Transferability and Generalizability of Cross-Task Neural Architecture Search Yawen Duan, Xin Chen, Hang Xu, Zewei Chen, Xiaodan Liang, Tong Zhang, Zhenguo Li [pdf] [supp] [bibtex]

ArtEmis: Affective Language for Visual Art Panos Achlioptas, Maks Ovsjanikov, Kilichbek Haydarov, Mohamed Elhoseiny, Leonidas J. Guibas [pdf] [arXiv] [bibtex]

Sketch, Ground, and Refine: Top-Down Dense Video Captioning Chaorui Deng, Shizhe Chen, Da Chen, Yuan He, Qi Wu [pdf] [bibtex]

Learning Normal Dynamics in Videos With Meta Prototype Network Hui Lv, Chen Chen, Zhen Cui, Chunyan Xu, Yong Li, Jian Yang [pdf] [supp] [arXiv] [bibtex]

Graph-Based High-Order Relation Discovery for Fine-Grained Recognition Yifan Zhao, Ke Yan, Feiyue Huang, Jia Li [pdf] [bibtex]

Normal Integration via Inverse Plane Fitting With Minimum Point-to-Plane Distance Xu Cao, Boxin Shi, Fumio Okura, Yasuyuki Matsushita [pdf] [supp] [bibtex]

NPAS: A Compiler-Aware Framework of Unified Network Pruning and Architecture Search for Beyond Real-Time Mobile Acceleration Zhengang Li, Geng Yuan, Wei Niu, Pu Zhao, Yanyu Li, Yuxuan Cai, Xuan Shen, Zheng Zhan, Zhenglun Kong, Qing Jin, Zhiyu Chen, Sijia Liu, Kaiyuan Yang, Bin Ren, Yanzhi Wang, Xue Lin [pdf] [arXiv] [bibtex]

Spatial Feature Calibration and Temporal Fusion for Effective One-Stage Video Instance Segmentation Minghan Li, Shuai Li, Lida Li, Lei Zhang [pdf] [supp] [arXiv] [bibtex]

Learning Asynchronous and Sparse Human-Object Interaction in Videos Romero Morais, Vuong Le, Svetha Venkatesh, Truyen Tran [pdf] [supp] [arXiv] [bibtex]

Single Image Reflection Removal With Absorption Effect Qian Zheng, Boxin Shi, Jinnan Chen, Xudong Jiang, Ling-Yu Duan, Alex C. Kot [pdf] [supp] [bibtex]

One-Shot Neural Ensemble Architecture Search by Diversity-Guided Search Space Shrinking Minghao Chen, Jianlong Fu, Haibin Ling [pdf] [supp] [arXiv] [bibtex]

Disentangled Cycle Consistency for Highly-Realistic Virtual Try-On Chongjian Ge, Yibing Song, Yuying Ge, Han Yang, Wei Liu, Ping Luo [pdf] [supp] [arXiv] [bibtex]

M3DSSD: Monocular 3D Single Stage Object Detector Shujie Luo, Hang Dai, Ling Shao, Yong Ding [pdf] [arXiv] [bibtex]

Structure-Aware Face Clustering on a Large-Scale Graph With 107 Nodes Shuai Shen, Wanhua Li, Zheng Zhu, Guan Huang, Dalong Du, Jiwen Lu, Jie Zhou [pdf] [supp] [bibtex]

Objects Are Different: Flexible Monocular 3D Object Detection Yunpeng Zhang, Jiwen Lu, Jie Zhou [pdf] [arXiv] [bibtex]

Permuted AdaIN: Reducing the Bias Towards Global Statistics in Image Classification Oren Nuriel, Sagie Benaim, Lior Wolf [pdf] [arXiv] [bibtex]

Pixel Codec Avatars Shugao Ma, Tomas Simon, Jason Saragih, Dawei Wang, Yuecheng Li, Fernando De la Torre, Yaser Sheikh [pdf] [supp] [arXiv] [bibtex]

SimPLE: Similar Pseudo Label Exploitation for Semi-Supervised Classification Zijian Hu, Zhengyu Yang, Xuefeng Hu, Ram Nevatia [pdf] [supp] [arXiv] [bibtex]

Context-Aware Layout to Image Generation With Enhanced Object Appearance Sen He, Wentong Liao, Michael Ying Yang, Yongxin Yang, Yi-Zhe Song, Bodo Rosenhahn, Tao Xiang [pdf] [arXiv] [bibtex]

Mask-Embedded Discriminator With Region-Based Semantic Regularization for Semi-Supervised Class-Conditional Image Synthesis Yi Liu, Xiaoyang Huo, Tianyi Chen, Xiangping Zeng, Si Wu, Zhiwen Yu, Hau-San Wong [pdf] [bibtex]

LEAP: Learning Articulated Occupancy of People Marko Mihajlovic, Yan Zhang, Michael J. Black, Siyu Tang [pdf] [supp] [arXiv] [bibtex]

ANR: Articulated Neural Rendering for Virtual Avatars Amit Raj, Julian Tanke, James Hays, Minh Vo, Carsten Stoll, Christoph Lassner [pdf] [supp] [arXiv] [bibtex]

Flow-Based Kernel Prior With Application to Blind Super-Resolution Jingyun Liang, Kai Zhang, Shuhang Gu, Luc Van Gool, Radu Timofte [pdf] [supp] [arXiv] [bibtex]

Probabilistic Selective Encryption of Convolutional Neural Networks for Hierarchical Services Jinyu Tian, Jiantao Zhou, Jia Duan [pdf] [supp] [arXiv] [bibtex]

Cuboids Revisited: Learning Robust 3D Shape Fitting to Single RGB Images Florian Kluger, Hanno Ackermann, Eric Brachmann, Michael Ying Yang, Bodo Rosenhahn [pdf] [supp] [arXiv] [bibtex]

Dive Into Ambiguity: Latent Distribution Mining and Pairwise Uncertainty Estimation for Facial Expression Recognition Jiahui She, Yibo Hu, Hailin Shi, Jun Wang, Qiu Shen, Tao Mei [pdf] [supp] [arXiv] [bibtex]

Attention-Guided Image Compression by Deep Reconstruction of Compressive Sensed Saliency Skeleton Xi Zhang, Xiaolin Wu [pdf] [supp] [arXiv] [bibtex]

Cluster-Wise Hierarchical Generative Model for Deep Amortized Clustering Huafeng Liu, Jiaqi Wang, Liping Jing [pdf] [supp] [bibtex]

Mirror3D: Depth Refinement for Mirror Surfaces Jiaqi Tan, Weijie Lin, Angel X. Chang, Manolis Savva [pdf] [supp] [bibtex]

Propagate Yourself: Exploring Pixel-Level Consistency for Unsupervised Visual Representation Learning Zhenda Xie, Yutong Lin, Zheng Zhang, Yue Cao, Stephen Lin, Han Hu [pdf] [arXiv] [bibtex]

Reciprocal Transformations for Unsupervised Video Object Segmentation Sucheng Ren, Wenxi Liu, Yongtuo Liu, Haoxin Chen, Guoqiang Han, Shengfeng He [pdf] [supp] [bibtex]

Detection, Tracking, and Counting Meets Drones in Crowds: A Benchmark Longyin Wen, Dawei Du, Pengfei Zhu, Qinghua Hu, Qilong Wang, Liefeng Bo, Siwei Lyu [pdf] [arXiv] [bibtex]

Learning Complete 3D Morphable Face Models From Images and Videos Mallikarjun B R, Ayush Tewari, Hans-Peter Seidel, Mohamed Elgharib, Christian Theobalt [pdf] [supp] [arXiv] [bibtex]

Bottom-Up Shift and Reasoning for Referring Image Segmentation Sibei Yang, Meng Xia, Guanbin Li, Hong-Yu Zhou, Yizhou Yu [pdf] [bibtex]

Sparse Auxiliary Networks for Unified Monocular Depth Prediction and Completion Vitor Guizilini, Rares Ambrus, Wolfram Burgard, Adrien Gaidon [pdf] [arXiv] [bibtex]

DeepMetaHandles: Learning Deformation Meta-Handles of 3D Meshes With Biharmonic Coordinates Minghua Liu, Minhyuk Sung, Radomir Mech, Hao Su [pdf] [supp] [arXiv] [bibtex]

Panoptic Segmentation Forecasting Colin Graber, Grace Tsai, Michael Firman, Gabriel Brostow, Alexander G. Schwing [pdf] [supp] [arXiv] [bibtex]

SRDAN: Scale-Aware and Range-Aware Domain Adaptation Network for Cross-Dataset 3D Object Detection Weichen Zhang, Wen Li, Dong Xu [pdf] [bibtex]

Pedestrian and Ego-Vehicle Trajectory Prediction From Monocular Camera Lukas Neumann, Andrea Vedaldi [pdf] [bibtex]

Globally Optimal Relative Pose Estimation With Gravity Prior Yaqing Ding, Daniel Barath, Jian Yang, Hui Kong, Zuzana Kukelova [pdf] [supp] [arXiv] [bibtex]

Mutual CRF-GNN for Few-Shot Learning Shixiang Tang, Dapeng Chen, Lei Bai, Kaijian Liu, Yixiao Ge, Wanli Ouyang [pdf] [supp] [bibtex]

Weakly Supervised Action Selection Learning in Video Junwei Ma, Satya Krishna Gorti, Maksims Volkovs, Guangwei Yu [pdf] [arXiv] [bibtex]

Learning Student Networks in the Wild Hanting Chen, Tianyu Guo, Chang Xu, Wenshuo Li, Chunjing Xu, Chao Xu, Yunhe Wang [pdf] [bibtex]

Distilling Knowledge via Knowledge Review Pengguang Chen, Shu Liu, Hengshuang Zhao, Jiaya Jia [pdf] [supp] [arXiv] [bibtex]

DoDNet: Learning To Segment Multi-Organ and Tumors From Multiple Partially Labeled Datasets Jianpeng Zhang, Yutong Xie, Yong Xia, Chunhua Shen [pdf] [arXiv] [bibtex]

Lips Don't Lie: A Generalisable and Robust Approach To Face Forgery Detection Alexandros Haliassos, Konstantinos Vougioukas, Stavros Petridis, Maja Pantic [pdf] [supp] [bibtex]

Exploring Simple Siamese Representation Learning Xinlei Chen, Kaiming He [pdf] [supp] [arXiv] [bibtex]

CAMERAS: Enhanced Resolution and Sanity Preserving Class Activation Mapping for Image Saliency Mohammad A. A. K. Jalwana, Naveed Akhtar, Mohammed Bennamoun, Ajmal Mian [pdf] [supp] [bibtex]

3D AffordanceNet: A Benchmark for Visual Object Affordance Understanding Shengheng Deng, Xun Xu, Chaozheng Wu, Ke Chen, Kui Jia [pdf] [supp] [arXiv] [bibtex]

Learning To Segment Actions From Visual and Language Instructions via Differentiable Weak Sequence Alignment Yuhan Shen, Lu Wang, Ehsan Elhamifar [pdf] [supp] [bibtex]

Deep Implicit Templates for 3D Shape Representation Zerong Zheng, Tao Yu, Qionghai Dai, Yebin Liu [pdf] [supp] [arXiv] [bibtex]

Semantic Image Matting Yanan Sun, Chi-Keung Tang, Yu-Wing Tai [pdf] [supp] [arXiv] [bibtex]

Semi-Supervised Semantic Segmentation With Cross Pseudo Supervision Xiaokang Chen, Yuhui Yuan, Gang Zeng, Jingdong Wang [pdf] [supp] [arXiv] [bibtex]

Ranking Neural Checkpoints Yandong Li, Xuhui Jia, Ruoxin Sang, Yukun Zhu, Bradley Green, Liqiang Wang, Boqing Gong [pdf] [supp] [arXiv] [bibtex]

SuperMix: Supervising the Mixing Data Augmentation Ali Dabouei, Sobhan Soleymani, Fariborz Taherkhani, Nasser M. Nasrabadi [pdf] [supp] [arXiv] [bibtex]

Informative and Consistent Correspondence Mining for Cross-Domain Weakly Supervised Object Detection Luwei Hou, Yu Zhang, Kui Fu, Jia Li [pdf] [bibtex]

Inception Convolution With Efficient Dilation Search Jie Liu, Chuming Li, Feng Liang, Chen Lin, Ming Sun, Junjie Yan, Wanli Ouyang, Dong Xu [pdf] [arXiv] [bibtex]

Back to Event Basics: Self-Supervised Learning of Image Reconstruction for Event Cameras via Photometric Constancy Federico Paredes-Valles, Guido C. H. E. de Croon [pdf] [supp] [bibtex]

AdderSR: Towards Energy Efficient Image Super-Resolution Dehua Song, Yunhe Wang, Hanting Chen, Chang Xu, Chunjing Xu, Dacheng Tao [pdf] [supp] [arXiv] [bibtex]

Semi-Supervised Domain Adaptation Based on Dual-Level Domain Mixing for Semantic Segmentation Shuaijun Chen, Xu Jia, Jianzhong He, Yongjie Shi, Jianzhuang Liu [pdf] [supp] [arXiv] [bibtex]

Connecting What To Say With Where To Look by Modeling Human Attention Traces Zihang Meng, Licheng Yu, Ning Zhang, Tamara L. Berg, Babak Damavandi, Vikas Singh, Amy Bearman [pdf] [supp] [arXiv] [bibtex]

Shelf-Supervised Mesh Prediction in the Wild Yufei Ye, Shubham Tulsiani, Abhinav Gupta [pdf] [supp] [arXiv] [bibtex]

Learning To Filter: Siamese Relation Network for Robust Tracking Siyuan Cheng, Bineng Zhong, Guorong Li, Xin Liu, Zhenjun Tang, Xianxian Li, Jing Wang [pdf] [arXiv] [bibtex]

Ensembling With Deep Generative Views Lucy Chai, Jun-Yan Zhu, Eli Shechtman, Phillip Isola, Richard Zhang [pdf] [supp] [arXiv] [bibtex]

Accurate Few-Shot Object Detection With Support-Query Mutual Guidance and Hybrid Loss Lu Zhang, Shuigeng Zhou, Jihong Guan, Ji Zhang [pdf] [supp] [bibtex]

Cascaded Prediction Network via Segment Tree for Temporal Video Grounding Yang Zhao, Zhou Zhao, Zhu Zhang, Zhijie Lin [pdf] [supp] [bibtex]

Posterior Promoted GAN With Distribution Discriminator for Unsupervised Image Synthesis Xianchao Zhang, Ziyang Cheng, Xiaotong Zhang, Han Liu [pdf] [bibtex]

Toward Accurate and Realistic Outfits Visualization With Attention to Details Kedan Li, Min Jin Chong, Jeffrey Zhang, Jingen Liu [pdf] [supp] [bibtex]

Delving Deep Into Many-to-Many Attention for Few-Shot Video Object Segmentation Haoxin Chen, Hanjie Wu, Nanxuan Zhao, Sucheng Ren, Shengfeng He [pdf] [supp] [bibtex]

MongeNet: Efficient Sampler for Geometric Deep Learning Leo Lebrat, Rodrigo Santa Cruz, Clinton Fookes, Olivier Salvado [pdf] [arXiv] [bibtex]

Gated Spatio-Temporal Attention-Guided Video Deblurring Maitreya Suin, A. N. Rajagopalan [pdf] [bibtex]

Learning Multi-Scale Photo Exposure Correction Mahmoud Afifi, Konstantinos G. Derpanis, Bjorn Ommer, Michael S. Brown [pdf] [supp] [arXiv] [bibtex]

Learning Semantic Person Image Generation by Region-Adaptive Normalization Zhengyao Lv, Xiaoming Li, Xin Li, Fu Li, Tianwei Lin, Dongliang He, Wangmeng Zuo [pdf] [arXiv] [bibtex]

Rethinking Class Relations: Absolute-Relative Supervised and Unsupervised Few-Shot Learning Hongguang Zhang, Piotr Koniusz, Songlei Jian, Hongdong Li, Philip H. S. Torr [pdf] [supp] [arXiv] [bibtex]

Divergence Optimization for Noisy Universal Domain Adaptation Qing Yu, Atsushi Hashimoto, Yoshitaka Ushiku [pdf] [supp] [arXiv] [bibtex]

Learning Dynamic Alignment via Meta-Filter for Few-Shot Learning Chengming Xu, Yanwei Fu, Chen Liu, Chengjie Wang, Jilin Li, Feiyue Huang, Li Zhang, Xiangyang Xue [pdf] [supp] [arXiv] [bibtex]

Unsupervised Learning of 3D Object Categories From Videos in the Wild Philipp Henzler, Jeremy Reizenstein, Patrick Labatut, Roman Shapovalov, Tobias Ritschel, Andrea Vedaldi, David Novotny [pdf] [supp] [arXiv] [bibtex]

Exploring Heterogeneous Clues for Weakly-Supervised Audio-Visual Video Parsing Yu Wu, Yi Yang [pdf] [bibtex]

Dogfight: Detecting Drones From Drones Videos Muhammad Waseem Ashraf, Waqas Sultani, Mubarak Shah [pdf] [arXiv] [bibtex]

PAUL: Procrustean Autoencoder for Unsupervised Lifting Chaoyang Wang, Simon Lucey [pdf] [arXiv] [bibtex]

Group Collaborative Learning for Co-Salient Object Detection Qi Fan, Deng-Ping Fan, Huazhu Fu, Chi-Keung Tang, Ling Shao, Yu-Wing Tai [pdf] [arXiv] [bibtex]

RobustNet: Improving Domain Generalization in Urban-Scene Segmentation via Instance Selective Whitening Sungha Choi, Sanghun Jung, Huiwon Yun, Joanne T. Kim, Seungryong Kim, Jaegul Choo [pdf] [supp] [arXiv] [bibtex]

Monocular Real-Time Full Body Capture With Inter-Part Correlations Yuxiao Zhou, Marc Habermann, Ikhsanul Habibie, Ayush Tewari, Christian Theobalt, Feng Xu [pdf] [supp] [arXiv] [bibtex]

Pre-Trained Image Processing Transformer Hanting Chen, Yunhe Wang, Tianyu Guo, Chang Xu, Yiping Deng, Zhenhua Liu, Siwei Ma, Chunjing Xu, Chao Xu, Wen Gao [pdf] [supp] [arXiv] [bibtex]

Robust and Accurate Object Detection via Adversarial Learning Xiangning Chen, Cihang Xie, Mingxing Tan, Li Zhang, Cho-Jui Hsieh, Boqing Gong [pdf] [supp] [arXiv] [bibtex]

Faster Meta Update Strategy for Noise-Robust Deep Learning Youjiang Xu, Linchao Zhu, Lu Jiang, Yi Yang [pdf] [supp] [arXiv] [bibtex]

ContactOpt: Optimizing Contact To Improve Grasps Patrick Grady, Chengcheng Tang, Christopher D. Twigg, Minh Vo, Samarth Brahmbhatt, Charles C. Kemp [pdf] [supp] [arXiv] [bibtex]

Panoptic-PolarNet: Proposal-Free LiDAR Point Cloud Panoptic Segmentation Zixiang Zhou, Yang Zhang, Hassan Foroosh [pdf] [supp] [bibtex]

Source-Free Domain Adaptation for Semantic Segmentation Yuang Liu, Wei Zhang, Jun Wang [pdf] [supp] [arXiv] [bibtex]

Adaptive Weighted Discriminator for Training Generative Adversarial Networks Vasily Zadorozhnyy, Qiang Cheng, Qiang Ye [pdf] [supp] [arXiv] [bibtex]

Depth From Camera Motion and Object Detection Brent A. Griffin, Jason J. Corso [pdf] [supp] [arXiv] [bibtex]

PPR10K: A Large-Scale Portrait Photo Retouching Dataset With Human-Region Mask and Group-Level Consistency Jie Liang, Hui Zeng, Miaomiao Cui, Xuansong Xie, Lei Zhang [pdf] [supp] [arXiv] [bibtex]

Transformation Driven Visual Reasoning Xin Hong, Yanyan Lan, Liang Pang, Jiafeng Guo, Xueqi Cheng [pdf] [supp] [arXiv] [bibtex]

Sparse R-CNN: End-to-End Object Detection With Learnable Proposals Peize Sun, Rufeng Zhang, Yi Jiang, Tao Kong, Chenfeng Xu, Wei Zhan, Masayoshi Tomizuka, Lei Li, Zehuan Yuan, Changhu Wang, Ping Luo [pdf] [bibtex]

Plan2Scene: Converting Floorplans to 3D Scenes Madhawa Vidanapathirana, Qirui Wu, Yasutaka Furukawa, Angel X. Chang, Manolis Savva [pdf] [supp] [bibtex]

Towards Semantic Segmentation of Urban-Scale 3D Point Clouds: A Dataset, Benchmarks and Challenges Qingyong Hu, Bo Yang, Sheikh Khalid, Wen Xiao, Niki Trigoni, Andrew Markham [pdf] [supp] [arXiv] [bibtex]

Towards Open World Object Detection K J Joseph, Salman Khan, Fahad Shahbaz Khan, Vineeth N Balasubramanian [pdf] [supp] [arXiv] [bibtex]

Conditional Bures Metric for Domain Adaptation You-Wei Luo, Chuan-Xian Ren [pdf] [supp] [bibtex]

DatasetGAN: Efficient Labeled Data Factory With Minimal Human Effort Yuxuan Zhang, Huan Ling, Jun Gao, Kangxue Yin, Jean-Francois Lafleche, Adela Barriuso, Antonio Torralba, Sanja Fidler [pdf] [arXiv] [bibtex]

Repurposing GANs for One-Shot Semantic Part Segmentation Nontawat Tritrong, Pitchaporn Rewatbowornwong, Supasorn Suwajanakorn [pdf] [supp] [arXiv] [bibtex]

Semi-Supervised 3D Hand-Object Poses Estimation With Interactions in Time Shaowei Liu, Hanwen Jiang, Jiarui Xu, Sifei Liu, Xiaolong Wang [pdf] [supp] [bibtex]

Cyclic Co-Learning of Sounding Object Visual Grounding and Sound Separation Yapeng Tian, Di Hu, Chenliang Xu [pdf] [supp] [arXiv] [bibtex]

Digital Gimbal: End-to-End Deep Image Stabilization With Learnable Exposure Times Omer Dahary, Matan Jacoby, Alex M. Bronstein [pdf] [supp] [arXiv] [bibtex]

Rethinking Text Segmentation: A Novel Dataset and a Text-Specific Refinement Approach Xingqian Xu, Zhifei Zhang, Zhaowen Wang, Brian Price, Zhonghao Wang, Humphrey Shi [pdf] [supp] [arXiv] [bibtex]

SUTD-TrafficQA: A Question Answering Benchmark and an Efficient Network for Video Reasoning Over Traffic Events Li Xu, He Huang, Jun Liu [pdf] [supp] [bibtex]

T2VLAD: Global-Local Sequence Alignment for Text-Video Retrieval Xiaohan Wang, Linchao Zhu, Yi Yang [pdf] [arXiv] [bibtex]

Privacy-Preserving Image Features via Adversarial Affine Subspace Embeddings Mihai Dusmanu, Johannes L. Schonberger, Sudipta N. Sinha, Marc Pollefeys [pdf] [supp] [arXiv] [bibtex]

StyleMeUp: Towards Style-Agnostic Sketch-Based Image Retrieval Aneeshan Sain, Ayan Kumar Bhunia, Yongxin Yang, Tao Xiang, Yi-Zhe Song [pdf] [supp] [arXiv] [bibtex]

Embedding Transfer With Label Relaxation for Improved Metric Learning Sungyeon Kim, Dongwon Kim, Minsu Cho, Suha Kwak [pdf] [supp] [arXiv] [bibtex]

Beyond Static Features for Temporally Consistent 3D Human Pose and Shape From a Video Hongsuk Choi, Gyeongsik Moon, Ju Yong Chang, Kyoung Mu Lee [pdf] [supp] [arXiv] [bibtex]

Layout-Guided Novel View Synthesis From a Single Indoor Panorama Jiale Xu, Jia Zheng, Yanyu Xu, Rui Tang, Shenghua Gao [pdf] [supp] [arXiv] [bibtex]

STMTrack: Template-Free Visual Tracking With Space-Time Memory Networks Zhihong Fu, Qingjie Liu, Zehua Fu, Yunhong Wang [pdf] [arXiv] [bibtex]

Reformulating HOI Detection As Adaptive Set Prediction Mingfei Chen, Yue Liao, Si Liu, Zhiyuan Chen, Fei Wang, Chen Qian [pdf] [arXiv] [bibtex]

Strengthen Learning Tolerance for Weakly Supervised Object Localization Guangyu Guo, Junwei Han, Fang Wan, Dingwen Zhang [pdf] [bibtex]

Mesh Saliency: An Independent Perceptual Measure or a Derivative of Image Saliency? Ran Song, Wei Zhang, Yitian Zhao, Yonghuai Liu, Paul L. Rosin [pdf] [supp] [bibtex]

Passive Inter-Photon Imaging Atul Ingle, Trevor Seets, Mauro Buttafava, Shantanu Gupta, Alberto Tosi, Mohit Gupta, Andreas Velten [pdf] [supp] [arXiv] [bibtex]

Domain Consensus Clustering for Universal Domain Adaptation Guangrui Li, Guoliang Kang, Yi Zhu, Yunchao Wei, Yi Yang [pdf] [supp] [bibtex]

Continual Semantic Segmentation via Repulsion-Attraction of Sparse and Disentangled Latent Representations Umberto Michieli, Pietro Zanuttigh [pdf] [supp] [arXiv] [bibtex]

Audio-Driven Emotional Video Portraits Xinya Ji, Hang Zhou, Kaisiyuan Wang, Wayne Wu, Chen Change Loy, Xun Cao, Feng Xu [pdf] [supp] [arXiv] [bibtex]

Pareto Self-Supervised Training for Few-Shot Learning Zhengyu Chen, Jixie Ge, Heshen Zhan, Siteng Huang, Donglin Wang [pdf] [supp] [arXiv] [bibtex]

EnD: Entangling and Disentangling Deep Representations for Bias Correction Enzo Tartaglione, Carlo Alberto Barbano, Marco Grangetto [pdf] [supp] [arXiv] [bibtex]

Recorrupted-to-Recorrupted: Unsupervised Deep Learning for Image Denoising Tongyao Pang, Huan Zheng, Yuhui Quan, Hui Ji [pdf] [supp] [bibtex]

Reconsidering Representation Alignment for Multi-View Clustering Daniel J. Trosten, Sigurd Lokse, Robert Jenssen, Michael Kampffmeyer [pdf] [supp] [arXiv] [bibtex]

Probabilistic Embeddings for Cross-Modal Retrieval Sanghyuk Chun, Seong Joon Oh, Rafael Sampaio de Rezende, Yannis Kalantidis, Diane Larlus [pdf] [supp] [arXiv] [bibtex]

Cloud2Curve: Generation and Vectorization of Parametric Sketches Ayan Das, Yongxin Yang, Timothy M. Hospedales, Tao Xiang, Yi-Zhe Song [pdf] [arXiv] [bibtex]

TransFill: Reference-Guided Image Inpainting by Merging Multiple Color and Spatial Transformations Yuqian Zhou, Connelly Barnes, Eli Shechtman, Sohrab Amirghodsi [pdf] [supp] [arXiv] [bibtex]

On Focal Loss for Class-Posterior Probability Estimation: A Theoretical Perspective Nontawat Charoenphakdee, Jayakorn Vongkulbhisal, Nuttapong Chairatanakul, Masashi Sugiyama [pdf] [supp] [arXiv] [bibtex]

VIP-DeepLab: Learning Visual Perception With Depth-Aware Video Panoptic Segmentation Siyuan Qiao, Yukun Zhu, Hartwig Adam, Alan Yuille, Liang-Chieh Chen [pdf] [supp] [bibtex]

Sequence-to-Sequence Contrastive Learning for Text Recognition Aviad Aberdam, Ron Litman, Shahar Tsiper, Oron Anschel, Ron Slossberg, Shai Mazor, R. Manmatha, Pietro Perona [pdf] [supp] [arXiv] [bibtex]

Prototype-Supervised Adversarial Network for Targeted Attack of Deep Hashing Xunguang Wang, Zheng Zhang, Baoyuan Wu, Fumin Shen, Guangming Lu [pdf] [arXiv] [bibtex]

PD-GAN: Probabilistic Diverse GAN for Image Inpainting Hongyu Liu, Ziyu Wan, Wei Huang, Yibing Song, Xintong Han, Jing Liao [pdf] [supp] [bibtex]

Simple Copy-Paste Is a Strong Data Augmentation Method for Instance Segmentation Golnaz Ghiasi, Yin Cui, Aravind Srinivas, Rui Qian, Tsung-Yi Lin, Ekin D. Cubuk, Quoc V. Le, Barret Zoph [pdf] [supp] [arXiv] [bibtex]

Learning Deep Latent Variable Models by Short-Run MCMC Inference With Optimal Transport Correction Dongsheng An, Jianwen Xie, Ping Li [pdf] [supp] [bibtex]

MobileDets: Searching for Object Detection Architectures for Mobile Accelerators Yunyang Xiong, Hanxiao Liu, Suyog Gupta, Berkin Akin, Gabriel Bender, Yongzhe Wang, Pieter-Jan Kindermans, Mingxing Tan, Vikas Singh, Bo Chen [pdf] [supp] [arXiv] [bibtex]

Self-Supervised Geometric Perception Heng Yang, Wei Dong, Luca Carlone, Vladlen Koltun [pdf] [supp] [arXiv] [bibtex]

CutPaste: Self-Supervised Learning for Anomaly Detection and Localization Chun-Liang Li, Kihyuk Sohn, Jinsung Yoon, Tomas Pfister [pdf] [supp] [arXiv] [bibtex]

Open World Compositional Zero-Shot Learning Massimiliano Mancini, Muhammad Ferjad Naeem, Yongqin Xian, Zeynep Akata [pdf] [supp] [bibtex]

Bi-GCN: Binary Graph Convolutional Network Junfu Wang, Yunhong Wang, Zhen Yang, Liang Yang, Yuanfang Guo [pdf] [supp] [bibtex]

Complementary Relation Contrastive Distillation Jinguo Zhu, Shixiang Tang, Dapeng Chen, Shijie Yu, Yakun Liu, Mingzhe Rong, Aijun Yang, Xiaohua Wang [pdf] [arXiv] [bibtex]

UnrealPerson: An Adaptive Pipeline Towards Costless Person Re-Identification Tianyu Zhang, Lingxi Xie, Longhui Wei, Zijie Zhuang, Yongfei Zhang, Bo Li, Qi Tian [pdf] [supp] [arXiv] [bibtex]

Iterative Filter Adaptive Network for Single Image Defocus Deblurring Junyong Lee, Hyeongseok Son, Jaesung Rim, Sunghyun Cho, Seungyong Lee [pdf] [supp] [bibtex]

UPFlow: Upsampling Pyramid for Unsupervised Optical Flow Learning Kunming Luo, Chuan Wang, Shuaicheng Liu, Haoqiang Fan, Jue Wang, Jian Sun [pdf] [arXiv] [bibtex]

House-GAN++: Generative Adversarial Layout Refinement Network towards Intelligent Computational Agent for Professional Architects Nelson Nauata, Sepidehsadat Hosseini, Kai-Hung Chang, Hang Chu, Chin-Yi Cheng, Yasutaka Furukawa [pdf] [supp] [bibtex]

HDR Environment Map Estimation for Real-Time Augmented Reality Gowri Somanath, Daniel Kurz [pdf] [supp] [arXiv] [bibtex]

OTA: Optimal Transport Assignment for Object Detection Zheng Ge, Songtao Liu, Zeming Li, Osamu Yoshie, Jian Sun [pdf] [supp] [arXiv] [bibtex]

Progressive Semantic Segmentation Chuong Huynh, Anh Tuan Tran, Khoa Luu, Minh Hoai [pdf] [supp] [arXiv] [bibtex]

BasicVSR: The Search for Essential Components in Video Super-Resolution and Beyond Kelvin C.K. Chan, Xintao Wang, Ke Yu, Chao Dong, Chen Change Loy [pdf] [supp] [arXiv] [bibtex]

Efficient Multi-Stage Video Denoising With Recurrent Spatio-Temporal Fusion Matteo Maggioni, Yibin Huang, Cheng Li, Shuai Xiao, Zhongqian Fu, Fenglong Song [pdf] [supp] [arXiv] [bibtex]

Self-Supervised Simultaneous Multi-Step Prediction of Road Dynamics and Cost Map Elmira Amirloo, Mohsen Rohani, Ershad Banijamali, Jun Luo, Pascal Poupart [pdf] [arXiv] [bibtex]

Probabilistic Tracklet Scoring and Inpainting for Multiple Object Tracking Fatemeh Saleh, Sadegh Aliakbarian, Hamid Rezatofighi, Mathieu Salzmann, Stephen Gould [pdf] [supp] [arXiv] [bibtex]

Stay Positive: Non-Negative Image Synthesis for Augmented Reality Katie Luo, Guandao Yang, Wenqi Xian, Harald Haraldsson, Bharath Hariharan, Serge Belongie [pdf] [supp] [bibtex]

3D-to-2D Distillation for Indoor Scene Parsing Zhengzhe Liu, Xiaojuan Qi, Chi-Wing Fu [pdf] [arXiv] [bibtex]

Learning the Best Pooling Strategy for Visual Semantic Embedding Jiacheng Chen, Hexiang Hu, Hao Wu, Yuning Jiang, Changhu Wang [pdf] [supp] [arXiv] [bibtex]

GLAVNet: Global-Local Audio-Visual Cues for Fine-Grained Material Recognition Fengmin Shi, Jie Guo, Haonan Zhang, Shan Yang, Xiying Wang, Yanwen Guo [pdf] [supp] [bibtex]

Refining Pseudo Labels With Clustering Consensus Over Generations for Unsupervised Object Re-Identification Xiao Zhang, Yixiao Ge, Yu Qiao, Hongsheng Li [pdf] [bibtex]

Regularizing Generative Adversarial Networks Under Limited Data Hung-Yu Tseng, Lu Jiang, Ce Liu, Ming-Hsuan Yang, Weilong Yang [pdf] [supp] [arXiv] [bibtex]

Skeleton Merger: An Unsupervised Aligned Keypoint Detector Ruoxi Shi, Zhengrong Xue, Yang You, Cewu Lu [pdf] [arXiv] [bibtex]

Regularizing Neural Networks via Adversarial Model Perturbation Yaowei Zheng, Richong Zhang, Yongyi Mao [pdf] [supp] [arXiv] [bibtex]

Learning by Aligning Videos in Time Sanjay Haresh, Sateesh Kumar, Huseyin Coskun, Shahram N. Syed, Andrey Konin, Zeeshan Zia, Quoc-Huy Tran [pdf] [supp] [arXiv] [bibtex]

Contrastive Neural Architecture Search With Neural Architecture Comparators Yaofo Chen, Yong Guo, Qi Chen, Minli Li, Wei Zeng, Yaowei Wang, Mingkui Tan [pdf] [supp] [arXiv] [bibtex]

Implicit Feature Alignment: Learn To Convert Text Recognizer to Text Spotter Tianwei Wang, Yuanzhi Zhu, Lianwen Jin, Dezhi Peng, Zhe Li, Mengchao He, Yongpan Wang, Canjie Luo [pdf] [bibtex]

Populating 3D Scenes by Learning Human-Scene Interaction Mohamed Hassan, Partha Ghosh, Joachim Tesch, Dimitrios Tzionas, Michael J. Black [pdf] [supp] [arXiv] [bibtex]

Variational Pedestrian Detection Yuang Zhang, Huanyu He, Jianguo Li, Yuxi Li, John See, Weiyao Lin [pdf] [supp] [arXiv] [bibtex]

SIPSA-Net: Shift-Invariant Pan Sharpening With Moving Object Alignment for Satellite Imagery Jaehyup Lee, Soomin Seo, Munchurl Kim [pdf] [supp] [bibtex]

Large-Scale Localization Datasets in Crowded Indoor Spaces Donghwan Lee, Soohyun Ryu, Suyong Yeon, Yonghan Lee, Deokhwa Kim, Cheolho Han, Yohann Cabon, Philippe Weinzaepfel, Nicolas Guerin, Gabriela Csurka, Martin Humenberger [pdf] [supp] [arXiv] [bibtex]

Distilling Causal Effect of Data in Class-Incremental Learning Xinting Hu, Kaihua Tang, Chunyan Miao, Xian-Sheng Hua, Hanwang Zhang [pdf] [supp] [arXiv] [bibtex]

Backdoor Attacks Against Deep Learning Systems in the Physical World Emily Wenger, Josephine Passananti, Arjun Nitin Bhagoji, Yuanshun Yao, Haitao Zheng, Ben Y. Zhao [pdf] [supp] [arXiv] [bibtex]

A Multiplexed Network for End-to-End, Multilingual OCR Jing Huang, Guan Pang, Rama Kovvuri, Mandy Toh, Kevin J Liang, Praveen Krishnan, Xi Yin, Tal Hassner [pdf] [arXiv] [bibtex]

Semi-Supervised Semantic Segmentation With Directional Context-Aware Consistency Xin Lai, Zhuotao Tian, Li Jiang, Shu Liu, Hengshuang Zhao, Liwei Wang, Jiaya Jia [pdf] [supp] [bibtex]

Causal Hidden Markov Model for Time Series Disease Forecasting Jing Li, Botong Wu, Xinwei Sun, Yizhou Wang [pdf] [supp] [arXiv] [bibtex]

Generalizable Pedestrian Detection: The Elephant in the Room Irtiza Hasan, Shengcai Liao, Jinpeng Li, Saad Ullah Akram, Ling Shao [pdf] [arXiv] [bibtex]

Focus on Local: Detecting Lane Marker From Bottom Up via Key Point Zhan Qu, Huan Jin, Yang Zhou, Zhen Yang, Wei Zhang [pdf] [arXiv] [bibtex]

Memory-Guided Unsupervised Image-to-Image Translation Somi Jeong, Youngjung Kim, Eungbean Lee, Kwanghoon Sohn [pdf] [arXiv] [bibtex]

Incremental Few-Shot Instance Segmentation Dan Andrei Ganea, Bas Boom, Ronald Poppe [pdf] [supp] [arXiv] [bibtex]

Mining Better Samples for Contrastive Learning of Temporal Correspondence Sangryul Jeon, Dongbo Min, Seungryong Kim, Kwanghoon Sohn [pdf] [supp] [bibtex]

Scene-Aware Generative Network for Human Motion Synthesis Jingbo Wang, Sijie Yan, Bo Dai, Dahua Lin [pdf] [supp] [arXiv] [bibtex]

Learning Neural Representation of Camera Pose with Matrix Representation of Pose Shift via View Synthesis Yaxuan Zhu, Ruiqi Gao, Siyuan Huang, Song-Chun Zhu, Ying Nian Wu [pdf] [supp] [arXiv] [bibtex]

PML: Progressive Margin Loss for Long-Tailed Age Classification Zongyong Deng, Hao Liu, Yaoxing Wang, Chenyang Wang, Zekuan Yu, Xuehong Sun [pdf] [arXiv] [bibtex]

Single Image Depth Prediction With Wavelet Decomposition Michael Ramamonjisoa, Michael Firman, Jamie Watson, Vincent Lepetit, Daniyar Turmukhambetov [pdf] [supp] [bibtex]

PVGNet: A Bottom-Up One-Stage 3D Object Detector With Integrated Multi-Level Features Zhenwei Miao, Jikai Chen, Hongyu Pan, Ruiwen Zhang, Kaixuan Liu, Peihan Hao, Jun Zhu, Yang Wang, Xin Zhan [pdf] [bibtex]

Exemplar-Based Open-Set Panoptic Segmentation Network Jaedong Hwang, Seoung Wug Oh, Joon-Young Lee, Bohyung Han [pdf] [supp] [arXiv] [bibtex]

KOALAnet: Blind Super-Resolution Using Kernel-Oriented Adaptive Local Adjustment Soo Ye Kim, Hyeonjun Sim, Munchurl Kim [pdf] [supp] [arXiv] [bibtex]

Learning Deep Classifiers Consistent With Fine-Grained Novelty Detection Jiacheng Cheng, Nuno Vasconcelos [pdf] [supp] [bibtex]

Multiple Object Tracking With Correlation Learning Qiang Wang, Yun Zheng, Pan Pan, Yinghui Xu [pdf] [arXiv] [bibtex]

SAIL-VOS 3D: A Synthetic Dataset and Baselines for Object Detection and 3D Mesh Reconstruction From Video Data Yuan-Ting Hu, Jiahong Wang, Raymond A. Yeh, Alexander G. Schwing [pdf] [arXiv] [bibtex]

PixMatch: Unsupervised Domain Adaptation via Pixelwise Consistency Training Luke Melas-Kyriazi, Arjun K. Manrai [pdf] [supp] [bibtex]

Deep RGB-D Saliency Detection With Depth-Sensitive Attention and Automatic Multi-Modal Fusion Peng Sun, Wenhu Zhang, Huanyu Wang, Songyuan Li, Xi Li [pdf] [supp] [bibtex]

Exploring Sparsity in Image Super-Resolution for Efficient Inference Longguang Wang, Xiaoyu Dong, Yingqian Wang, Xinyi Ying, Zaiping Lin, Wei An, Yulan Guo [pdf] [supp] [arXiv] [bibtex]

Positive Sample Propagation Along the Audio-Visual Event Line Jinxing Zhou, Liang Zheng, Yiran Zhong, Shijie Hao, Meng Wang [pdf] [arXiv] [bibtex]

Understanding the Behaviour of Contrastive Loss Feng Wang, Huaping Liu [pdf] [supp] [arXiv] [bibtex]

Variational Prototype Learning for Deep Face Recognition Jiankang Deng, Jia Guo, Jing Yang, Alexandros Lattas, Stefanos Zafeiriou [pdf] [bibtex]

StylePeople: A Generative Model of Fullbody Human Avatars Artur Grigorev, Karim Iskakov, Anastasia Ianina, Renat Bashirov, Ilya Zakharkin, Alexander Vakhitov, Victor Lempitsky [pdf] [supp] [arXiv] [bibtex]

Optimal Quantization Using Scaled Codebook Yerlan Idelbayev, Pavlo Molchanov, Maying Shen, Hongxu Yin, Miguel A. Carreira-Perpinan, Jose M. Alvarez [pdf] [bibtex]

RPN Prototype Alignment for Domain Adaptive Object Detector Yixin Zhang, Zilei Wang, Yushi Mao [pdf] [bibtex]

Dual Contradistinctive Generative Autoencoder Gaurav Parmar, Dacheng Li, Kwonjoon Lee, Zhuowen Tu [pdf] [supp] [arXiv] [bibtex]

Binary TTC: A Temporal Geofence for Autonomous Navigation Abhishek Badki, Orazio Gallo, Jan Kautz, Pradeep Sen [pdf] [supp] [arXiv] [bibtex]

Semantic-Aware Video Text Detection Wei Feng, Fei Yin, Xu-Yao Zhang, Cheng-Lin Liu [pdf] [bibtex]

Real-Time High-Resolution Background Matting Shanchuan Lin, Andrey Ryabtsev, Soumyadip Sengupta, Brian L. Curless, Steven M. Seitz, Ira Kemelmacher-Shlizerman [pdf] [supp] [arXiv] [bibtex]

Interpretable Social Anchors for Human Trajectory Forecasting in Crowds Parth Kothari, Brian Sifringer, Alexandre Alahi [pdf] [arXiv] [bibtex]

Trajectory Prediction With Latent Belief Energy-Based Model Bo Pang, Tianyang Zhao, Xu Xie, Ying Nian Wu [pdf] [supp] [arXiv] [bibtex]

Metadata Normalization Mandy Lu, Qingyu Zhao, Jiequan Zhang, Kilian M. Pohl, Li Fei-Fei, Juan Carlos Niebles, Ehsan Adeli [pdf] [arXiv] [bibtex]

Multi-Objective Interpolation Training for Robustness To Label Noise Diego Ortego, Eric Arazo, Paul Albert, Noel E. O'Connor, Kevin McGuinness [pdf] [arXiv] [bibtex]

PhySG: Inverse Rendering With Spherical Gaussians for Physics-Based Material Editing and Relighting Kai Zhang, Fujun Luan, Qianqian Wang, Kavita Bala, Noah Snavely [pdf] [arXiv] [bibtex]

Predator: Registration of 3D Point Clouds With Low Overlap Shengyu Huang, Zan Gojcic, Mikhail Usvyatsov, Andreas Wieser, Konrad Schindler [pdf] [supp] [arXiv] [bibtex]

Hierarchical Motion Understanding via Motion Programs Sumith Kulal, Jiayuan Mao, Alex Aiken, Jiajun Wu [pdf] [arXiv] [bibtex]

Neural Side-by-Side: Predicting Human Preferences for No-Reference Super-Resolution Evaluation Valentin Khrulkov, Artem Babenko [pdf] [bibtex]

Coordinate Attention for Efficient Mobile Network Design Qibin Hou, Daquan Zhou, Jiashi Feng [pdf] [arXiv] [bibtex]

Stylized Neural Painting Zhengxia Zou, Tianyang Shi, Shuang Qiu, Yi Yuan, Zhenwei Shi [pdf] [supp] [arXiv] [bibtex]

Image Change Captioning by Learning From an Auxiliary Task Mehrdad Hosseinzadeh, Yang Wang [pdf] [bibtex]

Learning to Generalize Unseen Domains via Memory-based Multi-Source Meta-Learning for Person Re-Identification Yuyang Zhao, Zhun Zhong, Fengxiang Yang, Zhiming Luo, Yaojin Lin, Shaozi Li, Nicu Sebe [pdf] [supp] [arXiv] [bibtex]

Discriminative Appearance Modeling With Multi-Track Pooling for Real-Time Multi-Object Tracking Chanho Kim, Li Fuxin, Mazen Alotaibi, James M. Rehg [pdf] [supp] [arXiv] [bibtex]

LASR: Learning Articulated Shape Reconstruction From a Monocular Video Gengshan Yang, Deqing Sun, Varun Jampani, Daniel Vlasic, Forrester Cole, Huiwen Chang, Deva Ramanan, William T. Freeman, Ce Liu [pdf] [supp] [arXiv] [bibtex]

FVC: A New Framework Towards Deep Video Compression in Feature Space Zhihao Hu, Guo Lu, Dong Xu [pdf] [arXiv] [bibtex]

Exponential Moving Average Normalization for Self-Supervised and Semi-Supervised Learning Zhaowei Cai, Avinash Ravichandran, Subhransu Maji, Charless Fowlkes, Zhuowen Tu, Stefano Soatto [pdf] [supp] [arXiv] [bibtex]

Confluent Vessel Trees With Accurate Bifurcations Zhongwen Zhang, Dmitrii Marin, Maria Drangova, Yuri Boykov [pdf] [supp] [arXiv] [bibtex]

Intentonomy: A Dataset and Study Towards Human Intent Understanding Menglin Jia, Zuxuan Wu, Austin Reiter, Claire Cardie, Serge Belongie, Ser-Nam Lim [pdf] [supp] [arXiv] [bibtex]

End-to-End Rotation Averaging With Multi-Source Propagation Luwei Yang, Heng Li, Jamal Ahmed Rahim, Zhaopeng Cui, Ping Tan [pdf] [supp] [bibtex]

Controllable Image Restoration for Under-Display Camera in Smartphones Kinam Kwon, Eunhee Kang, Sangwon Lee, Su-Jin Lee, Hyong-Euk Lee, ByungIn Yoo, Jae-Joon Han [pdf] [supp] [bibtex]

Farewell to Mutual Information: Variational Distillation for Cross-Modal Person Re-Identification Xudong Tian, Zhizhong Zhang, Shaohui Lin, Yanyun Qu, Yuan Xie, Lizhuang Ma [pdf] [supp] [arXiv] [bibtex]

Context-Aware Biaffine Localizing Network for Temporal Sentence Grounding Daizong Liu, Xiaoye Qu, Jianfeng Dong, Pan Zhou, Yu Cheng, Wei Wei, Zichuan Xu, Yulai Xie [pdf] [arXiv] [bibtex]

NewtonianVAE: Proportional Control and Goal Identification From Pixels via Physical Latent Spaces Miguel Jaques, Michael Burke, Timothy M. Hospedales [pdf] [arXiv] [bibtex]

Auto-Exposure Fusion for Single-Image Shadow Removal Lan Fu, Changqing Zhou, Qing Guo, Felix Juefei-Xu, Hongkai Yu, Wei Feng, Yang Liu, Song Wang [pdf] [arXiv] [bibtex]

Anticipating Human Actions by Correlating Past With the Future With Jaccard Similarity Measures Basura Fernando, Samitha Herath [pdf] [supp] [arXiv] [bibtex]

LipSync3D: Data-Efficient Learning of Personalized 3D Talking Faces From Video Using Pose and Lighting Normalization Avisek Lahiri, Vivek Kwatra, Christian Frueh, John Lewis, Chris Bregler [pdf] [supp] [bibtex]

Simpler Certified Radius Maximization by Propagating Covariances Xingjian Zhen, Rudrasis Chakraborty, Vikas Singh [pdf] [supp] [arXiv] [bibtex]

A 3D GAN for Improved Large-Pose Facial Recognition Richard T. Marriott, Sami Romdhani, Liming Chen [pdf] [arXiv] [bibtex]

Repopulating Street Scenes Yifan Wang, Andrew Liu, Richard Tucker, Jiajun Wu, Brian L. Curless, Steven M. Seitz, Noah Snavely [pdf] [supp] [arXiv] [bibtex]

ARVo: Learning All-Range Volumetric Correspondence for Video Deblurring Dongxu Li, Chenchen Xu, Kaihao Zhang, Xin Yu, Yiran Zhong, Wenqi Ren, Hanna Suominen, Hongdong Li [pdf] [arXiv] [bibtex]

Unsupervised Object Detection With LIDAR Clues Hao Tian, Yuntao Chen, Jifeng Dai, Zhaoxiang Zhang, Xizhou Zhu [pdf] [supp] [arXiv] [bibtex]

TesseTrack: End-to-End Learnable Multi-Person Articulated 3D Pose Tracking N Dinesh Reddy, Laurent Guigues, Leonid Pishchulin, Jayan Eledath, Srinivasa G. Narasimhan [pdf] [supp] [bibtex]

HVPR: Hybrid Voxel-Point Representation for Single-Stage 3D Object Detection Jongyoun Noh, Sanghoon Lee, Bumsub Ham [pdf] [supp] [arXiv] [bibtex]

SOE-Net: A Self-Attention and Orientation Encoding Network for Point Cloud Based Place Recognition Yan Xia, Yusheng Xu, Shuang Li, Rui Wang, Juan Du, Daniel Cremers, Uwe Stilla [pdf] [bibtex]

Controlling the Rain: From Removal to Rendering Siqi Ni, Xueyun Cao, Tao Yue, Xuemei Hu [pdf] [supp] [bibtex]

KeypointDeformer: Unsupervised 3D Keypoint Discovery for Shape Control Tomas Jakab, Richard Tucker, Ameesh Makadia, Jiajun Wu, Noah Snavely, Angjoo Kanazawa [pdf] [supp] [arXiv] [bibtex]

A2-FPN: Attention Aggregation Based Feature Pyramid Network for Instance Segmentation Miao Hu, Yali Li, Lu Fang, Shengjin Wang [pdf] [supp] [bibtex]

Quasi-Dense Similarity Learning for Multiple Object Tracking Jiangmiao Pang, Linlu Qiu, Xia Li, Haofeng Chen, Qi Li, Trevor Darrell, Fisher Yu [pdf] [supp] [arXiv] [bibtex]

Simultaneously Localize, Segment and Rank the Camouflaged Objects Yunqiu Lv, Jing Zhang, Yuchao Dai, Aixuan Li, Bowen Liu, Nick Barnes, Deng-Ping Fan [pdf] [supp] [arXiv] [bibtex]

Hybrid Message Passing With Performance-Driven Structures for Facial Action Unit Detection Tengfei Song, Zijun Cui, Wenming Zheng, Qiang Ji [pdf] [supp] [bibtex]

Distilling Object Detectors via Decoupled Features Jianyuan Guo, Kai Han, Yunhe Wang, Han Wu, Xinghao Chen, Chunjing Xu, Chang Xu [pdf] [supp] [arXiv] [bibtex]

Roof-GAN: Learning To Generate Roof Geometry and Relations for Residential Houses Yiming Qian, Hao Zhang, Yasutaka Furukawa [pdf] [supp] [bibtex]

No Shadow Left Behind: Removing Objects and Their Shadows Using Approximate Lighting and Geometry Edward Zhang, Ricardo Martin-Brualla, Janne Kontkanen, Brian L. Curless [pdf] [supp] [bibtex]

NetAdaptV2: Efficient Neural Architecture Search With Fast Super-Network Training and Architecture Optimization Tien-Ju Yang, Yi-Lun Liao, Vivienne Sze [pdf] [supp] [arXiv] [bibtex]

PhD Learning: Learning With Pompeiu-Hausdorff Distances for Video-Based Vehicle Re-Identification Jianan Zhao, Fengliang Qi, Guangyu Ren, Lin Xu [pdf] [supp] [bibtex]

DeepVideoMVS: Multi-View Stereo on Video With Recurrent Spatio-Temporal Fusion Arda Duzceker, Silvano Galliani, Christoph Vogel, Pablo Speciale, Mihai Dusmanu, Marc Pollefeys [pdf] [supp] [arXiv] [bibtex]

Saliency-Guided Image Translation Lai Jiang, Mai Xu, Xiaofei Wang, Leonid Sigal [pdf] [supp] [bibtex]

Weakly Supervised Learning of Rigid 3D Scene Flow Zan Gojcic, Or Litany, Andreas Wieser, Leonidas J. Guibas, Tolga Birdal [pdf] [supp] [arXiv] [bibtex]

InverseForm: A Loss Function for Structured Boundary-Aware Segmentation Shubhankar Borse, Ying Wang, Yizhe Zhang, Fatih Porikli [pdf] [supp] [arXiv] [bibtex]

Towards Accurate Text-Based Image Captioning With Content Diversity Exploration Guanghui Xu, Shuaicheng Niu, Mingkui Tan, Yucheng Luo, Qing Du, Qi Wu [pdf] [supp] [arXiv] [bibtex]

Learning Placeholders for Open-Set Recognition Da-Wei Zhou, Han-Jia Ye, De-Chuan Zhan [pdf] [supp] [arXiv] [bibtex]

CodedStereo: Learned Phase Masks for Large Depth-of-Field Stereo Shiyu Tan, Yicheng Wu, Shoou-I Yu, Ashok Veeraraghavan [pdf] [supp] [arXiv] [bibtex]

More Photos Are All You Need: Semi-Supervised Learning for Fine-Grained Sketch Based Image Retrieval Ayan Kumar Bhunia, Pinaki Nath Chowdhury, Aneeshan Sain, Yongxin Yang, Tao Xiang, Yi-Zhe Song [pdf] [supp] [arXiv] [bibtex]

Unsupervised Hyperbolic Representation Learning via Message Passing Auto-Encoders Jiwoong Park, Junho Cho, Hyung Jin Chang, Jin Young Choi [pdf] [supp] [arXiv] [bibtex]

Retinex-Inspired Unrolling With Cooperative Prior Architecture Search for Low-Light Image Enhancement Risheng Liu, Long Ma, Jiaao Zhang, Xin Fan, Zhongxuan Luo [pdf] [supp] [arXiv] [bibtex]

Relevance-CAM: Your Model Already Knows Where To Look Jeong Ryong Lee, Sewon Kim, Inyong Park, Taejoon Eo, Dosik Hwang [pdf] [supp] [bibtex]

Boundary IoU: Improving Object-Centric Image Segmentation Evaluation Bowen Cheng, Ross Girshick, Piotr Dollar, Alexander C. Berg, Alexander Kirillov [pdf] [supp] [arXiv] [bibtex]

KeepAugment: A Simple Information-Preserving Data Augmentation Approach Chengyue Gong, Dilin Wang, Meng Li, Vikas Chandra, Qiang Liu [pdf] [arXiv] [bibtex]

On Robustness and Transferability of Convolutional Neural Networks Josip Djolonga, Jessica Yung, Michael Tschannen, Rob Romijnders, Lucas Beyer, Alexander Kolesnikov, Joan Puigcerver, Matthias Minderer, Alexander D'Amour, Dan Moldovan, Sylvain Gelly, Neil Houlsby, Xiaohua Zhai, Mario Lucic [pdf] [supp] [arXiv] [bibtex]

POSEFusion: Pose-Guided Selective Fusion for Single-View Human Volumetric Capture Zhe Li, Tao Yu, Zerong Zheng, Kaiwen Guo, Yebin Liu [pdf] [supp] [arXiv] [bibtex]

Exploring Adversarial Fake Images on Face Manifold Dongze Li, Wei Wang, Hongxing Fan, Jing Dong [pdf] [arXiv] [bibtex]

Reinforced Attention for Few-Shot Learning and Beyond Jie Hong, Pengfei Fang, Weihao Li, Tong Zhang, Christian Simon, Mehrtash Harandi, Lars Petersson [pdf] [supp] [arXiv] [bibtex]

HOTR: End-to-End Human-Object Interaction Detection With Transformers Bumsoo Kim, Junhyun Lee, Jaewoo Kang, Eun-Sol Kim, Hyunwoo J. Kim [pdf] [supp] [arXiv] [bibtex]

Deep Video Matting via Spatio-Temporal Alignment and Aggregation Yanan Sun, Guanzhi Wang, Qiao Gu, Chi-Keung Tang, Yu-Wing Tai [pdf] [supp] [arXiv] [bibtex]

Triple-Cooperative Video Shadow Detection Zhihao Chen, Liang Wan, Lei Zhu, Jia Shen, Huazhu Fu, Wennan Liu, Jing Qin [pdf] [arXiv] [bibtex]

Scale-Aware Graph Neural Network for Few-Shot Semantic Segmentation Guo-Sen Xie, Jie Liu, Huan Xiong, Ling Shao [pdf] [bibtex]

Continuous Face Aging via Self-Estimated Residual Age Embedding Zeqi Li, Ruowei Jiang, Parham Aarabi [pdf] [supp] [arXiv] [bibtex]

Towards Fast and Accurate Real-World Depth Super-Resolution: Benchmark Dataset and Baseline Lingzhi He, Hongguang Zhu, Feng Li, Huihui Bai, Runmin Cong, Chunjie Zhang, Chunyu Lin, Meiqin Liu, Yao Zhao [pdf] [supp] [bibtex]

Jigsaw Clustering for Unsupervised Visual Representation Learning Pengguang Chen, Shu Liu, Jiaya Jia [pdf] [supp] [arXiv] [bibtex]

DI-Fusion: Online Implicit 3D Reconstruction With Deep Priors Jiahui Huang, Shi-Sheng Huang, Haoxuan Song, Shi-Min Hu [pdf] [supp] [bibtex]

Square Root Bundle Adjustment for Large-Scale Reconstruction Nikolaus Demmel, Christiane Sommer, Daniel Cremers, Vladyslav Usenko [pdf] [supp] [arXiv] [bibtex]

PatchMatch-Based Neighborhood Consensus for Semantic Correspondence Jae Yong Lee, Joseph DeGol, Victor Fragoso, Sudipta N. Sinha [pdf] [supp] [bibtex]

Representative Forgery Mining for Fake Face Detection Chengrui Wang, Weihong Deng [pdf] [arXiv] [bibtex]

Look Closer To Segment Better: Boundary Patch Refinement for Instance Segmentation Chufeng Tang, Hang Chen, Xiao Li, Jianmin Li, Zhaoxiang Zhang, Xiaolin Hu [pdf] [supp] [arXiv] [bibtex]

Adaptive Class Suppression Loss for Long-Tail Object Detection Tong Wang, Yousong Zhu, Chaoyang Zhao, Wei Zeng, Jinqiao Wang, Ming Tang [pdf] [arXiv] [bibtex]

ChallenCap: Monocular 3D Capture of Challenging Human Performances Using Multi-Modal References Yannan He, Anqi Pang, Xin Chen, Han Liang, Minye Wu, Yuexin Ma, Lan Xu [pdf] [arXiv] [bibtex]

Automated Log-Scale Quantization for Low-Cost Deep Neural Networks Sangyun Oh, Hyeonuk Sim, Sugil Lee, Jongeun Lee [pdf] [supp] [bibtex]

Hallucination Improves Few-Shot Object Detection Weilin Zhang, Yu-Xiong Wang [pdf] [supp] [arXiv] [bibtex]

Efficient Conditional GAN Transfer With Knowledge Propagation Across Classes Mohamad Shahbazi, Zhiwu Huang, Danda Pani Paudel, Ajad Chhatkuli, Luc Van Gool [pdf] [supp] [arXiv] [bibtex]

Fully Convolutional Scene Graph Generation Hengyue Liu, Ning Yan, Masood Mortazavi, Bir Bhanu [pdf] [supp] [arXiv] [bibtex]

Crossing Cuts Polygonal Puzzles: Models and Solvers Peleg Harel, Ohad Ben-Shahar [pdf] [supp] [bibtex]

Graph-Based High-Order Relation Modeling for Long-Term Action Recognition Jiaming Zhou, Kun-Yu Lin, Haoxin Li, Wei-Shi Zheng [pdf] [supp] [bibtex]

Positive-Unlabeled Data Purification in the Wild for Object Detection Jianyuan Guo, Kai Han, Han Wu, Chao Zhang, Xinghao Chen, Chunjing Xu, Chang Xu, Yunhe Wang [pdf] [bibtex]

ArtFlow: Unbiased Image Style Transfer via Reversible Neural Flows Jie An, Siyu Huang, Yibing Song, Dejing Dou, Wei Liu, Jiebo Luo [pdf] [supp] [arXiv] [bibtex]

Network Quantization With Element-Wise Gradient Scaling Junghyup Lee, Dohyung Kim, Bumsub Ham [pdf] [arXiv] [bibtex]

img2pose: Face Alignment and Detection via 6DoF, Face Pose Estimation Vitor Albiero, Xingyu Chen, Xi Yin, Guan Pang, Tal Hassner [pdf] [arXiv] [bibtex]

Sparse Multi-Path Corrections in Fringe Projection Profilometry Yu Zhang, Daniel Lau, David Wipf [pdf] [bibtex]

NeuroMorph: Unsupervised Shape Interpolation and Correspondence in One Go Marvin Eisenberger, David Novotny, Gael Kerchenbaum, Patrick Labatut, Natalia Neverova, Daniel Cremers, Andrea Vedaldi [pdf] [supp] [bibtex]

Soft-IntroVAE: Analyzing and Improving the Introspective Variational Autoencoder Tal Daniel, Aviv Tamar [pdf] [supp] [bibtex]

Energy-Based Learning for Scene Graph Generation Mohammed Suhail, Abhay Mittal, Behjat Siddiquie, Chris Broaddus, Jayan Eledath, Gerard Medioni, Leonid Sigal [pdf] [supp] [arXiv] [bibtex]

Zillow Indoor Dataset: Annotated Floor Plans With 360deg Panoramas and 3D Room Layouts Steve Cruz, Will Hutchcroft, Yuguang Li, Naji Khosravan, Ivaylo Boyadzhiev, Sing Bing Kang [pdf] [supp] [bibtex]

Progressive Contour Regression for Arbitrary-Shape Scene Text Detection Pengwen Dai, Sanyi Zhang, Hua Zhang, Xiaochun Cao [pdf] [bibtex]

UV-Net: Learning From Boundary Representations Pradeep Kumar Jayaraman, Aditya Sanghi, Joseph G. Lambourne, Karl D.D. Willis, Thomas Davies, Hooman Shayani, Nigel Morris [pdf] [supp] [bibtex]

MAZE: Data-Free Model Stealing Attack Using Zeroth-Order Gradient Estimation Sanjay Kariyappa, Atul Prakash, Moinuddin K Qureshi [pdf] [supp] [arXiv] [bibtex]

Universal Spectral Adversarial Attacks for Deformable Shapes Arianna Rampini, Franco Pestarini, Luca Cosmo, Simone Melzi, Emanuele Rodola [pdf] [supp] [arXiv] [bibtex]

Prototypical Cross-Domain Self-Supervised Learning for Few-Shot Unsupervised Domain Adaptation Xiangyu Yue, Zangwei Zheng, Shanghang Zhang, Yang Gao, Trevor Darrell, Kurt Keutzer, Alberto Sangiovanni Vincentelli [pdf] [supp] [arXiv] [bibtex]

HybrIK: A Hybrid Analytical-Neural Inverse Kinematics Solution for 3D Human Pose and Shape Estimation Jiefeng Li, Chao Xu, Zhicun Chen, Siyuan Bian, Lixin Yang, Cewu Lu [pdf] [supp] [arXiv] [bibtex]

Human De-Occlusion: Invisible Perception and Recovery for Humans Qiang Zhou, Shiyin Wang, Yitong Wang, Zilong Huang, Xinggang Wang [pdf] [supp] [bibtex]

The Neural Tangent Link Between CNN Denoisers and Non-Local Filters Julian Tachella, Junqi Tang, Mike Davies [pdf] [arXiv] [bibtex]

Achieving Robustness in Classification Using Optimal Transport With Hinge Regularization Mathieu Serrurier, Franck Mamalet, Alberto Gonzalez-Sanz, Thibaut Boissin, Jean-Michel Loubes, Eustasio del Barrio [pdf] [bibtex]

Stochastic Image-to-Video Synthesis Using cINNs Michael Dorkenwald, Timo Milbich, Andreas Blattmann, Robin Rombach, Konstantinos G. Derpanis, Bjorn Ommer [pdf] [supp] [arXiv] [bibtex]

Ego-Exo: Transferring Visual Representations From Third-Person to First-Person Videos Yanghao Li, Tushar Nagarajan, Bo Xiong, Kristen Grauman [pdf] [supp] [bibtex]

Dynamic Slimmable Network Changlin Li, Guangrun Wang, Bing Wang, Xiaodan Liang, Zhihui Li, Xiaojun Chang [pdf] [supp] [arXiv] [bibtex]

Jo-SRC: A Contrastive Approach for Combating Noisy Labels Yazhou Yao, Zeren Sun, Chuanyi Zhang, Fumin Shen, Qi Wu, Jian Zhang, Zhenmin Tang [pdf] [bibtex]

Deep Lucas-Kanade Homography for Multimodal Image Alignment Yiming Zhao, Xinming Huang, Ziming Zhang [pdf] [arXiv] [bibtex]

clDice - A Novel Topology-Preserving Loss Function for Tubular Structure Segmentation Suprosanna Shit, Johannes C. Paetzold, Anjany Sekuboyina, Ivan Ezhov, Alexander Unger, Andrey Zhylka, Josien P. W. Pluim, Ulrich Bauer, Bjoern H. Menze [pdf] [supp] [bibtex]

Hyper-LifelongGAN: Scalable Lifelong Learning for Image Conditioned Generation Mengyao Zhai, Lei Chen, Greg Mori [pdf] [bibtex]

Semi-Supervised Synthesis of High-Resolution Editable Textures for 3D Humans Bindita Chaudhuri, Nikolaos Sarafianos, Linda Shapiro, Tony Tung [pdf] [supp] [arXiv] [bibtex]

CoSMo: Content-Style Modulation for Image Retrieval With Text Feedback Seungmin Lee, Dongwan Kim, Bohyung Han [pdf] [supp] [bibtex]

Thinking Fast and Slow: Efficient Text-to-Visual Retrieval With Transformers Antoine Miech, Jean-Baptiste Alayrac, Ivan Laptev, Josef Sivic, Andrew Zisserman [pdf] [arXiv] [bibtex]

RGB-D Local Implicit Function for Depth Completion of Transparent Objects Luyang Zhu, Arsalan Mousavian, Yu Xiang, Hammad Mazhar, Jozef van Eenbergen, Shoubhik Debnath, Dieter Fox [pdf] [supp] [bibtex]

Fingerspelling Detection in American Sign Language Bowen Shi, Diane Brentari, Greg Shakhnarovich, Karen Livescu [pdf] [supp] [arXiv] [bibtex]

Uncertainty Reduction for Model Adaptation in Semantic Segmentation Prabhu Teja S, Francois Fleuret [pdf] [supp] [bibtex]

Learning Triadic Belief Dynamics in Nonverbal Communication From Videos Lifeng Fan, Shuwen Qiu, Zilong Zheng, Tao Gao, Song-Chun Zhu, Yixin Zhu [pdf] [supp] [arXiv] [bibtex]

Temporal Modulation Network for Controllable Space-Time Video Super-Resolution Gang Xu, Jun Xu, Zhen Li, Liang Wang, Xing Sun, Ming-Ming Cheng [pdf] [supp] [arXiv] [bibtex]

Zero-Shot Single Image Restoration Through Controlled Perturbation of Koschmieder's Model Aupendu Kar, Sobhan Kanti Dhara, Debashis Sen, Prabir Kumar Biswas [pdf] [supp] [bibtex]

Uncertainty-Aware Camera Pose Estimation From Points and Lines Alexander Vakhitov, Luis Ferraz, Antonio Agudo, Francesc Moreno-Noguer [pdf] [supp] [bibtex]

Temporal Context Aggregation Network for Temporal Action Proposal Refinement Zhiwu Qing, Haisheng Su, Weihao Gan, Dongliang Wang, Wei Wu, Xiang Wang, Yu Qiao, Junjie Yan, Changxin Gao, Nong Sang [pdf] [arXiv] [bibtex]

Information-Theoretic Segmentation by Inpainting Error Maximization Pedro Savarese, Sunnie S. Y. Kim, Michael Maire, Greg Shakhnarovich, David McAllester [pdf] [supp] [arXiv] [bibtex]

Adaptive Prototype Learning and Allocation for Few-Shot Segmentation Gen Li, Varun Jampani, Laura Sevilla-Lara, Deqing Sun, Jonghyun Kim, Joongkyu Kim [pdf] [supp] [bibtex]

RefineMask: Towards High-Quality Instance Segmentation With Fine-Grained Features Gang Zhang, Xin Lu, Jingru Tan, Jianmin Li, Zhaoxiang Zhang, Quanquan Li, Xiaolin Hu [pdf] [supp] [arXiv] [bibtex]

DCNAS: Densely Connected Neural Architecture Search for Semantic Image Segmentation Xiong Zhang, Hongmin Xu, Hong Mo, Jianchao Tan, Cheng Yang, Lei Wang, Wenqi Ren [pdf] [supp] [arXiv] [bibtex]

Tackling the Ill-Posedness of Super-Resolution Through Adaptive Target Generation Younghyun Jo, Seoung Wug Oh, Peter Vajda, Seon Joo Kim [pdf] [supp] [bibtex]

DiNTS: Differentiable Neural Network Topology Search for 3D Medical Image Segmentation Yufan He, Dong Yang, Holger Roth, Can Zhao, Daguang Xu [pdf] [arXiv] [bibtex]

Im2Vec: Synthesizing Vector Graphics Without Vector Supervision Pradyumna Reddy, Michael Gharbi, Michal Lukac, Niloy J. Mitra [pdf] [supp] [arXiv] [bibtex]

Perception Matters: Detecting Perception Failures of VQA Models Using Metamorphic Testing Yuanyuan Yuan, Shuai Wang, Mingyue Jiang, Tsong Yueh Chen [pdf] [supp] [bibtex]

Unsupervised Part Segmentation Through Disentangling Appearance and Shape Shilong Liu, Lei Zhang, Xiao Yang, Hang Su, Jun Zhu [pdf] [supp] [arXiv] [bibtex]

Adversarial Imaging Pipelines Buu Phan, Fahim Mannan, Felix Heide [pdf] [supp] [arXiv] [bibtex]

Adaptive Consistency Regularization for Semi-Supervised Transfer Learning Abulikemu Abuduweili, Xingjian Li, Humphrey Shi, Cheng-Zhong Xu, Dejing Dou [pdf] [supp] [arXiv] [bibtex]

GANmut: Learning Interpretable Conditional Space for Gamut of Emotions Stefano d'Apolito, Danda Pani Paudel, Zhiwu Huang, Andres Romero, Luc Van Gool [pdf] [supp] [bibtex]

StyleSpace Analysis: Disentangled Controls for StyleGAN Image Generation Zongze Wu, Dani Lischinski, Eli Shechtman [pdf] [supp] [arXiv] [bibtex]

Rethinking the Heatmap Regression for Bottom-Up Human Pose Estimation Zhengxiong Luo, Zhicheng Wang, Yan Huang, Liang Wang, Tieniu Tan, Erjin Zhou [pdf] [arXiv] [bibtex]

From Semantic Categories to Fixations: A Novel Weakly-Supervised Visual-Auditory Saliency Detection Approach Guotao Wang, Chenglizhao Chen, Deng-Ping Fan, Aimin Hao, Hong Qin [pdf] [bibtex]

High-Fidelity Face Tracking for AR/VR via Deep Lighting Adaptation Lele Chen, Chen Cao, Fernando De la Torre, Jason Saragih, Chenliang Xu, Yaser Sheikh [pdf] [supp] [arXiv] [bibtex]

Mixed-Privacy Forgetting in Deep Networks Aditya Golatkar, Alessandro Achille, Avinash Ravichandran, Marzia Polito, Stefano Soatto [pdf] [supp] [arXiv] [bibtex]

TediGAN: Text-Guided Diverse Face Image Generation and Manipulation Weihao Xia, Yujiu Yang, Jing-Hao Xue, Baoyuan Wu [pdf] [arXiv] [bibtex]

Affective Processes: Stochastic Modelling of Temporal Context for Emotion and Facial Expression Recognition Enrique Sanchez, Mani Kumar Tellamekala, Michel Valstar, Georgios Tzimiropoulos [pdf] [supp] [arXiv] [bibtex]

ID-Unet: Iterative Soft and Hard Deformation for View Synthesis Mingyu Yin, Li Sun, Qingli Li [pdf] [bibtex]

Positional Encoding As Spatial Inductive Bias in GANs Rui Xu, Xintao Wang, Kai Chen, Bolei Zhou, Chen Change Loy [pdf] [supp] [arXiv] [bibtex]

Mask-ToF: Learning Microlens Masks for Flying Pixel Correction in Time-of-Flight Imaging Ilya Chugunov, Seung-Hwan Baek, Qiang Fu, Wolfgang Heidrich, Felix Heide [pdf] [supp] [bibtex]

QPP: Real-Time Quantization Parameter Prediction for Deep Neural Networks Vladimir Kryzhanovskiy, Gleb Balitskiy, Nikolay Kozyrskiy, Aleksandr Zuruev [pdf] [supp] [bibtex]

Nighttime Visibility Enhancement by Increasing the Dynamic Range and Suppression of Light Effects Aashish Sharma, Robby T. Tan [pdf] [bibtex]

Self-Supervised Augmentation Consistency for Adapting Semantic Segmentation Nikita Araslanov, Stefan Roth [pdf] [supp] [arXiv] [bibtex]

Patch-VQ: 'Patching Up' the Video Quality Problem Zhenqiang Ying, Maniratnam Mandal, Deepti Ghadiyaram, Alan Bovik [pdf] [supp] [bibtex]

Double Low-Rank Representation With Projection Distance Penalty for Clustering Zhiqiang Fu, Yao Zhao, Dongxia Chang, Xingxing Zhang, Yiming Wang [pdf] [supp] [bibtex]

Towards High Fidelity Face Relighting With Realistic Shadows Andrew Hou, Ze Zhang, Michel Sarkis, Ning Bi, Yiying Tong, Xiaoming Liu [pdf] [supp] [arXiv] [bibtex]

Multi-View Multi-Person 3D Pose Estimation With Plane Sweep Stereo Jiahao Lin, Gim Hee Lee [pdf] [arXiv] [bibtex]

Fusing the Old with the New: Learning Relative Camera Pose with Geometry-Guided Uncertainty Bingbing Zhuang, Manmohan Chandraker [pdf] [supp] [arXiv] [bibtex]

CReST: A Class-Rebalancing Self-Training Framework for Imbalanced Semi-Supervised Learning Chen Wei, Kihyuk Sohn, Clayton Mellina, Alan Yuille, Fan Yang [pdf] [supp] [arXiv] [bibtex]

Towards Diverse Paragraph Captioning for Untrimmed Videos Yuqing Song, Shizhe Chen, Qin Jin [pdf] [supp] [arXiv] [bibtex]

FlowStep3D: Model Unrolling for Self-Supervised Scene Flow Estimation Yair Kittenplon, Yonina C. Eldar, Dan Raviv [pdf] [arXiv] [bibtex]

Adversarial Robustness Across Representation Spaces Pranjal Awasthi, George Yu, Chun-Sung Ferng, Andrew Tomkins, Da-Cheng Juan [pdf] [supp] [arXiv] [bibtex]

MagDR: Mask-Guided Detection and Reconstruction for Defending Deepfakes Zhikai Chen, Lingxi Xie, Shanmin Pang, Yong He, Bo Zhang [pdf] [arXiv] [bibtex]

Neural Deformation Graphs for Globally-Consistent Non-Rigid Reconstruction Aljaz Bozic, Pablo Palafox, Michael Zollhofer, Justus Thies, Angela Dai, Matthias Niessner [pdf] [supp] [arXiv] [bibtex]

Fostering Generalization in Single-View 3D Reconstruction by Learning a Hierarchy of Local and Global Shape Priors Jan Bechtold, Maxim Tatarchenko, Volker Fischer, Thomas Brox [pdf] [supp] [arXiv] [bibtex]

Progressive Semantic-Aware Style Transformation for Blind Face Restoration Chaofeng Chen, Xiaoming Li, Lingbo Yang, Xianhui Lin, Lei Zhang, Kwan-Yee K. Wong [pdf] [supp] [arXiv] [bibtex]

Seeking the Shape of Sound: An Adaptive Framework for Learning Voice-Face Association Peisong Wen, Qianqian Xu, Yangbangyan Jiang, Zhiyong Yang, Yuan He, Qingming Huang [pdf] [supp] [arXiv] [bibtex]

Invertible Image Signal Processing Yazhou Xing, Zian Qian, Qifeng Chen [pdf] [supp] [arXiv] [bibtex]

Lighting, Reflectance and Geometry Estimation From 360deg Panoramic Stereo Junxuan Li, Hongdong Li, Yasuyuki Matsushita [pdf] [bibtex]

Building Reliable Explanations of Unreliable Neural Networks: Locally Smoothing Perspective of Model Interpretation Dohun Lim, Hyeonseok Lee, Sungchan Kim [pdf] [supp] [arXiv] [bibtex]

NeX: Real-Time View Synthesis With Neural Basis Expansion Suttisak Wizadwongsa, Pakkapon Phongthawee, Jiraphon Yenphraphai, Supasorn Suwajanakorn [pdf] [supp] [arXiv] [bibtex]

DAT: Training Deep Networks Robust To Label-Noise by Matching the Feature Distributions Yuntao Qu, Shasha Mo, Jianwei Niu [pdf] [supp] [bibtex]

Repetitive Activity Counting by Sight and Sound Yunhua Zhang, Ling Shao, Cees G. M. Snoek [pdf] [supp] [arXiv] [bibtex]

PointGuard: Provably Robust 3D Point Cloud Classification Hongbin Liu, Jinyuan Jia, Neil Zhenqiang Gong [pdf] [supp] [arXiv] [bibtex]

Unsupervised Multi-Source Domain Adaptation for Person Re-Identification Zechen Bai, Zhigang Wang, Jian Wang, Di Hu, Errui Ding [pdf] [supp] [arXiv] [bibtex]

BBAM: Bounding Box Attribution Map for Weakly Supervised Semantic and Instance Segmentation Jungbeom Lee, Jihun Yi, Chaehun Shin, Sungroh Yoon [pdf] [supp] [arXiv] [bibtex]

Boosting Video Representation Learning With Multi-Faceted Integration Zhaofan Qiu, Ting Yao, Chong-Wah Ngo, Xiao-Ping Zhang, Dong Wu, Tao Mei [pdf] [bibtex]

Beyond Bounding-Box: Convex-Hull Feature Adaptation for Oriented and Densely Packed Object Detection Zonghao Guo, Chang Liu, Xiaosong Zhang, Jianbin Jiao, Xiangyang Ji, Qixiang Ye [pdf] [bibtex]

3D Graph Anatomy Geometry-Integrated Network for Pancreatic Mass Segmentation, Diagnosis, and Quantitative Patient Management Tianyi Zhao, Kai Cao, Jiawen Yao, Isabella Nogues, Le Lu, Lingyun Huang, Jing Xiao, Zhaozheng Yin, Ling Zhang [pdf] [arXiv] [bibtex]

Protecting Intellectual Property of Generative Adversarial Networks From Ambiguity Attacks Ding Sheng Ong, Chee Seng Chan, Kam Woh Ng, Lixin Fan, Qiang Yang [pdf] [supp] [arXiv] [bibtex]

End-to-End High Dynamic Range Camera Pipeline Optimization Nicolas Robidoux, Luis E. Garcia Capel, Dong-eun Seo, Avinash Sharma, Federico Ariza, Felix Heide [pdf] [supp] [bibtex]

Parser-Free Virtual Try-On via Distilling Appearance Flows Yuying Ge, Yibing Song, Ruimao Zhang, Chongjian Ge, Wei Liu, Ping Luo [pdf] [supp] [arXiv] [bibtex]

GIRAFFE: Representing Scenes As Compositional Generative Neural Feature Fields Michael Niemeyer, Andreas Geiger [pdf] [arXiv] [bibtex]

Single-Stage Instance Shadow Detection With Bidirectional Relation Learning Tianyu Wang, Xiaowei Hu, Chi-Wing Fu, Pheng-Ann Heng [pdf] [supp] [bibtex]

High-Speed Image Reconstruction Through Short-Term Plasticity for Spiking Cameras Yajing Zheng, Lingxiao Zheng, Zhaofei Yu, Boxin Shi, Yonghong Tian, Tiejun Huang [pdf] [supp] [bibtex]

Self-Supervised 3D Mesh Reconstruction From Single Images Tao Hu, Liwei Wang, Xiaogang Xu, Shu Liu, Jiaya Jia [pdf] [supp] [bibtex]

Dual-GAN: Joint BVP and Noise Modeling for Remote Physiological Measurement Hao Lu, Hu Han, S. Kevin Zhou [pdf] [bibtex]

Audio-Visual Instance Discrimination with Cross-Modal Agreement Pedro Morgado, Nuno Vasconcelos, Ishan Misra [pdf] [supp] [arXiv] [bibtex]

Combined Depth Space Based Architecture Search for Person Re-Identification Hanjun Li, Gaojie Wu, Wei-Shi Zheng [pdf] [supp] [arXiv] [bibtex]

Rethinking BiSeNet for Real-Time Semantic Segmentation Mingyuan Fan, Shenqi Lai, Junshi Huang, Xiaoming Wei, Zhenhua Chai, Junfeng Luo, Xiaolin Wei [pdf] [arXiv] [bibtex]

The Spatially-Correlative Loss for Various Image Translation Tasks Chuanxia Zheng, Tat-Jen Cham, Jianfei Cai [pdf] [arXiv] [bibtex]

Learning To Restore Hazy Video: A New Real-World Dataset and a New Method Xinyi Zhang, Hang Dong, Jinshan Pan, Chao Zhu, Ying Tai, Chengjie Wang, Jilin Li, Feiyue Huang, Fei Wang [pdf] [supp] [bibtex]

DyGLIP: A Dynamic Graph Model With Link Prediction for Accurate Multi-Camera Multiple Object Tracking Kha Gia Quach, Pha Nguyen, Huu Le, Thanh-Dat Truong, Chi Nhan Duong, Minh-Triet Tran, Khoa Luu [pdf] [supp] [arXiv] [bibtex]

Towards Efficient Tensor Decomposition-Based DNN Model Compression With Optimization Framework Miao Yin, Yang Sui, Siyu Liao, Bo Yuan [pdf] [supp] [bibtex]

User-Guided Line Art Flat Filling With Split Filling Mechanism Lvmin Zhang, Chengze Li, Edgar Simo-Serra, Yi Ji, Tien-Tsin Wong, Chunping Liu [pdf] [bibtex]

Restore From Restored: Video Restoration With Pseudo Clean Video Seunghwan Lee, Donghyeon Cho, Jiwon Kim, Tae Hyun Kim [pdf] [arXiv] [bibtex]

Semantic Segmentation for Real Point Cloud Scenes via Bilateral Augmentation and Adaptive Fusion Shi Qiu, Saeed Anwar, Nick Barnes [pdf] [supp] [arXiv] [bibtex]

Interactive Self-Training With Mean Teachers for Semi-Supervised Object Detection Qize Yang, Xihan Wei, Biao Wang, Xian-Sheng Hua, Lei Zhang [pdf] [bibtex]

DeFLOCNet: Deep Image Editing via Flexible Low-Level Controls Hongyu Liu, Ziyu Wan, Wei Huang, Yibing Song, Xintong Han, Jing Liao, Bin Jiang, Wei Liu [pdf] [supp] [arXiv] [bibtex]

Vx2Text: End-to-End Learning of Video-Based Text Generation From Multimodal Inputs Xudong Lin, Gedas Bertasius, Jue Wang, Shih-Fu Chang, Devi Parikh, Lorenzo Torresani [pdf] [supp] [arXiv] [bibtex]

KSM: Fast Multiple Task Adaption via Kernel-Wise Soft Mask Learning Li Yang, Zhezhi He, Junshan Zhang, Deliang Fan [pdf] [arXiv] [bibtex]

Rich Context Aggregation With Reflection Prior for Glass Surface Detection Jiaying Lin, Zebang He, Rynson W.H. Lau [pdf] [bibtex]

Coming Down to Earth: Satellite-to-Street View Synthesis for Geo-Localization Aysim Toker, Qunjie Zhou, Maxim Maximov, Laura Leal-Taixe [pdf] [supp] [arXiv] [bibtex]

AutoInt: Automatic Integration for Fast Neural Volume Rendering David B. Lindell, Julien N. P. Martel, Gordon Wetzstein [pdf] [supp] [arXiv] [bibtex]

Pose-Guided Human Animation From a Single Image in the Wild Jae Shin Yoon, Lingjie Liu, Vladislav Golyanik, Kripasindhu Sarkar, Hyun Soo Park, Christian Theobalt [pdf] [supp] [arXiv] [bibtex]

Room-and-Object Aware Knowledge Reasoning for Remote Embodied Referring Expression Chen Gao, Jinyu Chen, Si Liu, Luting Wang, Qiong Zhang, Qi Wu [pdf] [supp] [bibtex]

Equivariant Point Network for 3D Point Cloud Analysis Haiwei Chen, Shichen Liu, Weikai Chen, Hao Li, Randall Hill [pdf] [supp] [arXiv] [bibtex]

Learning Graph Embeddings for Compositional Zero-Shot Learning Muhammad Ferjad Naeem, Yongqin Xian, Federico Tombari, Zeynep Akata [pdf] [supp] [arXiv] [bibtex]

NeRD: Neural 3D Reflection Symmetry Detector Yichao Zhou, Shichen Liu, Yi Ma [pdf] [supp] [arXiv] [bibtex]

Checkerboard Context Model for Efficient Learned Image Compression Dailan He, Yaoyan Zheng, Baocheng Sun, Yan Wang, Hongwei Qin [pdf] [supp] [arXiv] [bibtex]

Zero-Shot Adversarial Quantization Yuang Liu, Wei Zhang, Jun Wang [pdf] [arXiv] [bibtex]

Group Whitening: Balancing Learning Efficiency and Representational Capacity Lei Huang, Yi Zhou, Li Liu, Fan Zhu, Ling Shao [pdf] [supp] [arXiv] [bibtex]

Adversarial Robustness Under Long-Tailed Distribution Tong Wu, Ziwei Liu, Qingqiu Huang, Yu Wang, Dahua Lin [pdf] [supp] [arXiv] [bibtex]

HyperSeg: Patch-Wise Hypernetwork for Real-Time Semantic Segmentation Yuval Nirkin, Lior Wolf, Tal Hassner [pdf] [supp] [arXiv] [bibtex]

Augmentation Strategies for Learning With Noisy Labels Kento Nishi, Yi Ding, Alex Rich, Tobias Hollerer [pdf] [supp] [arXiv] [bibtex]

AdaStereo: A Simple and Efficient Approach for Adaptive Stereo Matching Xiao Song, Guorun Yang, Xinge Zhu, Hui Zhou, Zhe Wang, Jianping Shi [pdf] [supp] [arXiv] [bibtex]

ClassSR: A General Framework to Accelerate Super-Resolution Networks by Data Characteristic Xiangtao Kong, Hengyuan Zhao, Yu Qiao, Chao Dong [pdf] [arXiv] [bibtex]

Partition-Guided GANs Mohammadreza Armandpour, Ali Sadeghian, Chunyuan Li, Mingyuan Zhou [pdf] [supp] [arXiv] [bibtex]

GATSBI: Generative Agent-Centric Spatio-Temporal Object Interaction Cheol-Hui Min, Jinseok Bae, Junho Lee, Young Min Kim [pdf] [supp] [arXiv] [bibtex]

Privacy-Preserving Collaborative Learning With Automatic Transformation Search Wei Gao, Shangwei Guo, Tianwei Zhang, Han Qiu, Yonggang Wen, Yang Liu [pdf] [arXiv] [bibtex]

Multi-Modal Relational Graph for Cross-Modal Video Moment Retrieval Yawen Zeng, Da Cao, Xiaochi Wei, Meng Liu, Zhou Zhao, Zheng Qin [pdf] [bibtex]

Point Cloud Instance Segmentation Using Probabilistic Embeddings Biao Zhang, Peter Wonka [pdf] [supp] [arXiv] [bibtex]

pixelNeRF: Neural Radiance Fields From One or Few Images Alex Yu, Vickie Ye, Matthew Tancik, Angjoo Kanazawa [pdf] [supp] [arXiv] [bibtex]

Navigating the GAN Parameter Space for Semantic Image Editing Anton Cherepkov, Andrey Voynov, Artem Babenko [pdf] [supp] [arXiv] [bibtex]

Large-Capacity Image Steganography Based on Invertible Neural Networks Shao-Ping Lu, Rong Wang, Tao Zhong, Paul L. Rosin [pdf] [supp] [bibtex]

Exploiting Edge-Oriented Reasoning for 3D Point-Based Scene Graph Analysis Chaoyi Zhang, Jianhui Yu, Yang Song, Weidong Cai [pdf] [supp] [arXiv] [bibtex]

CoLA: Weakly-Supervised Temporal Action Localization With Snippet Contrastive Learning Can Zhang, Meng Cao, Dongming Yang, Jie Chen, Yuexian Zou [pdf] [supp] [arXiv] [bibtex]

MetaSAug: Meta Semantic Augmentation for Long-Tailed Visual Recognition Shuang Li, Kaixiong Gong, Chi Harold Liu, Yulin Wang, Feng Qiao, Xinjing Cheng [pdf] [arXiv] [bibtex]

Limitations of Post-Hoc Feature Alignment for Robustness Collin Burns, Jacob Steinhardt [pdf] [supp] [arXiv] [bibtex]

Every Annotation Counts: Multi-Label Deep Supervision for Medical Image Segmentation Simon Reiss, Constantin Seibold, Alexander Freytag, Erik Rodner, Rainer Stiefelhagen [pdf] [supp] [arXiv] [bibtex]

Roses Are Red, Violets Are Blue... but Should VQA Expect Them To? Corentin Kervadec, Grigory Antipov, Moez Baccouche, Christian Wolf [pdf] [supp] [arXiv] [bibtex]

FAPIS: A Few-Shot Anchor-Free Part-Based Instance Segmenter Khoi Nguyen, Sinisa Todorovic [pdf] [supp] [arXiv] [bibtex]

Disentangling Label Distribution for Long-Tailed Visual Recognition Youngkyu Hong, Seungju Han, Kwanghee Choi, Seokjun Seo, Beomsu Kim, Buru Chang [pdf] [supp] [arXiv] [bibtex]

Gradient Forward-Propagation for Large-Scale Temporal Video Modelling Mateusz Malinowski, Dimitrios Vytiniotis, Grzegorz Swirszcz, Viorica Patraucean, Joao Carreira [pdf] [supp] [bibtex]

Learning a Non-Blind Deblurring Network for Night Blurry Images Liang Chen, Jiawei Zhang, Jinshan Pan, Songnan Lin, Faming Fang, Jimmy S. Ren [pdf] [supp] [bibtex]

Differentiable Diffusion for Dense Depth Estimation From Multi-View Images Numair Khan, Min H. Kim, James Tompkin [pdf] [supp] [bibtex]

Deep Compositional Metric Learning Wenzhao Zheng, Chengkun Wang, Jiwen Lu, Jie Zhou [pdf] [bibtex]

Representing Videos As Discriminative Sub-Graphs for Action Recognition Dong Li, Zhaofan Qiu, Yingwei Pan, Ting Yao, Houqiang Li, Tao Mei [pdf] [bibtex]

AIFit: Automatic 3D Human-Interpretable Feedback Models for Fitness Training Mihai Fieraru, Mihai Zanfir, Silviu Cristian Pirlea, Vlad Olaru, Cristian Sminchisescu [pdf] [supp] [bibtex]

Synthesizing Long-Term 3D Human Motion and Interaction in 3D Scenes Jiashun Wang, Huazhe Xu, Jingwei Xu, Sifei Liu, Xiaolong Wang [pdf] [supp] [arXiv] [bibtex]

How Well Do Self-Supervised Models Transfer? Linus Ericsson, Henry Gouk, Timothy M. Hospedales [pdf] [supp] [arXiv] [bibtex]

Understanding Object Dynamics for Interactive Image-to-Video Synthesis Andreas Blattmann, Timo Milbich, Michael Dorkenwald, Bjorn Ommer [pdf] [supp] [bibtex]

Pi-GAN: Periodic Implicit Generative Adversarial Networks for 3D-Aware Image Synthesis Eric R. Chan, Marco Monteiro, Petr Kellnhofer, Jiajun Wu, Gordon Wetzstein [pdf] [supp] [bibtex]

Diverse Branch Block: Building a Convolution as an Inception-Like Unit Xiaohan Ding, Xiangyu Zhang, Jungong Han, Guiguang Ding [pdf] [arXiv] [bibtex]

Post-Hoc Uncertainty Calibration for Domain Drift Scenarios Christian Tomani, Sebastian Gruber, Muhammed Ebrar Erdem, Daniel Cremers, Florian Buettner [pdf] [supp] [arXiv] [bibtex]

Slimmable Compressive Autoencoders for Practical Neural Image Compression Fei Yang, Luis Herranz, Yongmei Cheng, Mikhail G. Mozerov [pdf] [supp] [arXiv] [bibtex]

Function4D: Real-Time Human Volumetric Capture From Very Sparse Consumer RGBD Sensors Tao Yu, Zerong Zheng, Kaiwen Guo, Pengpeng Liu, Qionghai Dai, Yebin Liu [pdf] [supp] [arXiv] [bibtex]

LAU-Net: Latitude Adaptive Upscaling Network for Omnidirectional Image Super-Resolution Xin Deng, Hao Wang, Mai Xu, Yichen Guo, Yuhang Song, Li Yang [pdf] [bibtex]

UP-DETR: Unsupervised Pre-Training for Object Detection With Transformers Zhigang Dai, Bolun Cai, Yugeng Lin, Junying Chen [pdf] [supp] [bibtex]

Self-Attention Based Text Knowledge Mining for Text Detection Qi Wan, Haoqin Ji, Linlin Shen [pdf] [supp] [bibtex]

Image De-Raining via Continual Learning Man Zhou, Jie Xiao, Yifan Chang, Xueyang Fu, Aiping Liu, Jinshan Pan, Zheng-Jun Zha [pdf] [bibtex]

Layer-Wise Searching for 1-Bit Detectors Sheng Xu, Junhe Zhao, Jinhu Lu, Baochang Zhang, Shumin Han, David Doermann [pdf] [bibtex]

Distilling Audio-Visual Knowledge by Compositional Contrastive Learning Yanbei Chen, Yongqin Xian, A. Sophia Koepke, Ying Shan, Zeynep Akata [pdf] [supp] [arXiv] [bibtex]

Unsupervised Visual Attention and Invariance for Reinforcement Learning Xudong Wang, Long Lian, Stella X. Yu [pdf] [supp] [arXiv] [bibtex]

CRFace: Confidence Ranker for Model-Agnostic Face Detection Refinement Noranart Vesdapunt, Baoyuan Wang [pdf] [supp] [arXiv] [bibtex]

Semantic Audio-Visual Navigation Changan Chen, Ziad Al-Halah, Kristen Grauman [pdf] [supp] [bibtex]

Humble Teachers Teach Better Students for Semi-Supervised Object Detection Yihe Tang, Weifeng Chen, Yijun Luo, Yuting Zhang [pdf] [supp] [bibtex]

One Shot Face Swapping on Megapixels Yuhao Zhu, Qi Li, Jian Wang, Cheng-Zhong Xu, Zhenan Sun [pdf] [supp] [arXiv] [bibtex]

CDFI: Compression-Driven Network Design for Frame Interpolation Tianyu Ding, Luming Liang, Zhihui Zhu, Ilya Zharkov [pdf] [arXiv] [bibtex]

PAConv: Position Adaptive Convolution With Dynamic Kernel Assembling on Point Clouds Mutian Xu, Runyu Ding, Hengshuang Zhao, Xiaojuan Qi [pdf] [supp] [arXiv] [bibtex]

End-to-End Object Detection With Fully Convolutional Network Jianfeng Wang, Lin Song, Zeming Li, Hongbin Sun, Jian Sun, Nanning Zheng [pdf] [supp] [arXiv] [bibtex]

Efficient Initial Pose-Graph Generation for Global SfM Daniel Barath, Dmytro Mishkin, Ivan Eichhardt, Ilia Shipachev, Jiri Matas [pdf] [supp] [arXiv] [bibtex]

Representative Batch Normalization With Feature Calibration Shang-Hua Gao, Qi Han, Duo Li, Ming-Ming Cheng, Pai Peng [pdf] [bibtex]

VarifocalNet: An IoU-Aware Dense Object Detector Haoyang Zhang, Ying Wang, Feras Dayoub, Niko Sunderhauf [pdf] [arXiv] [bibtex]

Background-Aware Pooling and Noise-Aware Loss for Weakly-Supervised Semantic Segmentation Youngmin Oh, Beomjun Kim, Bumsub Ham [pdf] [supp] [arXiv] [bibtex]

Abstract Spatial-Temporal Reasoning via Probabilistic Abduction and Execution Chi Zhang, Baoxiong Jia, Song-Chun Zhu, Yixin Zhu [pdf] [supp] [arXiv] [bibtex]

Reducing Domain Gap by Reducing Style Bias Hyeonseob Nam, HyunJae Lee, Jongchan Park, Wonjun Yoon, Donggeun Yoo [pdf] [arXiv] [bibtex]

Efficient Regional Memory Network for Video Object Segmentation Haozhe Xie, Hongxun Yao, Shangchen Zhou, Shengping Zhang, Wenxiu Sun [pdf] [arXiv] [bibtex]

Human POSEitioning System (HPS): 3D Human Pose Estimation and Self-Localization in Large Scenes From Body-Mounted Sensors Vladimir Guzov, Aymen Mir, Torsten Sattler, Gerard Pons-Moll [pdf] [supp] [arXiv] [bibtex]

Semantic Relation Reasoning for Shot-Stable Few-Shot Object Detection Chenchen Zhu, Fangyi Chen, Uzair Ahmed, Zhiqiang Shen, Marios Savvides [pdf] [supp] [arXiv] [bibtex]

Online Multiple Object Tracking With Cross-Task Synergy Song Guo, Jingya Wang, Xinchao Wang, Dacheng Tao [pdf] [arXiv] [bibtex]

Discovering Relationships Between Object Categories via Universal Canonical Maps Natalia Neverova, Artsiom Sanakoyeu, Patrick Labatut, David Novotny, Andrea Vedaldi [pdf] [supp] [bibtex]

Prior Based Human Completion Zibo Zhao, Wen Liu, Yanyu Xu, Xianing Chen, Weixin Luo, Lei Jin, Bohui Zhu, Tong Liu, Binqiang Zhao, Shenghua Gao [pdf] [supp] [bibtex]

Neural Response Interpretation Through the Lens of Critical Pathways Ashkan Khakzar, Soroosh Baselizadeh, Saurabh Khanduja, Christian Rupprecht, Seong Tae Kim, Nassir Navab [pdf] [supp] [arXiv] [bibtex]

Rethinking and Improving the Robustness of Image Style Transfer Pei Wang, Yijun Li, Nuno Vasconcelos [pdf] [supp] [arXiv] [bibtex]

FSCE: Few-Shot Object Detection via Contrastive Proposal Encoding Bo Sun, Banghuai Li, Shengcai Cai, Ye Yuan, Chi Zhang [pdf] [supp] [arXiv] [bibtex]

Cross-Domain Similarity Learning for Face Recognition in Unseen Domains Masoud Faraki, Xiang Yu, Yi-Hsuan Tsai, Yumin Suh, Manmohan Chandraker [pdf] [arXiv] [bibtex]

Learning 3D Shape Feature for Texture-Insensitive Person Re-Identification Jiaxing Chen, Xinyang Jiang, Fudong Wang, Jun Zhang, Feng Zheng, Xing Sun, Wei-Shi Zheng [pdf] [supp] [bibtex]

Virtual Fully-Connected Layer: Training a Large-Scale Face Recognition Dataset With Limited Computational Resources Pengyu Li, Biao Wang, Lei Zhang [pdf] [supp] [bibtex]

Multi-Person Implicit Reconstruction From a Single Image Armin Mustafa, Akin Caliskan, Lourdes Agapito, Adrian Hilton [pdf] [arXiv] [bibtex]

OPANAS: One-Shot Path Aggregation Network Architecture Search for Object Detection Tingting Liang, Yongtao Wang, Zhi Tang, Guosheng Hu, Haibin Ling [pdf] [arXiv] [bibtex]

Bridge To Answer: Structure-Aware Graph Interaction Network for Video Question Answering Jungin Park, Jiyoung Lee, Kwanghoon Sohn [pdf] [arXiv] [bibtex]

Learning Compositional Radiance Fields of Dynamic Human Heads Ziyan Wang, Timur Bagautdinov, Stephen Lombardi, Tomas Simon, Jason Saragih, Jessica Hodgins, Michael Zollhofer [pdf] [supp] [arXiv] [bibtex]

Partial Person Re-Identification With Part-Part Correspondence Learning Tianyu He, Xu Shen, Jianqiang Huang, Zhibo Chen, Xian-Sheng Hua [pdf] [supp] [bibtex]

Monte Carlo Scene Search for 3D Scene Understanding Shreyas Hampali, Sinisa Stekovic, Sayan Deb Sarkar, Chetan S. Kumar, Friedrich Fraundorfer, Vincent Lepetit [pdf] [supp] [arXiv] [bibtex]

Coarse-To-Fine Person Re-Identification With Auxiliary-Domain Classification and Second-Order Information Bottleneck Anguo Zhang, Yueming Gao, Yuzhen Niu, Wenxi Liu, Yongcheng Zhou [pdf] [supp] [bibtex]

Transformer Tracking Xin Chen, Bin Yan, Jiawen Zhu, Dong Wang, Xiaoyun Yang, Huchuan Lu [pdf] [arXiv] [bibtex]

Structured Multi-Level Interaction Network for Video Moment Localization via Language Query Hao Wang, Zheng-Jun Zha, Liang Li, Dong Liu, Jiebo Luo [pdf] [bibtex]

Structured Scene Memory for Vision-Language Navigation Hanqing Wang, Wenguan Wang, Wei Liang, Caiming Xiong, Jianbing Shen [pdf] [arXiv] [bibtex]

Unsupervised Pre-Training for Person Re-Identification Dengpan Fu, Dongdong Chen, Jianmin Bao, Hao Yang, Lu Yuan, Lei Zhang, Houqiang Li, Dong Chen [pdf] [supp] [arXiv] [bibtex]

Progressive Stage-Wise Learning for Unsupervised Feature Representation Enhancement Zefan Li, Chenxi Liu, Alan Yuille, Bingbing Ni, Wenjun Zhang, Wen Gao [pdf] [bibtex]

Domain-Specific Suppression for Adaptive Object Detection Yu Wang, Rui Zhang, Shuo Zhang, Miao Li, Yangyang Xia, Xishan Zhang, Shaoli Liu [pdf] [arXiv] [bibtex]

Few-Shot Object Detection via Classification Refinement and Distractor Retreatment Yiting Li, Haiyue Zhu, Yu Cheng, Wenxin Wang, Chek Sing Teo, Cheng Xiang, Prahlad Vadakkepat, Tong Heng Lee [pdf] [supp] [bibtex]

D2IM-Net: Learning Detail Disentangled Implicit Fields From Single Images Manyi Li, Hao Zhang [pdf] [bibtex]

Not Just Compete, but Collaborate: Local Image-to-Image Translation via Cooperative Mask Prediction Daejin Kim, Mohammad Azam Khan, Jaegul Choo [pdf] [bibtex]

Behavior-Driven Synthesis of Human Dynamics Andreas Blattmann, Timo Milbich, Michael Dorkenwald, Bjorn Ommer [pdf] [supp] [arXiv] [bibtex]

GAIA: A Transfer Learning System of Object Detection That Fits Your Needs Xingyuan Bu, Junran Peng, Junjie Yan, Tieniu Tan, Zhaoxiang Zhang [pdf] [supp] [bibtex]

IronMask: Modular Architecture for Protecting Deep Face Template Sunpill Kim, Yunseong Jeong, Jinsu Kim, Jungkon Kim, Hyung Tae Lee, Jae Hong Seo [pdf] [supp] [arXiv] [bibtex]

Learning To Recommend Frame for Interactive Video Object Segmentation in the Wild Zhaoyuan Yin, Jia Zheng, Weixin Luo, Shenhan Qian, Hanling Zhang, Shenghua Gao [pdf] [supp] [arXiv] [bibtex]

DSRNA: Differentiable Search of Robust Neural Architectures Ramtin Hosseini, Xingyi Yang, Pengtao Xie [pdf] [arXiv] [bibtex]

Reconstructing 3D Human Pose by Watching Humans in the Mirror Qi Fang, Qing Shuai, Junting Dong, Hujun Bao, Xiaowei Zhou [pdf] [arXiv] [bibtex]

Spk2ImgNet: Learning To Reconstruct Dynamic Scene From Continuous Spike Stream Jing Zhao, Ruiqin Xiong, Hangfan Liu, Jian Zhang, Tiejun Huang [pdf] [supp] [bibtex]

MonoRUn: Monocular 3D Object Detection by Reconstruction and Uncertainty Propagation Hansheng Chen, Yuyao Huang, Wei Tian, Zhong Gao, Lu Xiong [pdf] [supp] [arXiv] [bibtex]

Complete & Label: A Domain Adaptation Approach to Semantic Segmentation of LiDAR Point Clouds Li Yi, Boqing Gong, Thomas Funkhouser [pdf] [supp] [arXiv] [bibtex]

GMOT-40: A Benchmark for Generic Multiple Object Tracking Hexin Bai, Wensheng Cheng, Peng Chu, Juehuan Liu, Kai Zhang, Haibin Ling [pdf] [supp] [bibtex]

Few-Shot Image Generation via Cross-Domain Correspondence Utkarsh Ojha, Yijun Li, Jingwan Lu, Alexei A. Efros, Yong Jae Lee, Eli Shechtman, Richard Zhang [pdf] [supp] [arXiv] [bibtex]

Hierarchical Lovasz Embeddings for Proposal-Free Panoptic Segmentation Tommi Kerola, Jie Li, Atsushi Kanehira, Yasunori Kudo, Alexis Vallet, Adrien Gaidon [pdf] [supp] [bibtex]

Neural Body: Implicit Neural Representations With Structured Latent Codes for Novel View Synthesis of Dynamic Humans Sida Peng, Yuanqing Zhang, Yinghao Xu, Qianqian Wang, Qing Shuai, Hujun Bao, Xiaowei Zhou [pdf] [arXiv] [bibtex]

Cross-Modal Collaborative Representation Learning and a Large-Scale RGBT Benchmark for Crowd Counting Lingbo Liu, Jiaqi Chen, Hefeng Wu, Guanbin Li, Chenglong Li, Liang Lin [pdf] [supp] [arXiv] [bibtex]

Weakly Supervised Video Salient Object Detection Wangbo Zhao, Jing Zhang, Long Li, Nick Barnes, Nian Liu, Junwei Han [pdf] [supp] [arXiv] [bibtex]

Pixel-Wise Anomaly Detection in Complex Driving Scenes Giancarlo Di Biase, Hermann Blum, Roland Siegwart, Cesar Cadena [pdf] [supp] [arXiv] [bibtex]

Learning To Associate Every Segment for Video Panoptic Segmentation Sanghyun Woo, Dahun Kim, Joon-Young Lee, In So Kweon [pdf] [bibtex]

Variational Transformer Networks for Layout Generation Diego Martin Arroyo, Janis Postels, Federico Tombari [pdf] [supp] [arXiv] [bibtex]

Mitigating Face Recognition Bias via Group Adaptive Classifier Sixue Gong, Xiaoming Liu, Anil K. Jain [pdf] [supp] [arXiv] [bibtex]

A Peek Into the Reasoning of Neural Networks: Interpreting With Structural Visual Concepts Yunhao Ge, Yao Xiao, Zhi Xu, Meng Zheng, Srikrishna Karanam, Terrence Chen, Laurent Itti, Ziyan Wu [pdf] [supp] [arXiv] [bibtex]

Three Birds with One Stone: Multi-Task Temporal Action Detection via Recycling Temporal Annotations Zhihui Li, Lina Yao [pdf] [bibtex]

A Dual Iterative Refinement Method for Non-Rigid Shape Matching Rui Xiang, Rongjie Lai, Hongkai Zhao [pdf] [supp] [arXiv] [bibtex]

Image Super-Resolution With Non-Local Sparse Attention Yiqun Mei, Yuchen Fan, Yuqian Zhou [pdf] [supp] [bibtex]

3D Video Stabilization With Depth Estimation by CNN-Based Optimization Yao-Chih Lee, Kuan-Wei Tseng, Yu-Ta Chen, Chien-Cheng Chen, Chu-Song Chen, Yi-Ping Hung [pdf] [supp] [bibtex]

Predicting Human Scanpaths in Visual Question Answering Xianyu Chen, Ming Jiang, Qi Zhao [pdf] [supp] [bibtex]

DetectoRS: Detecting Objects With Recursive Feature Pyramid and Switchable Atrous Convolution Siyuan Qiao, Liang-Chieh Chen, Alan Yuille [pdf] [arXiv] [bibtex]

SCANimate: Weakly Supervised Learning of Skinned Clothed Avatar Networks Shunsuke Saito, Jinlong Yang, Qianli Ma, Michael J. Black [pdf] [supp] [arXiv] [bibtex]

Improving Accuracy of Binary Neural Networks Using Unbalanced Activation Distribution Hyungjun Kim, Jihoon Park, Changhun Lee, Jae-Joon Kim [pdf] [supp] [arXiv] [bibtex]

Cylindrical and Asymmetrical 3D Convolution Networks for LiDAR Segmentation Xinge Zhu, Hui Zhou, Tai Wang, Fangzhou Hong, Yuexin Ma, Wei Li, Hongsheng Li, Dahua Lin [pdf] [arXiv] [bibtex]

SMPLicit: Topology-Aware Generative Model for Clothed People Enric Corona, Albert Pumarola, Guillem Alenya, Gerard Pons-Moll, Francesc Moreno-Noguer [pdf] [supp] [arXiv] [bibtex]

Learning View-Disentangled Human Pose Representation by Contrastive Cross-View Mutual Information Maximization Long Zhao, Yuxiao Wang, Jiaping Zhao, Liangzhe Yuan, Jennifer J. Sun, Florian Schroff, Hartwig Adam, Xi Peng, Dimitris Metaxas, Ting Liu [pdf] [supp] [arXiv] [bibtex]

Non-Salient Region Object Mining for Weakly Supervised Semantic Segmentation Yazhou Yao, Tao Chen, Guo-Sen Xie, Chuanyi Zhang, Fumin Shen, Qi Wu, Zhenmin Tang, Jian Zhang [pdf] [arXiv] [bibtex]

DCT-Mask: Discrete Cosine Transform Mask Representation for Instance Segmentation Xing Shen, Jirui Yang, Chunbo Wei, Bing Deng, Jianqiang Huang, Xian-Sheng Hua, Xiaoliang Cheng, Kewei Liang [pdf] [bibtex]

Bridging the Visual Gap: Wide-Range Image Blending Chia-Ni Lu, Ya-Chu Chang, Wei-Chen Chiu [pdf] [supp] [arXiv] [bibtex]

A Realistic Evaluation of Semi-Supervised Learning for Fine-Grained Classification Jong-Chyi Su, Zezhou Cheng, Subhransu Maji [pdf] [supp] [arXiv] [bibtex]

Residential Floor Plan Recognition and Reconstruction Xiaolei Lv, Shengchu Zhao, Xinyang Yu, Binqiang Zhao [pdf] [supp] [bibtex]

Dynamic Domain Adaptation for Efficient Inference Shuang Li, JinMing Zhang, Wenxuan Ma, Chi Harold Liu, Wei Li [pdf] [arXiv] [bibtex]

Regularization Strategy for Point Cloud via Rigidly Mixed Sample Dogyoon Lee, Jaeha Lee, Junhyeop Lee, Hyeongmin Lee, Minhyeok Lee, Sungmin Woo, Sangyoun Lee [pdf] [supp] [arXiv] [bibtex]

StereoPIFu: Depth Aware Clothed Human Digitization via Stereo Vision Yang Hong, Juyong Zhang, Boyi Jiang, Yudong Guo, Ligang Liu, Hujun Bao [pdf] [arXiv] [bibtex]

Unsupervised Multi-Source Domain Adaptation Without Access to Source Data Sk Miraj Ahmed, Dripta S. Raychaudhuri, Sujoy Paul, Samet Oymak, Amit K. Roy-Chowdhury [pdf] [supp] [arXiv] [bibtex]

On Semantic Similarity in Video Retrieval Michael Wray, Hazel Doughty, Dima Damen [pdf] [supp] [arXiv] [bibtex]

Few-Shot Open-Set Recognition by Transformation Consistency Minki Jeong, Seokeon Choi, Changick Kim [pdf] [supp] [arXiv] [bibtex]

Uncertainty-Guided Model Generalization to Unseen Domains Fengchun Qiao, Xi Peng [pdf] [supp] [arXiv] [bibtex]

Debiased Subjective Assessment of Real-World Image Enhancement Peibei Cao, Zhangyang Wang, Kede Ma [pdf] [bibtex]

Landmark Regularization: Ranking Guided Super-Net Training in Neural Architecture Search Kaicheng Yu, Rene Ranftl, Mathieu Salzmann [pdf] [supp] [arXiv] [bibtex]

Noise-Resistant Deep Metric Learning With Ranking-Based Instance Selection Chang Liu, Han Yu, Boyang Li, Zhiqi Shen, Zhanning Gao, Peiran Ren, Xuansong Xie, Lizhen Cui, Chunyan Miao [pdf] [supp] [arXiv] [bibtex]

Neural Reprojection Error: Merging Feature Learning and Camera Pose Estimation Hugo Germain, Vincent Lepetit, Guillaume Bourmaud [pdf] [supp] [arXiv] [bibtex]

Cross Modal Focal Loss for RGBD Face Anti-Spoofing Anjith George, Sebastien Marcel [pdf] [arXiv] [bibtex]

StickyPillars: Robust and Efficient Feature Matching on Point Clouds Using Graph Neural Networks Kai Fischer, Martin Simon, Florian Olsner, Stefan Milz, Horst-Michael Gross, Patrick Mader [pdf] [bibtex]

HoHoNet: 360 Indoor Holistic Understanding With Latent Horizontal Features Cheng Sun, Min Sun, Hwann-Tzong Chen [pdf] [supp] [arXiv] [bibtex]

Online Learning of a Probabilistic and Adaptive Scene Representation Zike Yan, Xin Wang, Hongbin Zha [pdf] [arXiv] [bibtex]

Domain Adaptation With Auxiliary Target Domain-Oriented Classifier Jian Liang, Dapeng Hu, Jiashi Feng [pdf] [arXiv] [bibtex]

Learning To Recover 3D Scene Shape From a Single Image Wei Yin, Jianming Zhang, Oliver Wang, Simon Niklaus, Long Mai, Simon Chen, Chunhua Shen [pdf] [supp] [arXiv] [bibtex]

Neural Scene Flow Fields for Space-Time View Synthesis of Dynamic Scenes Zhengqi Li, Simon Niklaus, Noah Snavely, Oliver Wang [pdf] [supp] [arXiv] [bibtex]

FS-Net: Fast Shape-Based Network for Category-Level 6D Object Pose Estimation With Decoupled Rotation Mechanism Wei Chen, Xi Jia, Hyung Jin Chang, Jinming Duan, Linlin Shen, Ales Leonardis [pdf] [supp] [bibtex]

Unsupervised Human Pose Estimation Through Transforming Shape Templates Luca Schmidtke, Athanasios Vlontzos, Simon Ellershaw, Anna Lukens, Tomoki Arichi, Bernhard Kainz [pdf] [supp] [arXiv] [bibtex]

Improving OCR-Based Image Captioning by Incorporating Geometrical Relationship Jing Wang, Jinhui Tang, Mingkun Yang, Xiang Bai, Jiebo Luo [pdf] [bibtex]

Cross-Iteration Batch Normalization Zhuliang Yao, Yue Cao, Shuxin Zheng, Gao Huang, Stephen Lin [pdf] [supp] [arXiv] [bibtex]

Multimodal Contrastive Training for Visual Representation Learning Xin Yuan, Zhe Lin, Jason Kuen, Jianming Zhang, Yilin Wang, Michael Maire, Ajinkya Kale, Baldo Faieta [pdf] [arXiv] [bibtex]

3D Shape Generation With Grid-Based Implicit Functions Moritz Ibing, Isaak Lim, Leif Kobbelt [pdf] [supp] [bibtex]

Tangent Space Backpropagation for 3D Transformation Groups Zachary Teed, Jia Deng [pdf] [supp] [arXiv] [bibtex]

FAIEr: Fidelity and Adequacy Ensured Image Caption Evaluation Sijin Wang, Ziwei Yao, Ruiping Wang, Zhongqin Wu, Xilin Chen [pdf] [supp] [bibtex]

HLA-Face: Joint High-Low Adaptation for Low Light Face Detection Wenjing Wang, Wenhan Yang, Jiaying Liu [pdf] [supp] [bibtex]

Hierarchical Video Prediction Using Relational Layouts for Human-Object Interactions Navaneeth Bodla, Gaurav Shrivastava, Rama Chellappa, Abhinav Shrivastava [pdf] [supp] [bibtex]

From Rain Generation to Rain Removal Hong Wang, Zongsheng Yue, Qi Xie, Qian Zhao, Yefeng Zheng, Deyu Meng [pdf] [supp] [arXiv] [bibtex]

Few-Shot Classification With Feature Map Reconstruction Networks Davis Wertheimer, Luming Tang, Bharath Hariharan [pdf] [supp] [arXiv] [bibtex]

Object Classification From Randomized EEG Trials Hamad Ahmed, Ronnie B. Wilbur, Hari M. Bharadwaj, Jeffrey Mark Siskind [pdf] [supp] [arXiv] [bibtex]

Learning Monocular 3D Reconstruction of Articulated Categories From Motion Filippos Kokkinos, Iasonas Kokkinos [pdf] [supp] [arXiv] [bibtex]

De-Rendering the World's Revolutionary Artefacts Shangzhe Wu, Ameesh Makadia, Jiajun Wu, Noah Snavely, Richard Tucker, Angjoo Kanazawa [pdf] [supp] [bibtex]

Progressively Complementary Network for Fisheye Image Rectification Using Appearance Flow Shangrong Yang, Chunyu Lin, Kang Liao, Chunjie Zhang, Yao Zhao [pdf] [supp] [arXiv] [bibtex]

DECOR-GAN: 3D Shape Detailization by Conditional Refinement Zhiqin Chen, Vladimir G. Kim, Matthew Fisher, Noam Aigerman, Hao Zhang, Siddhartha Chaudhuri [pdf] [supp] [bibtex]

Model-Aware Gesture-to-Gesture Translation Hezhen Hu, Weilun Wang, Wengang Zhou, Weichao Zhao, Houqiang Li [pdf] [bibtex]

Spatio-temporal Contrastive Domain Adaptation for Action Recognition Xiaolin Song, Sicheng Zhao, Jingyu Yang, Huanjing Yue, Pengfei Xu, Runbo Hu, Hua Chai [pdf] [supp] [bibtex]

Exploiting Semantic Embedding and Visual Feature for Facial Action Unit Detection Huiyuan Yang, Lijun Yin, Yi Zhou, Jiuxiang Gu [pdf] [supp] [bibtex]

Categorical Depth Distribution Network for Monocular 3D Object Detection Cody Reading, Ali Harakeh, Julia Chae, Steven L. Waslander [pdf] [supp] [arXiv] [bibtex]

Learning From the Master: Distilling Cross-Modal Advanced Knowledge for Lip Reading Sucheng Ren, Yong Du, Jianming Lv, Guoqiang Han, Shengfeng He [pdf] [bibtex]

Spatially-Varying Outdoor Lighting Estimation From Intrinsics Yongjie Zhu, Yinda Zhang, Si Li, Boxin Shi [pdf] [supp] [arXiv] [bibtex]

VITON-HD: High-Resolution Virtual Try-On via Misalignment-Aware Normalization Seunghwan Choi, Sunghyun Park, Minsoo Lee, Jaegul Choo [pdf] [supp] [bibtex]

Ultra-High-Definition Image Dehazing via Multi-Guided Bilateral Learning Zhuoran Zheng, Wenqi Ren, Xiaochun Cao, Xiaobin Hu, Tao Wang, Fenglong Song, Xiuyi Jia [pdf] [bibtex]

RankDetNet: Delving Into Ranking Constraints for Object Detection Ji Liu, Dong Li, Rongzhang Zheng, Lu Tian, Yi Shan [pdf] [supp] [bibtex]

Back to the Feature: Learning Robust Camera Localization From Pixels To Pose Paul-Edouard Sarlin, Ajaykumar Unagar, Mans Larsson, Hugo Germain, Carl Toft, Viktor Larsson, Marc Pollefeys, Vincent Lepetit, Lars Hammarstrand, Fredrik Kahl, Torsten Sattler [pdf] [supp] [arXiv] [bibtex]

Learning Parallel Dense Correspondence From Spatio-Temporal Descriptors for Efficient and Robust 4D Reconstruction Jiapeng Tang, Dan Xu, Kui Jia, Lei Zhang [pdf] [arXiv] [bibtex]

Multi-Modal Fusion Transformer for End-to-End Autonomous Driving Aditya Prakash, Kashyap Chitta, Andreas Geiger [pdf] [arXiv] [bibtex]

LightTrack: Finding Lightweight Neural Networks for Object Tracking via One-Shot Architecture Search Bin Yan, Houwen Peng, Kan Wu, Dong Wang, Jianlong Fu, Huchuan Lu [pdf] [supp] [arXiv] [bibtex]

Unsupervised Disentanglement of Linear-Encoded Facial Semantics Yutong Zheng, Yu-Kai Huang, Ran Tao, Zhiqiang Shen, Marios Savvides [pdf] [supp] [arXiv] [bibtex]

Learning Position and Target Consistency for Memory-Based Video Object Segmentation Li Hu, Peng Zhang, Bang Zhang, Pan Pan, Yinghui Xu, Rong Jin [pdf] [arXiv] [bibtex]

Prototypical Pseudo Label Denoising and Target Structure Learning for Domain Adaptive Semantic Segmentation Pan Zhang, Bo Zhang, Ting Zhang, Dong Chen, Yong Wang, Fang Wen [pdf] [supp] [arXiv] [bibtex]

Deep Denoising of Flash and No-Flash Pairs for Photography in Low-Light Environments Zhihao Xia, Michael Gharbi, Federico Perazzi, Kalyan Sunkavalli, Ayan Chakrabarti [pdf] [supp] [arXiv] [bibtex]

Transformer Interpretability Beyond Attention Visualization Hila Chefer, Shir Gur, Lior Wolf [pdf] [supp] [arXiv] [bibtex]

Unsupervised Learning for Robust Fitting: A Reinforcement Learning Approach Giang Truong, Huu Le, David Suter, Erchuan Zhang, Syed Zulqarnain Gilani [pdf] [supp] [arXiv] [bibtex]

Unsupervised Real-World Image Super Resolution via Domain-Distance Aware Training Yunxuan Wei, Shuhang Gu, Yawei Li, Radu Timofte, Longcun Jin, Hengjie Song [pdf] [supp] [arXiv] [bibtex]

Learning to Track Instances without Video Annotations Yang Fu, Sifei Liu, Umar Iqbal, Shalini De Mello, Humphrey Shi, Jan Kautz [pdf] [supp] [arXiv] [bibtex]

Unsupervised Feature Learning by Cross-Level Instance-Group Discrimination Xudong Wang, Ziwei Liu, Stella X. Yu [pdf] [supp] [arXiv] [bibtex]

Representation Learning via Global Temporal Alignment and Cycle-Consistency Isma Hadji, Konstantinos G. Derpanis, Allan D. Jepson [pdf] [supp] [arXiv] [bibtex]

Personalized Outfit Recommendation With Learnable Anchors Zhi Lu, Yang Hu, Yan Chen, Bing Zeng [pdf] [supp] [bibtex]

When Age-Invariant Face Recognition Meets Face Age Synthesis: A Multi-Task Learning Framework Zhizhong Huang, Junping Zhang, Hongming Shan [pdf] [supp] [arXiv] [bibtex]

Learning Dynamics via Graph Neural Networks for Human Pose Estimation and Tracking Yiding Yang, Zhou Ren, Haoxiang Li, Chunluan Zhou, Xinchao Wang, Gang Hua [pdf] [arXiv] [bibtex]

Smoothing the Disentangled Latent Style Space for Unsupervised Image-to-Image Translation Yahui Liu, Enver Sangineto, Yajing Chen, Linchao Bao, Haoxian Zhang, Nicu Sebe, Bruno Lepri, Wei Wang, Marco De Nadai [pdf] [supp] [bibtex]

Robust Instance Segmentation Through Reasoning About Multi-Object Occlusion Xiaoding Yuan, Adam Kortylewski, Yihong Sun, Alan Yuille [pdf] [arXiv] [bibtex]

Architectural Adversarial Robustness: The Case for Deep Pursuit George Cazenavette, Calvin Murdock, Simon Lucey [pdf] [arXiv] [bibtex]

Multi-Scale Aligned Distillation for Low-Resolution Detection Lu Qi, Jason Kuen, Jiuxiang Gu, Zhe Lin, Yi Wang, Yukang Chen, Yanwei Li, Jiaya Jia [pdf] [supp] [bibtex]

Deep Active Surface Models Udaranga Wickramasinghe, Pascal Fua, Graham Knott [pdf] [supp] [arXiv] [bibtex]

Can We Characterize Tasks Without Labels or Features? Bram Wallace, Ziyang Wu, Bharath Hariharan [pdf] [supp] [bibtex]

Scene Essence Jiayan Qiu, Yiding Yang, Xinchao Wang, Dacheng Tao [pdf] [bibtex]

Visual Room Rearrangement Luca Weihs, Matt Deitke, Aniruddha Kembhavi, Roozbeh Mottaghi [pdf] [supp] [arXiv] [bibtex]

VDSM: Unsupervised Video Disentanglement With State-Space Modeling and Deep Mixtures of Experts Matthew J. Vowels, Necati Cihan Camgoz, Richard Bowden [pdf] [supp] [arXiv] [bibtex]

Rotation-Only Bundle Adjustment Seong Hun Lee, Javier Civera [pdf] [supp] [arXiv] [bibtex]

Right for the Right Concept: Revising Neuro-Symbolic Concepts by Interacting With Their Explanations Wolfgang Stammer, Patrick Schramowski, Kristian Kersting [pdf] [arXiv] [bibtex]

Polygonal Point Set Tracking Gunhee Nam, Miran Heo, Seoung Wug Oh, Joon-Young Lee, Seon Joo Kim [pdf] [arXiv] [bibtex]

Deformed Implicit Field: Modeling 3D Shapes With Learned Dense Correspondence Yu Deng, Jiaolong Yang, Xin Tong [pdf] [arXiv] [bibtex]

Verifiability and Predictability: Interpreting Utilities of Network Architectures for Point Cloud Processing Wen Shen, Zhihua Wei, Shikun Huang, Binbin Zhang, Panyue Chen, Ping Zhao, Quanshi Zhang [pdf] [supp] [arXiv] [bibtex]

Tracking Pedestrian Heads in Dense Crowd Ramana Sundararaman, Cedric De Almeida Braga, Eric Marchand, Julien Pettre [pdf] [arXiv] [bibtex]

Neural Splines: Fitting 3D Surfaces With Infinitely-Wide Neural Networks Francis Williams, Matthew Trager, Joan Bruna, Denis Zorin [pdf] [supp] [arXiv] [bibtex]

Alpha-Refine: Boosting Tracking Performance by Precise Bounding Box Estimation Bin Yan, Xinyu Zhang, Dong Wang, Huchuan Lu, Xiaoyun Yang [pdf] [bibtex]

Adaptive Cross-Modal Prototypes for Cross-Domain Visual-Language Retrieval Yang Liu, Qingchao Chen, Samuel Albanie [pdf] [bibtex]

Conceptual 12M: Pushing Web-Scale Image-Text Pre-Training To Recognize Long-Tail Visual Concepts Soravit Changpinyo, Piyush Sharma, Nan Ding, Radu Soricut [pdf] [supp] [arXiv] [bibtex]

SetVAE: Learning Hierarchical Composition for Generative Modeling of Set-Structured Data Jinwoo Kim, Jaehoon Yoo, Juho Lee, Seunghoon Hong [pdf] [supp] [arXiv] [bibtex]

Few-Shot 3D Point Cloud Semantic Segmentation Na Zhao, Tat-Seng Chua, Gim Hee Lee [pdf] [supp] [arXiv] [bibtex]

CFNet: Cascade and Fused Cost Volume for Robust Stereo Matching Zhelun Shen, Yuchao Dai, Zhibo Rao [pdf] [supp] [arXiv] [bibtex]

Adaptive Consistency Prior Based Deep Network for Image Denoising Chao Ren, Xiaohai He, Chuncheng Wang, Zhibo Zhao [pdf] [supp] [bibtex]

Topological Planning With Transformers for Vision-and-Language Navigation Kevin Chen, Junshen K. Chen, Jo Chuang, Marynel Vazquez, Silvio Savarese [pdf] [supp] [arXiv] [bibtex]

FixBi: Bridging Domain Spaces for Unsupervised Domain Adaptation Jaemin Na, Heechul Jung, Hyung Jin Chang, Wonjun Hwang [pdf] [arXiv] [bibtex]

Generalized Few-Shot Object Detection Without Forgetting Zhibo Fan, Yuchen Ma, Zeming Li, Jian Sun [pdf] [supp] [arXiv] [bibtex]

Truly Shift-Invariant Convolutional Neural Networks Anadi Chaman, Ivan Dokmanic [pdf] [supp] [arXiv] [bibtex]

Leveraging the Availability of Two Cameras for Illuminant Estimation Abdelrahman Abdelhamed, Abhijith Punnappurath, Michael S. Brown [pdf] [supp] [bibtex]

LiDAR-Based Panoptic Segmentation via Dynamic Shifting Network Fangzhou Hong, Hui Zhou, Xinge Zhu, Hongsheng Li, Ziwei Liu [pdf] [supp] [arXiv] [bibtex]

Towards Accurate 3D Human Motion Prediction From Incomplete Observations Qiongjie Cui, Huaijiang Sun [pdf] [bibtex]

SiamMOT: Siamese Multi-Object Tracking Bing Shuai, Andrew Berneshawi, Xinyu Li, Davide Modolo, Joseph Tighe [pdf] [supp] [arXiv] [bibtex]

Open-Book Video Captioning With Retrieve-Copy-Generate Network Ziqi Zhang, Zhongang Qi, Chunfeng Yuan, Ying Shan, Bing Li, Ying Deng, Weiming Hu [pdf] [supp] [arXiv] [bibtex]

MUST-GAN: Multi-Level Statistics Transfer for Self-Driven Person Image Generation Tianxiang Ma, Bo Peng, Wei Wang, Jing Dong [pdf] [supp] [bibtex]

Learning Camera Localization via Dense Scene Matching Shitao Tang, Chengzhou Tang, Rui Huang, Siyu Zhu, Ping Tan [pdf] [supp] [arXiv] [bibtex]

SDD-FIQA: Unsupervised Face Image Quality Assessment With Similarity Distribution Distance Fu-Zhao Ou, Xingyu Chen, Ruixin Zhang, Yuge Huang, Shaoxin Li, Jilin Li, Yong Li, Liujuan Cao, Yuan-Gen Wang [pdf] [bibtex]

Self-Aligned Video Deraining With Transmission-Depth Consistency Wending Yan, Robby T. Tan, Wenhan Yang, Dengxin Dai [pdf] [supp] [bibtex]

Self-Promoted Prototype Refinement for Few-Shot Class-Incremental Learning Kai Zhu, Yang Cao, Wei Zhai, Jie Cheng, Zheng-Jun Zha [pdf] [bibtex]

PANDA: Adapting Pretrained Features for Anomaly Detection and Segmentation Tal Reiss, Niv Cohen, Liron Bergman, Yedid Hoshen [pdf] [supp] [arXiv] [bibtex]

Towards Compact CNNs via Collaborative Compression Yuchao Li, Shaohui Lin, Jianzhuang Liu, Qixiang Ye, Mengdi Wang, Fei Chao, Fan Yang, Jincheng Ma, Qi Tian, Rongrong Ji [pdf] [supp] [arXiv] [bibtex]

Embracing Uncertainty: Decoupling and De-Bias for Robust Temporal Grounding Hao Zhou, Chongyang Zhang, Yan Luo, Yanjun Chen, Chuanping Hu [pdf] [arXiv] [bibtex]

Separating Skills and Concepts for Novel Visual Question Answering Spencer Whitehead, Hui Wu, Heng Ji, Rogerio Feris, Kate Saenko [pdf] [supp] [bibtex]

Discrete-Continuous Action Space Policy Gradient-Based Attention for Image-Text Matching Shiyang Yan, Li Yu, Yuan Xie [pdf] [arXiv] [bibtex]

Scalable Differential Privacy With Sparse Network Finetuning Zelun Luo, Daniel J. Wu, Ehsan Adeli, Li Fei-Fei [pdf] [bibtex]

Video Object Segmentation Using Global and Instance Embedding Learning Wenbin Ge, Xiankai Lu, Jianbing Shen [pdf] [supp] [bibtex]

Scene Text Retrieval via Joint Text Detection and Similarity Learning Hao Wang, Xiang Bai, Mingkun Yang, Shenggao Zhu, Jing Wang, Wenyu Liu [pdf] [supp] [arXiv] [bibtex]

Learning Continuous Image Representation With Local Implicit Image Function Yinbo Chen, Sifei Liu, Xiaolong Wang [pdf] [arXiv] [bibtex]

Locally Aware Piecewise Transformation Fields for 3D Human Mesh Registration Shaofei Wang, Andreas Geiger, Siyu Tang [pdf] [supp] [arXiv] [bibtex]

Graph Attention Tracking Dongyan Guo, Yanyan Shao, Ying Cui, Zhenhua Wang, Liyan Zhang, Chunhua Shen [pdf] [arXiv] [bibtex]

ReDet: A Rotation-Equivariant Detector for Aerial Object Detection Jiaming Han, Jian Ding, Nan Xue, Gui-Song Xia [pdf] [arXiv] [bibtex]

Action Shuffle Alternating Learning for Unsupervised Action Segmentation Jun Li, Sinisa Todorovic [pdf] [arXiv] [bibtex]

Progressive Modality Reinforcement for Human Multimodal Emotion Recognition From Unaligned Multimodal Sequences Fengmao Lv, Xiang Chen, Yanyong Huang, Lixin Duan, Guosheng Lin [pdf] [bibtex]

OpenMix: Reviving Known Knowledge for Discovering Novel Visual Categories in an Open World Zhun Zhong, Linchao Zhu, Zhiming Luo, Shaozi Li, Yi Yang, Nicu Sebe [pdf] [arXiv] [bibtex]

Combining Semantic Guidance and Deep Reinforcement Learning for Generating Human Level Paintings Jaskirat Singh, Liang Zheng [pdf] [supp] [arXiv] [bibtex]

Event-Based Bispectral Photometry Using Temporally Modulated Illumination Tsuyoshi Takatani, Yuzuha Ito, Ayaka Ebisu, Yinqiang Zheng, Takahito Aoto [pdf] [supp] [bibtex]

LiDAR-Aug: A General Rendering-Based Augmentation Framework for 3D Object Detection Jin Fang, Xinxin Zuo, Dingfu Zhou, Shengze Jin, Sen Wang, Liangjun Zhang [pdf] [bibtex]

Semantic-Aware Knowledge Distillation for Few-Shot Class-Incremental Learning Ali Cheraghian, Shafin Rahman, Pengfei Fang, Soumava Kumar Roy, Lars Petersson, Mehrtash Harandi [pdf] [arXiv] [bibtex]

General Instance Distillation for Object Detection Xing Dai, Zeren Jiang, Zhao Wu, Yiping Bao, Zhicheng Wang, Si Liu, Erjin Zhou [pdf] [arXiv] [bibtex]

Joint Noise-Tolerant Learning and Meta Camera Shift Adaptation for Unsupervised Person Re-Identification Fengxiang Yang, Zhun Zhong, Zhiming Luo, Yuanzheng Cai, Yaojin Lin, Shaozi Li, Nicu Sebe [pdf] [supp] [arXiv] [bibtex]

Mutual Graph Learning for Camouflaged Object Detection Qiang Zhai, Xin Li, Fan Yang, Chenglizhao Chen, Hong Cheng, Deng-Ping Fan [pdf] [arXiv] [bibtex]

Single Pair Cross-Modality Super Resolution Guy Shacht, Dov Danon, Sharon Fogel, Daniel Cohen-Or [pdf] [supp] [arXiv] [bibtex]

Target-Aware Object Discovery and Association for Unsupervised Video Multi-Object Segmentation Tianfei Zhou, Jianwu Li, Xueyi Li, Ling Shao [pdf] [arXiv] [bibtex]

Cross-View Regularization for Domain Adaptive Panoptic Segmentation Jiaxing Huang, Dayan Guan, Aoran Xiao, Shijian Lu [pdf] [supp] [arXiv] [bibtex]

End-to-End Learning for Joint Image Demosaicing, Denoising and Super-Resolution Wenzhu Xing, Karen Egiazarian [pdf] [supp] [bibtex]

Keep Your Eyes on the Lane: Real-Time Attention-Guided Lane Detection Lucas Tabelini, Rodrigo Berriel, Thiago M. Paixao, Claudine Badue, Alberto F. De Souza, Thiago Oliveira-Santos [pdf] [arXiv] [bibtex]

Lesion-Aware Transformers for Diabetic Retinopathy Grading Rui Sun, Yihao Li, Tianzhu Zhang, Zhendong Mao, Feng Wu, Yongdong Zhang [pdf] [supp] [bibtex]

Involution: Inverting the Inherence of Convolution for Visual Recognition Duo Li, Jie Hu, Changhu Wang, Xiangtai Li, Qi She, Lei Zhu, Tong Zhang, Qifeng Chen [pdf] [supp] [arXiv] [bibtex]

QPIC: Query-Based Pairwise Human-Object Interaction Detection With Image-Wide Contextual Information Masato Tamura, Hiroki Ohashi, Tomoaki Yoshinaga [pdf] [supp] [arXiv] [bibtex]

Home Action Genome: Cooperative Compositional Action Understanding Nishant Rai, Haofeng Chen, Jingwei Ji, Rishi Desai, Kazuki Kozuka, Shun Ishizaka, Ehsan Adeli, Juan Carlos Niebles [pdf] [supp] [arXiv] [bibtex]

Deep Lesion Tracker: Monitoring Lesions in 4D Longitudinal Imaging Studies Jinzheng Cai, Youbao Tang, Ke Yan, Adam P. Harrison, Jing Xiao, Gigin Lin, Le Lu [pdf] [supp] [arXiv] [bibtex]

Learning To Warp for Style Transfer Xiao-Chang Liu, Yong-Liang Yang, Peter Hall [pdf] [supp] [bibtex]

Towards Extremely Compact RNNs for Video Recognition With Fully Decomposed Hierarchical Tucker Structure Miao Yin, Siyu Liao, Xiao-Yang Liu, Xiaodong Wang, Bo Yuan [pdf] [arXiv] [bibtex]

Self-Supervised Multi-Frame Monocular Scene Flow Junhwa Hur, Stefan Roth [pdf] [supp] [arXiv] [bibtex]

Enriching ImageNet With Human Similarity Judgments and Psychological Embeddings Brett D. Roads, Bradley C. Love [pdf] [arXiv] [bibtex]

What's in the Image? Explorable Decoding of Compressed Images Yuval Bahat, Tomer Michaeli [pdf] [supp] [bibtex]

Context Modeling in 3D Human Pose Estimation: A Unified Perspective Xiaoxuan Ma, Jiajun Su, Chunyu Wang, Hai Ci, Yizhou Wang [pdf] [arXiv] [bibtex]

Less Is More: ClipBERT for Video-and-Language Learning via Sparse Sampling Jie Lei, Linjie Li, Luowei Zhou, Zhe Gan, Tamara L. Berg, Mohit Bansal, Jingjing Liu [pdf] [supp] [arXiv] [bibtex]

Consensus Maximisation Using Influences of Monotone Boolean Functions Ruwan Tennakoon, David Suter, Erchuan Zhang, Tat-Jun Chin, Alireza Bab-Hadiashar [pdf] [supp] [arXiv] [bibtex]

Meta-Mining Discriminative Samples for Kinship Verification Wanhua Li, Shiwei Wang, Jiwen Lu, Jianjiang Feng, Jie Zhou [pdf] [arXiv] [bibtex]

AQD: Towards Accurate Quantized Object Detection Peng Chen, Jing Liu, Bohan Zhuang, Mingkui Tan, Chunhua Shen [pdf] [supp] [arXiv] [bibtex]

Learning Cross-Modal Retrieval With Noisy Labels Peng Hu, Xi Peng, Hongyuan Zhu, Liangli Zhen, Jie Lin [pdf] [bibtex]

LOHO: Latent Optimization of Hairstyles via Orthogonalization Rohit Saha, Brendan Duke, Florian Shkurti, Graham W. Taylor, Parham Aarabi [pdf] [supp] [arXiv] [bibtex]

Single-Shot Freestyle Dance Reenactment Oran Gafni, Oron Ashual, Lior Wolf [pdf] [arXiv] [bibtex]

A Quasiconvex Formulation for Radial Cameras Carl Olsson, Viktor Larsson, Fredrik Kahl [pdf] [bibtex]

Self-Supervised Learning of Depth Inference for Multi-View Stereo Jiayu Yang, Jose M. Alvarez, Miaomiao Liu [pdf] [supp] [arXiv] [bibtex]

BRepNet: A Topological Message Passing System for Solid Models Joseph G. Lambourne, Karl D.D. Willis, Pradeep Kumar Jayaraman, Aditya Sanghi, Peter Meltzer, Hooman Shayani [pdf] [supp] [arXiv] [bibtex]

Learning To Predict Visual Attributes in the Wild Khoi Pham, Kushal Kafle, Zhe Lin, Zhihong Ding, Scott Cohen, Quan Tran, Abhinav Shrivastava [pdf] [supp] [bibtex]

Animating Pictures With Eulerian Motion Fields Aleksander Holynski, Brian L. Curless, Steven M. Seitz, Richard Szeliski [pdf] [arXiv] [bibtex]

Generalized Focal Loss V2: Learning Reliable Localization Quality Estimation for Dense Object Detection Xiang Li, Wenhai Wang, Xiaolin Hu, Jun Li, Jinhui Tang, Jian Yang [pdf] [arXiv] [bibtex]

Cross-Domain Adaptive Clustering for Semi-Supervised Domain Adaptation Jichang Li, Guanbin Li, Yemin Shi, Yizhou Yu [pdf] [supp] [arXiv] [bibtex]

ST3D: Self-Training for Unsupervised Domain Adaptation on 3D Object Detection Jihan Yang, Shaoshuai Shi, Zhe Wang, Hongsheng Li, Xiaojuan Qi [pdf] [arXiv] [bibtex]

HITNet: Hierarchical Iterative Tile Refinement Network for Real-time Stereo Matching Vladimir Tankovich, Christian Hane, Yinda Zhang, Adarsh Kowdle, Sean Fanello, Sofien Bouaziz [pdf] [supp] [arXiv] [bibtex]

VaB-AL: Incorporating Class Imbalance and Difficulty With Variational Bayes for Active Learning Jongwon Choi, Kwang Moo Yi, Jihoon Kim, Jinho Choo, Byoungjip Kim, Jinyeop Chang, Youngjune Gwon, Hyung Jin Chang [pdf] [supp] [bibtex]

Exploiting & Refining Depth Distributions With Triangulation Light Curtains Yaadhav Raaj, Siddharth Ancha, Robert Tamburo, David Held, Srinivasa G. Narasimhan [pdf] [bibtex]

DG-Font: Deformable Generative Networks for Unsupervised Font Generation Yangchen Xie, Xinyuan Chen, Li Sun, Yue Lu [pdf] [supp] [bibtex]

Deep Multi-Task Learning for Joint Localization, Perception, and Prediction John Phillips, Julieta Martinez, Ioan Andrei Barsan, Sergio Casas, Abbas Sadat, Raquel Urtasun [pdf] [arXiv] [bibtex]

Deeply Shape-Guided Cascade for Instance Segmentation Hao Ding, Siyuan Qiao, Alan Yuille, Wei Shen [pdf] [supp] [arXiv] [bibtex]

MetricOpt: Learning To Optimize Black-Box Evaluation Metrics Chen Huang, Shuangfei Zhai, Pengsheng Guo, Josh Susskind [pdf] [supp] [arXiv] [bibtex]

Multispectral Photometric Stereo for Spatially-Varying Spectral Reflectances: A Well Posed Problem? Heng Guo, Fumio Okura, Boxin Shi, Takuya Funatomi, Yasuhiro Mukaigawa, Yasuyuki Matsushita [pdf] [supp] [bibtex]

Fashion IQ: A New Dataset Towards Retrieving Images by Natural Language Feedback Hui Wu, Yupeng Gao, Xiaoxiao Guo, Ziad Al-Halah, Steven Rennie, Kristen Grauman, Rogerio Feris [pdf] [supp] [arXiv] [bibtex]

Few-Shot Human Motion Transfer by Personalized Geometry and Texture Modeling Zhichao Huang, Xintong Han, Jia Xu, Tong Zhang [pdf] [supp] [arXiv] [bibtex]

HDMapGen: A Hierarchical Graph Generative Model of High Definition Maps Lu Mi, Hang Zhao, Charlie Nash, Xiaohan Jin, Jiyang Gao, Chen Sun, Cordelia Schmid, Nir Shavit, Yuning Chai, Dragomir Anguelov [pdf] [supp] [bibtex]

GeoSim: Realistic Video Simulation via Geometry-Aware Composition for Self-Driving Yun Chen, Frieda Rong, Shivam Duggal, Shenlong Wang, Xinchen Yan, Sivabalan Manivasagam, Shangjie Xue, Ersin Yumer, Raquel Urtasun [pdf] [supp] [arXiv] [bibtex]

AlphaMatch: Improving Consistency for Semi-Supervised Learning With Alpha-Divergence Chengyue Gong, Dilin Wang, Qiang Liu [pdf] [arXiv] [bibtex]

Unbalanced Feature Transport for Exemplar-Based Image Translation Fangneng Zhan, Yingchen Yu, Kaiwen Cui, Gongjie Zhang, Shijian Lu, Jianxiong Pan, Changgong Zhang, Feiying Ma, Xuansong Xie, Chunyan Miao [pdf] [bibtex]

Self-Generated Defocus Blur Detection via Dual Adversarial Discriminators Wenda Zhao, Cai Shang, Huchuan Lu [pdf] [bibtex]

View Generalization for Single Image Textured 3D Models Anand Bhattad, Aysegul Dundar, Guilin Liu, Andrew Tao, Bryan Catanzaro [pdf] [supp] [bibtex]

Your "Flamingo" is My "Bird": Fine-Grained, or Not Dongliang Chang, Kaiyue Pang, Yixiao Zheng, Zhanyu Ma, Yi-Zhe Song, Jun Guo [pdf] [arXiv] [bibtex]

Anchor-Constrained Viterbi for Set-Supervised Action Segmentation Jun Li, Sinisa Todorovic [pdf] [arXiv] [bibtex]

SOON: Scenario Oriented Object Navigation With Graph-Based Exploration Fengda Zhu, Xiwen Liang, Yi Zhu, Qizhi Yu, Xiaojun Chang, Xiaodan Liang [pdf] [arXiv] [bibtex]

Learning Scalable lY=-Constrained Near-Lossless Image Compression via Joint Lossy Image and Residual Compression Yuanchao Bai, Xianming Liu, Wangmeng Zuo, Yaowei Wang, Xiangyang Ji [pdf] [supp] [bibtex]

Minimally Invasive Surgery for Sparse Neural Networks in Contrastive Manner Chong Yu [pdf] [supp] [bibtex]

XProtoNet: Diagnosis in Chest Radiography With Global and Local Explanations Eunji Kim, Siwon Kim, Minji Seo, Sungroh Yoon [pdf] [supp] [arXiv] [bibtex]

Learning Scene Structure Guidance via Cross-Task Knowledge Transfer for Single Depth Super-Resolution Baoli Sun, Xinchen Ye, Baopu Li, Haojie Li, Zhihui Wang, Rui Xu [pdf] [arXiv] [bibtex]

Visual Navigation With Spatial Attention Bar Mayo, Tamir Hazan, Ayellet Tal [pdf] [supp] [arXiv] [bibtex]

Model-Based 3D Hand Reconstruction via Self-Supervised Learning Yujin Chen, Zhigang Tu, Di Kang, Linchao Bao, Ying Zhang, Xuefei Zhe, Ruizhi Chen, Junsong Yuan [pdf] [supp] [arXiv] [bibtex]

Robust Reflection Removal With Reflection-Free Flash-Only Cues Chenyang Lei, Qifeng Chen [pdf] [arXiv] [bibtex]

Real-Time Selfie Video Stabilization Jiyang Yu, Ravi Ramamoorthi, Keli Cheng, Michel Sarkis, Ning Bi [pdf] [supp] [arXiv] [bibtex]

3D Human Action Representation Learning via Cross-View Consistency Pursuit Linguo Li, Minsi Wang, Bingbing Ni, Hang Wang, Jiancheng Yang, Wenjun Zhang [pdf] [arXiv] [bibtex]

Differentiable SLAM-Net: Learning Particle SLAM for Visual Navigation Peter Karkus, Shaojun Cai, David Hsu [pdf] [supp] [bibtex]

Learning Goals From Failure Dave Epstein, Carl Vondrick [pdf] [arXiv] [bibtex]

Rank-One Prior: Toward Real-Time Scene Recovery Jun Liu, Wen Liu, Jianing Sun, Tieyong Zeng [pdf] [supp] [arXiv] [bibtex]

Body2Hands: Learning To Infer 3D Hands From Conversational Gesture Body Dynamics Evonne Ng, Shiry Ginosar, Trevor Darrell, Hanbyul Joo [pdf] [arXiv] [bibtex]

Linear Semantics in Generative Adversarial Networks Jianjin Xu, Changxi Zheng [pdf] [supp] [arXiv] [bibtex]

Mesoscopic Photogrammetry With an Unstabilized Phone Camera Kevin C. Zhou, Colin Cooke, Jaehee Park, Ruobing Qian, Roarke Horstmeyer, Joseph A. Izatt, Sina Farsiu [pdf] [supp] [arXiv] [bibtex]

Joint Generative and Contrastive Learning for Unsupervised Person Re-Identification Hao Chen, Yaohui Wang, Benoit Lagadec, Antitza Dantcheva, Francois Bremond [pdf] [supp] [arXiv] [bibtex]

Wide-Baseline Multi-Camera Calibration Using Person Re-Identification Yan Xu, Yu-Jhe Li, Xinshuo Weng, Kris Kitani [pdf] [arXiv] [bibtex]

ATSO: Asynchronous Teacher-Student Optimization for Semi-Supervised Image Segmentation Xinyue Huo, Lingxi Xie, Jianzhong He, Zijie Yang, Wengang Zhou, Houqiang Li, Qi Tian [pdf] [supp] [bibtex]

Panoramic Image Reflection Removal Yuchen Hong, Qian Zheng, Lingran Zhao, Xudong Jiang, Alex C. Kot, Boxin Shi [pdf] [supp] [bibtex]

OTCE: A Transferability Metric for Cross-Domain Cross-Task Representations Yang Tan, Yang Li, Shao-Lun Huang [pdf] [supp] [arXiv] [bibtex]

Diverse Semantic Image Synthesis via Probability Distribution Modeling Zhentao Tan, Menglei Chai, Dongdong Chen, Jing Liao, Qi Chu, Bin Liu, Gang Hua, Nenghai Yu [pdf] [supp] [arXiv] [bibtex]

NeRF in the Wild: Neural Radiance Fields for Unconstrained Photo Collections Ricardo Martin-Brualla, Noha Radwan, Mehdi S. M. Sajjadi, Jonathan T. Barron, Alexey Dosovitskiy, Daniel Duckworth [pdf] [supp] [bibtex]

Learning by Watching Jimuyang Zhang, Eshed Ohn-Bar [pdf] [bibtex]

Pseudo Facial Generation With Extreme Poses for Face Recognition Guoli Wang, Jiaqi Ma, Qian Zhang, Jiwen Lu, Jie Zhou [pdf] [bibtex]

Inverting Generative Adversarial Renderer for Face Reconstruction Jingtan Piao, Keqiang Sun, Quan Wang, Kwan-Yee Lin, Hongsheng Li [pdf] [supp] [arXiv] [bibtex]

Efficient Object Embedding for Spliced Image Retrieval Bor-Chun Chen, Zuxuan Wu, Larry S. Davis, Ser-Nam Lim [pdf] [supp] [arXiv] [bibtex]

GrooMeD-NMS: Grouped Mathematically Differentiable NMS for Monocular 3D Object Detection Abhinav Kumar, Garrick Brazil, Xiaoming Liu [pdf] [supp] [bibtex]

Flow Guided Transformable Bottleneck Networks for Motion Retargeting Jian Ren, Menglei Chai, Oliver J. Woodford, Kyle Olszewski, Sergey Tulyakov [pdf] [supp] [bibtex]

Projecting Your View Attentively: Monocular Road Scene Layout Estimation via Cross-View Transformation Weixiang Yang, Qi Li, Wenxi Liu, Yuanlong Yu, Yuexin Ma, Shengfeng He, Jia Pan [pdf] [supp] [bibtex]

Deep Analysis of CNN-Based Spatio-Temporal Representations for Action Recognition Chun-Fu Richard Chen, Rameswar Panda, Kandan Ramakrishnan, Rogerio Feris, John Cohn, Aude Oliva, Quanfu Fan [pdf] [supp] [arXiv] [bibtex]

Generalizable Person Re-Identification With Relevance-Aware Mixture of Experts Yongxing Dai, Xiaotong Li, Jun Liu, Zekun Tong, Ling-Yu Duan [pdf] [arXiv] [bibtex]

Part-Aware Panoptic Segmentation Daan de Geus, Panagiotis Meletis, Chenyang Lu, Xiaoxiao Wen, Gijs Dubbelman [pdf] [supp] [bibtex]

Unsupervised Degradation Representation Learning for Blind Super-Resolution Longguang Wang, Yingqian Wang, Xiaoyu Dong, Qingyu Xu, Jungang Yang, Wei An, Yulan Guo [pdf] [supp] [arXiv] [bibtex]

Convolutional Hough Matching Networks Juhong Min, Minsu Cho [pdf] [supp] [arXiv] [bibtex]

Hierarchical and Partially Observable Goal-Driven Policy Learning With Goals Relational Graph Xin Ye, Yezhou Yang [pdf] [supp] [arXiv] [bibtex]

Point 4D Transformer Networks for Spatio-Temporal Modeling in Point Cloud Videos Hehe Fan, Yi Yang, Mohan Kankanhalli [pdf] [supp] [bibtex]

CoCoNets: Continuous Contrastive 3D Scene Representations Shamit Lal, Mihir Prabhudesai, Ishita Mediratta, Adam W. Harley, Katerina Fragkiadaki [pdf] [supp] [arXiv] [bibtex]

Distribution Alignment: A Unified Framework for Long-Tail Visual Recognition Songyang Zhang, Zeming Li, Shipeng Yan, Xuming He, Jian Sun [pdf] [supp] [arXiv] [bibtex]

Dynamic Class Queue for Large Scale Face Recognition in the Wild Bi Li, Teng Xi, Gang Zhang, Haocheng Feng, Junyu Han, Jingtuo Liu, Errui Ding, Wenyu Liu [pdf] [arXiv] [bibtex]

3D-MAN: 3D Multi-Frame Attention Network for Object Detection Zetong Yang, Yin Zhou, Zhifeng Chen, Jiquan Ngiam [pdf] [supp] [bibtex]

Cross-Modal Center Loss for 3D Cross-Modal Retrieval Longlong Jing, Elahe Vahdani, Jiaxing Tan, Yingli Tian [pdf] [bibtex]

Learning View Selection for 3D Scenes Yifan Sun, Qixing Huang, Dun-Yu Hsiao, Li Guan, Gang Hua [pdf] [supp] [bibtex]

FESTA: Flow Estimation via Spatial-Temporal Attention for Scene Point Clouds Haiyan Wang, Jiahao Pang, Muhammad A. Lodhi, Yingli Tian, Dong Tian [pdf] [supp] [arXiv] [bibtex]

Semi-Supervised Action Recognition With Temporal Contrastive Learning Ankit Singh, Omprakash Chakraborty, Ashutosh Varshney, Rameswar Panda, Rogerio Feris, Kate Saenko, Abir Das [pdf] [supp] [arXiv] [bibtex]

SG-Net: Spatial Granularity Network for One-Stage Video Instance Segmentation Dongfang Liu, Yiming Cui, Wenbo Tan, Yingjie Chen [pdf] [bibtex]

Learned Initializations for Optimizing Coordinate-Based Neural Representations Matthew Tancik, Ben Mildenhall, Terrance Wang, Divi Schmidt, Pratul P. Srinivasan, Jonathan T. Barron, Ren Ng [pdf] [supp] [arXiv] [bibtex]

Actor-Context-Actor Relation Network for Spatio-Temporal Action Localization Junting Pan, Siyu Chen, Mike Zheng Shou, Yu Liu, Jing Shao, Hongsheng Li [pdf] [arXiv] [bibtex]

Cross-View Cross-Scene Multi-View Crowd Counting Qi Zhang, Wei Lin, Antoni B. Chan [pdf] [supp] [bibtex]

Semantic Segmentation With Generative Models: Semi-Supervised Learning and Strong Out-of-Domain Generalization Daiqing Li, Junlin Yang, Karsten Kreis, Antonio Torralba, Sanja Fidler [pdf] [arXiv] [bibtex]

Depth-Aware Mirror Segmentation Haiyang Mei, Bo Dong, Wen Dong, Pieter Peers, Xin Yang, Qiang Zhang, Xiaopeng Wei [pdf] [supp] [bibtex]

You Only Look One-Level Feature Qiang Chen, Yingming Wang, Tong Yang, Xiangyu Zhang, Jian Cheng, Jian Sun [pdf] [arXiv] [bibtex]

Multi-Perspective LSTM for Joint Visual Representation Learning Alireza Sepas-Moghaddam, Fernando Pereira, Paulo Lobato Correia, Ali Etemad [pdf] [bibtex]

Towards Improving the Consistency, Efficiency, and Flexibility of Differentiable Neural Architecture Search Yibo Yang, Shan You, Hongyang Li, Fei Wang, Chen Qian, Zhouchen Lin [pdf] [supp] [arXiv] [bibtex]

Gaussian Context Transformer Dongsheng Ruan, Daiyin Wang, Yuan Zheng, Nenggan Zheng, Min Zheng [pdf] [supp] [bibtex]

Keypoint-Graph-Driven Learning Framework for Object Pose Estimation Shaobo Zhang, Wanqing Zhao, Ziyu Guan, Xianlin Peng, Jinye Peng [pdf] [supp] [bibtex]

Deep Burst Super-Resolution Goutam Bhat, Martin Danelljan, Luc Van Gool, Radu Timofte [pdf] [supp] [arXiv] [bibtex]

Transferable Semantic Augmentation for Domain Adaptation Shuang Li, Mixue Xie, Kaixiong Gong, Chi Harold Liu, Yulin Wang, Wei Li [pdf] [supp] [arXiv] [bibtex]

Patchwise Generative ConvNet: Training Energy-Based Models From a Single Natural Image for Internal Learning Zilong Zheng, Jianwen Xie, Ping Li [pdf] [supp] [bibtex]

Clusformer: A Transformer Based Clustering Approach to Unsupervised Large-Scale Face and Visual Landmark Recognition Xuan-Bac Nguyen, Duc Toan Bui, Chi Nhan Duong, Tien D. Bui, Khoa Luu [pdf] [bibtex]

No Frame Left Behind: Full Video Action Recognition Xin Liu, Silvia L. Pintea, Fatemeh Karimi Nejadasl, Olaf Booij, Jan C. van Gemert [pdf] [arXiv] [bibtex]

ColorRL: Reinforced Coloring for End-to-End Instance Segmentation Tran Anh Tuan, Nguyen Tuan Khoa, Tran Minh Quan, Won-Ki Jeong [pdf] [supp] [bibtex]

Compatibility-Aware Heterogeneous Visual Search Rahul Duggal, Hao Zhou, Shuo Yang, Yuanjun Xiong, Wei Xia, Zhuowen Tu, Stefano Soatto [pdf] [supp] [arXiv] [bibtex]

WOAD: Weakly Supervised Online Action Detection in Untrimmed Videos Mingfei Gao, Yingbo Zhou, Ran Xu, Richard Socher, Caiming Xiong [pdf] [arXiv] [bibtex]

Deep Dual Consecutive Network for Human Pose Estimation Zhenguang Liu, Haoming Chen, Runyang Feng, Shuang Wu, Shouling Ji, Bailin Yang, Xun Wang [pdf] [arXiv] [bibtex]

Uncertainty-Aware Joint Salient Object and Camouflaged Object Detection Aixuan Li, Jing Zhang, Yunqiu Lv, Bowen Liu, Tong Zhang, Yuchao Dai [pdf] [supp] [arXiv] [bibtex]

HourNAS: Extremely Fast Neural Architecture Search Through an Hourglass Lens Zhaohui Yang, Yunhe Wang, Xinghao Chen, Jianyuan Guo, Wei Zhang, Chao Xu, Chunjing Xu, Dacheng Tao, Chang Xu [pdf] [supp] [arXiv] [bibtex]

Tree-Like Decision Distillation Jie Song, Haofei Zhang, Xinchao Wang, Mengqi Xue, Ying Chen, Li Sun, Dacheng Tao, Mingli Song [pdf] [supp] [bibtex]

GAN Prior Embedded Network for Blind Face Restoration in the Wild Tao Yang, Peiran Ren, Xuansong Xie, Lei Zhang [pdf] [supp] [arXiv] [bibtex]

Collaborative Spatial-Temporal Modeling for Language-Queried Video Actor Segmentation Tianrui Hui, Shaofei Huang, Si Liu, Zihan Ding, Guanbin Li, Wenguan Wang, Jizhong Han, Fei Wang [pdf] [supp] [arXiv] [bibtex]

Drafting and Revision: Laplacian Pyramid Network for Fast High-Quality Artistic Style Transfer Tianwei Lin, Zhuoqi Ma, Fu Li, Dongliang He, Xin Li, Errui Ding, Nannan Wang, Jie Li, Xinbo Gao [pdf] [arXiv] [bibtex]

The Lottery Ticket Hypothesis for Object Recognition Sharath Girish, Shishira R Maiya, Kamal Gupta, Hao Chen, Larry S. Davis, Abhinav Shrivastava [pdf] [supp] [arXiv] [bibtex]

Refer-It-in-RGBD: A Bottom-Up Approach for 3D Visual Grounding in RGBD Images Haolin Liu, Anran Lin, Xiaoguang Han, Lei Yang, Yizhou Yu, Shuguang Cui [pdf] [supp] [bibtex]

LQF: Linear Quadratic Fine-Tuning Alessandro Achille, Aditya Golatkar, Avinash Ravichandran, Marzia Polito, Stefano Soatto [pdf] [supp] [arXiv] [bibtex]

Watching You: Global-Guided Reciprocal Learning for Video-Based Person Re-Identification Xuehu Liu, Pingping Zhang, Chenyang Yu, Huchuan Lu, Xiaoyun Yang [pdf] [arXiv] [bibtex]

S3: Learnable Sparse Signal Superdensity for Guided Depth Estimation Yu-Kai Huang, Yueh-Cheng Liu, Tsung-Han Wu, Hung-Ting Su, Yu-Cheng Chang, Tsung-Lin Tsou, Yu-An Wang, Winston H. Hsu [pdf] [supp] [bibtex]

Transformer Meets Tracker: Exploiting Temporal Context for Robust Visual Tracking Ning Wang, Wengang Zhou, Jie Wang, Houqiang Li [pdf] [arXiv] [bibtex]

High-Fidelity Neural Human Motion Transfer From Monocular Video Moritz Kappel, Vladislav Golyanik, Mohamed Elgharib, Jann-Ole Henningson, Hans-Peter Seidel, Susana Castillo, Christian Theobalt, Marcus Magnor [pdf] [supp] [arXiv] [bibtex]

Polygonal Building Extraction by Frame Field Learning Nicolas Girard, Dmitriy Smirnov, Justin Solomon, Yuliya Tarabalka [pdf] [supp] [bibtex]

NeuralFusion: Online Depth Fusion in Latent Space Silvan Weder, Johannes L. Schonberger, Marc Pollefeys, Martin R. Oswald [pdf] [supp] [arXiv] [bibtex]

PoseAug: A Differentiable Pose Augmentation Framework for 3D Human Pose Estimation Kehong Gong, Jianfeng Zhang, Jiashi Feng [pdf] [supp] [arXiv] [bibtex]

Depth Completion With Twin Surface Extrapolation at Occlusion Boundaries Saif Imran, Xiaoming Liu, Daniel Morris [pdf] [supp] [arXiv] [bibtex]

Learning the Superpixel in a Non-Iterative and Lifelong Manner Lei Zhu, Qi She, Bin Zhang, Yanye Lu, Zhilin Lu, Duo Li, Jie Hu [pdf] [arXiv] [bibtex]

Image Generators With Conditionally-Independent Pixel Synthesis Ivan Anokhin, Kirill Demochkin, Taras Khakhulin, Gleb Sterkin, Victor Lempitsky, Denis Korzhenkov [pdf] [supp] [arXiv] [bibtex]

Towards Good Practices for Efficiently Annotating Large-Scale Image Classification Datasets Yuan-Hong Liao, Amlan Kar, Sanja Fidler [pdf] [supp] [arXiv] [bibtex]

Seesaw Loss for Long-Tailed Instance Segmentation Jiaqi Wang, Wenwei Zhang, Yuhang Zang, Yuhang Cao, Jiangmiao Pang, Tao Gong, Kai Chen, Ziwei Liu, Chen Change Loy, Dahua Lin [pdf] [supp] [arXiv] [bibtex]

Dynamic Neural Radiance Fields for Monocular 4D Facial Avatar Reconstruction Guy Gafni, Justus Thies, Michael Zollhofer, Matthias Niessner [pdf] [arXiv] [bibtex]

PU-GCN: Point Cloud Upsampling Using Graph Convolutional Networks Guocheng Qian, Abdulellah Abualshour, Guohao Li, Ali Thabet, Bernard Ghanem [pdf] [bibtex]

Differentiable Patch Selection for Image Recognition Jean-Baptiste Cordonnier, Aravindh Mahendran, Alexey Dosovitskiy, Dirk Weissenborn, Jakob Uszkoreit, Thomas Unterthiner [pdf] [supp] [arXiv] [bibtex]

MaX-DeepLab: End-to-End Panoptic Segmentation With Mask Transformers Huiyu Wang, Yukun Zhu, Hartwig Adam, Alan Yuille, Liang-Chieh Chen [pdf] [supp] [bibtex]

Improving Transferability of Adversarial Patches on Face Recognition With Generative Models Zihao Xiao, Xianfeng Gao, Chilin Fu, Yinpeng Dong, Wei Gao, Xiaolu Zhang, Jun Zhou, Jun Zhu [pdf] [supp] [bibtex]

Counterfactual VQA: A Cause-Effect Look at Language Bias Yulei Niu, Kaihua Tang, Hanwang Zhang, Zhiwu Lu, Xian-Sheng Hua, Ji-Rong Wen [pdf] [supp] [arXiv] [bibtex]

Denoise and Contrast for Category Agnostic Shape Completion Antonio Alliegro, Diego Valsesia, Giulia Fracastoro, Enrico Magli, Tatiana Tommasi [pdf] [supp] [arXiv] [bibtex]

Transformation Invariant Few-Shot Object Detection Aoxue Li, Zhenguo Li [pdf] [supp] [bibtex]

2D or not 2D? Adaptive 3D Convolution Selection for Efficient Video Recognition Hengduo Li, Zuxuan Wu, Abhinav Shrivastava, Larry S. Davis [pdf] [arXiv] [bibtex]

Temporal Query Networks for Fine-Grained Video Understanding Chuhan Zhang, Ankush Gupta, Andrew Zisserman [pdf] [arXiv] [bibtex]

Adversarial Generation of Continuous Images Ivan Skorokhodov, Savva Ignatyev, Mohamed Elhoseiny [pdf] [supp] [arXiv] [bibtex]

UniT: Unified Knowledge Transfer for Any-Shot Object Detection and Segmentation Siddhesh Khandelwal, Raghav Goyal, Leonid Sigal [pdf] [supp] [arXiv] [bibtex]

Indoor Panorama Planar 3D Reconstruction via Divide and Conquer Cheng Sun, Chi-Wei Hsiao, Ning-Hsu Wang, Min Sun, Hwann-Tzong Chen [pdf] [supp] [bibtex]

Embedded Discriminative Attention Mechanism for Weakly Supervised Semantic Segmentation Tong Wu, Junshi Huang, Guangyu Gao, Xiaoming Wei, Xiaolin Wei, Xuan Luo, Chi Harold Liu [pdf] [bibtex]

TextOCR: Towards Large-Scale End-to-End Reasoning for Arbitrary-Shaped Scene Text Amanpreet Singh, Guan Pang, Mandy Toh, Jing Huang, Wojciech Galuba, Tal Hassner [pdf] [arXiv] [bibtex]

Distractor-Aware Fast Tracking via Dynamic Convolutions and MOT Philosophy Zikai Zhang, Bineng Zhong, Shengping Zhang, Zhenjun Tang, Xin Liu, Zhaoxiang Zhang [pdf] [arXiv] [bibtex]

Scaling Local Self-Attention for Parameter Efficient Visual Backbones Ashish Vaswani, Prajit Ramachandran, Aravind Srinivas, Niki Parmar, Blake Hechtman, Jonathon Shlens [pdf] [supp] [arXiv] [bibtex]

Image Inpainting Guided by Coherence Priors of Semantics and Textures Liang Liao, Jing Xiao, Zheng Wang, Chia-Wen Lin, Shin'ichi Satoh [pdf] [arXiv] [bibtex]

Multi-Source Domain Adaptation With Collaborative Learning for Semantic Segmentation Jianzhong He, Xu Jia, Shuaijun Chen, Jianzhuang Liu [pdf] [supp] [arXiv] [bibtex]

Positive-Congruent Training: Towards Regression-Free Model Updates Sijie Yan, Yuanjun Xiong, Kaustav Kundu, Shuo Yang, Siqi Deng, Meng Wang, Wei Xia, Stefano Soatto [pdf] [supp] [arXiv] [bibtex]

FrameExit: Conditional Early Exiting for Efficient Video Recognition Amir Ghodrati, Babak Ehteshami Bejnordi, Amirhossein Habibian [pdf] [supp] [arXiv] [bibtex]

Neighbor2Neighbor: Self-Supervised Denoising From Single Noisy Images Tao Huang, Songjiang Li, Xu Jia, Huchuan Lu, Jianzhuang Liu [pdf] [supp] [arXiv] [bibtex]

Differentiable Multi-Granularity Human Representation Learning for Instance-Aware Human Semantic Parsing Tianfei Zhou, Wenguan Wang, Si Liu, Yi Yang, Luc Van Gool [pdf] [supp] [arXiv] [bibtex]

Dynamic Weighted Learning for Unsupervised Domain Adaptation Ni Xiao, Lei Zhang [pdf] [arXiv] [bibtex]

Using Shape To Categorize: Low-Shot Learning With an Explicit Shape Bias Stefan Stojanov, Anh Thai, James M. Rehg [pdf] [supp] [arXiv] [bibtex]

Face Forensics in the Wild Tianfei Zhou, Wenguan Wang, Zhiyuan Liang, Jianbing Shen [pdf] [supp] [arXiv] [bibtex]

Spatial-Phase Shallow Learning: Rethinking Face Forgery Detection in Frequency Domain Honggu Liu, Xiaodan Li, Wenbo Zhou, Yuefeng Chen, Yuan He, Hui Xue, Weiming Zhang, Nenghai Yu [pdf] [supp] [arXiv] [bibtex]

A Closer Look at Fourier Spectrum Discrepancies for CNN-Generated Images Detection Keshigeyan Chandrasegaran, Ngoc-Trung Tran, Ngai-Man Cheung [pdf] [supp] [arXiv] [bibtex]

Learning Delaunay Surface Elements for Mesh Reconstruction Marie-Julie Rakotosaona, Paul Guerrero, Noam Aigerman, Niloy J. Mitra, Maks Ovsjanikov [pdf] [supp] [arXiv] [bibtex]

FaceSec: A Fine-Grained Robustness Evaluation Framework for Face Recognition Systems Liang Tong, Zhengzhang Chen, Jingchao Ni, Wei Cheng, Dongjin Song, Haifeng Chen, Yevgeniy Vorobeychik [pdf] [supp] [arXiv] [bibtex]

Dynamic Head: Unifying Object Detection Heads With Attentions Xiyang Dai, Yinpeng Chen, Bin Xiao, Dongdong Chen, Mengchen Liu, Lu Yuan, Lei Zhang [pdf] [bibtex]

Riggable 3D Face Reconstruction via In-Network Optimization Ziqian Bai, Zhaopeng Cui, Xiaoming Liu, Ping Tan [pdf] [supp] [arXiv] [bibtex]

One-Shot Free-View Neural Talking-Head Synthesis for Video Conferencing Ting-Chun Wang, Arun Mallya, Ming-Yu Liu [pdf] [supp] [arXiv] [bibtex]

S2R-DepthNet: Learning a Generalizable Depth-Specific Structural Representation Xiaotian Chen, Yuwang Wang, Xuejin Chen, Wenjun Zeng [pdf] [supp] [bibtex]

Holistic 3D Human and Scene Mesh Estimation From Single View Images Zhenzhen Weng, Serena Yeung [pdf] [arXiv] [bibtex]

MIST: Multiple Instance Spatial Transformer Baptiste Angles, Yuhe Jin, Simon Kornblith, Andrea Tagliasacchi, Kwang Moo Yi [pdf] [supp] [arXiv] [bibtex]

FFB6D: A Full Flow Bidirectional Fusion Network for 6D Pose Estimation Yisheng He, Haibin Huang, Haoqiang Fan, Qifeng Chen, Jian Sun [pdf] [supp] [arXiv] [bibtex]

Shape From Sky: Polarimetric Normal Recovery Under the Sky Tomoki Ichikawa, Matthew Purri, Ryo Kawahara, Shohei Nobuhara, Kristin Dana, Ko Nishino [pdf] [supp] [bibtex]

Adversarially Adaptive Normalization for Single Domain Generalization Xinjie Fan, Qifei Wang, Junjie Ke, Feng Yang, Boqing Gong, Mingyuan Zhou [pdf] [supp] [arXiv] [bibtex]

Rethinking Channel Dimensions for Efficient Model Design Dongyoon Han, Sangdoo Yun, Byeongho Heo, YoungJoon Yoo [pdf] [supp] [arXiv] [bibtex]

A Self-Boosting Framework for Automated Radiographic Report Generation Zhanyu Wang, Luping Zhou, Lei Wang, Xiu Li [pdf] [bibtex]

RAFT-3D: Scene Flow Using Rigid-Motion Embeddings Zachary Teed, Jia Deng [pdf] [supp] [bibtex]

Orthogonal Over-Parameterized Training Weiyang Liu, Rongmei Lin, Zhen Liu, James M. Rehg, Liam Paull, Li Xiong, Le Song, Adrian Weller [pdf] [supp] [arXiv] [bibtex]

Masksembles for Uncertainty Estimation Nikita Durasov, Timur Bagautdinov, Pierre Baque, Pascal Fua [pdf] [supp] [arXiv] [bibtex]

Network Pruning via Performance Maximization Shangqian Gao, Feihu Huang, Weidong Cai, Heng Huang [pdf] [supp] [bibtex]

Closing the Loop: Joint Rain Generation and Removal via Disentangled Image Translation Yuntong Ye, Yi Chang, Hanyu Zhou, Luxin Yan [pdf] [supp] [arXiv] [bibtex]

ACTION-Net: Multipath Excitation for Action Recognition Zhengwei Wang, Qi She, Aljosa Smolic [pdf] [supp] [bibtex]

Co-Attention for Conditioned Image Matching Olivia Wiles, Sebastien Ehrhardt, Andrew Zisserman [pdf] [supp] [arXiv] [bibtex]

EventZoom: Learning To Denoise and Super Resolve Neuromorphic Events Peiqi Duan, Zihao W. Wang, Xinyu Zhou, Yi Ma, Boxin Shi [pdf] [supp] [bibtex]

Re-Labeling ImageNet: From Single to Multi-Labels, From Global to Localized Labels Sangdoo Yun, Seong Joon Oh, Byeongho Heo, Dongyoon Han, Junsuk Choe, Sanghyuk Chun [pdf] [supp] [arXiv] [bibtex]

CoCosNet v2: Full-Resolution Correspondence Learning for Image Translation Xingran Zhou, Bo Zhang, Ting Zhang, Pan Zhang, Jianmin Bao, Dong Chen, Zhongfei Zhang, Fang Wen [pdf] [supp] [arXiv] [bibtex]

SceneGraphFusion: Incremental 3D Scene Graph Prediction From RGB-D Sequences Shun-Cheng Wu, Johanna Wald, Keisuke Tateno, Nassir Navab, Federico Tombari [pdf] [supp] [bibtex]

Interventional Video Grounding With Dual Contrastive Learning Guoshun Nan, Rui Qiao, Yao Xiao, Jun Liu, Sicong Leng, Hao Zhang, Wei Lu [pdf] [supp] [bibtex]

A Fourier-Based Framework for Domain Generalization Qinwei Xu, Ruipeng Zhang, Ya Zhang, Yanfeng Wang, Qi Tian [pdf] [supp] [arXiv] [bibtex]

Probabilistic Modeling of Semantic Ambiguity for Scene Graph Generation Gengcong Yang, Jingyi Zhang, Yong Zhang, Baoyuan Wu, Yujiu Yang [pdf] [supp] [arXiv] [bibtex]

SRWarp: Generalized Image Super-Resolution under Arbitrary Transformation Sanghyun Son, Kyoung Mu Lee [pdf] [supp] [arXiv] [bibtex]

IQDet: Instance-Wise Quality Distribution Sampling for Object Detection Yuchen Ma, Songtao Liu, Zeming Li, Jian Sun [pdf] [arXiv] [bibtex]

Scan2Cap: Context-Aware Dense Captioning in RGB-D Scans Zhenyu Chen, Ali Gholami, Matthias Niessner, Angel X. Chang [pdf] [supp] [bibtex]

NeuralHumanFVV: Real-Time Neural Volumetric Human Performance Rendering Using RGB Cameras Xin Suo, Yuheng Jiang, Pei Lin, Yingliang Zhang, Minye Wu, Kaiwen Guo, Lan Xu [pdf] [arXiv] [bibtex]

Anti-Aliasing Semantic Reconstruction for Few-Shot Semantic Segmentation Binghao Liu, Yao Ding, Jianbin Jiao, Xiangyang Ji, Qixiang Ye [pdf] [arXiv] [bibtex]

Composing Photos Like a Photographer Chaoyi Hong, Shuaiyuan Du, Ke Xian, Hao Lu, Zhiguo Cao, Weicai Zhong [pdf] [supp] [bibtex]

Asymmetric Gained Deep Image Compression With Continuous Rate Adaptation Ze Cui, Jing Wang, Shangyin Gao, Tiansheng Guo, Yihui Feng, Bo Bai [pdf] [supp] [bibtex]

Optimal Gradient Checkpoint Search for Arbitrary Computation Graphs Jianwei Feng, Dong Huang [pdf] [supp] [arXiv] [bibtex]

NBNet: Noise Basis Learning for Image Denoising With Subspace Projection Shen Cheng, Yuzhi Wang, Haibin Huang, Donghao Liu, Haoqiang Fan, Shuaicheng Liu [pdf] [arXiv] [bibtex]

NeRV: Neural Reflectance and Visibility Fields for Relighting and View Synthesis Pratul P. Srinivasan, Boyang Deng, Xiuming Zhang, Matthew Tancik, Ben Mildenhall, Jonathan T. Barron [pdf] [supp] [arXiv] [bibtex]

How Transferable Are Reasoning Patterns in VQA? Corentin Kervadec, Theo Jaunet, Grigory Antipov, Moez Baccouche, Romain Vuillemot, Christian Wolf [pdf] [supp] [arXiv] [bibtex]

DyStaB: Unsupervised Object Segmentation via Dynamic-Static Bootstrapping Yanchao Yang, Brian Lai, Stefano Soatto [pdf] [arXiv] [bibtex]

Deep Texture Recognition via Exploiting Cross-Layer Statistical Self-Similarity Zhile Chen, Feng Li, Yuhui Quan, Yong Xu, Hui Ji [pdf] [supp] [bibtex]

Light Field Super-Resolution With Zero-Shot Learning Zhen Cheng, Zhiwei Xiong, Chang Chen, Dong Liu, Zheng-Jun Zha [pdf] [supp] [bibtex]

Spherical Confidence Learning for Face Recognition Shen Li, Jianqing Xu, Xiaqing Xu, Pengcheng Shen, Shaoxin Li, Bryan Hooi [pdf] [supp] [bibtex]

Three Ways To Improve Semantic Segmentation With Self-Supervised Depth Estimation Lukas Hoyer, Dengxin Dai, Yuhua Chen, Adrian Koring, Suman Saha, Luc Van Gool [pdf] [supp] [arXiv] [bibtex]

Cross-Modal Contrastive Learning for Text-to-Image Generation Han Zhang, Jing Yu Koh, Jason Baldridge, Honglak Lee, Yinfei Yang [pdf] [supp] [arXiv] [bibtex]

Lifting 2D StyleGAN for 3D-Aware Face Generation Yichun Shi, Divyansh Aggarwal, Anil K. Jain [pdf] [supp] [arXiv] [bibtex]

iMiGUE: An Identity-Free Video Dataset for Micro-Gesture Understanding and Emotion Analysis Xin Liu, Henglin Shi, Haoyu Chen, Zitong Yu, Xiaobai Li, Guoying Zhao [pdf] [supp] [bibtex]

MeGA-CDA: Memory Guided Attention for Category-Aware Unsupervised Domain Adaptive Object Detection Vibashan VS, Vikram Gupta, Poojan Oza, Vishwanath A. Sindagi, Vishal M. Patel [pdf] [supp] [bibtex]

Nutrition5k: Towards Automatic Nutritional Understanding of Generic Food Quin Thames, Arjun Karpur, Wade Norris, Fangting Xia, Liviu Panait, Tobias Weyand, Jack Sim [pdf] [supp] [arXiv] [bibtex]

Extreme Low-Light Environment-Driven Image Denoising Over Permanently Shadowed Lunar Regions With a Physical Noise Model Ben Moseley, Valentin Bickel, Ignacio G. Lopez-Francos, Loveneesh Rana [pdf] [supp] [bibtex]

Unsupervised Discovery of the Long-Tail in Instance Segmentation Using Hierarchical Self-Supervision Zhenzhen Weng, Mehmet Giray Ogut, Shai Limonchik, Serena Yeung [pdf] [supp] [arXiv] [bibtex]

How Privacy-Preserving Are Line Clouds? Recovering Scene Details From 3D Lines Kunal Chelani, Fredrik Kahl, Torsten Sattler [pdf] [supp] [arXiv] [bibtex]

Multi-View 3D Reconstruction of a Texture-Less Smooth Surface of Unknown Generic Reflectance Ziang Cheng, Hongdong Li, Yuta Asano, Yinqiang Zheng, Imari Sato [pdf] [supp] [arXiv] [bibtex]

Rectification-Based Knowledge Retention for Continual Learning Pravendra Singh, Pratik Mazumder, Piyush Rai, Vinay P. Namboodiri [pdf] [supp] [arXiv] [bibtex]

Scale-Aware Automatic Augmentation for Object Detection Yukang Chen, Yanwei Li, Tao Kong, Lu Qi, Ruihang Chu, Lei Li, Jiaya Jia [pdf] [supp] [arXiv] [bibtex]

Towards Robust Classification Model by Counterfactual and Invariant Data Generation Chun-Hao Chang, George Alexandru Adam, Anna Goldenberg [pdf] [supp] [arXiv] [bibtex]

Fully Convolutional Networks for Panoptic Segmentation Yanwei Li, Hengshuang Zhao, Xiaojuan Qi, Liwei Wang, Zeming Li, Jian Sun, Jiaya Jia [pdf] [arXiv] [bibtex]

Benchmarking Representation Learning for Natural World Image Collections Grant Van Horn, Elijah Cole, Sara Beery, Kimberly Wilber, Serge Belongie, Oisin Mac Aodha [pdf] [arXiv] [bibtex]

PGT: A Progressive Method for Training Models on Long Videos Bo Pang, Gao Peng, Yizhuo Li, Cewu Lu [pdf] [arXiv] [bibtex]

Prioritized Architecture Sampling With Monto-Carlo Tree Search Xiu Su, Tao Huang, Yanxi Li, Shan You, Fei Wang, Chen Qian, Changshui Zhang, Chang Xu [pdf] [supp] [arXiv] [bibtex]

HumanGPS: Geodesic PreServing Feature for Dense Human Correspondences Feitong Tan, Danhang Tang, Mingsong Dou, Kaiwen Guo, Rohit Pandey, Cem Keskin, Ruofei Du, Deqing Sun, Sofien Bouaziz, Sean Fanello, Ping Tan, Yinda Zhang [pdf] [supp] [arXiv] [bibtex]

Read Like Humans: Autonomous, Bidirectional and Iterative Language Modeling for Scene Text Recognition Shancheng Fang, Hongtao Xie, Yuxin Wang, Zhendong Mao, Yongdong Zhang [pdf] [supp] [arXiv] [bibtex]

Generic Perceptual Loss for Modeling Structured Output Dependencies Yifan Liu, Hao Chen, Yu Chen, Wei Yin, Chunhua Shen [pdf] [supp] [arXiv] [bibtex]

Style-Based Point Generator With Adversarial Rendering for Point Cloud Completion Chulin Xie, Chuxin Wang, Bo Zhang, Hao Yang, Dong Chen, Fang Wen [pdf] [supp] [arXiv] [bibtex]

Neural Architecture Search With Random Labels Xuanyang Zhang, Pengfei Hou, Xiangyu Zhang, Jian Sun [pdf] [supp] [arXiv] [bibtex]

Towards Long-Form Video Understanding Chao-Yuan Wu, Philipp Krahenbuhl [pdf] [supp] [bibtex]

Shape and Material Capture at Home Daniel Lichy, Jiaye Wu, Soumyadip Sengupta, David W. Jacobs [pdf] [supp] [arXiv] [bibtex]

Deep Polarization Imaging for 3D Shape and SVBRDF Acquisition Valentin Deschaintre, Yiming Lin, Abhijeet Ghosh [pdf] [arXiv] [bibtex]

Convolutional Neural Network Pruning With Structural Redundancy Reduction Zi Wang, Chengcheng Li, Xiangyang Wang [pdf] [supp] [arXiv] [bibtex]

T-vMF Similarity for Regularizing Intra-Class Feature Distribution Takumi Kobayashi [pdf] [supp] [bibtex]

Surrogate Gradient Field for Latent Space Manipulation Minjun Li, Yanghua Jin, Huachun Zhu [pdf] [supp] [arXiv] [bibtex]

SCF-Net: Learning Spatial Contextual Features for Large-Scale Point Cloud Segmentation Siqi Fan, Qiulei Dong, Fenghua Zhu, Yisheng Lv, Peijun Ye, Fei-Yue Wang [pdf] [bibtex]

UnsupervisedR&R: Unsupervised Point Cloud Registration via Differentiable Rendering Mohamed El Banani, Luya Gao, Justin Johnson [pdf] [supp] [bibtex]

ZeroScatter: Domain Transfer for Long Distance Imaging and Vision Through Scattering Media Zheng Shi, Ethan Tseng, Mario Bijelic, Werner Ritter, Felix Heide [pdf] [supp] [arXiv] [bibtex]

Defending Multimodal Fusion Models Against Single-Source Adversaries Karren Yang, Wan-Yi Lin, Manash Barman, Filipe Condessa, Zico Kolter [pdf] [supp] [bibtex]

Generalized Domain Adaptation Yu Mitsuzumi, Go Irie, Daiki Ikami, Takashi Shibata [pdf] [supp] [arXiv] [bibtex]

AGORA: Avatars in Geography Optimized for Regression Analysis Priyanka Patel, Chun-Hao P. Huang, Joachim Tesch, David T. Hoffmann, Shashank Tripathi, Michael J. Black [pdf] [supp] [arXiv] [bibtex]

Exploring and Distilling Posterior and Prior Knowledge for Radiology Report Generation Fenglin Liu, Xian Wu, Shen Ge, Wei Fan, Yuexian Zou [pdf] [bibtex]

Rotation Coordinate Descent for Fast Globally Optimal Rotation Averaging Alvaro Parra, Shin-Fang Chng, Tat-Jun Chin, Anders Eriksson, Ian Reid [pdf] [supp] [arXiv] [bibtex]

Extreme Rotation Estimation Using Dense Correlation Volumes Ruojin Cai, Bharath Hariharan, Noah Snavely, Hadar Averbuch-Elor [pdf] [supp] [arXiv] [bibtex]

Capsule Network Is Not More Robust Than Convolutional Network Jindong Gu, Volker Tresp, Han Hu [pdf] [supp] [arXiv] [bibtex]

BASAR:Black-Box Attack on Skeletal Action Recognition Yunfeng Diao, Tianjia Shao, Yong-Liang Yang, Kun Zhou, He Wang [pdf] [supp] [arXiv] [bibtex]

Self-Supervised Learning on 3D Point Clouds by Learning Discrete Generative Models Benjamin Eckart, Wentao Yuan, Chao Liu, Jan Kautz [pdf] [supp] [bibtex]

Iso-Points: Optimizing Neural Implicit Surfaces With Hybrid Representations Wang Yifan, Shihao Wu, Cengiz Oztireli, Olga Sorkine-Hornung [pdf] [supp] [bibtex]

Dense Relation Distillation With Context-Aware Aggregation for Few-Shot Object Detection Hanzhe Hu, Shuai Bai, Aoxue Li, Jinshi Cui, Liwei Wang [pdf] [supp] [arXiv] [bibtex]

End-to-End Human Object Interaction Detection With HOI Transformer Cheng Zou, Bohan Wang, Yue Hu, Junqi Liu, Qian Wu, Yu Zhao, Boxun Li, Chenguang Zhang, Chi Zhang, Yichen Wei, Jian Sun [pdf] [arXiv] [bibtex]

How Does Topology Influence Gradient Propagation and Model Performance of Deep Networks With DenseNet-Type Skip Connections? Kartikeya Bhardwaj, Guihong Li, Radu Marculescu [pdf] [supp] [arXiv] [bibtex]

Multi-Shot Temporal Event Localization: A Benchmark Xiaolong Liu, Yao Hu, Song Bai, Fei Ding, Xiang Bai, Philip H. S. Torr [pdf] [supp] [arXiv] [bibtex]

We Are More Than Our Joints: Predicting How 3D Bodies Move Yan Zhang, Michael J. Black, Siyu Tang [pdf] [supp] [arXiv] [bibtex]

Spatially-Adaptive Pixelwise Networks for Fast Image Translation Tamar Rott Shaham, Michael Gharbi, Richard Zhang, Eli Shechtman, Tomer Michaeli [pdf] [supp] [arXiv] [bibtex]

PointFlow: Flowing Semantics Through Points for Aerial Image Segmentation Xiangtai Li, Hao He, Xia Li, Duo Li, Guangliang Cheng, Jianping Shi, Lubin Weng, Yunhai Tong, Zhouchen Lin [pdf] [supp] [arXiv] [bibtex]

Deep Stable Learning for Out-of-Distribution Generalization Xingxuan Zhang, Peng Cui, Renzhe Xu, Linjun Zhou, Yue He, Zheyan Shen [pdf] [arXiv] [bibtex]

Continual Learning via Bit-Level Information Preserving Yujun Shi, Li Yuan, Yunpeng Chen, Jiashi Feng [pdf] [supp] [arXiv] [bibtex]

Vectorization and Rasterization: Self-Supervised Learning for Sketch and Handwriting Ayan Kumar Bhunia, Pinaki Nath Chowdhury, Yongxin Yang, Timothy M. Hospedales, Tao Xiang, Yi-Zhe Song [pdf] [arXiv] [bibtex]

Generating Diverse Structure for Image Inpainting With Hierarchical VQ-VAE Jialun Peng, Dong Liu, Songcen Xu, Houqiang Li [pdf] [supp] [arXiv] [bibtex]

Refine Myself by Teaching Myself: Feature Refinement via Self-Knowledge Distillation Mingi Ji, Seungjae Shin, Seunghyun Hwang, Gibeom Park, Il-Chul Moon [pdf] [supp] [arXiv] [bibtex]

Self-Supervised Visibility Learning for Novel View Synthesis Yujiao Shi, Hongdong Li, Xin Yu [pdf] [supp] [arXiv] [bibtex]

End-to-End Human Pose and Mesh Reconstruction with Transformers Kevin Lin, Lijuan Wang, Zicheng Liu [pdf] [supp] [arXiv] [bibtex]

CapsuleRRT: Relationships-Aware Regression Tracking via Capsules Ding Ma, Xiangqian Wu [pdf] [bibtex]

Test-Time Fast Adaptation for Dynamic Scene Deblurring via Meta-Auxiliary Learning Zhixiang Chi, Yang Wang, Yuanhao Yu, Jin Tang [pdf] [supp] [bibtex]

Anycost GANs for Interactive Image Synthesis and Editing Ji Lin, Richard Zhang, Frieder Ganz, Song Han, Jun-Yan Zhu [pdf] [arXiv] [bibtex]

TrafficSim: Learning To Simulate Realistic Multi-Agent Behaviors Simon Suo, Sebastian Regalado, Sergio Casas, Raquel Urtasun [pdf] [supp] [arXiv] [bibtex]

Monocular 3D Multi-Person Pose Estimation by Integrating Top-Down and Bottom-Up Networks Yu Cheng, Bo Wang, Bo Yang, Robby T. Tan [pdf] [supp] [arXiv] [bibtex]

Space-Time Distillation for Video Super-Resolution Zeyu Xiao, Xueyang Fu, Jie Huang, Zhen Cheng, Zhiwei Xiong [pdf] [supp] [bibtex]

Robust Audio-Visual Instance Discrimination Pedro Morgado, Ishan Misra, Nuno Vasconcelos [pdf] [supp] [arXiv] [bibtex]

High-Fidelity and Arbitrary Face Editing Yue Gao, Fangyun Wei, Jianmin Bao, Shuyang Gu, Dong Chen, Fang Wen, Zhouhui Lian [pdf] [supp] [arXiv] [bibtex]

Explicit Knowledge Incorporation for Visual Reasoning Yifeng Zhang, Ming Jiang, Qi Zhao [pdf] [supp] [bibtex]

Progressive Unsupervised Learning for Visual Object Tracking Qiangqiang Wu, Jia Wan, Antoni B. Chan [pdf] [supp] [bibtex]

IoU Attack: Towards Temporally Coherent Black-Box Adversarial Attack for Visual Object Tracking Shuai Jia, Yibing Song, Chao Ma, Xiaokang Yang [pdf] [arXiv] [bibtex]

Deep Graph Matching Under Quadratic Constraint Quankai Gao, Fudong Wang, Nan Xue, Jin-Gang Yu, Gui-Song Xia [pdf] [arXiv] [bibtex]

Multi-Label Activity Recognition Using Activity-Specific Features and Activity Correlations Yanyi Zhang, Xinyu Li, Ivan Marsic [pdf] [supp] [arXiv] [bibtex]

Learning High Fidelity Depths of Dressed Humans by Watching Social Media Dance Videos Yasamin Jafarian, Hyun Soo Park [pdf] [supp] [arXiv] [bibtex]

Unpaired Image-to-Image Translation via Latent Energy Transport Yang Zhao, Changyou Chen [pdf] [supp] [arXiv] [bibtex]

VLN BERT: A Recurrent Vision-and-Language BERT for Navigation Yicong Hong, Qi Wu, Yuankai Qi, Cristian Rodriguez-Opazo, Stephen Gould [pdf] [supp] [bibtex]

Content-Aware GAN Compression Yuchen Liu, Zhixin Shu, Yijun Li, Zhe Lin, Federico Perazzi, Sun-Yuan Kung [pdf] [arXiv] [bibtex]

FBI-Denoiser: Fast Blind Image Denoiser for Poisson-Gaussian Noise Jaeseok Byun, Sungmin Cha, Taesup Moon [pdf] [supp] [bibtex]

Hijack-GAN: Unintended-Use of Pretrained, Black-Box GANs Hui-Po Wang, Ning Yu, Mario Fritz [pdf] [supp] [bibtex]

LiDAR R-CNN: An Efficient and Universal 3D Object Detector Zhichao Li, Feng Wang, Naiyan Wang [pdf] [bibtex]

Line Segment Detection Using Transformers Without Edges Yifan Xu, Weijian Xu, David Cheung, Zhuowen Tu [pdf] [arXiv] [bibtex]

Region-Aware Adaptive Instance Normalization for Image Harmonization Jun Ling, Han Xue, Li Song, Rong Xie, Xiao Gu [pdf] [supp] [arXiv] [bibtex]

Learning Tensor Low-Rank Prior for Hyperspectral Image Reconstruction Shipeng Zhang, Lizhi Wang, Lei Zhang, Hua Huang [pdf] [bibtex]

Unsupervised Learning of Depth and Depth-of-Field Effect From Natural Images With Aperture Rendering Generative Adversarial Networks Takuhiro Kaneko [pdf] [bibtex]

Sign-Agnostic Implicit Learning of Surface Self-Similarities for Shape Modeling and Reconstruction From Raw Point Clouds Wenbin Zhao, Jiabao Lei, Yuxin Wen, Jianguo Zhang, Kui Jia [pdf] [arXiv] [bibtex]

Towards More Flexible and Accurate Object Tracking With Natural Language: Algorithms and Benchmark Xiao Wang, Xiujun Shu, Zhipeng Zhang, Bo Jiang, Yaowei Wang, Yonghong Tian, Feng Wu [pdf] [supp] [arXiv] [bibtex]

On Learning the Geodesic Path for Incremental Learning Christian Simon, Piotr Koniusz, Mehrtash Harandi [pdf] [supp] [arXiv] [bibtex]

The Lottery Tickets Hypothesis for Supervised and Self-Supervised Pre-Training in Computer Vision Models Tianlong Chen, Jonathan Frankle, Shiyu Chang, Sijia Liu, Yang Zhang, Michael Carbin, Zhangyang Wang [pdf] [supp] [arXiv] [bibtex]

Iterative Shrinking for Referring Expression Grounding Using Deep Reinforcement Learning Mingjie Sun, Jimin Xiao, Eng Gee Lim [pdf] [arXiv] [bibtex]

Simulating Unknown Target Models for Query-Efficient Black-Box Attacks Chen Ma, Li Chen, Jun-Hai Yong [pdf] [supp] [arXiv] [bibtex]

Diffusion Probabilistic Models for 3D Point Cloud Generation Shitong Luo, Wei Hu [pdf] [supp] [arXiv] [bibtex]

Dual Pixel Exploration: Simultaneous Depth Estimation and Image Restoration Liyuan Pan, Shah Chowdhury, Richard Hartley, Miaomiao Liu, Hongguang Zhang, Hongdong Li [pdf] [arXiv] [bibtex]

Guided Integrated Gradients: An Adaptive Path Method for Removing Noise Andrei Kapishnikov, Subhashini Venugopalan, Besim Avci, Ben Wedin, Michael Terry, Tolga Bolukbasi [pdf] [supp] [bibtex]

Spatiotemporal Registration for Event-Based Visual Odometry Daqi Liu, Alvaro Parra, Tat-Jun Chin [pdf] [supp] [arXiv] [bibtex]

Temporal Action Segmentation From Timestamp Supervision Zhe Li, Yazan Abu Farha, Jurgen Gall [pdf] [supp] [arXiv] [bibtex]

Data-Free Model Extraction Jean-Baptiste Truong, Pratyush Maini, Robert J. Walls, Nicolas Papernot [pdf] [supp] [arXiv] [bibtex]

PointAugmenting: Cross-Modal Augmentation for 3D Object Detection Chunwei Wang, Chao Ma, Ming Zhu, Xiaokang Yang [pdf] [supp] [bibtex]

Learning Feature Aggregation for Deep 3D Morphable Models Zhixiang Chen, Tae-Kyun Kim [pdf] [supp] [arXiv] [bibtex]

There Is More Than Meets the Eye: Self-Supervised Multi-Object Detection and Tracking With Sound by Distilling Multimodal Knowledge Francisco Rivera Valverde, Juana Valeria Hurtado, Abhinav Valada [pdf] [supp] [arXiv] [bibtex]

DeRF: Decomposed Radiance Fields Daniel Rebain, Wei Jiang, Soroosh Yazdani, Ke Li, Kwang Moo Yi, Andrea Tagliasacchi [pdf] [supp] [arXiv] [bibtex]

Group-aware Label Transfer for Domain Adaptive Person Re-identification Kecheng Zheng, Wu Liu, Lingxiao He, Tao Mei, Jiebo Luo, Zheng-Jun Zha [pdf] [arXiv] [bibtex]

MR Image Super-Resolution With Squeeze and Excitation Reasoning Attention Network Yulun Zhang, Kai Li, Kunpeng Li, Yun Fu [pdf] [bibtex]

BABEL: Bodies, Action and Behavior With English Labels Abhinanda R. Punnakkal, Arjun Chandrasekaran, Nikos Athanasiou, Alejandra Quiros-Ramirez, Michael J. Black [pdf] [supp] [bibtex]

SMD-Nets: Stereo Mixture Density Networks Fabio Tosi, Yiyi Liao, Carolin Schmitt, Andreas Geiger [pdf] [bibtex]

Discover Cross-Modality Nuances for Visible-Infrared Person Re-Identification Qiong Wu, Pingyang Dai, Jie Chen, Chia-Wen Lin, Yongjian Wu, Feiyue Huang, Bineng Zhong, Rongrong Ji [pdf] [bibtex]

Learning Progressive Point Embeddings for 3D Point Cloud Generation Cheng Wen, Baosheng Yu, Dacheng Tao [pdf] [supp] [bibtex]

Learnable Graph Matching: Incorporating Graph Partitioning With Deep Feature Learning for Multiple Object Tracking Jiawei He, Zehao Huang, Naiyan Wang, Zhaoxiang Zhang [pdf] [supp] [arXiv] [bibtex]

A Decomposition Model for Stereo Matching Chengtang Yao, Yunde Jia, Huijun Di, Pengxiang Li, Yuwei Wu [pdf] [supp] [arXiv] [bibtex]

RangeIoUDet: Range Image Based Real-Time 3D Object Detector Optimized by Intersection Over Union Zhidong Liang, Zehan Zhang, Ming Zhang, Xian Zhao, Shiliang Pu [pdf] [bibtex]

Domain-Robust VQA With Diverse Datasets and Methods but No Target Labels Mingda Zhang, Tristan Maidment, Ahmad Diab, Adriana Kovashka, Rebecca Hwa [pdf] [arXiv] [bibtex]

(AF)2-S3Net: Attentive Feature Fusion With Adaptive Feature Selection for Sparse Semantic Segmentation Network Ran Cheng, Ryan Razani, Ehsan Taghavi, Enxu Li, Bingbing Liu [pdf] [supp] [bibtex]

Towards Real-World Blind Face Restoration With Generative Facial Prior Xintao Wang, Yu Li, Honglun Zhang, Ying Shan [pdf] [arXiv] [bibtex]

Track To Detect and Segment: An Online Multi-Object Tracker Jialian Wu, Jiale Cao, Liangchen Song, Yu Wang, Ming Yang, Junsong Yuan [pdf] [arXiv] [bibtex]

Look Before You Speak: Visually Contextualized Utterances Paul Hongsuck Seo, Arsha Nagrani, Cordelia Schmid [pdf] [supp] [arXiv] [bibtex]

DivCo: Diverse Conditional Image Synthesis via Contrastive Generative Adversarial Network Rui Liu, Yixiao Ge, Ching Lam Choi, Xiaogang Wang, Hongsheng Li [pdf] [arXiv] [bibtex]

Effective Sparsification of Neural Networks With Global Sparsity Constraint Xiao Zhou, Weizhong Zhang, Hang Xu, Tong Zhang [pdf] [supp] [arXiv] [bibtex]

Deep Gaussian Scale Mixture Prior for Spectral Compressive Imaging Tao Huang, Weisheng Dong, Xin Yuan, Jinjian Wu, Guangming Shi [pdf] [supp] [arXiv] [bibtex]

Cross-Domain Gradient Discrepancy Minimization for Unsupervised Domain Adaptation Zhekai Du, Jingjing Li, Hongzu Su, Lei Zhu, Ke Lu [pdf] [bibtex]

DISCO: Dynamic and Invariant Sensitive Channel Obfuscation for Deep Neural Networks Abhishek Singh, Ayush Chopra, Ethan Garza, Emily Zhang, Praneeth Vepakomma, Vivek Sharma, Ramesh Raskar [pdf] [supp] [arXiv] [bibtex]

Training Generative Adversarial Networks in One Stage Chengchao Shen, Youtan Yin, Xinchao Wang, Xubin Li, Jie Song, Mingli Song [pdf] [supp] [arXiv] [bibtex]

Learning To Aggregate and Personalize 3D Face From In-the-Wild Photo Collection Zhenyu Zhang, Yanhao Ge, Renwang Chen, Ying Tai, Yan Yan, Jian Yang, Chengjie Wang, Jilin Li, Feiyue Huang [pdf] [bibtex]

Leveraging Line-Point Consistence To Preserve Structures for Wide Parallax Image Stitching Qi Jia, ZhengJun Li, Xin Fan, Haotian Zhao, Shiyu Teng, Xinchen Ye, Longin Jan Latecki [pdf] [bibtex]

3DIoUMatch: Leveraging IoU Prediction for Semi-Supervised 3D Object Detection He Wang, Yezhen Cong, Or Litany, Yue Gao, Leonidas J. Guibas [pdf] [supp] [arXiv] [bibtex]

Self-Supervised Video GANs: Learning for Appearance Consistency and Motion Coherency Sangeek Hyun, Jihwan Kim, Jae-Pil Heo [pdf] [bibtex]

Neural Lumigraph Rendering Petr Kellnhofer, Lars C. Jebe, Andrew Jones, Ryan Spicer, Kari Pulli, Gordon Wetzstein [pdf] [supp] [arXiv] [bibtex]

Robust Multimodal Vehicle Detection in Foggy Weather Using Complementary Lidar and Radar Signals Kun Qian, Shilin Zhu, Xinyu Zhang, Li Erran Li [pdf] [supp] [bibtex]

Stochastic Whitening Batch Normalization Shengdong Zhang, Ehsan Nezhadarya, Homa Fashandi, Jiayi Liu, Darin Graham, Mohak Shah [pdf] [supp] [bibtex]

Self-Guided and Cross-Guided Learning for Few-Shot Segmentation Bingfeng Zhang, Jimin Xiao, Terry Qin [pdf] [supp] [arXiv] [bibtex]

M3P: Learning Universal Representations via Multitask Multilingual Multimodal Pre-Training Minheng Ni, Haoyang Huang, Lin Su, Edward Cui, Taroon Bharti, Lijuan Wang, Dongdong Zhang, Nan Duan [pdf] [arXiv] [bibtex]

Hyperdimensional Computing as a Framework for Systematic Aggregation of Image Descriptors Peer Neubert, Stefan Schubert [pdf] [supp] [arXiv] [bibtex]

Layerwise Optimization by Gradient Decomposition for Continual Learning Shixiang Tang, Dapeng Chen, Jinguo Zhu, Shijie Yu, Wanli Ouyang [pdf] [supp] [arXiv] [bibtex]

Boosting Monocular Depth Estimation Models to High-Resolution via Content-Adaptive Multi-Resolution Merging S. Mahdi H. Miangoleh, Sebastian Dille, Long Mai, Sylvain Paris, Yagiz Aksoy [pdf] [supp] [arXiv] [bibtex]

Blind Deblurring for Saturated Images Liang Chen, Jiawei Zhang, Songnan Lin, Faming Fang, Jimmy S. Ren [pdf] [supp] [bibtex]

Turning Frequency to Resolution: Video Super-Resolution via Event Cameras Yongcheng Jing, Yiding Yang, Xinchao Wang, Mingli Song, Dacheng Tao [pdf] [bibtex]

Time Adaptive Recurrent Neural Network Anil Kag, Venkatesh Saligrama [pdf] [supp] [bibtex]

DeFMO: Deblurring and Shape Recovery of Fast Moving Objects Denys Rozumnyi, Martin R. Oswald, Vittorio Ferrari, Jiri Matas, Marc Pollefeys [pdf] [supp] [arXiv] [bibtex]

PISE: Person Image Synthesis and Editing With Decoupled GAN Jinsong Zhang, Kun Li, Yu-Kun Lai, Jingyu Yang [pdf] [supp] [arXiv] [bibtex]

4D Hyperspectral Photoacoustic Data Restoration With Reliability Analysis Weihang Liao, Art Subpa-asa, Yinqiang Zheng, Imari Sato [pdf] [supp] [bibtex]

OBoW: Online Bag-of-Visual-Words Generation for Self-Supervised Learning Spyros Gidaris, Andrei Bursuc, Gilles Puy, Nikos Komodakis, Matthieu Cord, Patrick Perez [pdf] [supp] [bibtex]

Learning-Based Image Registration With Meta-Regularization Ebrahim Al Safadi, Xubo Song [pdf] [bibtex]

A Hyperbolic-to-Hyperbolic Graph Convolutional Network Jindou Dai, Yuwei Wu, Zhi Gao, Yunde Jia [pdf] [supp] [arXiv] [bibtex]

Deep Homography for Efficient Stereo Image Compression Xin Deng, Wenzhe Yang, Ren Yang, Mai Xu, Enpeng Liu, Qianhan Feng, Radu Timofte [pdf] [bibtex]

Point2Skeleton: Learning Skeletal Representations from Point Clouds Cheng Lin, Changjian Li, Yuan Liu, Nenglun Chen, Yi-King Choi, Wenping Wang [pdf] [supp] [arXiv] [bibtex]

Neighborhood Contrastive Learning for Novel Class Discovery Zhun Zhong, Enrico Fini, Subhankar Roy, Zhiming Luo, Elisa Ricci, Nicu Sebe [pdf] [supp] [bibtex]

SimPoE: Simulated Character Control for 3D Human Pose Estimation Ye Yuan, Shih-En Wei, Tomas Simon, Kris Kitani, Jason Saragih [pdf] [supp] [arXiv] [bibtex]

Neural Camera Simulators Hao Ouyang, Zifan Shi, Chenyang Lei, Ka Lung Law, Qifeng Chen [pdf] [supp] [arXiv] [bibtex]

Neighborhood Normalization for Robust Geometric Feature Learning Xingtong Liu, Benjamin D. Killeen, Ayushi Sinha, Masaru Ishii, Gregory D. Hager, Russell H. Taylor, Mathias Unberath [pdf] [supp] [bibtex]

Video Rescaling Networks With Joint Optimization Strategies for Downscaling and Upscaling Yan-Cheng Huang, Yi-Hsin Chen, Cheng-You Lu, Hui-Po Wang, Wen-Hsiao Peng, Ching-Chun Huang [pdf] [supp] [arXiv] [bibtex]

TPCN: Temporal Point Cloud Networks for Motion Forecasting Maosheng Ye, Tongyi Cao, Qifeng Chen [pdf] [arXiv] [bibtex]

TSGCNet: Discriminative Geometric Feature Learning With Two-Stream Graph Convolutional Network for 3D Dental Model Segmentation Lingming Zhang, Yue Zhao, Deyu Meng, Zhiming Cui, Chenqiang Gao, Xinbo Gao, Chunfeng Lian, Dinggang Shen [pdf] [bibtex]

Meta Batch-Instance Normalization for Generalizable Person Re-Identification Seokeon Choi, Taekyung Kim, Minki Jeong, Hyoungseob Park, Changick Kim [pdf] [supp] [arXiv] [bibtex]

Dictionary-Guided Scene Text Recognition Nguyen Nguyen, Thu Nguyen, Vinh Tran, Minh-Triet Tran, Thanh Duc Ngo, Thien Huu Nguyen, Minh Hoai [pdf] [supp] [bibtex]

Glance and Gaze: Inferring Action-Aware Points for One-Stage Human-Object Interaction Detection Xubin Zhong, Xian Qu, Changxing Ding, Dacheng Tao [pdf] [supp] [arXiv] [bibtex]

Activate or Not: Learning Customized Activation Ningning Ma, Xiangyu Zhang, Ming Liu, Jian Sun [pdf] [supp] [arXiv] [bibtex]

Wide-Baseline Relative Camera Pose Estimation With Directional Learning Kefan Chen, Noah Snavely, Ameesh Makadia [pdf] [supp] [arXiv] [bibtex]

Improving Unsupervised Image Clustering With Robust Learning Sungwon Park, Sungwon Han, Sundong Kim, Danu Kim, Sungkyu Park, Seunghoon Hong, Meeyoung Cha [pdf] [supp] [arXiv] [bibtex]

Neural Surface Maps Luca Morreale, Noam Aigerman, Vladimir G. Kim, Niloy J. Mitra [pdf] [supp] [arXiv] [bibtex]

Enhance Curvature Information by Structured Stochastic Quasi-Newton Methods Minghan Yang, Dong Xu, Hongyu Chen, Zaiwen Wen, Mengyun Chen [pdf] [supp] [arXiv] [bibtex]

Variational Relational Point Completion Network Liang Pan, Xinyi Chen, Zhongang Cai, Junzhe Zhang, Haiyu Zhao, Shuai Yi, Ziwei Liu [pdf] [supp] [arXiv] [bibtex]

StruMonoNet: Structure-Aware Monocular 3D Prediction Zhenpei Yang, Li Erran Li, Qixing Huang [pdf] [supp] [bibtex]

Learning To Relate Depth and Semantics for Unsupervised Domain Adaptation Suman Saha, Anton Obukhov, Danda Pani Paudel, Menelaos Kanakis, Yuhua Chen, Stamatios Georgoulis, Luc Van Gool [pdf] [supp] [arXiv] [bibtex]

Training Networks in Null Space of Feature Covariance for Continual Learning Shipeng Wang, Xiaorong Li, Jian Sun, Zongben Xu [pdf] [supp] [arXiv] [bibtex]

PiCIE: Unsupervised Semantic Segmentation Using Invariance and Equivariance in Clustering Jang Hyun Cho, Utkarsh Mall, Kavita Bala, Bharath Hariharan [pdf] [supp] [arXiv] [bibtex]

DyCo3D: Robust Instance Segmentation of 3D Point Clouds Through Dynamic Convolution Tong He, Chunhua Shen, Anton van den Hengel [pdf] [arXiv] [bibtex]

SSLayout360: Semi-Supervised Indoor Layout Estimation From 360deg Panorama Phi Vu Tran [pdf] [supp] [bibtex]

SLADE: A Self-Training Framework for Distance Metric Learning Jiali Duan, Yen-Liang Lin, Son Tran, Larry S. Davis, C.-C. Jay Kuo [pdf] [supp] [arXiv] [bibtex]

NormalFusion: Real-Time Acquisition of Surface Normals for High-Resolution RGB-D Scanning Hyunho Ha, Joo Ho Lee, Andreas Meuleman, Min H. Kim [pdf] [supp] [bibtex]

SE-SSD: Self-Ensembling Single-Stage Object Detector From Point Cloud Wu Zheng, Weiliang Tang, Li Jiang, Chi-Wing Fu [pdf] [supp] [bibtex]

Where and What? Examining Interpretable Disentangled Representations Xinqi Zhu, Chang Xu, Dacheng Tao [pdf] [supp] [arXiv] [bibtex]

Physically-Aware Generative Network for 3D Shape Modeling Mariem Mezghanni, Malika Boulkenafed, Andre Lieutier, Maks Ovsjanikov [pdf] [supp] [bibtex]

Bilinear Parameterization for Non-Separable Singular Value Penalties Marcus Valtonen Ornhag, Jose Pedro Iglesias, Carl Olsson [pdf] [bibtex]

Objectron: A Large Scale Dataset of Object-Centric Videos in the Wild With Pose Annotations Adel Ahmadyan, Liangkai Zhang, Artsiom Ablavatski, Jianing Wei, Matthias Grundmann [pdf] [supp] [arXiv] [bibtex]

Intra-Inter Camera Similarity for Unsupervised Person Re-Identification Shiyu Xuan, Shiliang Zhang [pdf] [arXiv] [bibtex]

Efficient Feature Transformations for Discriminative and Generative Continual Learning Vinay Kumar Verma, Kevin J Liang, Nikhil Mehta, Piyush Rai, Lawrence Carin [pdf] [supp] [arXiv] [bibtex]

Learning a Self-Expressive Network for Subspace Clustering Shangzhi Zhang, Chong You, Rene Vidal, Chun-Guang Li [pdf] [bibtex]

A Large-Scale Study on Unsupervised Spatiotemporal Representation Learning Christoph Feichtenhofer, Haoqi Fan, Bo Xiong, Ross Girshick, Kaiming He [pdf] [supp] [arXiv] [bibtex]

Asymmetric Metric Learning for Knowledge Transfer Mateusz Budnik, Yannis Avrithis [pdf] [supp] [arXiv] [bibtex]

Frequency-Aware Discriminative Feature Learning Supervised by Single-Center Loss for Face Forgery Detection Jiaming Li, Hongtao Xie, Jiahong Li, Zhongyuan Wang, Yongdong Zhang [pdf] [supp] [arXiv] [bibtex]

3DCaricShop: A Dataset and a Baseline Method for Single-View 3D Caricature Face Reconstruction Yuda Qiu, Xiaojie Xu, Lingteng Qiu, Yan Pan, Yushuang Wu, Weikai Chen, Xiaoguang Han [pdf] [supp] [arXiv] [bibtex]

OCONet: Image Extrapolation by Object Completion Richard Strong Bowen, Huiwen Chang, Charles Herrmann, Piotr Teterwak, Ce Liu, Ramin Zabih [pdf] [supp] [bibtex]

VisualVoice: Audio-Visual Speech Separation With Cross-Modal Consistency Ruohan Gao, Kristen Grauman [pdf] [arXiv] [bibtex]

Fair Attribute Classification Through Latent Space De-Biasing Vikram V. Ramaswamy, Sunnie S. Y. Kim, Olga Russakovsky [pdf] [supp] [arXiv] [bibtex]

Correlated Input-Dependent Label Noise in Large-Scale Image Classification Mark Collier, Basil Mustafa, Efi Kokiopoulou, Rodolphe Jenatton, Jesse Berent [pdf] [supp] [arXiv] [bibtex]

Delving Into Localization Errors for Monocular 3D Object Detection Xinzhu Ma, Yinmin Zhang, Dan Xu, Dongzhan Zhou, Shuai Yi, Haojie Li, Wanli Ouyang [pdf] [supp] [arXiv] [bibtex]

Nearest Neighbor Matching for Deep Clustering Zhiyuan Dang, Cheng Deng, Xu Yang, Kun Wei, Heng Huang [pdf] [bibtex]

MOOD: Multi-Level Out-of-Distribution Detection Ziqian Lin, Sreya Dutta Roy, Yixuan Li [pdf] [supp] [arXiv] [bibtex]

Equalization Loss v2: A New Gradient Balance Approach for Long-Tailed Object Detection Jingru Tan, Xin Lu, Gang Zhang, Changqing Yin, Quanquan Li [pdf] [supp] [arXiv] [bibtex]

Dynamic Metric Learning: Towards a Scalable Metric Space To Accommodate Multiple Semantic Scales Yifan Sun, Yuke Zhu, Yuhan Zhang, Pengkun Zheng, Xi Qiu, Chi Zhang, Yichen Wei [pdf] [arXiv] [bibtex]

Primitive Representation Learning for Scene Text Recognition Ruijie Yan, Liangrui Peng, Shanyu Xiao, Gang Yao [pdf] [supp] [arXiv] [bibtex]

RPSRNet: End-to-End Trainable Rigid Point Set Registration Network Using Barnes-Hut 2D-Tree Representation Sk Aziz Ali, Kerem Kahraman, Gerd Reis, Didier Stricker [pdf] [supp] [bibtex]

On the Difficulty of Membership Inference Attacks Shahbaz Rezaei, Xin Liu [pdf] [supp] [arXiv] [bibtex]

Neural Geometric Level of Detail: Real-Time Rendering With Implicit 3D Shapes Towaki Takikawa, Joey Litalien, Kangxue Yin, Karsten Kreis, Charles Loop, Derek Nowrouzezahrai, Alec Jacobson, Morgan McGuire, Sanja Fidler [pdf] [supp] [arXiv] [bibtex]

Pareidolia Face Reenactment Linsen Song, Wayne Wu, Chaoyou Fu, Chen Qian, Chen Change Loy, Ran He [pdf] [supp] [arXiv] [bibtex]

ProSelfLC: Progressive Self Label Correction for Training Robust Deep Neural Networks Xinshao Wang, Yang Hua, Elyor Kodirov, David A. Clifton, Neil M. Robertson [pdf] [supp] [arXiv] [bibtex]

Learning To Segment Rigid Motions From Two Frames Gengshan Yang, Deva Ramanan [pdf] [supp] [arXiv] [bibtex]

Joint Deep Model-Based MR Image and Coil Sensitivity Reconstruction Network (Joint-ICNet) for Fast MRI Yohan Jun, Hyungseob Shin, Taejoon Eo, Dosik Hwang [pdf] [bibtex]

On Feature Normalization and Data Augmentation Boyi Li, Felix Wu, Ser-Nam Lim, Serge Belongie, Kilian Q. Weinberger [pdf] [supp] [arXiv] [bibtex]

SelfDoc: Self-Supervised Document Representation Learning Peizhao Li, Jiuxiang Gu, Jason Kuen, Vlad I. Morariu, Handong Zhao, Rajiv Jain, Varun Manjunatha, Hongfu Liu [pdf] [supp] [arXiv] [bibtex]

Towards Rolling Shutter Correction and Deblurring in Dynamic Scenes Zhihang Zhong, Yinqiang Zheng, Imari Sato [pdf] [supp] [arXiv] [bibtex]

VSPW: A Large-scale Dataset for Video Scene Parsing in the Wild Jiaxu Miao, Yunchao Wei, Yu Wu, Chen Liang, Guangrui Li, Yi Yang [pdf] [supp] [bibtex]

Multi-Label Learning From Single Positive Labels Elijah Cole, Oisin Mac Aodha, Titouan Lorieul, Pietro Perona, Dan Morris, Nebojsa Jojic [pdf] [bibtex]

Towards Part-Based Understanding of RGB-D Scans Alexey Bokhovkin, Vladislav Ishimtsev, Emil Bogomolov, Denis Zorin, Alexey Artemov, Evgeny Burnaev, Angela Dai [pdf] [supp] [bibtex]

Learning Semantic-Aware Dynamics for Video Prediction Xinzhu Bei, Yanchao Yang, Stefano Soatto [pdf] [arXiv] [bibtex]

Bipartite Graph Network With Adaptive Message Passing for Unbiased Scene Graph Generation Rongjie Li, Songyang Zhang, Bo Wan, Xuming He [pdf] [supp] [arXiv] [bibtex]

Guided Interactive Video Object Segmentation Using Reliability-Based Attention Maps Yuk Heo, Yeong Jun Koh, Chang-Su Kim [pdf] [supp] [arXiv] [bibtex]

Learning Spatial-Semantic Relationship for Facial Attribute Recognition With Limited Labeled Data Ying Shu, Yan Yan, Si Chen, Jing-Hao Xue, Chunhua Shen, Hanzi Wang [pdf] [bibtex]

Decoupled Dynamic Filter Networks Jingkai Zhou, Varun Jampani, Zhixiong Pi, Qiong Liu, Ming-Hsuan Yang [pdf] [supp] [arXiv] [bibtex]

Motion Representations for Articulated Animation Aliaksandr Siarohin, Oliver J. Woodford, Jian Ren, Menglei Chai, Sergey Tulyakov [pdf] [supp] [arXiv] [bibtex]

General Multi-Label Image Classification With Transformers Jack Lanchantin, Tianlu Wang, Vicente Ordonez, Yanjun Qi [pdf] [supp] [arXiv] [bibtex]

On Self-Contact and Human Pose Lea Muller, Ahmed A. A. Osman, Siyu Tang, Chun-Hao P. Huang, Michael J. Black [pdf] [supp] [arXiv] [bibtex]

Center-Based 3D Object Detection and Tracking Tianwei Yin, Xingyi Zhou, Philipp Krahenbuhl [pdf] [supp] [arXiv] [bibtex]

Prototype Augmentation and Self-Supervision for Incremental Learning Fei Zhu, Xu-Yao Zhang, Chuang Wang, Fei Yin, Cheng-Lin Liu [pdf] [bibtex]

CompositeTasking: Understanding Images by Spatial Composition of Tasks Nikola Popovic, Danda Pani Paudel, Thomas Probst, Guolei Sun, Luc Van Gool [pdf] [supp] [arXiv] [bibtex]

Searching for Fast Model Families on Datacenter Accelerators Sheng Li, Mingxing Tan, Ruoming Pang, Andrew Li, Liqun Cheng, Quoc V. Le, Norman P. Jouppi [pdf] [supp] [arXiv] [bibtex]

Task-Aware Variational Adversarial Active Learning Kwanyoung Kim, Dongwon Park, Kwang In Kim, Se Young Chun [pdf] [supp] [arXiv] [bibtex]

Understanding and Simplifying Perceptual Distances Dan Amir, Yair Weiss [pdf] [supp] [bibtex]

Class-Aware Robust Adversarial Training for Object Detection Pin-Chun Chen, Bo-Han Kung, Jun-Cheng Chen [pdf] [supp] [arXiv] [bibtex]

Bayesian Nested Neural Networks for Uncertainty Calibration and Adaptive Compression Yufei Cui, Ziquan Liu, Qiao Li, Antoni B. Chan, Chun Jason Xue [pdf] [arXiv] [bibtex]

Fast Bayesian Uncertainty Estimation and Reduction of Batch Normalized Single Image Super-Resolution Network Aupendu Kar, Prabir Kumar Biswas [pdf] [supp] [arXiv] [bibtex]

Euro-PVI: Pedestrian Vehicle Interactions in Dense Urban Centers Apratim Bhattacharyya, Daniel Olmeda Reino, Mario Fritz, Bernt Schiele [pdf] [supp] [bibtex]

RepVGG: Making VGG-Style ConvNets Great Again Xiaohan Ding, Xiangyu Zhang, Ningning Ma, Jungong Han, Guiguang Ding, Jian Sun [pdf] [arXiv] [bibtex]

Partial Feature Selection and Alignment for Multi-Source Domain Adaptation Yangye Fu, Ming Zhang, Xing Xu, Zuo Cao, Chao Ma, Yanli Ji, Kai Zuo, Huimin Lu [pdf] [bibtex]

Multi-Institutional Collaborations for Improving Deep Learning-Based Magnetic Resonance Image Reconstruction Using Federated Learning Pengfei Guo, Puyang Wang, Jinyuan Zhou, Shanshan Jiang, Vishal M. Patel [pdf] [supp] [arXiv] [bibtex]

UAV-Human: A Large Benchmark for Human Behavior Understanding With Unmanned Aerial Vehicles Tianjiao Li, Jun Liu, Wei Zhang, Yun Ni, Wenqian Wang, Zhiheng Li [pdf] [bibtex]

An Alternative Probabilistic Interpretation of the Huber Loss Gregory P. Meyer [pdf] [supp] [arXiv] [bibtex]

Siamese Natural Language Tracker: Tracking by Natural Language Descriptions With Siamese Trackers Qi Feng, Vitaly Ablavsky, Qinxun Bai, Stan Sclaroff [pdf] [arXiv] [bibtex]

Discrimination-Aware Mechanism for Fine-Grained Representation Learning Furong Xu, Meng Wang, Wei Zhang, Yuan Cheng, Wei Chu [pdf] [bibtex]

Rainbow Memory: Continual Learning With a Memory of Diverse Samples Jihwan Bang, Heesu Kim, YoungJoon Yoo, Jung-Woo Ha, Jonghyun Choi [pdf] [supp] [arXiv] [bibtex]

Learning Discriminative Prototypes With Dynamic Time Warping Xiaobin Chang, Frederick Tung, Greg Mori [pdf] [arXiv] [bibtex]

Deep Implicit Moving Least-Squares Functions for 3D Reconstruction Shi-Lin Liu, Hao-Xiang Guo, Hao Pan, Peng-Shuai Wang, Xin Tong, Yang Liu [pdf] [supp] [arXiv] [bibtex]

Video Prediction Recalling Long-Term Motion Context via Memory Alignment Learning Sangmin Lee, Hak Gu Kim, Dae Hwi Choi, Hyung-Il Kim, Yong Man Ro [pdf] [supp] [arXiv] [bibtex]

Automatic Vertebra Localization and Identification in CT by Spine Rectification and Anatomically-Constrained Optimization Fakai Wang, Kang Zheng, Le Lu, Jing Xiao, Min Wu, Shun Miao [pdf] [supp] [arXiv] [bibtex]

MotionRNN: A Flexible Model for Video Prediction With Spacetime-Varying Motions Haixu Wu, Zhiyu Yao, Jianmin Wang, Mingsheng Long [pdf] [supp] [arXiv] [bibtex]

MOS: Towards Scaling Out-of-Distribution Detection for Large Semantic Space Rui Huang, Yixuan Li [pdf] [supp] [arXiv] [bibtex]

Visual Semantic Role Labeling for Video Understanding Arka Sadhu, Tanmay Gupta, Mark Yatskar, Ram Nevatia, Aniruddha Kembhavi [pdf] [supp] [arXiv] [bibtex]

SwiftNet: Real-Time Video Object Segmentation Haochen Wang, Xiaolong Jiang, Haibing Ren, Yao Hu, Song Bai [pdf] [arXiv] [bibtex]

Contrastive Embedding for Generalized Zero-Shot Learning Zongyan Han, Zhenyong Fu, Shuo Chen, Jian Yang [pdf] [supp] [arXiv] [bibtex]

Scale-Localized Abstract Reasoning Yaniv Benny, Niv Pekar, Lior Wolf [pdf] [supp] [arXiv] [bibtex]

Transferable Query Selection for Active Domain Adaptation Bo Fu, Zhangjie Cao, Jianmin Wang, Mingsheng Long [pdf] [supp] [bibtex]

CLCC: Contrastive Learning for Color Constancy Yi-Chen Lo, Chia-Che Chang, Hsuan-Chao Chiu, Yu-Hao Huang, Chia-Ping Chen, Yu-Lin Chang, Kevin Jou [pdf] [bibtex]

Dual Attention Suppression Attack: Generate Adversarial Camouflage in Physical World Jiakai Wang, Aishan Liu, Zixin Yin, Shunchang Liu, Shiyu Tang, Xianglong Liu [pdf] [arXiv] [bibtex]

Long-Tailed Multi-Label Visual Recognition by Collaborative Training on Uniform and Re-Balanced Samplings Hao Guo, Song Wang [pdf] [bibtex]

3D Object Detection With Pointformer Xuran Pan, Zhuofan Xia, Shiji Song, Li Erran Li, Gao Huang [pdf] [supp] [arXiv] [bibtex]

Fair Feature Distillation for Visual Recognition Sangwon Jung, Donggyu Lee, Taeeon Park, Taesup Moon [pdf] [supp] [bibtex]

Diversifying Sample Generation for Accurate Data-Free Quantization Xiangguo Zhang, Haotong Qin, Yifu Ding, Ruihao Gong, Qinghua Yan, Renshuai Tao, Yuhang Li, Fengwei Yu, Xianglong Liu [pdf] [supp] [arXiv] [bibtex]

SSTVOS: Sparse Spatiotemporal Transformers for Video Object Segmentation Brendan Duke, Abdalla Ahmed, Christian Wolf, Parham Aarabi, Graham W. Taylor [pdf] [supp] [arXiv] [bibtex]

Inferring CAD Modeling Sequences Using Zone Graphs Xianghao Xu, Wenzhe Peng, Chin-Yi Cheng, Karl D.D. Willis, Daniel Ritchie [pdf] [supp] [arXiv] [bibtex]

Closed-Form Factorization of Latent Semantics in GANs Yujun Shen, Bolei Zhou [pdf] [arXiv] [bibtex]

Weakly-Supervised Physically Unconstrained Gaze Estimation Rakshit Kothari, Shalini De Mello, Umar Iqbal, Wonmin Byeon, Seonwook Park, Jan Kautz [pdf] [supp] [arXiv] [bibtex]

A Circular-Structured Representation for Visual Emotion Distribution Learning Jingyuan Yang, Jie Li, Leida Li, Xiumei Wang, Xinbo Gao [pdf] [bibtex]

VirTex: Learning Visual Representations From Textual Annotations Karan Desai, Justin Johnson [pdf] [supp] [arXiv] [bibtex]

MASA-SR: Matching Acceleration and Spatial Adaptation for Reference-Based Image Super-Resolution Liying Lu, Wenbo Li, Xin Tao, Jiangbo Lu, Jiaya Jia [pdf] [supp] [bibtex]

Spatiotemporal Contrastive Video Representation Learning Rui Qian, Tianjian Meng, Boqing Gong, Ming-Hsuan Yang, Huisheng Wang, Serge Belongie, Yin Cui [pdf] [supp] [arXiv] [bibtex]

Scaled-YOLOv4: Scaling Cross Stage Partial Network Chien-Yao Wang, Alexey Bochkovskiy, Hong-Yuan Mark Liao [pdf] [supp] [bibtex]

Quantifying Explainers of Graph Neural Networks in Computational Pathology Guillaume Jaume, Pushpak Pati, Behzad Bozorgtabar, Antonio Foncubierta, Anna Maria Anniciello, Florinda Feroce, Tilman Rau, Jean-Philippe Thiran, Maria Gabrani, Orcun Goksel [pdf] [supp] [arXiv] [bibtex]

Knowledge Evolution in Neural Networks Ahmed Taha, Abhinav Shrivastava, Larry S. Davis [pdf] [supp] [arXiv] [bibtex]

Revisiting Knowledge Distillation: An Inheritance and Exploration Framework Zhen Huang, Xu Shen, Jun Xing, Tongliang Liu, Xinmei Tian, Houqiang Li, Bing Deng, Jianqiang Huang, Xian-Sheng Hua [pdf] [bibtex]

Temporally-Weighted Hierarchical Clustering for Unsupervised Action Segmentation Saquib Sarfraz, Naila Murray, Vivek Sharma, Ali Diba, Luc Van Gool, Rainer Stiefelhagen [pdf] [arXiv] [bibtex]

SMURF: Self-Teaching Multi-Frame Unsupervised RAFT With Full-Image Warping Austin Stone, Daniel Maurer, Alper Ayvaci, Anelia Angelova, Rico Jonschkowski [pdf] [arXiv] [bibtex]

Glancing at the Patch: Anomaly Localization With Global and Local Feature Comparison Shenzhi Wang, Liwei Wu, Lei Cui, Yujun Shen [pdf] [supp] [bibtex]

Single-View 3D Object Reconstruction From Shape Priors in Memory Shuo Yang, Min Xu, Haozhe Xie, Stuart Perry, Jiahao Xia [pdf] [arXiv] [bibtex]

Recognizing Actions in Videos From Unseen Viewpoints AJ Piergiovanni, Michael S. Ryoo [pdf] [supp] [arXiv] [bibtex]

Perceptual Indistinguishability-Net (PI-Net): Facial Image Obfuscation With Manipulable Semantics Jia-Wei Chen, Li-Ju Chen, Chia-Mu Yu, Chun-Shien Lu [pdf] [supp] [arXiv] [bibtex]

To the Point: Efficient 3D Object Detection in the Range Image With Graph Convolution Kernels Yuning Chai, Pei Sun, Jiquan Ngiam, Weiyue Wang, Benjamin Caine, Vijay Vasudevan, Xiao Zhang, Dragomir Anguelov [pdf] [supp] [bibtex]

Coarse-To-Fine Domain Adaptive Semantic Segmentation With Photometric Alignment and Category-Center Regularization Haoyu Ma, Xiangru Lin, Zifeng Wu, Yizhou Yu [pdf] [supp] [arXiv] [bibtex]

Self-Supervised Wasserstein Pseudo-Labeling for Semi-Supervised Image Classification Fariborz Taherkhani, Ali Dabouei, Sobhan Soleymani, Jeremy Dawson, Nasser M. Nasrabadi [pdf] [bibtex]

MeanShift++: Extremely Fast Mode-Seeking With Applications to Segmentation and Object Tracking Jennifer Jang, Heinrich Jiang [pdf] [supp] [bibtex]

PCLs: Geometry-Aware Neural Reconstruction of 3D Pose With Perspective Crop Layers Frank Yu, Mathieu Salzmann, Pascal Fua, Helge Rhodin [pdf] [supp] [arXiv] [bibtex]

Partially View-Aligned Representation Learning With Noise-Robust Contrastive Loss Mouxing Yang, Yunfan Li, Zhenyu Huang, Zitao Liu, Peng Hu, Xi Peng [pdf] [supp] [bibtex]

i3DMM: Deep Implicit 3D Morphable Model of Human Heads Tarun Yenamandra, Ayush Tewari, Florian Bernard, Hans-Peter Seidel, Mohamed Elgharib, Daniel Cremers, Christian Theobalt [pdf] [supp] [arXiv] [bibtex]

Searching by Generating: Flexible and Efficient One-Shot NAS With Architecture Generator Sian-Yao Huang, Wei-Ta Chu [pdf] [supp] [arXiv] [bibtex]

Discovering Interpretable Latent Space Directions of GANs Beyond Binary Attributes Huiting Yang, Liangyu Chai, Qiang Wen, Shuang Zhao, Zixun Sun, Shengfeng He [pdf] [supp] [bibtex]

ForgeryNet: A Versatile Benchmark for Comprehensive Forgery Analysis Yinan He, Bei Gan, Siyu Chen, Yichun Zhou, Guojun Yin, Luchuan Song, Lu Sheng, Jing Shao, Ziwei Liu [pdf] [supp] [arXiv] [bibtex]

Blocks-World Cameras Jongho Lee, Mohit Gupta [pdf] [supp] [bibtex]

The Affective Growth of Computer Vision Norman Makoto Su, David J. Crandall [pdf] [bibtex]

Lifelong Person Re-Identification via Adaptive Knowledge Accumulation Nan Pu, Wei Chen, Yu Liu, Erwin M. Bakker, Michael S. Lew [pdf] [supp] [arXiv] [bibtex]

Omnimatte: Associating Objects and Their Effects in Video Erika Lu, Forrester Cole, Tali Dekel, Andrew Zisserman, William T. Freeman, Michael Rubinstein [pdf] [arXiv] [bibtex]

Detecting Human-Object Interaction via Fabricated Compositional Learning Zhi Hou, Baosheng Yu, Yu Qiao, Xiaojiang Peng, Dacheng Tao [pdf] [supp] [arXiv] [bibtex]

Memory-Efficient Network for Large-Scale Video Compressive Sensing Ziheng Cheng, Bo Chen, Guanliang Liu, Hao Zhang, Ruiying Lu, Zhengjue Wang, Xin Yuan [pdf] [supp] [arXiv] [bibtex]

Deep Optimized Priors for 3D Shape Modeling and Reconstruction Mingyue Yang, Yuxin Wen, Weikai Chen, Yongwei Chen, Kui Jia [pdf] [supp] [arXiv] [bibtex]

Affordance Transfer Learning for Human-Object Interaction Detection Zhi Hou, Baosheng Yu, Yu Qiao, Xiaojiang Peng, Dacheng Tao [pdf] [supp] [arXiv] [bibtex]

DSC-PoseNet: Learning 6DoF Object Pose Estimation via Dual-Scale Consistency Zongxin Yang, Xin Yu, Yi Yang [pdf] [supp] [bibtex]

Rethinking Graph Neural Architecture Search From Message-Passing Shaofei Cai, Liang Li, Jincan Deng, Beichen Zhang, Zheng-Jun Zha, Li Su, Qingming Huang [pdf] [arXiv] [bibtex]

Locate Then Segment: A Strong Pipeline for Referring Image Segmentation Ya Jing, Tao Kong, Wei Wang, Liang Wang, Lei Li, Tieniu Tan [pdf] [arXiv] [bibtex]

Exploring Complementary Strengths of Invariant and Equivariant Representations for Few-Shot Learning Mamshad Nayeem Rizve, Salman Khan, Fahad Shahbaz Khan, Mubarak Shah [pdf] [supp] [arXiv] [bibtex]

Encoding in Style: A StyleGAN Encoder for Image-to-Image Translation Elad Richardson, Yuval Alaluf, Or Patashnik, Yotam Nitzan, Yaniv Azar, Stav Shapiro, Daniel Cohen-Or [pdf] [supp] [arXiv] [bibtex]

Towards Bridging Event Captioner and Sentence Localizer for Weakly Supervised Dense Event Captioning Shaoxiang Chen, Yu-Gang Jiang [pdf] [supp] [bibtex]

DER: Dynamically Expandable Representation for Class Incremental Learning Shipeng Yan, Jiangwei Xie, Xuming He [pdf] [arXiv] [bibtex]

Fine-Grained Angular Contrastive Learning With Coarse Labels Guy Bukchin, Eli Schwartz, Kate Saenko, Ori Shahar, Rogerio Feris, Raja Giryes, Leonid Karlinsky [pdf] [supp] [arXiv] [bibtex]

Polarimetric Normal Stereo Yoshiki Fukao, Ryo Kawahara, Shohei Nobuhara, Ko Nishino [pdf] [supp] [bibtex]

Manifold Regularized Dynamic Network Pruning Yehui Tang, Yunhe Wang, Yixing Xu, Yiping Deng, Chao Xu, Dacheng Tao, Chang Xu [pdf] [supp] [arXiv] [bibtex]

ViPNAS: Efficient Video Pose Estimation via Neural Architecture Search Lumin Xu, Yingda Guan, Sheng Jin, Wentao Liu, Chen Qian, Ping Luo, Wanli Ouyang, Xiaogang Wang [pdf] [supp] [arXiv] [bibtex]

Open Domain Generalization with Domain-Augmented Meta-Learning Yang Shu, Zhangjie Cao, Chenyu Wang, Jianmin Wang, Mingsheng Long [pdf] [supp] [arXiv] [bibtex]

DeepTag: An Unsupervised Deep Learning Method for Motion Tracking on Cardiac Tagging Magnetic Resonance Images Meng Ye, Mikael Kanski, Dong Yang, Qi Chang, Zhennan Yan, Qiaoying Huang, Leon Axel, Dimitris Metaxas [pdf] [arXiv] [bibtex]

Learning by Planning: Language-Guided Global Image Editing Jing Shi, Ning Xu, Yihang Xu, Trung Bui, Franck Dernoncourt, Chenliang Xu [pdf] [bibtex]

Curriculum Graph Co-Teaching for Multi-Target Domain Adaptation Subhankar Roy, Evgeny Krivosheev, Zhun Zhong, Nicu Sebe, Elisa Ricci [pdf] [supp] [arXiv] [bibtex]

Uncalibrated Neural Inverse Rendering for Photometric Stereo of General Surfaces Berk Kaya, Suryansh Kumar, Carlos Oliveira, Vittorio Ferrari, Luc Van Gool [pdf] [supp] [arXiv] [bibtex]

Improving the Transferability of Adversarial Samples With Adversarial Transformations Weibin Wu, Yuxin Su, Michael R. Lyu, Irwin King [pdf] [bibtex]

Self-Supervised Learning for Semi-Supervised Temporal Action Proposal Xiang Wang, Shiwei Zhang, Zhiwu Qing, Yuanjie Shao, Changxin Gao, Nong Sang [pdf] [arXiv] [bibtex]

Learning Compositional Representation for 4D Captures With Neural ODE Boyan Jiang, Yinda Zhang, Xingkui Wei, Xiangyang Xue, Yanwei Fu [pdf] [supp] [arXiv] [bibtex]

Effective Snapshot Compressive-Spectral Imaging via Deep Denoising and Total Variation Priors Haiquan Qiu, Yao Wang, Deyu Meng [pdf] [supp] [bibtex]

LAFEAT: Piercing Through Adversarial Defenses With Latent Features Yunrui Yu, Xitong Gao, Cheng-Zhong Xu [pdf] [arXiv] [bibtex]

Exploiting Spatial Dimensions of Latent in GAN for Real-Time Image Editing Hyunsu Kim, Yunjey Choi, Junho Kim, Sungjoo Yoo, Youngjung Uh [pdf] [supp] [arXiv] [bibtex]

Bidirectional Projection Network for Cross Dimension Scene Understanding Wenbo Hu, Hengshuang Zhao, Li Jiang, Jiaya Jia, Tien-Tsin Wong [pdf] [arXiv] [bibtex]

Event-Based Synthetic Aperture Imaging With a Hybrid Network Xiang Zhang, Wei Liao, Lei Yu, Wen Yang, Gui-Song Xia [pdf] [supp] [arXiv] [bibtex]

RSG: A Simple but Effective Module for Learning Imbalanced Datasets Jianfeng Wang, Thomas Lukasiewicz, Xiaolin Hu, Jianfei Cai, Zhenghua Xu [pdf] [supp] [bibtex]

Learning Statistical Texture for Semantic Segmentation Lanyun Zhu, Deyi Ji, Shiping Zhu, Weihao Gan, Wei Wu, Junjie Yan [pdf] [arXiv] [bibtex]

Neural Feature Search for RGB-Infrared Person Re-Identification Yehansen Chen, Lin Wan, Zhihang Li, Qianyan Jing, Zongyuan Sun [pdf] [supp] [arXiv] [bibtex]

FP-NAS: Fast Probabilistic Neural Architecture Search Zhicheng Yan, Xiaoliang Dai, Peizhao Zhang, Yuandong Tian, Bichen Wu, Matt Feiszli [pdf] [supp] [bibtex]

Fast Sinkhorn Filters: Using Matrix Scaling for Non-Rigid Shape Correspondence With Functional Maps Gautam Pai, Jing Ren, Simone Melzi, Peter Wonka, Maks Ovsjanikov [pdf] [supp] [bibtex]

Bilevel Online Adaptation for Out-of-Domain Human Mesh Reconstruction Shanyan Guan, Jingwei Xu, Yunbo Wang, Bingbing Ni, Xiaokang Yang [pdf] [supp] [arXiv] [bibtex]

The Temporal Opportunist: Self-Supervised Multi-Frame Monocular Depth Jamie Watson, Oisin Mac Aodha, Victor Prisacariu, Gabriel Brostow, Michael Firman [pdf] [arXiv] [bibtex]

Distribution-Aware Adaptive Multi-Bit Quantization Sijie Zhao, Tao Yue, Xuemei Hu [pdf] [supp] [bibtex]

KRISP: Integrating Implicit and Symbolic Knowledge for Open-Domain Knowledge-Based VQA Kenneth Marino, Xinlei Chen, Devi Parikh, Abhinav Gupta, Marcus Rohrbach [pdf] [supp] [arXiv] [bibtex]

Amalgamating Knowledge From Heterogeneous Graph Neural Networks Yongcheng Jing, Yiding Yang, Xinchao Wang, Mingli Song, Dacheng Tao [pdf] [bibtex]

MetaSets: Meta-Learning on Point Sets for Generalizable Representations Chao Huang, Zhangjie Cao, Yunbo Wang, Jianmin Wang, Mingsheng Long [pdf] [supp] [bibtex]

StEP: Style-Based Encoder Pre-Training for Multi-Modal Image Synthesis Moustafa Meshry, Yixuan Ren, Larry S. Davis, Abhinav Shrivastava [pdf] [supp] [arXiv] [bibtex]

Goal-Oriented Gaze Estimation for Zero-Shot Learning Yang Liu, Lei Zhou, Xiao Bai, Yifei Huang, Lin Gu, Jun Zhou, Tatsuya Harada [pdf] [arXiv] [bibtex]

LED2-Net: Monocular 360deg Layout Estimation via Differentiable Depth Rendering Fu-En Wang, Yu-Hsuan Yeh, Min Sun, Wei-Chen Chiu, Yi-Hsuan Tsai [pdf] [bibtex]

Multi-Stage Aggregated Transformer Network for Temporal Language Localization in Videos Mingxing Zhang, Yang Yang, Xinghan Chen, Yanli Ji, Xing Xu, Jingjing Li, Heng Tao Shen [pdf] [bibtex]

DANNet: A One-Stage Domain Adaptation Network for Unsupervised Nighttime Semantic Segmentation Xinyi Wu, Zhenyao Wu, Hao Guo, Lili Ju, Song Wang [pdf] [arXiv] [bibtex]

Dynamic Transfer for Multi-Source Domain Adaptation Yunsheng Li, Lu Yuan, Yinpeng Chen, Pei Wang, Nuno Vasconcelos [pdf] [arXiv] [bibtex]

Semi-Supervised Video Deraining With Dynamical Rain Generator Zongsheng Yue, Jianwen Xie, Qian Zhao, Deyu Meng [pdf] [supp] [arXiv] [bibtex]

See Through Gradients: Image Batch Recovery via GradInversion Hongxu Yin, Arun Mallya, Arash Vahdat, Jose M. Alvarez, Jan Kautz, Pavlo Molchanov [pdf] [supp] [arXiv] [bibtex]

Feature Decomposition and Reconstruction Learning for Effective Facial Expression Recognition Delian Ruan, Yan Yan, Shenqi Lai, Zhenhua Chai, Chunhua Shen, Hanzi Wang [pdf] [arXiv] [bibtex]

Seeing Behind Objects for 3D Multi-Object Tracking in RGB-D Sequences Norman Muller, Yu-Shiang Wong, Niloy J. Mitra, Angela Dai, Matthias Niessner [pdf] [supp] [bibtex]

Multi-view Depth Estimation using Epipolar Spatio-Temporal Networks Xiaoxiao Long, Lingjie Liu, Wei Li, Christian Theobalt, Wenping Wang [pdf] [supp] [arXiv] [bibtex]

AutoFlow: Learning a Better Training Set for Optical Flow Deqing Sun, Daniel Vlasic, Charles Herrmann, Varun Jampani, Michael Krainin, Huiwen Chang, Ramin Zabih, William T. Freeman, Ce Liu [pdf] [supp] [arXiv] [bibtex]

LPSNet: A Lightweight Solution for Fast Panoptic Segmentation Weixiang Hong, Qingpei Guo, Wei Zhang, Jingdong Chen, Wei Chu [pdf] [bibtex]

You See What I Want You To See: Exploring Targeted Black-Box Transferability Attack for Hash-Based Image Retrieval Systems Yanru Xiao, Cong Wang [pdf] [supp] [bibtex]

The Blessings of Unlabeled Background in Untrimmed Videos Yuan Liu, Jingyuan Chen, Zhenfang Chen, Bing Deng, Jianqiang Huang, Hanwang Zhang [pdf] [supp] [arXiv] [bibtex]

Autoregressive Stylized Motion Synthesis With Generative Flow Yu-Hui Wen, Zhipeng Yang, Hongbo Fu, Lin Gao, Yanan Sun, Yong-Jin Liu [pdf] [supp] [bibtex]

Improving Multiple Object Tracking With Single Object Tracking Linyu Zheng, Ming Tang, Yingying Chen, Guibo Zhu, Jinqiao Wang, Hanqing Lu [pdf] [supp] [bibtex]

Memory Oriented Transfer Learning for Semi-Supervised Image Deraining Huaibo Huang, Aijing Yu, Ran He [pdf] [supp] [bibtex]

Instance Localization for Self-Supervised Detection Pretraining Ceyuan Yang, Zhirong Wu, Bolei Zhou, Stephen Lin [pdf] [arXiv] [bibtex]

Adaptive Methods for Real-World Domain Generalization Abhimanyu Dubey, Vignesh Ramanathan, Alex Pentland, Dhruv Mahajan [pdf] [arXiv] [bibtex]

Deep Animation Video Interpolation in the Wild Li Siyao, Shiyu Zhao, Weijiang Yu, Wenxiu Sun, Dimitris Metaxas, Chen Change Loy, Ziwei Liu [pdf] [supp] [arXiv] [bibtex]

Isometric Multi-Shape Matching Maolin Gao, Zorah Lahner, Johan Thunberg, Daniel Cremers, Florian Bernard [pdf] [supp] [arXiv] [bibtex]

Spatially Consistent Representation Learning Byungseok Roh, Wuhyun Shin, Ildoo Kim, Sungwoong Kim [pdf] [supp] [arXiv] [bibtex]

Semantic Scene Completion via Integrating Instances and Scene In-the-Loop Yingjie Cai, Xuesong Chen, Chao Zhang, Kwan-Yee Lin, Xiaogang Wang, Hongsheng Li [pdf] [supp] [arXiv] [bibtex]

Efficient Deformable Shape Correspondence via Multiscale Spectral Manifold Wavelets Preservation Ling Hu, Qinsong Li, Shengjun Liu, Xinru Liu [pdf] [supp] [bibtex]

TearingNet: Point Cloud Autoencoder To Learn Topology-Friendly Representations Jiahao Pang, Duanshun Li, Dong Tian [pdf] [supp] [arXiv] [bibtex]

Boosting Ensemble Accuracy by Revisiting Ensemble Diversity Metrics Yanzhao Wu, Ling Liu, Zhongwei Xie, Ka-Ho Chow, Wenqi Wei [pdf] [supp] [bibtex]

WebFace260M: A Benchmark Unveiling the Power of Million-Scale Deep Face Recognition Zheng Zhu, Guan Huang, Jiankang Deng, Yun Ye, Junjie Huang, Xinze Chen, Jiagang Zhu, Tian Yang, Jiwen Lu, Dalong Du, Jie Zhou [pdf] [arXiv] [bibtex]

RSN: Range Sparse Net for Efficient, Accurate LiDAR 3D Object Detection Pei Sun, Weiyue Wang, Yuning Chai, Gamaleldin Elsayed, Alex Bewley, Xiao Zhang, Cristian Sminchisescu, Dragomir Anguelov [pdf] [supp] [bibtex]

Labeled From Unlabeled: Exploiting Unlabeled Data for Few-Shot Deep HDR Deghosting K. Ram Prabhakar, Gowtham Senthil, Susmit Agrawal, R. Venkatesh Babu, Rama Krishna Sai S Gorthi [pdf] [bibtex]

Convolutional Dynamic Alignment Networks for Interpretable Classifications Moritz Bohle, Mario Fritz, Bernt Schiele [pdf] [supp] [arXiv] [bibtex]

EDNet: Efficient Disparity Estimation With Cost Volume Combination and Attention-Based Spatial Residual Songyan Zhang, Zhicheng Wang, Qiang Wang, Jinshuo Zhang, Gang Wei, Xiaowen Chu [pdf] [arXiv] [bibtex]

Unsupervised Visual Representation Learning by Tracking Patches in Video Guangting Wang, Yizhou Zhou, Chong Luo, Wenxuan Xie, Wenjun Zeng, Zhiwei Xiong [pdf] [supp] [arXiv] [bibtex]

Wasserstein Contrastive Representation Distillation Liqun Chen, Dong Wang, Zhe Gan, Jingjing Liu, Ricardo Henao, Lawrence Carin [pdf] [supp] [arXiv] [bibtex]

Learnable Companding Quantization for Accurate Low-Bit Neural Networks Kohei Yamamoto [pdf] [supp] [arXiv] [bibtex]

FaceInpainter: High Fidelity Face Adaptation to Heterogeneous Domains Jia Li, Zhaoyang Li, Jie Cao, Xingguang Song, Ran He [pdf] [supp] [bibtex]

How Robust Are Randomized Smoothing Based Defenses to Data Poisoning? Akshay Mehra, Bhavya Kailkhura, Pin-Yu Chen, Jihun Hamm [pdf] [supp] [arXiv] [bibtex]

Deep Learning in Latent Space for Video Prediction and Compression Bowen Liu, Yu Chen, Shiyu Liu, Hun-Seok Kim [pdf] [supp] [bibtex]

PWCLO-Net: Deep LiDAR Odometry in 3D Point Clouds Using Hierarchical Embedding Mask Optimization Guangming Wang, Xinrui Wu, Zhe Liu, Hesheng Wang [pdf] [supp] [bibtex]

ORDisCo: Effective and Efficient Usage of Incremental Unlabeled Data for Semi-Supervised Continual Learning Liyuan Wang, Kuo Yang, Chongxuan Li, Lanqing Hong, Zhenguo Li, Jun Zhu [pdf] [supp] [arXiv] [bibtex]

Dynamic Region-Aware Convolution Jin Chen, Xijun Wang, Zichao Guo, Xiangyu Zhang, Jian Sun [pdf] [supp] [arXiv] [bibtex]

Explore Image Deblurring via Encoded Blur Kernel Space Phong Tran, Anh Tuan Tran, Quynh Phung, Minh Hoai [pdf] [supp] [bibtex]

BCNet: Searching for Network Width With Bilaterally Coupled Network Xiu Su, Shan You, Fei Wang, Chen Qian, Changshui Zhang, Chang Xu [pdf] [supp] [arXiv] [bibtex]

Camera Pose Matters: Improving Depth Prediction by Mitigating Pose Distribution Bias Yunhan Zhao, Shu Kong, Charless Fowlkes [pdf] [supp] [arXiv] [bibtex]

Lipstick Ain't Enough: Beyond Color Matching for In-the-Wild Makeup Transfer Thao Nguyen, Anh Tuan Tran, Minh Hoai [pdf] [supp] [bibtex]

Generative Interventions for Causal Learning Chengzhi Mao, Augustine Cha, Amogh Gupta, Hao Wang, Junfeng Yang, Carl Vondrick [pdf] [arXiv] [bibtex]

Graph Stacked Hourglass Networks for 3D Human Pose Estimation Tianhan Xu, Wataru Takano [pdf] [arXiv] [bibtex]

Adaptive Aggregation Networks for Class-Incremental Learning Yaoyao Liu, Bernt Schiele, Qianru Sun [pdf] [supp] [arXiv] [bibtex]

VS-Net: Voting With Segmentation for Visual Localization Zhaoyang Huang, Han Zhou, Yijin Li, Bangbang Yang, Yan Xu, Xiaowei Zhou, Hujun Bao, Guofeng Zhang, Hongsheng Li [pdf] [supp] [bibtex]

Learning To Identify Correct 2D-2D Line Correspondences on Sphere Haoang Li, Kai Chen, Ji Zhao, Jiangliu Wang, Pyojin Kim, Zhe Liu, Yun-Hui Liu [pdf] [bibtex]

Domain-Independent Dominance of Adaptive Methods Pedro Savarese, David McAllester, Sudarshan Babu, Michael Maire [pdf] [supp] [arXiv] [bibtex]

What if We Only Use Real Datasets for Scene Text Recognition? Toward Scene Text Recognition With Fewer Labels Jeonghun Baek, Yusuke Matsui, Kiyoharu Aizawa [pdf] [supp] [arXiv] [bibtex]

Incremental Learning via Rate Reduction Ziyang Wu, Christina Baek, Chong You, Yi Ma [pdf] [arXiv] [bibtex]

Neural Descent for Visual 3D Human Pose and Shape Andrei Zanfir, Eduard Gabriel Bazavan, Mihai Zanfir, William T. Freeman, Rahul Sukthankar, Cristian Sminchisescu [pdf] [supp] [arXiv] [bibtex]

HR-NAS: Searching Efficient High-Resolution Neural Architectures With Lightweight Transformers Mingyu Ding, Xiaochen Lian, Linjie Yang, Peng Wang, Xiaojie Jin, Zhiwu Lu, Ping Luo [pdf] [supp] [bibtex]

Transitional Adaptation of Pretrained Models for Visual Storytelling Youngjae Yu, Jiwan Chung, Heeseung Yun, Jongseok Kim, Gunhee Kim [pdf] [supp] [bibtex]

Improving Panoptic Segmentation at All Scales Lorenzo Porzi, Samuel Rota Bulo, Peter Kontschieder [pdf] [supp] [arXiv] [bibtex]

Model-Contrastive Federated Learning Qinbin Li, Bingsheng He, Dawn Song [pdf] [supp] [arXiv] [bibtex]

Scalability vs. Utility: Do We Have To Sacrifice One for the Other in Data Importance Quantification? Ruoxi Jia, Fan Wu, Xuehui Sun, Jiacen Xu, David Dao, Bhavya Kailkhura, Ce Zhang, Bo Li, Dawn Song [pdf] [supp] [arXiv] [bibtex]

Hierarchical Layout-Aware Graph Convolutional Network for Unified Aesthetics Assessment Dongyu She, Yu-Kun Lai, Gaoxiong Yi, Kun Xu [pdf] [supp] [bibtex]

Normalized Avatar Synthesis Using StyleGAN and Perceptual Refinement Huiwen Luo, Koki Nagano, Han-Wei Kung, Qingguo Xu, Zejian Wang, Lingyu Wei, Liwen Hu, Hao Li [pdf] [bibtex]

CT-Net: Complementary Transfering Network for Garment Transfer With Arbitrary Geometric Changes Fan Yang, Guosheng Lin [pdf] [supp] [bibtex]

MetaCorrection: Domain-Aware Meta Loss Correction for Unsupervised Domain Adaptation in Semantic Segmentation Xiaoqing Guo, Chen Yang, Baopu Li, Yixuan Yuan [pdf] [arXiv] [bibtex]

Multi-Stage Progressive Image Restoration Syed Waqas Zamir, Aditya Arora, Salman Khan, Munawar Hayat, Fahad Shahbaz Khan, Ming-Hsuan Yang, Ling Shao [pdf] [supp] [arXiv] [bibtex]

PointNetLK Revisited Xueqian Li, Jhony Kaesemodel Pontes, Simon Lucey [pdf] [supp] [arXiv] [bibtex]

Deep Convolutional Dictionary Learning for Image Denoising Hongyi Zheng, Hongwei Yong, Lei Zhang [pdf] [supp] [bibtex]

Fourier Contour Embedding for Arbitrary-Shaped Text Detection Yiqin Zhu, Jianyong Chen, Lingyu Liang, Zhanghui Kuang, Lianwen Jin, Wayne Zhang [pdf] [arXiv] [bibtex]

TAP: Text-Aware Pre-Training for Text-VQA and Text-Caption Zhengyuan Yang, Yijuan Lu, Jianfeng Wang, Xi Yin, Dinei Florencio, Lijuan Wang, Cha Zhang, Lei Zhang, Jiebo Luo [pdf] [supp] [arXiv] [bibtex]

Seeing Out of the Box: End-to-End Pre-Training for Vision-Language Representation Learning Zhicheng Huang, Zhaoyang Zeng, Yupan Huang, Bei Liu, Dongmei Fu, Jianlong Fu [pdf] [supp] [arXiv] [bibtex]

Quality-Agnostic Image Recognition via Invertible Decoder Insoo Kim, Seungju Han, Ji-won Baek, Seong-Jin Park, Jae-Joon Han, Jinwoo Shin [pdf] [supp] [bibtex]

Hybrid Rotation Averaging: A Fast and Robust Rotation Averaging Approach Yu Chen, Ji Zhao, Laurent Kneip [pdf] [supp] [arXiv] [bibtex]

One Thing One Click: A Self-Training Approach for Weakly Supervised 3D Semantic Segmentation Zhengzhe Liu, Xiaojuan Qi, Chi-Wing Fu [pdf] [supp] [arXiv] [bibtex]

Out-of-Distribution Detection Using Union of 1-Dimensional Subspaces Alireza Zaeemzadeh, Niccolo Bisagno, Zeno Sambugaro, Nicola Conci, Nazanin Rahnavard, Mubarak Shah [pdf] [supp] [bibtex]

MP3: A Unified Model To Map, Perceive, Predict and Plan Sergio Casas, Abbas Sadat, Raquel Urtasun [pdf] [supp] [arXiv] [bibtex]

SCALE: Modeling Clothed Humans with a Surface Codec of Articulated Local Elements Qianli Ma, Shunsuke Saito, Jinlong Yang, Siyu Tang, Michael J. Black [pdf] [supp] [arXiv] [bibtex]

Playable Video Generation Willi Menapace, Stephane Lathuiliere, Sergey Tulyakov, Aliaksandr Siarohin, Elisa Ricci [pdf] [arXiv] [bibtex]

AdCo: Adversarial Contrast for Efficient Learning of Unsupervised Representations From Self-Trained Negative Adversaries Qianjiang Hu, Xiao Wang, Wei Hu, Guo-Jun Qi [pdf] [supp] [arXiv] [bibtex]

Permute, Quantize, and Fine-Tune: Efficient Compression of Neural Networks Julieta Martinez, Jashan Shewakramani, Ting Wei Liu, Ioan Andrei Barsan, Wenyuan Zeng, Raquel Urtasun [pdf] [bibtex]

Mol2Image: Improved Conditional Flow Models for Molecule to Image Synthesis Karren Yang, Samuel Goldman, Wengong Jin, Alex X. Lu, Regina Barzilay, Tommi Jaakkola, Caroline Uhler [pdf] [supp] [bibtex]

Improved Handling of Motion Blur in Online Object Detection Mohamed Sayed, Gabriel Brostow [pdf] [arXiv] [bibtex]

Multimodal Motion Prediction With Stacked Transformers Yicheng Liu, Jinghuai Zhang, Liangji Fang, Qinhong Jiang, Bolei Zhou [pdf] [supp] [arXiv] [bibtex]

The Translucent Patch: A Physical and Universal Attack on Object Detectors Alon Zolfi, Moshe Kravchik, Yuval Elovici, Asaf Shabtai [pdf] [arXiv] [bibtex]

Exploit Visual Dependency Relations for Semantic Segmentation Mingyuan Liu, Dan Schonfeld, Wei Tang [pdf] [bibtex]

Dense Label Encoding for Boundary Discontinuity Free Rotation Detection Xue Yang, Liping Hou, Yue Zhou, Wentao Wang, Junchi Yan [pdf] [arXiv] [bibtex]

Self-Supervised Collision Handling via Generative 3D Garment Models for Virtual Try-On Igor Santesteban, Nils Thuerey, Miguel A. Otaduy, Dan Casas [pdf] [supp] [arXiv] [bibtex]

DexYCB: A Benchmark for Capturing Hand Grasping of Objects Yu-Wei Chao, Wei Yang, Yu Xiang, Pavlo Molchanov, Ankur Handa, Jonathan Tremblay, Yashraj S. Narang, Karl Van Wyk, Umar Iqbal, Stan Birchfield, Jan Kautz, Dieter Fox [pdf] [supp] [arXiv] [bibtex]

Prototype Completion With Primitive Knowledge for Few-Shot Learning Baoquan Zhang, Xutao Li, Yunming Ye, Zhichao Huang, Lisai Zhang [pdf] [supp] [arXiv] [bibtex]

High-Quality Stereo Image Restoration From Double Refraction Hakyeong Kim, Andreas Meuleman, Daniel S. Jeon, Min H. Kim [pdf] [supp] [bibtex]

Track, Check, Repeat: An EM Approach to Unsupervised Tracking Adam W. Harley, Yiming Zuo, Jing Wen, Ayush Mangal, Shubhankar Potdar, Ritwick Chaudhry, Katerina Fragkiadaki [pdf] [arXiv] [bibtex]

LayoutTransformer: Scene Layout Generation With Conceptual and Spatial Diversity Cheng-Fu Yang, Wan-Cyuan Fan, Fu-En Yang, Yu-Chiang Frank Wang [pdf] [supp] [bibtex]

Practical Wide-Angle Portraits Correction With Deep Structured Models Jing Tan, Shan Zhao, Pengfei Xiong, Jiangyu Liu, Haoqiang Fan, Shuaicheng Liu [pdf] [supp] [arXiv] [bibtex]

CanonPose: Self-Supervised Monocular 3D Human Pose Estimation in the Wild Bastian Wandt, Marco Rudolph, Petrissa Zell, Helge Rhodin, Bodo Rosenhahn [pdf] [arXiv] [bibtex]

Pushing It Out of the Way: Interactive Visual Navigation Kuo-Hao Zeng, Luca Weihs, Ali Farhadi, Roozbeh Mottaghi [pdf] [supp] [arXiv] [bibtex]

Improving Weakly Supervised Visual Grounding by Contrastive Knowledge Distillation Liwei Wang, Jing Huang, Yin Li, Kun Xu, Zhengyuan Yang, Dong Yu [pdf] [supp] [arXiv] [bibtex]

EvDistill: Asynchronous Events To End-Task Learning via Bidirectional Reconstruction-Guided Cross-Modal Knowledge Distillation Lin Wang, Yujeong Chae, Sung-Hoon Yoon, Tae-Kyun Kim, Kuk-Jin Yoon [pdf] [bibtex]

LoFTR: Detector-Free Local Feature Matching With Transformers Jiaming Sun, Zehong Shen, Yuang Wang, Hujun Bao, Xiaowei Zhou [pdf] [arXiv] [bibtex]

Combinatorial Learning of Graph Edit Distance via Dynamic Embedding Runzhong Wang, Tianqi Zhang, Tianshu Yu, Junchi Yan, Xiaokang Yang [pdf] [arXiv] [bibtex]

Radar-Camera Pixel Depth Association for Depth Completion Yunfei Long, Daniel Morris, Xiaoming Liu, Marcos Castro, Punarjay Chakravarty, Praveen Narayanan [pdf] [supp] [arXiv] [bibtex]

Improved Image Matting via Real-Time User Clicks and Uncertainty Estimation Tianyi Wei, Dongdong Chen, Wenbo Zhou, Jing Liao, Hanqing Zhao, Weiming Zhang, Nenghai Yu [pdf] [supp] [arXiv] [bibtex]

Revisiting Superpixels for Active Learning in Semantic Segmentation With Realistic Annotation Costs Lile Cai, Xun Xu, Jun Hao Liew, Chuan Sheng Foo [pdf] [supp] [bibtex]

IMODAL: Creating Learnable User-Defined Deformation Models Leander Lacroix, Benjamin Charlier, Alain Trouve, Barbara Gris [pdf] [supp] [bibtex]

Fast End-to-End Learning on Protein Surfaces Freyr Sverrisson, Jean Feydy, Bruno E. Correia, Michael M. Bronstein [pdf] [supp] [bibtex]

Found a Reason for me? Weakly-supervised Grounded Visual Question Answering using Capsules Aisha Urooj, Hilde Kuehne, Kevin Duarte, Chuang Gan, Niels Lobo, Mubarak Shah [pdf] [supp] [arXiv] [bibtex]

Person Re-Identification Using Heterogeneous Local Graph Attention Networks Zhong Zhang, Haijia Zhang, Shuang Liu [pdf] [bibtex]

Recurrent Multi-View Alignment Network for Unsupervised Surface Registration Wanquan Feng, Juyong Zhang, Hongrui Cai, Haofei Xu, Junhui Hou, Hujun Bao [pdf] [arXiv] [bibtex]

Divide-and-Conquer for Lane-Aware Diverse Trajectory Prediction Sriram Narayanan, Ramin Moslemi, Francesco Pittaluga, Buyu Liu, Manmohan Chandraker [pdf] [supp] [arXiv] [bibtex]

Probabilistic 3D Human Shape and Pose Estimation From Multiple Unconstrained Images in the Wild Akash Sengupta, Ignas Budvytis, Roberto Cipolla [pdf] [supp] [arXiv] [bibtex]

Weakly Supervised Instance Segmentation for Videos With Temporal Mask Consistency Qing Liu, Vignesh Ramanathan, Dhruv Mahajan, Alan Yuille, Zhenheng Yang [pdf] [arXiv] [bibtex]

Exploring Data-Efficient 3D Scene Understanding With Contrastive Scene Contexts Ji Hou, Benjamin Graham, Matthias Niessner, Saining Xie [pdf] [supp] [arXiv] [bibtex]

MetaHTR: Towards Writer-Adaptive Handwritten Text Recognition Ayan Kumar Bhunia, Shuvozit Ghose, Amandeep Kumar, Pinaki Nath Chowdhury, Aneeshan Sain, Yi-Zhe Song [pdf] [supp] [arXiv] [bibtex]

Learning To Reconstruct High Speed and High Dynamic Range Videos From Events Yunhao Zou, Yinqiang Zheng, Tsuyoshi Takatani, Ying Fu [pdf] [bibtex]

PSRR-MaxpoolNMS: Pyramid Shifted MaxpoolNMS With Relationship Recovery Tianyi Zhang, Jie Lin, Peng Hu, Bin Zhao, Mohamed M. Sabry Aly [pdf] [supp] [bibtex]

Flow-Guided One-Shot Talking Face Generation With a High-Resolution Audio-Visual Dataset Zhimeng Zhang, Lincheng Li, Yu Ding, Changjie Fan [pdf] [bibtex]

VIGOR: Cross-View Image Geo-Localization Beyond One-to-One Retrieval Sijie Zhu, Taojiannan Yang, Chen Chen [pdf] [supp] [arXiv] [bibtex]

D-NeRF: Neural Radiance Fields for Dynamic Scenes Albert Pumarola, Enric Corona, Gerard Pons-Moll, Francesc Moreno-Noguer [pdf] [bibtex]

Towards Unified Surgical Skill Assessment Daochang Liu, Qiyue Li, Tingting Jiang, Yizhou Wang, Rulin Miao, Fei Shan, Ziyu Li [pdf] [supp] [arXiv] [bibtex]

Read and Attend: Temporal Localisation in Sign Language Videos Gul Varol, Liliane Momeni, Samuel Albanie, Triantafyllos Afouras, Andrew Zisserman [pdf] [supp] [arXiv] [bibtex]

ABMDRNet: Adaptive-Weighted Bi-Directional Modality Difference Reduction Network for RGB-T Semantic Segmentation Qiang Zhang, Shenlu Zhao, Yongjiang Luo, Dingwen Zhang, Nianchang Huang, Jungong Han [pdf] [supp] [bibtex]

Heterogeneous Grid Convolution for Adaptive, Efficient, and Controllable Computation Ryuhei Hamaguchi, Yasutaka Furukawa, Masaki Onishi, Ken Sakurada [pdf] [supp] [arXiv] [bibtex]

Learning a Facial Expression Embedding Disentangled From Identity Wei Zhang, Xianpeng Ji, Keyu Chen, Yu Ding, Changjie Fan [pdf] [bibtex]

Robust Bayesian Neural Networks by Spectral Expectation Bound Regularization Jiaru Zhang, Yang Hua, Zhengui Xue, Tao Song, Chengyu Zheng, Ruhui Ma, Haibing Guan [pdf] [supp] [bibtex]

Learning Probabilistic Ordinal Embeddings for Uncertainty-Aware Regression Wanhua Li, Xiaoke Huang, Jiwen Lu, Jianjiang Feng, Jie Zhou [pdf] [supp] [arXiv] [bibtex]

StyleMix: Separating Content and Style for Enhanced Data Augmentation Minui Hong, Jinwoo Choi, Gunhee Kim [pdf] [supp] [bibtex]

Kaleido-BERT: Vision-Language Pre-Training on Fashion Domain Mingchen Zhuge, Dehong Gao, Deng-Ping Fan, Linbo Jin, Ben Chen, Haoming Zhou, Minghui Qiu, Ling Shao [pdf] [bibtex]

Co-Grounding Networks With Semantic Attention for Referring Expression Comprehension in Videos Sijie Song, Xudong Lin, Jiaying Liu, Zongming Guo, Shih-Fu Chang [pdf] [arXiv] [bibtex]

Binary Graph Neural Networks Mehdi Bahri, Gaetan Bahl, Stefanos Zafeiriou [pdf] [supp] [arXiv] [bibtex]

3D CNNs With Adaptive Temporal Feature Resolutions Mohsen Fayyaz, Emad Bahrami, Ali Diba, Mehdi Noroozi, Ehsan Adeli, Luc Van Gool, Jurgen Gall [pdf] [supp] [arXiv] [bibtex]

Space-Time Neural Irradiance Fields for Free-Viewpoint Video Wenqi Xian, Jia-Bin Huang, Johannes Kopf, Changil Kim [pdf] [supp] [arXiv] [bibtex]

AutoDO: Robust AutoAugment for Biased Data With Label Noise via Scalable Probabilistic Implicit Differentiation Denis Gudovskiy, Luca Rigazio, Shun Ishizaka, Kazuki Kozuka, Sotaro Tsukizawa [pdf] [supp] [arXiv] [bibtex]

Multiple Instance Active Learning for Object Detection Tianning Yuan, Fang Wan, Mengying Fu, Jianzhuang Liu, Songcen Xu, Xiangyang Ji, Qixiang Ye [pdf] [arXiv] [bibtex]

Forecasting Irreversible Disease via Progression Learning Botong Wu, Sijie Ren, Jing Li, Xinwei Sun, Shi-Ming Li, Yizhou Wang [pdf] [supp] [arXiv] [bibtex]

Understanding the Robustness of Skeleton-Based Action Recognition Under Adversarial Attack He Wang, Feixiang He, Zhexi Peng, Tianjia Shao, Yong-Liang Yang, Kun Zhou, David Hogg [pdf] [supp] [arXiv] [bibtex]

Learning Invariant Representations and Risks for Semi-Supervised Domain Adaptation Bo Li, Yezhen Wang, Shanghang Zhang, Dongsheng Li, Kurt Keutzer, Trevor Darrell, Han Zhao [pdf] [supp] [arXiv] [bibtex]

Cross-MPI: Cross-Scale Stereo for Image Super-Resolution Using Multiplane Images Yuemei Zhou, Gaochang Wu, Ying Fu, Kun Li, Yebin Liu [pdf] [bibtex]

Neural Cellular Automata Manifold Alejandro Hernandez, Armand Vilalta, Francesc Moreno-Noguer [pdf] [bibtex]

Few-Shot Transformation of Common Actions Into Time and Space Pengwan Yang, Pascal Mettes, Cees G. M. Snoek [pdf] [supp] [arXiv] [bibtex]

MultiLink: Multi-Class Structure Recovery via Agglomerative Clustering and Model Selection Luca Magri, Filippo Leveni, Giacomo Boracchi [pdf] [bibtex]

Meta Pseudo Labels Hieu Pham, Zihang Dai, Qizhe Xie, Quoc V. Le [pdf] [supp] [arXiv] [bibtex]

SGCN: Sparse Graph Convolution Network for Pedestrian Trajectory Prediction Liushuai Shi, Le Wang, Chengjiang Long, Sanping Zhou, Mo Zhou, Zhenxing Niu, Gang Hua [pdf] [supp] [arXiv] [bibtex]

Depth Completion Using Plane-Residual Representation Byeong-Uk Lee, Kyunghyun Lee, In So Kweon [pdf] [arXiv] [bibtex]

Learning an Explicit Weighting Scheme for Adapting Complex HSI Noise Xiangyu Rui, Xiangyong Cao, Qi Xie, Zongsheng Yue, Qian Zhao, Deyu Meng [pdf] [bibtex]

Neural Parts: Learning Expressive 3D Shape Abstractions With Invertible Neural Networks Despoina Paschalidou, Angelos Katharopoulos, Andreas Geiger, Sanja Fidler [pdf] [supp] [arXiv] [bibtex]

PV-RAFT: Point-Voxel Correlation Fields for Scene Flow Estimation of Point Clouds Yi Wei, Ziyi Wang, Yongming Rao, Jiwen Lu, Jie Zhou [pdf] [supp] [bibtex]

Improving the Efficiency and Robustness of Deepfakes Detection Through Precise Geometric Features Zekun Sun, Yujie Han, Zeyu Hua, Na Ruan, Weijia Jia [pdf] [arXiv] [bibtex]

Sketch2Model: View-Aware 3D Modeling From Single Free-Hand Sketches Song-Hai Zhang, Yuan-Chen Guo, Qing-Wen Gu [pdf] [supp] [arXiv] [bibtex]

CASTing Your Model: Learning To Localize Improves Self-Supervised Representations Ramprasaath R. Selvaraju, Karan Desai, Justin Johnson, Nikhil Naik [pdf] [supp] [arXiv] [bibtex]

Robust Consistent Video Depth Estimation Johannes Kopf, Xuejian Rong, Jia-Bin Huang [pdf] [supp] [arXiv] [bibtex]

LaPred: Lane-Aware Prediction of Multi-Modal Future Trajectories of Dynamic Agents ByeoungDo Kim, Seong Hyeon Park, Seokhwan Lee, Elbek Khoshimjonov, Dongsuk Kum, Junsoo Kim, Jeong Soo Kim, Jun Won Choi [pdf] [supp] [arXiv] [bibtex]

NeuralRecon: Real-Time Coherent 3D Reconstruction From Monocular Video Jiaming Sun, Yiming Xie, Linghao Chen, Xiaowei Zhou, Hujun Bao [pdf] [arXiv] [bibtex]

Pose-Controllable Talking Face Generation by Implicitly Modularized Audio-Visual Representation Hang Zhou, Yasheng Sun, Wayne Wu, Chen Change Loy, Xiaogang Wang, Ziwei Liu [pdf] [supp] [arXiv] [bibtex]

Modular Interactive Video Object Segmentation: Interaction-to-Mask, Propagation and Difference-Aware Fusion Ho Kei Cheng, Yu-Wing Tai, Chi-Keung Tang [pdf] [supp] [arXiv] [bibtex]

A Sliced Wasserstein Loss for Neural Texture Synthesis Eric Heitz, Kenneth Vanhoey, Thomas Chambon, Laurent Belcour [pdf] [arXiv] [bibtex]

Learning Accurate Dense Correspondences and When To Trust Them Prune Truong, Martin Danelljan, Luc Van Gool, Radu Timofte [pdf] [supp] [arXiv] [bibtex]

Learning Better Visual Dialog Agents With Pretrained Visual-Linguistic Representation Tao Tu, Qing Ping, Govindarajan Thattai, Gokhan Tur, Prem Natarajan [pdf] [supp] [arXiv] [bibtex]

Restoring Extremely Dark Images in Real Time Mohit Lamba, Kaushik Mitra [pdf] [supp] [bibtex]

Weakly-Supervised Instance Segmentation via Class-Agnostic Learning With Salient Images Xinggang Wang, Jiapei Feng, Bin Hu, Qi Ding, Longjin Ran, Xiaoxin Chen, Wenyu Liu [pdf] [arXiv] [bibtex]

Spoken Moments: Learning Joint Audio-Visual Representations From Video Descriptions Mathew Monfort, SouYoung Jin, Alexander Liu, David Harwath, Rogerio Feris, James Glass, Aude Oliva [pdf] [supp] [arXiv] [bibtex]

Image Restoration for Under-Display Camera Yuqian Zhou, David Ren, Neil Emerton, Sehoon Lim, Timothy Large [pdf] [supp] [arXiv] [bibtex]

Unbiased Mean Teacher for Cross-Domain Object Detection Jinhong Deng, Wen Li, Yuhua Chen, Lixin Duan [pdf] [arXiv] [bibtex]

How2Sign: A Large-Scale Multimodal Dataset for Continuous American Sign Language Amanda Duarte, Shruti Palaskar, Lucas Ventura, Deepti Ghadiyaram, Kenneth DeHaan, Florian Metze, Jordi Torres, Xavier Giro-i-Nieto [pdf] [supp] [arXiv] [bibtex]

Indoor Lighting Estimation Using an Event Camera Zehao Chen, Qian Zheng, Peisong Niu, Huajin Tang, Gang Pan [pdf] [supp] [bibtex]

Shot Contrastive Self-Supervised Learning for Scene Boundary Detection Shixing Chen, Xiaohan Nie, David Fan, Dongqing Zhang, Vimal Bhat, Raffay Hamid [pdf] [supp] [arXiv] [bibtex]

Sewer-ML: A Multi-Label Sewer Defect Classification Dataset and Benchmark Joakim Bruslund Haurum, Thomas B. Moeslund [pdf] [supp] [bibtex]

Joint-DetNAS: Upgrade Your Detector With NAS, Pruning and Dynamic Distillation Lewei Yao, Renjie Pi, Hang Xu, Wei Zhang, Zhenguo Li, Tong Zhang [pdf] [supp] [bibtex]

Back-Tracing Representative Points for Voting-Based 3D Object Detection in Point Clouds Bowen Cheng, Lu Sheng, Shaoshuai Shi, Ming Yang, Dong Xu [pdf] [arXiv] [bibtex]

High-Resolution Photorealistic Image Translation in Real-Time: A Laplacian Pyramid Translation Network Jie Liang, Hui Zeng, Lei Zhang [pdf] [supp] [bibtex]

End-to-End Video Instance Segmentation With Transformers Yuqing Wang, Zhaoliang Xu, Xinlong Wang, Chunhua Shen, Baoshan Cheng, Hao Shen, Huaxia Xia [pdf] [arXiv] [bibtex]

VoxelContext-Net: An Octree Based Framework for Point Cloud Compression Zizheng Que, Guo Lu, Dong Xu [pdf] [bibtex]

A Second-Order Approach to Learning With Instance-Dependent Label Noise Zhaowei Zhu, Tongliang Liu, Yang Liu [pdf] [supp] [arXiv] [bibtex]

SpinNet: Learning a General Surface Descriptor for 3D Point Cloud Registration Sheng Ao, Qingyong Hu, Bo Yang, Andrew Markham, Yulan Guo [pdf] [supp] [arXiv] [bibtex]

FSDR: Frequency Space Domain Randomization for Domain Generalization Jiaxing Huang, Dayan Guan, Aoran Xiao, Shijian Lu [pdf] [supp] [arXiv] [bibtex]

DualAST: Dual Style-Learning Networks for Artistic Style Transfer Haibo Chen, Lei Zhao, Zhizhong Wang, Huiming Zhang, Zhiwen Zuo, Ailin Li, Wei Xing, Dongming Lu [pdf] [supp] [bibtex]

Learning a Proposal Classifier for Multiple Object Tracking Peng Dai, Renliang Weng, Wongun Choi, Changshui Zhang, Zhangping He, Wei Ding [pdf] [supp] [arXiv] [bibtex]

Multi-Attentional Deepfake Detection Hanqing Zhao, Wenbo Zhou, Dongdong Chen, Tianyi Wei, Weiming Zhang, Nenghai Yu [pdf] [arXiv] [bibtex]

SOLD2: Self-Supervised Occlusion-Aware Line Description and Detection Remi Pautrat, Juan-Ting Lin, Viktor Larsson, Martin R. Oswald, Marc Pollefeys [pdf] [supp] [arXiv] [bibtex]

Shared Cross-Modal Trajectory Prediction for Autonomous Driving Chiho Choi, Joon Hee Choi, Jiachen Li, Srikanth Malla [pdf] [supp] [arXiv] [bibtex]

Cycle4Completion: Unpaired Point Cloud Completion Using Cycle Transformation With Missing Region Coding Xin Wen, Zhizhong Han, Yan-Pei Cao, Pengfei Wan, Wen Zheng, Yu-Shen Liu [pdf] [arXiv] [bibtex]

CGA-Net: Category Guided Aggregation for Point Cloud Semantic Segmentation Tao Lu, Limin Wang, Gangshan Wu [pdf] [bibtex]

PLOP: Learning Without Forgetting for Continual Semantic Segmentation Arthur Douillard, Yifu Chen, Arnaud Dapogny, Matthieu Cord [pdf] [supp] [arXiv] [bibtex]

Magic Layouts: Structural Prior for Component Detection in User Interface Designs Dipu Manandhar, Hailin Jin, John Collomosse [pdf] [supp] [bibtex]

MetaAlign: Coordinating Domain Alignment and Classification for Unsupervised Domain Adaptation Guoqiang Wei, Cuiling Lan, Wenjun Zeng, Zhibo Chen [pdf] [supp] [arXiv] [bibtex]

Neural Prototype Trees for Interpretable Fine-Grained Image Recognition Meike Nauta, Ron van Bree, Christin Seifert [pdf] [supp] [arXiv] [bibtex]

Hardness Sampling for Self-Training Based Transductive Zero-Shot Learning Liu Bo, Qiulei Dong, Zhanyi Hu [pdf] [supp] [arXiv] [bibtex]

Hilbert Sinkhorn Divergence for Optimal Transport Qian Li, Zhichao Wang, Gang Li, Jun Pang, Guandong Xu [pdf] [supp] [bibtex]

The Multi-Temporal Urban Development SpaceNet Dataset Adam Van Etten, Daniel Hogan, Jesus Martinez Manso, Jacob Shermeyer, Nicholas Weir, Ryan Lewis [pdf] [bibtex]

FBNetV3: Joint Architecture-Recipe Search Using Predictor Pretraining Xiaoliang Dai, Alvin Wan, Peizhao Zhang, Bichen Wu, Zijian He, Zhen Wei, Kan Chen, Yuandong Tian, Matthew Yu, Peter Vajda, Joseph E. Gonzalez [pdf] [supp] [arXiv] [bibtex]

Intrinsic Image Harmonization Zonghui Guo, Haiyong Zheng, Yufeng Jiang, Zhaorui Gu, Bing Zheng [pdf] [supp] [bibtex]

L2M-GAN: Learning To Manipulate Latent Space Semantics for Facial Attribute Editing Guoxing Yang, Nanyi Fei, Mingyu Ding, Guangzhen Liu, Zhiwu Lu, Tao Xiang [pdf] [supp] [bibtex]

IIRC: Incremental Implicitly-Refined Classification Mohamed Abdelsalam, Mojtaba Faramarzi, Shagun Sodhani, Sarath Chandar [pdf] [supp] [arXiv] [bibtex]

Learning To Fuse Asymmetric Feature Maps in Siamese Trackers Wencheng Han, Xingping Dong, Fahad Shahbaz Khan, Ling Shao, Jianbing Shen [pdf] [arXiv] [bibtex]

Generalizing to the Open World: Deep Visual Odometry With Online Adaptation Shunkai Li, Xin Wu, Yingdian Cao, Hongbin Zha [pdf] [arXiv] [bibtex]

PQA: Perceptual Question Answering Yonggang Qi, Kai Zhang, Aneeshan Sain, Yi-Zhe Song [pdf] [supp] [arXiv] [bibtex]

Adversarial Laser Beam: Effective Physical-World Attack to DNNs in a Blink Ranjie Duan, Xiaofeng Mao, A. K. Qin, Yuefeng Chen, Shaokai Ye, Yuan He, Yun Yang [pdf] [arXiv] [bibtex]

Robust Point Cloud Registration Framework Based on Deep Graph Matching Kexue Fu, Shaolei Liu, Xiaoyuan Luo, Manning Wang [pdf] [supp] [arXiv] [bibtex]

Dense Contrastive Learning for Self-Supervised Visual Pre-Training Xinlong Wang, Rufeng Zhang, Chunhua Shen, Tao Kong, Lei Li [pdf] [supp] [arXiv] [bibtex]

Birds of a Feather: Capturing Avian Shape Models From Images Yufu Wang, Nikos Kolotouros, Kostas Daniilidis, Marc Badger [pdf] [supp] [arXiv] [bibtex]

Learning Temporal Consistency for Low Light Video Enhancement From Single Images Fan Zhang, Yu Li, Shaodi You, Ying Fu [pdf] [supp] [bibtex]

Brain Image Synthesis With Unsupervised Multivariate Canonical CSCl4Net Yawen Huang, Feng Zheng, Danyang Wang, Weilin Huang, Matthew R. Scott, Ling Shao [pdf] [bibtex]

Inverse Simulation: Reconstructing Dynamic Geometry of Clothed Humans via Optimal Control Jingfan Guo, Jie Li, Rahul Narain, Hyun Soo Park [pdf] [bibtex]

Rotation Equivariant Siamese Networks for Tracking Deepak K. Gupta, Devanshu Arya, Efstratios Gavves [pdf] [supp] [arXiv] [bibtex]

Learning Decision Trees Recurrently Through Communication Stephan Alaniz, Diego Marcos, Bernt Schiele, Zeynep Akata [pdf] [supp] [arXiv] [bibtex]

PatchmatchNet: Learned Multi-View Patchmatch Stereo Fangjinhua Wang, Silvano Galliani, Christoph Vogel, Pablo Speciale, Marc Pollefeys [pdf] [supp] [arXiv] [bibtex]

Instance Level Affinity-Based Transfer for Unsupervised Domain Adaptation Astuti Sharma, Tarun Kalluri, Manmohan Chandraker [pdf] [supp] [arXiv] [bibtex]

COMPLETER: Incomplete Multi-View Clustering via Contrastive Prediction Yijie Lin, Yuanbiao Gou, Zitao Liu, Boyun Li, Jiancheng Lv, Xi Peng [pdf] [supp] [bibtex]

Image-to-Image Translation via Hierarchical Style Disentanglement Xinyang Li, Shengchuan Zhang, Jie Hu, Liujuan Cao, Xiaopeng Hong, Xudong Mao, Feiyue Huang, Yongjian Wu, Rongrong Ji [pdf] [supp] [arXiv] [bibtex]

What Can Style Transfer and Paintings Do for Model Robustness? Hubert Lin, Mitchell van Zuijlen, Sylvia C. Pont, Maarten W.A. Wijntjes, Kavita Bala [pdf] [supp] [arXiv] [bibtex]

Taming Transformers for High-Resolution Image Synthesis Patrick Esser, Robin Rombach, Bjorn Ommer [pdf] [supp] [arXiv] [bibtex]

Learning the Predictability of the Future Didac Suris, Ruoshi Liu, Carl Vondrick [pdf] [arXiv] [bibtex]

Multiple Instance Captioning: Learning Representations From Histopathology Textbooks and Articles Jevgenij Gamper, Nasir Rajpoot [pdf] [supp] [arXiv] [bibtex]

Beyond Max-Margin: Class Margin Equilibrium for Few-Shot Object Detection Bohao Li, Boyu Yang, Chang Liu, Feng Liu, Rongrong Ji, Qixiang Ye [pdf] [bibtex]

Consistent Instance False Positive Improves Fairness in Face Recognition Xingkun Xu, Yuge Huang, Pengcheng Shen, Shaoxin Li, Jilin Li, Feiyue Huang, Yong Li, Zhen Cui [pdf] [bibtex]

Learning Dynamic Network Using a Reuse Gate Function in Semi-Supervised Video Object Segmentation Hyojin Park, Jayeon Yoo, Seohyeong Jeong, Ganesh Venkatesh, Nojun Kwak [pdf] [supp] [arXiv] [bibtex]

RaScaNet: Learning Tiny Models by Raster-Scanning Images Jaehyoung Yoo, Dongwook Lee, Changyong Son, Sangil Jung, ByungIn Yoo, Changkyu Choi, Jae-Joon Han, Bohyung Han [pdf] [supp] [bibtex]

AGQA: A Benchmark for Compositional Spatio-Temporal Reasoning Madeleine Grunde-McLaughlin, Ranjay Krishna, Maneesh Agrawala [pdf] [supp] [bibtex]

Exploring intermediate representation for monocular vehicle pose estimation Shichao Li, Zengqiang Yan, Hongyang Li, Kwang-Ting Cheng [pdf] [supp] [arXiv] [bibtex]

Shallow Feature Matters for Weakly Supervised Object Localization Jun Wei, Qin Wang, Zhen Li, Sheng Wang, S. Kevin Zhou, Shuguang Cui [pdf] [bibtex]

Capturing Omni-Range Context for Omnidirectional Segmentation Kailun Yang, Jiaming Zhang, Simon Reiss, Xinxin Hu, Rainer Stiefelhagen [pdf] [arXiv] [bibtex]

PLADE-Net: Towards Pixel-Level Accuracy for Self-Supervised Single-View Depth Estimation With Neural Positional Encoding and Distilled Matting Loss Juan Luis Gonzalez, Munchurl Kim [pdf] [supp] [bibtex]

Reciprocal Landmark Detection and Tracking With Extremely Few Annotations Jianzhe Lin, Ghazal Sahebzamani, Christina Luong, Fatemeh Taheri Dezaki, Mohammad Jafari, Purang Abolmaesumi, Teresa Tsang [pdf] [supp] [arXiv] [bibtex]

Practical Single-Image Super-Resolution Using Look-Up Table Younghyun Jo, Seon Joo Kim [pdf] [supp] [bibtex]

Removing the Background by Adding the Background: Towards Background Robust Self-Supervised Video Representation Learning Jinpeng Wang, Yuting Gao, Ke Li, Yiqi Lin, Andy J. Ma, Hao Cheng, Pai Peng, Feiyue Huang, Rongrong Ji, Xing Sun [pdf] [arXiv] [bibtex]

GDR-Net: Geometry-Guided Direct Regression Network for Monocular 6D Object Pose Estimation Gu Wang, Fabian Manhardt, Federico Tombari, Xiangyang Ji [pdf] [supp] [bibtex]

Point Cloud Upsampling via Disentangled Refinement Ruihui Li, Xianzhi Li, Pheng-Ann Heng, Chi-Wing Fu [pdf] [supp] [bibtex]

Feature-Level Collaboration: Joint Unsupervised Learning of Optical Flow, Stereo Depth and Camera Motion Cheng Chi, Qingjie Wang, Tianyu Hao, Peng Guo, Xin Yang [pdf] [bibtex]

A Generalized Loss Function for Crowd Counting and Localization Jia Wan, Ziquan Liu, Antoni B. Chan [pdf] [supp] [bibtex]

Learning Fine-Grained Segmentation of 3D Shapes Without Part Labels Xiaogang Wang, Xun Sun, Xinyu Cao, Kai Xu, Bin Zhou [pdf] [arXiv] [bibtex]

Fine-Grained Shape-Appearance Mutual Learning for Cloth-Changing Person Re-Identification Peixian Hong, Tao Wu, Ancong Wu, Xintong Han, Wei-Shi Zheng [pdf] [supp] [bibtex]

DeepSurfels: Learning Online Appearance Fusion Marko Mihajlovic, Silvan Weder, Marc Pollefeys, Martin R. Oswald [pdf] [supp] [arXiv] [bibtex]

Joint Negative and Positive Learning for Noisy Labels Youngdong Kim, Juseung Yun, Hyounguk Shon, Junmo Kim [pdf] [arXiv] [bibtex]

Generalizing Face Forgery Detection With High-Frequency Features Yuchen Luo, Yong Zhang, Junchi Yan, Wei Liu [pdf] [supp] [arXiv] [bibtex]

The Heterogeneity Hypothesis: Finding Layer-Wise Differentiated Network Architectures Yawei Li, Wen Li, Martin Danelljan, Kai Zhang, Shuhang Gu, Luc Van Gool, Radu Timofte [pdf] [supp] [arXiv] [bibtex]

Robust Neural Routing Through Space Partitions for Camera Relocalization in Dynamic Indoor Environments Siyan Dong, Qingnan Fan, He Wang, Ji Shi, Li Yi, Thomas Funkhouser, Baoquan Chen, Leonidas J. Guibas [pdf] [arXiv] [bibtex]

Facial Action Unit Detection With Transformers Geethu Miriam Jacob, Bjorn Stenger [pdf] [bibtex]

Exploiting Aliasing for Manga Restoration Minshan Xie, Menghan Xia, Tien-Tsin Wong [pdf] [supp] [arXiv] [bibtex]

Discovering Hidden Physics Behind Transport Dynamics Peirong Liu, Lin Tian, Yubo Zhang, Stephen Aylward, Yueh Lee, Marc Niethammer [pdf] [supp] [arXiv] [bibtex]

Cross-View Gait Recognition With Deep Universal Linear Embeddings Shaoxiong Zhang, Yunhong Wang, Annan Li [pdf] [bibtex]

Tuning IR-Cut Filter for Illumination-Aware Spectral Reconstruction From RGB Bo Sun, Junchi Yan, Xiao Zhou, Yinqiang Zheng [pdf] [arXiv] [bibtex]

Relative Order Analysis and Optimization for Unsupervised Deep Metric Learning Shichao Kan, Yigang Cen, Yang Li, Vladimir Mladenovic, Zhihai He [pdf] [bibtex]

Anchor-Free Person Search Yichao Yan, Jinpeng Li, Jie Qin, Song Bai, Shengcai Liao, Li Liu, Fan Zhu, Ling Shao [pdf] [supp] [arXiv] [bibtex]

Are Labels Always Necessary for Classifier Accuracy Evaluation? Weijian Deng, Liang Zheng [pdf] [supp] [arXiv] [bibtex]

Self-Supervised Motion Learning From Static Images Ziyuan Huang, Shiwei Zhang, Jianwen Jiang, Mingqian Tang, Rong Jin, Marcelo H. Ang [pdf] [supp] [arXiv] [bibtex]

AttentiveNAS: Improving Neural Architecture Search via Attentive Sampling Dilin Wang, Meng Li, Chengyue Gong, Vikas Chandra [pdf] [supp] [arXiv] [bibtex]

StablePose: Learning 6D Object Poses From Geometrically Stable Patches Yifei Shi, Junwen Huang, Xin Xu, Yifan Zhang, Kai Xu [pdf] [supp] [arXiv] [bibtex]

Towards Evaluating and Training Verifiably Robust Neural Networks Zhaoyang Lyu, Minghao Guo, Tong Wu, Guodong Xu, Kehuan Zhang, Dahua Lin [pdf] [supp] [arXiv] [bibtex]

Interpolation-Based Semi-Supervised Learning for Object Detection Jisoo Jeong, Vikas Verma, Minsung Hyun, Juho Kannala, Nojun Kwak [pdf] [arXiv] [bibtex]

Teachers Do More Than Teach: Compressing Image-to-Image Models Qing Jin, Jian Ren, Oliver J. Woodford, Jiazhuo Wang, Geng Yuan, Yanzhi Wang, Sergey Tulyakov [pdf] [supp] [arXiv] [bibtex]

Seeing in Extra Darkness Using a Deep-Red Flash Jinhui Xiong, Jian Wang, Wolfgang Heidrich, Shree Nayar [pdf] [supp] [bibtex]

PSD: Principled Synthetic-to-Real Dehazing Guided by Physical Priors Zeyuan Chen, Yangchao Wang, Yang Yang, Dong Liu [pdf] [supp] [bibtex]

3D Spatial Recognition Without Spatially Labeled 3D Zhongzheng Ren, Ishan Misra, Alexander G. Schwing, Rohit Girdhar [pdf] [supp] [arXiv] [bibtex]

Robust Reference-Based Super-Resolution via C2-Matching Yuming Jiang, Kelvin C.K. Chan, Xintao Wang, Chen Change Loy, Ziwei Liu [pdf] [supp] [arXiv] [bibtex]

Temporal-Relational CrossTransformers for Few-Shot Action Recognition Toby Perrett, Alessandro Masullo, Tilo Burghardt, Majid Mirmehdi, Dima Damen [pdf] [supp] [arXiv] [bibtex]

Understanding Failures of Deep Networks via Robust Feature Extraction Sahil Singla, Besmira Nushi, Shital Shah, Ece Kamar, Eric Horvitz [pdf] [supp] [arXiv] [bibtex]

Relation-aware Instance Refinement for Weakly Supervised Visual Grounding Yongfei Liu, Bo Wan, Lin Ma, Xuming He [pdf] [supp] [arXiv] [bibtex]

Spatially-Invariant Style-Codes Controlled Makeup Transfer Han Deng, Chu Han, Hongmin Cai, Guoqiang Han, Shengfeng He [pdf] [supp] [bibtex]

Adaptive Image Transformer for One-Shot Object Detection Ding-Jie Chen, He-Yen Hsieh, Tyng-Luh Liu [pdf] [bibtex]

Bilateral Grid Learning for Stereo Matching Networks Bin Xu, Yuhua Xu, Xiaoli Yang, Wei Jia, Yulan Guo [pdf] [supp] [arXiv] [bibtex]

A Multi-Task Network for Joint Specular Highlight Detection and Removal Gang Fu, Qing Zhang, Lei Zhu, Ping Li, Chunxia Xiao [pdf] [bibtex]

A Deep Emulator for Secondary Motion of 3D Characters Mianlun Zheng, Yi Zhou, Duygu Ceylan, Jernej Barbic [pdf] [supp] [arXiv] [bibtex]

Omni-Supervised Point Cloud Segmentation via Gradual Receptive Field Component Reasoning Jingyu Gong, Jiachen Xu, Xin Tan, Haichuan Song, Yanyun Qu, Yuan Xie, Lizhuang Ma [pdf] [supp] [arXiv] [bibtex]

All Labels Are Not Created Equal: Enhancing Semi-Supervision via Label Grouping and Co-Training Islam Nassar, Samitha Herath, Ehsan Abbasnejad, Wray Buntine, Gholamreza Haffari [pdf] [supp] [arXiv] [bibtex]

PMP-Net: Point Cloud Completion by Learning Multi-Step Point Moving Paths Xin Wen, Peng Xiang, Zhizhong Han, Yan-Pei Cao, Pengfei Wan, Wen Zheng, Yu-Shen Liu [pdf] [supp] [bibtex]

Gradient-Based Algorithms for Machine Teaching Pei Wang, Kabir Nagrecha, Nuno Vasconcelos [pdf] [supp] [bibtex]

MetaSCI: Scalable and Adaptive Reconstruction for Video Compressive Sensing Zhengjue Wang, Hao Zhang, Ziheng Cheng, Bo Chen, Xin Yuan [pdf] [arXiv] [bibtex]

Removing Raindrops and Rain Streaks in One Go Ruijie Quan, Xin Yu, Yuanzhi Liang, Yi Yang [pdf] [supp] [bibtex]

Action Unit Memory Network for Weakly Supervised Temporal Action Localization Wang Luo, Tianzhu Zhang, Wenfei Yang, Jingen Liu, Tao Mei, Feng Wu, Yongdong Zhang [pdf] [arXiv] [bibtex]

IMAGINE: Image Synthesis by Image-Guided Model Inversion Pei Wang, Yijun Li, Krishna Kumar Singh, Jingwan Lu, Nuno Vasconcelos [pdf] [supp] [arXiv] [bibtex]

Neural Scene Graphs for Dynamic Scenes Julian Ost, Fahim Mannan, Nils Thuerey, Julian Knodt, Felix Heide [pdf] [supp] [arXiv] [bibtex]

RSTNet: Captioning With Adaptive Attention on Visual and Non-Visual Words Xuying Zhang, Xiaoshuai Sun, Yunpeng Luo, Jiayi Ji, Yiyi Zhou, Yongjian Wu, Feiyue Huang, Rongrong Ji [pdf] [bibtex]

Time Lens: Event-Based Video Frame Interpolation Stepan Tulyakov, Daniel Gehrig, Stamatios Georgoulis, Julius Erbach, Mathias Gehrig, Yuanyou Li, Davide Scaramuzza [pdf] [supp] [bibtex]

FedDG: Federated Domain Generalization on Medical Image Segmentation via Episodic Learning in Continuous Frequency Space Quande Liu, Cheng Chen, Jing Qin, Qi Dou, Pheng-Ann Heng [pdf] [supp] [arXiv] [bibtex]

Anomaly Detection in Video via Self-Supervised and Multi-Task Learning Mariana-Iuliana Georgescu, Antonio Barbalau, Radu Tudor Ionescu, Fahad Shahbaz Khan, Marius Popescu, Mubarak Shah [pdf] [supp] [arXiv] [bibtex]

Multiresolution Knowledge Distillation for Anomaly Detection Mohammadreza Salehi, Niousha Sadjadi, Soroosh Baselizadeh, Mohammad H. Rohban, Hamid R. Rabiee [pdf] [supp] [arXiv] [bibtex]

Joint Learning of 3D Shape Retrieval and Deformation Mikaela Angelina Uy, Vladimir G. Kim, Minhyuk Sung, Noam Aigerman, Siddhartha Chaudhuri, Leonidas J. Guibas [pdf] [supp] [arXiv] [bibtex]

Learning Spatially-Variant MAP Models for Non-Blind Image Deblurring Jiangxin Dong, Stefan Roth, Bernt Schiele [pdf] [supp] [bibtex]

FCPose: Fully Convolutional Multi-Person Pose Estimation With Dynamic Instance-Aware Convolutions Weian Mao, Zhi Tian, Xinlong Wang, Chunhua Shen [pdf] [arXiv] [bibtex]

BoxInst: High-Performance Instance Segmentation With Box Annotations Zhi Tian, Chunhua Shen, Xinlong Wang, Hao Chen [pdf] [supp] [arXiv] [bibtex]

Modeling Multi-Label Action Dependencies for Temporal Action Localization Praveen Tirupattur, Kevin Duarte, Yogesh S Rawat, Mubarak Shah [pdf] [supp] [arXiv] [bibtex]

HCRF-Flow: Scene Flow From Point Clouds With Continuous High-Order CRFs and Position-Aware Flow Embedding Ruibo Li, Guosheng Lin, Tong He, Fayao Liu, Chunhua Shen [pdf] [supp] [bibtex]

Lite-HRNet: A Lightweight High-Resolution Network Changqian Yu, Bin Xiao, Changxin Gao, Lu Yuan, Lei Zhang, Nong Sang, Jingdong Wang [pdf] [bibtex]

Self-Supervised Video Representation Learning by Context and Motion Decoupling Lianghua Huang, Yu Liu, Bin Wang, Pan Pan, Yinghui Xu, Rong Jin [pdf] [arXiv] [bibtex]

ReAgent: Point Cloud Registration Using Imitation and Reinforcement Learning Dominik Bauer, Timothy Patten, Markus Vincze [pdf] [supp] [arXiv] [bibtex]

Uncertainty Guided Collaborative Training for Weakly Supervised Temporal Action Detection Wenfei Yang, Tianzhu Zhang, Xiaoyuan Yu, Tian Qi, Yongdong Zhang, Feng Wu [pdf] [bibtex]

Dynamic Probabilistic Graph Convolution for Facial Action Unit Intensity Estimation Tengfei Song, Zijun Cui, Yuru Wang, Wenming Zheng, Qiang Ji [pdf] [supp] [bibtex]

Few-Shot Segmentation Without Meta-Learning: A Good Transductive Inference Is All You Need? Malik Boudiaf, Hoel Kervadec, Ziko Imtiaz Masud, Pablo Piantanida, Ismail Ben Ayed, Jose Dolz [pdf] [supp] [bibtex]

Spatial-Temporal Correlation and Topology Learning for Person Re-Identification in Videos Jiawei Liu, Zheng-Jun Zha, Wei Wu, Kecheng Zheng, Qibin Sun [pdf] [arXiv] [bibtex]

SPSG: Self-Supervised Photometric Scene Generation From RGB-D Scans Angela Dai, Yawar Siddiqui, Justus Thies, Julien Valentin, Matthias Niessner [pdf] [supp] [bibtex]

Neural Auto-Exposure for High-Dynamic Range Object Detection Emmanuel Onzon, Fahim Mannan, Felix Heide [pdf] [supp] [bibtex]

Rethinking Semantic Segmentation From a Sequence-to-Sequence Perspective With Transformers Sixiao Zheng, Jiachen Lu, Hengshuang Zhao, Xiatian Zhu, Zekun Luo, Yabiao Wang, Yanwei Fu, Jianfeng Feng, Tao Xiang, Philip H.S. Torr, Li Zhang [pdf] [supp] [arXiv] [bibtex]

Interpreting Super-Resolution Networks With Local Attribution Maps Jinjin Gu, Chao Dong [pdf] [supp] [arXiv] [bibtex]

Multi-Target Domain Adaptation With Collaborative Consistency Learning Takashi Isobe, Xu Jia, Shuaijun Chen, Jianzhong He, Yongjie Shi, Jianzhuang Liu, Huchuan Lu, Shengjin Wang [pdf] [arXiv] [bibtex]

Troubleshooting Blind Image Quality Models in the Wild Zhihua Wang, Haotao Wang, Tianlong Chen, Zhangyang Wang, Kede Ma [pdf] [arXiv] [bibtex]

Semantic Palette: Guiding Scene Generation With Class Proportions Guillaume Le Moing, Tuan-Hung Vu, Himalaya Jain, Patrick Perez, Matthieu Cord [pdf] [supp] [arXiv] [bibtex]

Physics-Based Iterative Projection Complex Neural Network for Phase Retrieval in Lensless Microscopy Imaging Feilong Zhang, Xianming Liu, Cheng Guo, Shiyi Lin, Junjun Jiang, Xiangyang Ji [pdf] [supp] [bibtex]

Causal Attention for Vision-Language Tasks Xu Yang, Hanwang Zhang, Guojun Qi, Jianfei Cai [pdf] [supp] [arXiv] [bibtex]

Scene Text Telescope: Text-Focused Scene Image Super-Resolution Jingye Chen, Bin Li, Xiangyang Xue [pdf] [supp] [bibtex]

NeuTex: Neural Texture Mapping for Volumetric Neural Rendering Fanbo Xiang, Zexiang Xu, Milos Hasan, Yannick Hold-Geoffroy, Kalyan Sunkavalli, Hao Su [pdf] [supp] [arXiv] [bibtex]

Improving Calibration for Long-Tailed Recognition Zhisheng Zhong, Jiequan Cui, Shu Liu, Jiaya Jia [pdf] [supp] [arXiv] [bibtex]

Learning Affinity-Aware Upsampling for Deep Image Matting Yutong Dai, Hao Lu, Chunhua Shen [pdf] [supp] [arXiv] [bibtex]

Improving Multiple Pedestrian Tracking by Track Management and Occlusion Handling Daniel Stadler, Jurgen Beyerer [pdf] [bibtex]

Revamping Cross-Modal Recipe Retrieval With Hierarchical Transformers and Self-Supervised Learning Amaia Salvador, Erhan Gundogdu, Loris Bazzani, Michael Donoser [pdf] [supp] [arXiv] [bibtex]

Geo-FARM: Geodesic Factor Regression Model for Misaligned Pre-Shape Responses in Statistical Shape Analysis Chao Huang, Anuj Srivastava, Rongjie Liu [pdf] [bibtex]

MOST: A Multi-Oriented Scene Text Detector With Localization Refinement Minghang He, Minghui Liao, Zhibo Yang, Humen Zhong, Jun Tang, Wenqing Cheng, Cong Yao, Yongpan Wang, Xiang Bai [pdf] [arXiv] [bibtex]

A Functional Approach to Rotation Equivariant Non-Linearities for Tensor Field Networks. Adrien Poulenard, Leonidas J. Guibas [pdf] [supp] [bibtex]

Leveraging Large-Scale Weakly Labeled Data for Semi-Supervised Mass Detection in Mammograms Yuxing Tang, Zhenjie Cao, Yanbo Zhang, Zhicheng Yang, Zongcheng Ji, Yiwei Wang, Mei Han, Jie Ma, Jing Xiao, Peng Chang [pdf] [supp] [bibtex]

Fast and Accurate Model Scaling Piotr Dollar, Mannat Singh, Ross Girshick [pdf] [arXiv] [bibtex]

Real-Time Sphere Sweeping Stereo From Multiview Fisheye Images Andreas Meuleman, Hyeonjoong Jang, Daniel S. Jeon, Min H. Kim [pdf] [supp] [bibtex]

Instant-Teaching: An End-to-End Semi-Supervised Object Detection Framework Qiang Zhou, Chaohui Yu, Zhibin Wang, Qi Qian, Hao Li [pdf] [bibtex]

Taskology: Utilizing Task Relations at Scale Yao Lu, Soren Pirk, Jan Dlabal, Anthony Brohan, Ankita Pasad, Zhao Chen, Vincent Casser, Anelia Angelova, Ariel Gordon [pdf] [supp] [arXiv] [bibtex]

Progressive Domain Expansion Network for Single Domain Generalization Lei Li, Ke Gao, Juan Cao, Ziyao Huang, Yepeng Weng, Xiaoyue Mi, Zhengze Yu, Xiaoya Li, Boyang Xia [pdf] [supp] [arXiv] [bibtex]

View-Guided Point Cloud Completion Xuancheng Zhang, Yutong Feng, Siqi Li, Changqing Zou, Hai Wan, Xibin Zhao, Yandong Guo, Yue Gao [pdf] [supp] [arXiv] [bibtex]

Generative Hierarchical Features From Synthesizing Images Yinghao Xu, Yujun Shen, Jiapeng Zhu, Ceyuan Yang, Bolei Zhou [pdf] [supp] [arXiv] [bibtex]

Affect2MM: Affective Analysis of Multimedia Content Using Emotion Causality Trisha Mittal, Puneet Mathur, Aniket Bera, Dinesh Manocha [pdf] [supp] [arXiv] [bibtex]

Black-Box Explanation of Object Detectors via Saliency Maps Vitali Petsiuk, Rajiv Jain, Varun Manjunatha, Vlad I. Morariu, Ashutosh Mehra, Vicente Ordonez, Kate Saenko [pdf] [supp] [arXiv] [bibtex]

Skip-Convolutions for Efficient Video Processing Amirhossein Habibian, Davide Abati, Taco S. Cohen, Babak Ehteshami Bejnordi [pdf] [supp] [arXiv] [bibtex]

Looking Into Your Speech: Learning Cross-Modal Affinity for Audio-Visual Speech Separation Jiyoung Lee, Soo-Whan Chung, Sunok Kim, Hong-Goo Kang, Kwanghoon Sohn [pdf] [supp] [arXiv] [bibtex]

GLEAN: Generative Latent Bank for Large-Factor Image Super-Resolution Kelvin C.K. Chan, Xintao Wang, Xiangyu Xu, Jinwei Gu, Chen Change Loy [pdf] [supp] [arXiv] [bibtex]

Soteria: Provable Defense Against Privacy Leakage in Federated Learning From Representation Perspective Jingwei Sun, Ang Li, Binghui Wang, Huanrui Yang, Hai Li, Yiran Chen [pdf] [supp] [bibtex]

Deep Occlusion-Aware Instance Segmentation With Overlapping BiLayers Lei Ke, Yu-Wing Tai, Chi-Keung Tang [pdf] [supp] [arXiv] [bibtex]

MonoRec: Semi-Supervised Dense Reconstruction in Dynamic Environments From a Single Moving Camera Felix Wimbauer, Nan Yang, Lukas von Stumberg, Niclas Zeller, Daniel Cremers [pdf] [supp] [arXiv] [bibtex]

DAP: Detection-Aware Pre-Training With Weak Supervision Yuanyi Zhong, Jianfeng Wang, Lijuan Wang, Jian Peng, Yu-Xiong Wang, Lei Zhang [pdf] [supp] [arXiv] [bibtex]

Spatial Assembly Networks for Image Representation Learning Yang Li, Shichao Kan, Jianhe Yuan, Wenming Cao, Zhihai He [pdf] [supp] [bibtex]

Linguistic Structures As Weak Supervision for Visual Scene Graph Generation Keren Ye, Adriana Kovashka [pdf] [arXiv] [bibtex]

SKFAC: Training Neural Networks With Faster Kronecker-Factored Approximate Curvature Zedong Tang, Fenlong Jiang, Maoguo Gong, Hao Li, Yue Wu, Fan Yu, Zidong Wang, Min Wang [pdf] [supp] [bibtex]

Global2Local: Efficient Structure Search for Video Action Segmentation Shang-Hua Gao, Qi Han, Zhong-Yu Li, Pai Peng, Liang Wang, Ming-Ming Cheng [pdf] [supp] [arXiv] [bibtex]

Picasso: A CUDA-Based Library for Deep Learning Over 3D Meshes Huan Lei, Naveed Akhtar, Ajmal Mian [pdf] [supp] [arXiv] [bibtex]

DeFlow: Learning Complex Image Degradations From Unpaired Data With Conditional Flows Valentin Wolf, Andreas Lugmayr, Martin Danelljan, Luc Van Gool, Radu Timofte [pdf] [supp] [arXiv] [bibtex]

Student-Teacher Learning From Clean Inputs to Noisy Inputs Guanzhe Hong, Zhiyuan Mao, Xiaojun Lin, Stanley H. Chan [pdf] [supp] [arXiv] [bibtex]

AdvSim: Generating Safety-Critical Scenarios for Self-Driving Vehicles Jingkang Wang, Ava Pun, James Tu, Sivabalan Manivasagam, Abbas Sadat, Sergio Casas, Mengye Ren, Raquel Urtasun [pdf] [supp] [arXiv] [bibtex]

MoViNets: Mobile Video Networks for Efficient Video Recognition Dan Kondratyuk, Liangzhe Yuan, Yandong Li, Li Zhang, Mingxing Tan, Matthew Brown, Boqing Gong [pdf] [supp] [arXiv] [bibtex]

IBRNet: Learning Multi-View Image-Based Rendering Qianqian Wang, Zhicheng Wang, Kyle Genova, Pratul P. Srinivasan, Howard Zhou, Jonathan T. Barron, Ricardo Martin-Brualla, Noah Snavely, Thomas Funkhouser [pdf] [supp] [arXiv] [bibtex]

SelfAugment: Automatic Augmentation Policies for Self-Supervised Learning Colorado J Reed, Sean Metzger, Aravind Srinivas, Trevor Darrell, Kurt Keutzer [pdf] [supp] [arXiv] [bibtex]

Adversarial Invariant Learning Nanyang Ye, Jingxuan Tang, Huayu Deng, Xiao-Yun Zhou, Qianxiao Li, Zhenguo Li, Guang-Zhong Yang, Zhanxing Zhu [pdf] [supp] [bibtex]

Densely Connected Multi-Dilated Convolutional Networks for Dense Prediction Tasks Naoya Takahashi, Yuki Mitsufuji [pdf] [bibtex]

Depth-Conditioned Dynamic Message Propagation for Monocular 3D Object Detection Li Wang, Liang Du, Xiaoqing Ye, Yanwei Fu, Guodong Guo, Xiangyang Xue, Jianfeng Feng, Li Zhang [pdf] [supp] [arXiv] [bibtex]

S2-BNN: Bridging the Gap Between Self-Supervised Real and 1-Bit Neural Networks via Guided Distribution Calibration Zhiqiang Shen, Zechun Liu, Jie Qin, Lei Huang, Kwang-Ting Cheng, Marios Savvides [pdf] [bibtex]

Learning Optical Flow From Still Images Filippo Aleotti, Matteo Poggi, Stefano Mattoccia [pdf] [supp] [arXiv] [bibtex]

From Shadow Generation To Shadow Removal Zhihao Liu, Hui Yin, Xinyi Wu, Zhenyao Wu, Yang Mi, Song Wang [pdf] [arXiv] [bibtex]

Face Forgery Detection by 3D Decomposition Xiangyu Zhu, Hao Wang, Hongyan Fei, Zhen Lei, Stan Z. Li [pdf] [supp] [arXiv] [bibtex]

Unsupervised 3D Shape Completion Through GAN Inversion Junzhe Zhang, Xinyi Chen, Zhongang Cai, Liang Pan, Haiyu Zhao, Shuai Yi, Chai Kiat Yeo, Bo Dai, Chen Change Loy [pdf] [supp] [arXiv] [bibtex]

Pseudo 3D Auto-Correlation Network for Real Image Denoising Xiaowan Hu, Ruijun Ma, Zhihong Liu, Yuanhao Cai, Xiaole Zhao, Yulun Zhang, Haoqian Wang [pdf] [bibtex]

MaxUp: Lightweight Adversarial Training With Data Augmentation Improves Neural Network Training Chengyue Gong, Tongzheng Ren, Mao Ye, Qiang Liu [pdf] [supp] [bibtex]

Anti-Adversarially Manipulated Attributions for Weakly and Semi-Supervised Semantic Segmentation Jungbeom Lee, Eunji Kim, Sungroh Yoon [pdf] [supp] [arXiv] [bibtex]

Data-Free Knowledge Distillation for Image Super-Resolution Yiman Zhang, Hanting Chen, Xinghao Chen, Yiping Deng, Chunjing Xu, Yunhe Wang [pdf] [supp] [bibtex]

PluckerNet: Learn To Register 3D Line Reconstructions Liu Liu, Hongdong Li, Haodong Yao, Ruyi Zha [pdf] [bibtex]

Deep Perceptual Preprocessing for Video Coding Aaron Chadha, Yiannis Andreopoulos [pdf] [supp] [bibtex]

Explaining Classifiers Using Adversarial Perturbations on the Perceptual Ball Andrew Elliott, Stephen Law, Chris Russell [pdf] [supp] [arXiv] [bibtex]

DARCNN: Domain Adaptive Region-Based Convolutional Neural Network for Unsupervised Instance Segmentation in Biomedical Images Joy Hsu, Wah Chiu, Serena Yeung [pdf] [arXiv] [bibtex]