IEEE conference on Computer Vision and Pattern Recognition (CVPR 2018)
IEEE conference on Computer Vision and Pattern Recognition (CVPR) 2018 is the premier annual computer vision event comprising the main conference and several co-located workshops and short courses.
Location: Salt Lake City, Utah
Date: June 18-22, 2018
Main Conference and Exhibition: June 19-21
Workshops and Tutorials: June 18, 22
With over 3300 main-conference paper submissions and 979 accepted papers, CVPR 2018 offers an exciting program covering a wide variety of state-of-the-art work in the field of computer vision. In addition to the main program, CVPR 2018 includes 21 tutorials, 48 workshops, our annual doctoral consortium, and an industrial exhibition featuring over 115 companies.
CVPR 2018 Papers
Embodied Question Answering
Abhishek Das, Samyak Datta, Georgia Gkioxari, Stefan Lee, Devi Parikh, Dhruv Batra
Learning by Asking Questions
Ishan Misra, Ross Girshick, Rob Fergus, Martial Hebert, Abhinav Gupta, Laurens van der Maaten
Finding Tiny Faces in the Wild With Generative Adversarial Network
Yancheng Bai, Yongqiang Zhang, Mingli Ding, Bernard Ghanem
Learning Face Age Progression: A Pyramid Architecture of GANs
Hongyu Yang, Di Huang, Yunhong Wang, Anil K. Jain
PairedCycleGAN: Asymmetric Style Transfer for Applying and Removing Makeup
Huiwen Chang, Jingwan Lu, Fisher Yu, Adam Finkelstein
GANerated Hands for Real-Time 3D Hand Tracking From Monocular RGB
Franziska Mueller, Florian Bernard, Oleksandr Sotnychenko, Dushyant Mehta, Srinath Sridhar, Dan Casas, Christian Theobalt
Learning Pose Specific Representations by Predicting Different Views
Georg Poier, David Schinagl, Horst Bischof
Weakly and Semi Supervised Human Body Part Parsing via Pose-Guided Knowledge Transfer
Hao-Shu Fang, Guansong Lu, Xiaolin Fang, Jianwen Xie, Yu-Wing Tai, Cewu Lu
Person Transfer GAN to Bridge Domain Gap for Person Re-Identification
Longhui Wei, Shiliang Zhang, Wen Gao, Qi Tian
Cross-Modal Deep Variational Hand Pose Estimation
Adrian Spurr, Jie Song, Seonwook Park, Otmar Hilliges
Disentangled Person Image Generation
Liqian Ma, Qianru Sun, Stamatios Georgoulis, Luc Van Gool, Bernt Schiele, Mario Fritz
Super-FAN: Integrated Facial Landmark Localization and Super-Resolution of Real-World Low Resolution Faces in Arbitrary Poses With GANs
Adrian Bulat, Georgios Tzimiropoulos
Multistage Adversarial Losses for Pose-Based Human Image Synthesis
Chenyang Si, Wei Wang, Liang Wang, Tieniu Tan
Rotation Averaging and Strong Duality
Anders Eriksson, Carl Olsson, Fredrik Kahl, Tat-Jun Chin
Hybrid Camera Pose Estimation
Federico Camposeco, Andrea Cohen, Marc Pollefeys, Torsten Sattler
A Certifiably Globally Optimal Solution to the Non-Minimal Relative Pose Problem
Jesus Briales, Laurent Kneip, Javier Gonzalez-Jimenez
Single View Stereo Matching
Yue Luo, Jimmy Ren, Mude Lin, Jiahao Pang, Wenxiu Sun, Hongsheng Li, Liang Lin
Fight Ill-Posedness With Ill-Posedness: Single-Shot Variational Depth Super-Resolution From Shading
Bjoern Haefner, Yvain Quéau, Thomas Möllenhoff, Daniel Cremers
Deep Depth Completion of a Single RGB-D Image
Yinda Zhang, Thomas Funkhouser
Multi-View Harmonized Bilinear Network for 3D Object Recognition
Tan Yu, Jingjing Meng, Junsong Yuan
PPFNet: Global Context Aware Local Features for Robust 3D Point Matching
Haowen Deng, Tolga Birdal, Slobodan Ilic
FoldingNet: Point Cloud Auto-Encoder via Deep Grid Deformation
Yaoqing Yang, Chen Feng, Yiru Shen, Dong Tian
A Papier-Mâché Approach to Learning 3D Surface Generation
Thibault Groueix, Matthew Fisher, Vladimir G. Kim, Bryan C. Russell, Mathieu Aubry
LEGO: Learning Edge With Geometry All at Once by Watching Videos
Zhenheng Yang, Peng Wang, Yang Wang, Wei Xu, Ram Nevatia
Five-Point Fundamental Matrix Estimation for Uncalibrated Cameras
Daniel Barath
PointFusion: Deep Sensor Fusion for 3D Bounding Box Estimation
Danfei Xu, Dragomir Anguelov, Ashesh Jain
Scalable Dense Non-Rigid Structure-From-Motion: A Grassmannian Perspective
Suryansh Kumar, Anoop Cherian, Yuchao Dai, Hongdong Li
GVCNN: Group-View Convolutional Neural Networks for 3D Shape Recognition
Yifan Feng, Zizhao Zhang, Xibin Zhao, Rongrong Ji, Yue Gao
Depth and Transient Imaging With Compressive SPAD Array Cameras
Qilin Sun, Xiong Dun, Yifan Peng, Wolfgang Heidrich
GeoNet: Geometric Neural Network for Joint Depth and Surface Normal Estimation
Xiaojuan Qi, Renjie Liao, Zhengzhe Liu, Raquel Urtasun, Jiaya Jia
Real-Time Seamless Single Shot 6D Object Pose Prediction
Bugra Tekin, Sudipta N. Sinha, Pascal Fua
Factoring Shape, Pose, and Layout From the 2D Image of a 3D Scene
Shubham Tulsiani, Saurabh Gupta, David F. Fouhey, Alexei A. Efros, Jitendra Malik
Monocular Relative Depth Perception With Web Stereo Data Supervision
Ke Xian, Chunhua Shen, Zhiguo Cao, Hao Lu, Yang Xiao, Ruibo Li, Zhenbo Luo
Spline Error Weighting for Robust Visual-Inertial Fusion
Hannes Ovrén, Per-Erik Forssén
Single-Image Depth Estimation Based on Fourier Domain Analysis
Jae-Han Lee, Minhyeok Heo, Kyung-Rae Kim, Chang-Su Kim
Unsupervised Learning of Monocular Depth Estimation and Visual Odometry With Deep Feature Reconstruction
Huangying Zhan, Ravi Garg, Chamara Saroj Weerasekera, Kejie Li, Harsh Agarwal, Ian Reid
Detect-and-Track: Efficient Pose Estimation in Videos
Rohit Girdhar, Georgia Gkioxari, Lorenzo Torresani, Manohar Paluri, Du Tran
Supervision-by-Registration: An Unsupervised Approach to Improve the Precision of Facial Landmark Detectors
Xuanyi Dong, Shoou-I Yu, Xinshuo Weng, Shih-En Wei, Yi Yang, Yaser Sheikh
Diversity Regularized Spatiotemporal Attention for Video-Based Person Re-Identification
Shuang Li, Slawomir Bak, Peter Carr, Xiaogang Wang
Style Aggregated Network for Facial Landmark Detection
Xuanyi Dong, Yan Yan, Wanli Ouyang, Yi Yang
Learning Deep Models for Face Anti-Spoofing: Binary or Auxiliary Supervision
Yaojie Liu, Amin Jourabloo, Xiaoming Liu
Deep Cost-Sensitive and Order-Preserving Feature Learning for Cross-Population Age Estimation
Kai Li, Junliang Xing, Chi Su, Weiming Hu, Yundong Zhang, Stephen Maybank
First-Person Hand Action Benchmark With RGB-D Videos and 3D Hand Pose Annotations
Guillermo Garcia-Hernando, Shanxin Yuan, Seungryul Baek, Tae-Kyun Kim
A Pose-Sensitive Embedding for Person Re-Identification With Expanded Cross Neighborhood Re-Ranking
M. Saquib Sarfraz, Arne Schumann, Andreas Eberle, Rainer Stiefelhagen
Disentangling 3D Pose in a Dendritic CNN for Unconstrained 2D Face Alignment
Amit Kumar, Rama Chellappa
A Hierarchical Generative Model for Eye Image Synthesis and Eye Gaze Estimation
Kang Wang, Rui Zhao, Qiang Ji
MiCT: Mixed 3D/2D Convolutional Tube for Human Action Recognition
Yizhou Zhou, Xiaoyan Sun, Zheng-Jun Zha, Wenjun Zeng
Learning to Estimate 3D Human Pose and Shape From a Single Color Image
Georgios Pavlakos, Luyang Zhu, Xiaowei Zhou, Kostas Daniilidis
Glimpse Clouds: Human Activity Recognition From Unstructured Feature Points
Fabien Baradel, Christian Wolf, Julien Mille, Graham W. Taylor
Context-Aware Deep Feature Compression for High-Speed Visual Tracking
Jongwon Choi, Hyung Jin Chang, Tobias Fischer, Sangdoo Yun, Kyuewang Lee, Jiyeoup Jeong, Yiannis Demiris, Jin Young Choi
Correlation Tracking via Joint Discrimination and Reliability Learning
Chong Sun, Dong Wang, Huchuan Lu, Ming-Hsuan Yang
PhaseNet for Video Frame Interpolation
Simone Meyer, Abdelaziz Djelouah, Brian McWilliams, Alexander Sorkine-Hornung, Markus Gross, Christopher Schroers
The Best of Both Worlds: Combining CNNs and Geometric Constraints for Hierarchical Motion Segmentation
Pia Bideau, Aruni RoyChowdhury, Rakesh R. Menon, Erik Learned-Miller
Hyperparameter Optimization for Tracking With Continuous Deep Q-Learning
Xingping Dong, Jianbing Shen, Wenguan Wang, Yu Liu, Ling Shao, Fatih Porikli
Scale-Transferrable Object Detection
Peng Zhou, Bingbing Ni, Cong Geng, Jianguo Hu, Yi Xu
A Prior-Less Method for Multi-Face Tracking in Unconstrained Videos
Chung-Ching Lin, Ying Hung
End-to-End Flow Correlation Tracking With Spatial-Temporal Attention
Zheng Zhu, Wei Wu, Wei Zou, Junjie Yan
Deep Texture Manifold for Ground Terrain Recognition
Jia Xue, Hang Zhang, Kristin Dana
Learning Superpixels With Segmentation-Aware Affinity Loss
Wei-Chih Tu, Ming-Yu Liu, Varun Jampani, Deqing Sun, Shao-Yi Chien, Ming-Hsuan Yang, Jan Kautz
Interactive Image Segmentation With Latent Diversity
Zhuwen Li, Qifeng Chen, Vladlen Koltun
The Unreasonable Effectiveness of Deep Features as a Perceptual Metric
Richard Zhang, Phillip Isola, Alexei A. Efros, Eli Shechtman, Oliver Wang
Local Descriptors Optimized for Average Precision
Kun He, Yan Lu, Stan Sclaroff
Recovering Realistic Texture in Image Super-Resolution by Deep Spatial Feature Transform
Xintao Wang, Ke Yu, Chao Dong, Chen Change Loy
Deep Extreme Cut: From Extreme Points to Object Segmentation
Kevis-Kokitsi Maninis, Sergi Caelles, Jordi Pont-Tuset, Luc Van Gool
Learning to Parse Wireframes in Images of Man-Made Environments
Kun Huang, Yifan Wang, Zihan Zhou, Tianjiao Ding, Shenghua Gao, Yi Ma
Occlusion-Aware Rolling Shutter Rectification of 3D Scenes
Subeesh Vasu, Mahesh Mohan M. R., A. N. Rajagopalan
Content-Sensitive Supervoxels via Uniform Tessellations on Video Manifolds
Ran Yi, Yong-Jin Liu, Yu-Kun Lai
Intrinsic Image Transformation via Scale Space Decomposition
Lechao Cheng, Chengyi Zhang, Zicheng Liao
Learned Shape-Tailored Descriptors for Segmentation
Naeemullah Khan, Ganesh Sundaramoorthi
PAD-Net: Multi-Tasks Guided Prediction-and-Distillation Network for Simultaneous Depth Estimation and Scene Parsing
Dan Xu, Wanli Ouyang, Xiaogang Wang, Nicu Sebe
Multi-Image Semantic Matching by Mining Consistent Features
Qianqian Wang, Xiaowei Zhou, Kostas Daniilidis
Density-Aware Single Image De-Raining Using a Multi-Stream Dense Network
He Zhang, Vishal M. Patel
Joint Cuts and Matching of Partitions in One Graph
Tianshu Yu, Junchi Yan, Jieyi Zhao, Baoxin Li
Progressive Attention Guided Recurrent Network for Salient Object Detection
Xiaoning Zhang, Tiantian Wang, Jinqing Qi, Huchuan Lu, Gang Wang
Fast and Accurate Single Image Super-Resolution via Information Distillation Network
Zheng Hui, Xiumei Wang, Xinbo Gao
Hallucinated-IQA: No-Reference Image Quality Assessment via Adversarial Learning
Kwan-Yee Lin, Guanxiang Wang
NAG: Network for Adversary Generation
Konda Reddy Mopuri, Utkarsh Ojha, Utsav Garg, R. Venkatesh Babu
Dynamic-Structured Semantic Propagation Network
Xiaodan Liang, Hongfei Zhou, Eric Xing
Cross-Domain Self-Supervised Multi-Task Feature Learning Using Synthetic Imagery
Zhongzheng Ren, Yong Jae Lee
A Two-Step Disentanglement Method
Naama Hadad, Lior Wolf, Moni Shahar
Robust Facial Landmark Detection via a Fully-Convolutional Local-Global Context Network
Daniel Merget, Matthias Rock, Gerhard Rigoll
Decorrelated Batch Normalization
Lei Huang, Dawei Yang, Bo Lang, Jia Deng
Learning to Sketch With Shortcut Cycle Consistency
Jifei Song, Kaiyue Pang, Yi-Zhe Song, Tao Xiang, Timothy M. Hospedales
Towards a Mathematical Understanding of the Difficulty in Learning With Feedforward Neural Networks
Hao Shen
FaceID-GAN: Learning a Symmetry Three-Player GAN for Identity-Preserving Face Synthesis
Yujun Shen, Ping Luo, Junjie Yan, Xiaogang Wang, Xiaoou Tang
A Constrained Deep Neural Network for Ordinal Regression
Yanzhu Liu, Adams Wai Kin Kong, Chi Keong Goh
Modulated Convolutional Networks
Xiaodi Wang, Baochang Zhang, Ce Li, Rongrong Ji, Jungong Han, Xianbin Cao, Jianzhuang Liu
Learning Steerable Filters for Rotation Equivariant CNNs
Maurice Weiler, Fred A. Hamprecht, Martin Storath
Efficient Interactive Annotation of Segmentation Datasets With Polygon-RNN++
David Acuna, Huan Ling, Amlan Kar, Sanja Fidler
SplineCNN: Fast Geometric Deep Learning With Continuous B-Spline Kernels
Matthias Fey, Jan Eric Lenssen, Frank Weichert, Heinrich Müller
GAGAN: Geometry-Aware Generative Adversarial Networks
Jean Kossaifi, Linh Tran, Yannis Panagakis, Maja Pantic
On the Robustness of Semantic Segmentation Models to Adversarial Attacks
Anurag Arnab, Ondrej Miksik, Philip H.S. Torr
Feedback-Prop: Convolutional Neural Network Inference Under Partial Evidence
Tianlu Wang, Kota Yamaguchi, Vicente Ordonez
Super-Resolving Very Low-Resolution Face Images With Supplementary Attributes
Xin Yu, Basura Fernando, Richard Hartley, Fatih Porikli
Frustum PointNets for 3D Object Detection From RGB-D Data
Charles R. Qi, Wei Liu, Chenxia Wu, Hao Su, Leonidas J. Guibas
W2F: A Weakly-Supervised to Fully-Supervised Framework for Object Detection
Yongqiang Zhang, Yancheng Bai, Mingli Ding, Yongqiang Li, Bernard Ghanem
3D Object Detection With Latent Support Surfaces
Zhile Ren, Erik B. Sudderth
Towards Faster Training of Global Covariance Pooling Networks by Iterative Matrix Square Root Normalization
Peihua Li, Jiangtao Xie, Qilong Wang, Zilin Gao
Recurrent Scene Parsing With Perspective Understanding in the Loop
Shu Kong, Charless C. Fowlkes
Improving Occlusion and Hard Negative Handling for Single-Stage Pedestrian Detectors
Junhyug Noh, Soochan Lee, Beomsu Kim, Gunhee Kim
Learning to Act Properly: Predicting and Explaining Affordances From Images
Ching-Yao Chuang, Jiaman Li, Antonio Torralba, Sanja Fidler
Pointwise Convolutional Neural Networks
Binh-Son Hua, Minh-Khoi Tran, Sai-Kit Yeung
Image-Image Domain Adaptation With Preserved Self-Similarity and Domain-Dissimilarity for Person Re-Identification
Weijian Deng, Liang Zheng, Qixiang Ye, Guoliang Kang, Yi Yang, Jianbin Jiao
A Generative Adversarial Approach for Zero-Shot Learning From Noisy Texts
Yizhe Zhu, Mohamed Elhoseiny, Bingchen Liu, Xi Peng, Ahmed Elgammal
Tensorize, Factorize and Regularize: Robust Visual Relationship Learning
Seong Jae Hwang, Sathya N. Ravi, Zirui Tao, Hyunwoo J. Kim, Maxwell D. Collins, Vikas Singh
Transductive Unbiased Embedding for Zero-Shot Learning
Jie Song, Chengchao Shen, Yezhou Yang, Yang Liu, Mingli Song
Hierarchical Novelty Detection for Visual Object Recognition
Kibok Lee, Kimin Lee, Kyle Min, Yuting Zhang, Jinwoo Shin, Honglak Lee
Zero-Shot Visual Recognition Using Semantics-Preserving Adversarial Embedding Networks
Long Chen, Hanwang Zhang, Jun Xiao, Wei Liu, Shih-Fu Chang
Learning Rich Features for Image Manipulation Detection
Peng Zhou, Xintong Han, Vlad I. Morariu, Larry S. Davis
Human Semantic Parsing for Person Re-Identification
Mahdi M. Kalayeh, Emrah Basaran, Muhittin Gökmen, Mustafa E. Kamasak, Mubarak Shah
Stacked Latent Attention for Multimodal Reasoning
Haoqi Fan, Jiatong Zhou
R-FCN-3000 at 30fps: Decoupling Detection and Classification
Bharat Singh, Hengduo Li, Abhishek Sharma, Larry S. Davis
CSRNet: Dilated Convolutional Neural Networks for Understanding the Highly Congested Scenes
Yuhong Li, Xiaofan Zhang, Deming Chen
Revisiting Knowledge Transfer for Training Object Class Detectors
Jasper Uijlings, Stefan Popov, Vittorio Ferrari
Deep Sparse Coding for Invariant Multimodal Halle Berry Neurons
Edward Kim, Darryl Hannan, Garrett Kenyon
On the Convergence of PatchMatch and Its Variants
Thibaud Ehret, Pablo Arias
Rethinking the Faster R-CNN Architecture for Temporal Action Localization
Yu-Wei Chao, Sudheendra Vijayanarasimhan, Bryan Seybold, David A. Ross, Jia Deng, Rahul Sukthankar
MoNet: Deep Motion Exploitation for Video Object Segmentation
Huaxin Xiao, Jiashi Feng, Guosheng Lin, Yu Liu, Maojun Zhang
Video Representation Learning Using Discriminative Pooling
Jue Wang, Anoop Cherian, Fatih Porikli, Stephen Gould
Recognizing Human Actions as the Evolution of Pose Estimation Maps
Mengyuan Liu, Junsong Yuan
Video Person Re-Identification With Competitive Snippet-Similarity Aggregation and Co-Attentive Snippet Embedding
Dapeng Chen, Hongsheng Li, Tong Xiao, Shuai Yi, Xiaogang Wang
Mask-Guided Contrastive Attention Model for Person Re-Identification
Chunfeng Song, Yan Huang, Wanli Ouyang, Liang Wang
Blazingly Fast Video Object Segmentation With Pixel-Wise Metric Learning
Yuhua Chen, Jordi Pont-Tuset, Alberto Montes, Luc Van Gool
Learning to Compare: Relation Network for Few-Shot Learning
Flood Sung, Yongxin Yang, Li Zhang, Tao Xiang, Philip H.S. Torr, Timothy M. Hospedales
COCO-Stuff: Thing and Stuff Classes in Context
Holger Caesar, Jasper Uijlings, Vittorio Ferrari
Image Generation From Scene Graphs
Justin Johnson, Agrim Gupta, Li Fei-Fei
Deep Cauchy Hashing for Hamming Space Retrieval
Yue Cao, Mingsheng Long, Bin Liu, Jianmin Wang
Learning to Look Around: Intelligently Exploring Unseen Environments for Unknown Tasks
Dinesh Jayaraman, Kristen Grauman
Multi-Scale Location-Aware Kernel Representation for Object Detection
Hao Wang, Qilong Wang, Mingqi Gao, Peihua Li, Wangmeng Zuo
Clinical Skin Lesion Diagnosis Using Representations Inspired by Dermatologist Criteria
Jufeng Yang, Xiaoxiao Sun, Jie Liang, Paul L. Rosin
Compare and Contrast: Learning Prominent Visual Differences
Steven Chen, Kristen Grauman
Multi-Evidence Filtering and Fusion for Multi-Label Classification, Object Detection and Semantic Segmentation Based on Weakly Supervised Learning
Weifeng Ge, Sibei Yang, Yizhou Yu
HashGAN: Deep Learning to Hash With Pair Conditional Wasserstein GAN
Yue Cao, Bin Liu, Mingsheng Long, Jianmin Wang
Min-Entropy Latent Model for Weakly Supervised Object Detection
Fang Wan, Pengxu Wei, Jianbin Jiao, Zhenjun Han, Qixiang Ye
MAttNet: Modular Attention Network for Referring Expression Comprehension
Licheng Yu, Zhe Lin, Xiaohui Shen, Jimei Yang, Xin Lu, Mohit Bansal, Tamara L. Berg
AttnGAN: Fine-Grained Text to Image Generation With Attentional Generative Adversarial Networks
Tao Xu, Pengchuan Zhang, Qiuyuan Huang, Han Zhang, Zhe Gan, Xiaolei Huang, Xiaodong He
Adversarial Complementary Learning for Weakly Supervised Object Localization
Xiaolin Zhang, Yunchao Wei, Jiashi Feng, Yi Yang, Thomas S. Huang
Conditional Generative Adversarial Network for Structured Domain Adaptation
Weixiang Hong, Zhenzhen Wang, Ming Yang, Junsong Yuan
GroupCap: Group-Based Image Captioning With Structured Relevance and Diversity Constraints
Fuhai Chen, Rongrong Ji, Xiaoshuai Sun, Yongjian Wu, Jinsong Su
Weakly-Supervised Semantic Segmentation by Iteratively Mining Common Object Features
Xiang Wang, Shaodi You, Xi Li, Huimin Ma
Bootstrapping the Performance of Webly Supervised Semantic Segmentation
Tong Shen, Guosheng Lin, Chunhua Shen, Ian Reid
DeepVoting: A Robust and Explainable Deep Network for Semantic Part Detection Under Partial Occlusion
Zhishuai Zhang, Cihang Xie, Jianyu Wang, Lingxi Xie, Alan L. Yuille
Geometry-Aware Scene Text Detection With Instance Transformation Network
Fangfang Wang, Liming Zhao, Xi Li, Xinchao Wang, Dacheng Tao
Optical Flow Guided Feature: A Fast and Robust Motion Representation for Video Action Recognition
Shuyang Sun, Zhanghui Kuang, Lu Sheng, Wanli Ouyang, Wei Zhang
Motion-Guided Cascaded Refinement Network for Video Object Segmentation
Ping Hu, Gang Wang, Xiangfei Kong, Jason Kuen, Yap-Peng Tan
A Memory Network Approach for Story-Based Temporal Summarization of 360° Videos
Sangho Lee, Jinyoung Sung, Youngjae Yu, Gunhee Kim
Cube Padding for Weakly-Supervised Saliency Prediction in 360° Videos
Hsien-Tzu Cheng, Chun-Hung Chao, Jin-Dong Dong, Hao-Kai Wen, Tyng-Luh Liu, Min Sun
Appearance-and-Relation Networks for Video Classification
Limin Wang, Wei Li, Wen Li, Luc Van Gool
Excitation Backprop for RNNs
Sarah Adel Bargal, Andrea Zunino, Donghyun Kim, Jianming Zhang, Vittorio Murino, Stan Sclaroff
One-Shot Action Localization by Learning Sequence Matching Network
Hongtao Yang, Xuming He, Fatih Porikli
Structure Preserving Video Prediction
Jingwei Xu, Bingbing Ni, Zefan Li, Shuo Cheng, Xiaokang Yang
Person Re-Identification With Cascaded Pairwise Convolutions
Yicheng Wang, Zhenzhong Chen, Feng Wu, Gang Wang
On the Importance of Label Quality for Semantic Segmentation
Aleksandar Zlateski, Ronnachai Jaroensri, Prafull Sharma, Frédo Durand
Scalable and Effective Deep CCA via Soft Decorrelation
Xiaobin Chang, Tao Xiang, Timothy M. Hospedales
Duplex Generative Adversarial Network for Unsupervised Domain Adaptation
Lanqing Hu, Meina Kan, Shiguang Shan, Xilin Chen
Edit Probability for Scene Text Recognition
Fan Bai, Zhanzhan Cheng, Yi Niu, Shiliang Pu, Shuigeng Zhou
Global Versus Localized Generative Adversarial Nets
Guo-Jun Qi, Liheng Zhang, Hao Hu, Marzieh Edraki, Jingdong Wang, Xian-Sheng Hua
MoCoGAN: Decomposing Motion and Content for Video Generation
Sergey Tulyakov, Ming-Yu Liu, Xiaodong Yang, Jan Kautz
Recurrent Residual Module for Fast Inference in Videos
Bowen Pan, Wuwei Lin, Xiaolin Fang, Chaoqin Huang, Bolei Zhou, Cewu Lu
Improving Landmark Localization With Semi-Supervised Learning
Sina Honari, Pavlo Molchanov, Stephen Tyree, Pascal Vincent, Christopher Pal, Jan Kautz
Adversarial Data Programming: Using GANs to Relax the Bottleneck of Curated Labeled Data
Arghya Pal, Vineeth N. Balasubramanian
Stochastic Variational Inference With Gradient Linearization
Tobias Plötz, Anne S. Wannenwetsch, Stefan Roth
Multi-Label Zero-Shot Learning With Structured Knowledge Graphs
Chung-Wei Lee, Wei Fang, Chih-Kuan Yeh, Yu-Chiang Frank Wang
MorphNet: Fast & Simple Resource-Constrained Structure Learning of Deep Networks
Ariel Gordon, Elad Eban, Ofir Nachum, Bo Chen, Hao Wu, Tien-Ju Yang, Edward Choi
Deep Adversarial Subspace Clustering
Pan Zhou, Yunqing Hou, Jiashi Feng
Towards Human-Machine Cooperation: Self-Supervised Sample Mining for Object Detection
Keze Wang, Xiaopeng Yan, Dongyu Zhang, Lei Zhang, Liang Lin
Discrete-Continuous ADMM for Transductive Inference in Higher-Order MRFs
Emanuel Laude, Jan-Hendrik Lange, Jonas Schüpfer, Csaba Domokos, Laura Leal-Taixé, Frank R. Schmidt, Bjoern Andres, Daniel Cremers
Robust Physical-World Attacks on Deep Learning Visual Classification
Kevin Eykholt, Ivan Evtimov, Earlence Fernandes, Bo Li, Amir Rahmati, Chaowei Xiao, Atul Prakash, Tadayoshi Kohno, Dawn Song
Generating a Fusion Image: One's Identity and Another's Shape
DongGyu Joo, Doyeon Kim, Junmo Kim
Learning to Promote Saliency Detectors
Yu Zeng, Huchuan Lu, Lihe Zhang, Mengyang Feng, Ali Borji
Image Super-Resolution via Dual-State Recurrent Networks
Wei Han, Shiyu Chang, Ding Liu, Mo Yu, Michael Witbrock, Thomas S. Huang
Deep Back-Projection Networks for Super-Resolution
Muhammad Haris, Gregory Shakhnarovich, Norimichi Ukita
Focus Manipulation Detection via Photometric Histogram Analysis
Can Chen, Scott McCloskey, Jingyi Yu
Compassionately Conservative Balanced Cuts for Image Segmentation
Nathan D. Cahill, Tyler L. Hayes, Renee T. Meinhold, John F. Hamilton
A High-Quality Denoising Dataset for Smartphone Cameras
Abdelrahman Abdelhamed, Stephen Lin, Michael S. Brown
Context-Aware Synthesis for Video Frame Interpolation
Simon Niklaus, Feng Liu
Salient Object Detection Driven by Fixation Prediction
Wenguan Wang, Jianbing Shen, Xingping Dong, Ali Borji
Enhancing the Spatial Resolution of Stereo Images Using a Parallax Prior
Daniel S. Jeon, Seung-Hwan Baek, Inchang Choi, Min H. Kim
HATS: Histograms of Averaged Time Surfaces for Robust Event-Based Object Classification
Amos Sironi, Manuele Brambilla, Nicolas Bourdis, Xavier Lagorce, Ryad Benosman
A Bi-Directional Message Passing Model for Salient Object Detection
Lu Zhang, Ju Dai, Huchuan Lu, You He, Gang Wang
Matching Pixels Using Co-Occurrence Statistics
Rotal Kat, Roy Jevnisek, Shai Avidan
SeedNet: Automatic Seed Generation With Deep Reinforcement Learning for Robust Interactive Segmentation
Gwangmo Song, Heesoo Myeong, Kyoung Mu Lee
Jerk-Aware Video Acceleration Magnification
Shoichiro Takeda, Kazuki Okami, Dan Mikami, Megumi Isogai, Hideaki Kimata
Defense Against Adversarial Attacks Using High-Level Representation Guided Denoiser
Fangzhou Liao, Ming Liang, Yinpeng Dong, Tianyu Pang, Xiaolin Hu, Jun Zhu
Stacked Conditional Generative Adversarial Networks for Jointly Learning Shadow Detection and Shadow Removal
Jifeng Wang, Xiang Li, Jian Yang
Image Correction via Deep Reciprocating HDR Transformation
Xin Yang, Ke Xu, Yibing Song, Qiang Zhang, Xiaopeng Wei, Rynson W.H. Lau
PieAPP: Perceptual Image-Error Assessment Through Pairwise Preference
Ekta Prashnani, Hong Cai, Yasamin Mostofi, Pradeep Sen
Normalized Cut Loss for Weakly-Supervised CNN Segmentation
Meng Tang, Abdelaziz Djelouah, Federico Perazzi, Yuri Boykov, Christopher Schroers
ISTA-Net: Interpretable Optimization-Inspired Deep Network for Image Compressive Sensing
Jian Zhang, Bernard Ghanem
Fast End-to-End Trainable Guided Filter
Huikai Wu, Shuai Zheng, Junge Zhang, Kaiqi Huang
Disentangling Structure and Aesthetics for Style-Aware Image Completion
Andrew Gilbert, John Collomosse, Hailin Jin, Brian Price
Learning a Discriminative Feature Network for Semantic Segmentation
Changqian Yu, Jingbo Wang, Chao Peng, Changxin Gao, Gang Yu, Nong Sang
Kernelized Subspace Pooling for Deep Local Descriptors
Xing Wei, Yue Zhang, Yihong Gong, Nanning Zheng
pOSE: Pseudo Object Space Error for Initialization-Free Bundle Adjustment
Je Hyeong Hong, Christopher Zach
Deformable Shape Completion With Graph Convolutional Autoencoders
Or Litany, Alex Bronstein, Michael Bronstein, Ameesh Makadia
Learning From Millions of 3D Scans for Large-Scale 3D Face Recognition
Syed Zulqarnain Gilani, Ajmal Mian
CarFusion: Combining Point Tracking and Part Detection for Dynamic 3D Reconstruction of Vehicles
N. Dinesh Reddy, Minh Vo, Srinivasa G. Narasimhan
Deep Material-Aware Cross-Spectral Stereo Matching
Tiancheng Zhi, Bernardo R. Pires, Martial Hebert, Srinivasa G. Narasimhan
Augmenting Crowd-Sourced 3D Reconstructions Using Semantic Detections
True Price, Johannes L. Schönberger, Zhen Wei, Marc Pollefeys, Jan-Michael Frahm
Matryoshka Networks: Predicting 3D Geometry via Nested Shape Layers
Stephan R. Richter, Stefan Roth
Triplet-Center Loss for Multi-View 3D Object Retrieval
Xinwei He, Yang Zhou, Zhichao Zhou, Song Bai, Xiang Bai
Learning 3D Shape Completion From Laser Scan Data With Weak Supervision
David Stutz, Andreas Geiger
End-to-End Learning of Keypoint Detector and Descriptor for Pose Invariant 3D Matching
Georgios Georgakis, Srikrishna Karanam, Ziyan Wu, Jan Ernst, Jana Košecká
ICE-BA: Incremental, Consistent and Efficient Bundle Adjustment for Visual-Inertial SLAM
Haomin Liu, Mingyu Chen, Guofeng Zhang, Hujun Bao, Yingze Bao
GeoNet: Unsupervised Learning of Dense Depth, Optical Flow and Camera Pose
Zhichao Yin, Jianping Shi
Radially-Distorted Conjugate Translations
James Pritts, Zuzana Kukelova, Viktor Larsson, Ondřej Chum
Deep Ordinal Regression Network for Monocular Depth Estimation
Huan Fu, Mingming Gong, Chaohui Wang, Kayhan Batmanghelich, Dacheng Tao
Analytical Modeling of Vanishing Points and Curves in Catadioptric Cameras
Pedro Miraldo, Francisco Eiras, Srikumar Ramalingam
Learning Depth From Monocular Videos Using Direct Methods
Chaoyang Wang, José Miguel Buenaposada, Rui Zhu, Simon Lucey
Salience Guided Depth Calibration for Perceptually Optimized Compressive Light Field 3D Display
Shizheng Wang, Wenjuan Liao, Phil Surman, Zhigang Tu, Yuanjin Zheng, Junsong Yuan
MegaDepth: Learning Single-View Depth Prediction From Internet Photos
Zhengqi Li, Noah Snavely
LayoutNet: Reconstructing the 3D Room Layout From a Single RGB Image
Chuhang Zou, Alex Colburn, Qi Shan, Derek Hoiem
CBMV: A Coalesced Bidirectional Matching Volume for Disparity Estimation
Konstantinos Batsos, Changjiang Cai, Philippos Mordohai
Zoom and Learn: Generalizing Deep Stereo Matching to Novel Domains
Jiahao Pang, Wenxiu Sun, Chengxi Yang, Jimmy Ren, Ruichao Xiao, Jin Zeng, Liang Lin
Exploring Disentangled Feature Representation Beyond Face Identification
Yu Liu, Fangyin Wei, Jing Shao, Lu Sheng, Junjie Yan, Xiaogang Wang
Learning Facial Action Units From Web Images With Scalable Weakly Supervised Clustering
Kaili Zhao, Wen-Sheng Chu, Aleix M. Martinez
Human Pose Estimation With Parsing Induced Learner
Xuecheng Nie, Jiashi Feng, Yiming Zuo, Shuicheng Yan
Multi-Level Factorisation Net for Person Re-Identification
Xiaobin Chang, Timothy M. Hospedales, Tao Xiang
Attention-Aware Compositional Network for Person Re-Identification
Jing Xu, Rui Zhao, Feng Zhu, Huaming Wang, Wanli Ouyang
Look at Boundary: A Boundary-Aware Face Alignment Algorithm
Wayne Wu, Chen Qian, Shuo Yang, Quan Wang, Yici Cai, Qiang Zhou
Demo2Vec: Reasoning Object Affordances From Online Videos
Kuan Fang, Te-Lin Wu, Daniel Yang, Silvio Savarese, Joseph J. Lim
Monocular 3D Pose and Shape Estimation of Multiple People in Natural Scenes - The Importance of Multiple Scene Constraints
Andrei Zanfir, Elisabeta Marinoiu, Cristian Sminchisescu
3D Human Sensing, Action and Emotion Recognition in Robot Assisted Therapy of Children With Autism
Elisabeta Marinoiu, Mihai Zanfir, Vlad Olaru, Cristian Sminchisescu
Facial Expression Recognition by De-Expression Residue Learning
Huiyuan Yang, Umur Ciftci, Lijun Yin
A Causal And-Or Graph Model for Visibility Fluent Reasoning in Tracking Interacting Objects
Yuanlu Xu, Lei Qin, Xiaobai Liu, Jianwen Xie, Song-Chun Zhu
Weakly Supervised Facial Action Unit Recognition Through Adversarial Training
Guozhu Peng, Shangfei Wang
Non-Linear Temporal Subspace Representations for Activity Recognition
Anoop Cherian, Suvrit Sra, Stephen Gould, Richard Hartley
Towards Pose Invariant Face Recognition in the Wild
Jian Zhao, Yu Cheng, Yan Xu, Lin Xiong, Jianshu Li, Fang Zhao, Karlekar Jayashree, Sugiri Pranata, Shengmei Shen, Junliang Xing, Shuicheng Yan, Jiashi Feng
Unifying Identification and Context Learning for Person Recognition
Qingqiu Huang, Yu Xiong, Dahua Lin
Jointly Optimize Data Augmentation and Network Training: Adversarial Data Augmentation in Human Pose Estimation
Xi Peng, Zhiqiang Tang, Fei Yang, Rogerio S. Feris, Dimitris Metaxas
Wing Loss for Robust Facial Landmark Localisation With Convolutional Neural Networks
Zhen-Hua Feng, Josef Kittler, Muhammad Awais, Patrik Huber, Xiao-Jun Wu
Multiple Granularity Group Interaction Prediction
Taiping Yao, Minsi Wang, Bingbing Ni, Huawei Wei, Xiaokang Yang
Social GAN: Socially Acceptable Trajectories With Generative Adversarial Networks
Agrim Gupta, Justin Johnson, Li Fei-Fei, Silvio Savarese, Alexandre Alahi
Deep Group-Shuffling Random Walk for Person Re-Identification
Yantao Shen, Hongsheng Li, Tong Xiao, Shuai Yi, Dapeng Chen, Xiaogang Wang
Transferable Joint Attribute-Identity Deep Learning for Unsupervised Person Re-Identification
Jingya Wang, Xiatian Zhu, Shaogang Gong, Wei Li
Harmonious Attention Network for Person Re-Identification
Wei Li, Xiatian Zhu, Shaogang Gong
Real-Time Rotation-Invariant Face Detection With Progressive Calibration Networks
Xuepeng Shi, Shiguang Shan, Meina Kan, Shuzhe Wu, Xilin Chen
Deep Regression Forests for Age Estimation
Wei Shen, Yilu Guo, Yan Wang, Kai Zhao, Bo Wang, Alan L. Yuille
Weakly-Supervised Deep Convolutional Neural Network Learning for Facial Action Unit Intensity Estimation
Yong Zhang, Weiming Dong, Bao-Gang Hu, Qiang Ji
Memory Based Online Learning of Deep Representations From Video Streams
Federico Pernici, Federico Bartoli, Matteo Bruni, Alberto Del Bimbo
Efficient and Deep Person Re-Identification Using Multi-Level Similarity
Yiluan Guo, Ngai-Man Cheung
Multi-Level Fusion Based 3D Object Detection From Monocular Images
Bin Xu, Zhenzhong Chen
A Perceptual Measure for Deep Single Image Camera Calibration
Yannick Hold-Geoffroy, Kalyan Sunkavalli, Jonathan Eisenmann, Matthew Fisher, Emiliano Gambaretto, Sunil Hadap, Jean-François Lalonde
Learning to Generate Time-Lapse Videos Using Multi-Stage Dynamic Generative Adversarial Networks
Wei Xiong, Wenhan Luo, Lin Ma, Wei Liu, Jiebo Luo
Document Enhancement Using Visibility Detection
Netanel Kligler, Sagi Katz, Ayellet Tal
A Weighted Sparse Sampling and Smoothing Frame Transition Approach for Semantic Fast-Forward First-Person Videos
Michel Silva, Washington Ramos, João Ferreira, Felipe Chamone, Mario Campos, Erickson R. Nascimento
Context Contrasted Feature and Gated Multi-Scale Aggregation for Scene Segmentation
Henghui Ding, Xudong Jiang, Bing Shuai, Ai Qun Liu, Gang Wang
Deep Layer Aggregation
Fisher Yu, Dequan Wang, Evan Shelhamer, Trevor Darrell
Convolutional Neural Networks With Alternately Updated Clique
Yibo Yang, Zhisheng Zhong, Tiancheng Shen, Zhouchen Lin
Practical Block-Wise Neural Network Architecture Generation
Zhao Zhong, Junjie Yan, Wei Wu, Jing Shao, Cheng-Lin Liu
xUnit: Learning a Spatial Activation Function for Efficient Image Restoration
Idan Kligvasser, Tamar Rott Shaham, Tomer Michaeli
Crafting a Toolchain for Image Restoration by Deep Reinforcement Learning
Ke Yu, Chao Dong, Liang Lin, Chen Change Loy
Deformation Aware Image Compression
Tamar Rott Shaham, Tomer Michaeli
Distributable Consistent Multi-Object Matching
Nan Hu, Qixing Huang, Boris Thibert, Leonidas J. Guibas
Residual Dense Network for Image Super-Resolution
Yulun Zhang, Yapeng Tian, Yu Kong, Bineng Zhong, Yun Fu
Attentive Generative Adversarial Network for Raindrop Removal From a Single Image
Rui Qian, Robby T. Tan, Wenhan Yang, Jiajun Su, Jiaying Liu
FSRNet: End-to-End Learning Face Super-Resolution With Facial Priors
Yu Chen, Ying Tai, Xiaoming Liu, Chunhua Shen, Jian Yang
Burst Denoising With Kernel Prediction Networks
Ben Mildenhall, Jonathan T. Barron, Jiawen Chen, Dillon Sharlet, Ren Ng, Robert Carroll
Unsupervised Sparse Dirichlet-Net for Hyperspectral Image Super-Resolution
Ying Qu, Hairong Qi, Chiman Kwan
Dynamic Scene Deblurring Using Spatially Variant Recurrent Neural Networks
Jiawei Zhang, Jinshan Pan, Jimmy Ren, Yibing Song, Linchao Bao, Rynson W.H. Lau, Ming-Hsuan Yang
SPLATNet: Sparse Lattice Networks for Point Cloud Processing
Hang Su, Varun Jampani, Deqing Sun, Subhransu Maji, Evangelos Kalogerakis, Ming-Hsuan Yang, Jan Kautz
Surface Networks
Ilya Kostrikov, Zhongshi Jiang, Daniele Panozzo, Denis Zorin, Joan Bruna
Self-Supervised Multi-Level Face Model Learning for Monocular Reconstruction at Over 250 Hz
Ayush Tewari, Michael Zollhöfer, Pablo Garrido, Florian Bernard, Hyeongwoo Kim, Patrick Pérez, Christian Theobalt
CodeSLAM — Learning a Compact, Optimisable Representation for Dense Visual SLAM
Michael Bloesch, Jan Czarnowski, Ronald Clark, Stefan Leutenegger, Andrew J. Davison
SGPN: Similarity Group Proposal Network for 3D Point Cloud Instance Segmentation
Weiyue Wang, Ronald Yu, Qiangui Huang, Ulrich Neumann
PlaneNet: Piece-Wise Planar Reconstruction From a Single RGB Image
Chen Liu, Jimei Yang, Duygu Ceylan, Ersin Yumer, Yasutaka Furukawa
Deep Parametric Continuous Convolutional Neural Networks
Shenlong Wang, Simon Suo, Wei-Chiu Ma, Andrei Pokrovsky, Raquel Urtasun
FeaStNet: Feature-Steered Graph Convolutions for 3D Shape Analysis
Nitika Verma, Edmond Boyer, Jakob Verbeek
Image Collection Pop-Up: 3D Reconstruction and Clustering of Rigid and Non-Rigid Categories
Antonio Agudo, Melcior Pijoan, Francesc Moreno-Noguer
Geometry-Aware Learning of Maps for Camera Localization
Samarth Brahmbhatt, Jinwei Gu, Kihwan Kim, James Hays, Jan Kautz
Recurrent Slice Networks for 3D Segmentation of Point Clouds
Qiangui Huang, Weiyue Wang, Ulrich Neumann
Depth-Based 3D Hand Pose Estimation: From Current Achievements to Future Goals
Shanxin Yuan, Guillermo Garcia-Hernando, Björn Stenger, Gyeongsik Moon, Ju Yong Chang, Kyoung Mu Lee, Pavlo Molchanov, Jan Kautz, Sina Honari, Liuhao Ge, Junsong Yuan, Xinghao Chen, Guijin Wang, Fan Yang, Kai Akiyama, Yang Wu, Qingfu Wan, Meysam Madadi, Sergio Escalera, Shile Li, Dongheui Lee, Iason Oikonomidis, Antonis Argyros, Tae-Kyun Kim
SobolevFusion: 3D Reconstruction of Scenes Undergoing Free Non-Rigid Motion
Miroslava Slavcheva, Maximilian Baust, Slobodan Ilic
AdaDepth: Unsupervised Content Congruent Adaptation for Depth Estimation
Jogendra Nath Kundu, Phani Krishna Uppala, Anuj Pahuja, R. Venkatesh Babu
Learning to Find Good Correspondences
Kwang Moo Yi, Eduard Trulls, Yuki Ono, Vincent Lepetit, Mathieu Salzmann, Pascal Fua
OATM: Occlusion Aware Template Matching by Consensus Set Maximization
Simon Korman, Mark Milam, Stefano Soatto
Deep Learning of Graph Matching
Andrei Zanfir, Cristian Sminchisescu
Unsupervised Discovery of Object Landmarks as Structural Representations
Yuting Zhang, Yijie Guo, Yixin Jin, Yijun Luo, Zhiyuan He, Honglak Lee
Quantization and Training of Neural Networks for Efficient Integer-Arithmetic-Only Inference
Benoit Jacob, Skirmantas Kligys, Bo Chen, Menglong Zhu, Matthew Tang, Andrew Howard, Hartwig Adam, Dmitry Kalenichenko
Lean Multiclass Crowdsourcing
Grant Van Horn, Steve Branson, Scott Loarie, Serge Belongie, Pietro Perona
Partial Transfer Learning With Selective Adversarial Networks
Zhangjie Cao, Mingsheng Long, Jianmin Wang, Michael I. Jordan
Self-Supervised Feature Learning by Learning to Spot Artifacts
Simon Jenni, Paolo Favaro
LDMNet: Low Dimensional Manifold Regularized Neural Networks
Wei Zhu, Qiang Qiu, Jiaji Huang, Robert Calderbank, Guillermo Sapiro, Ingrid Daubechies
CondenseNet: An Efficient DenseNet Using Learned Group Convolutions
Gao Huang, Shichen Liu, Laurens van der Maaten, Kilian Q. Weinberger
Learning Deep Descriptors With Scale-Aware Triplet Networks
Michel Keller, Zetao Chen, Fabiola Maffra, Patrik Schmuck, Margarita Chli
Decoupled Networks
Weiyang Liu, Zhen Liu, Zhiding Yu, Bo Dai, Rongmei Lin, Yisen Wang, James M. Rehg, Le Song
Deep Adversarial Metric Learning
Yueqi Duan, Wenzhao Zheng, Xudong Lin, Jiwen Lu, Jie Zhou
PU-Net: Point Cloud Upsampling Network
Lequan Yu, Xianzhi Li, Chi-Wing Fu, Daniel Cohen-Or, Pheng-Ann Heng
Real-Time Monocular Depth Estimation Using Synthetic Data With Domain Adaptation via Image Style Transfer
Amir Atapour-Abarghouei, Toby P. Breckon
Learning for Disparity Estimation Through Feature Constancy
Zhengfa Liang, Yiliu Feng, Yulan Guo, Hengzhu Liu, Wei Chen, Linbo Qiao, Li Zhou, Jianfeng Zhang
DeepMVS: Learning Multi-View Stereopsis
Po-Han Huang, Kevin Matzen, Johannes Kopf, Narendra Ahuja, Jia-Bin Huang
Self-Calibrating Polarising Radiometric Calibration
Daniel Teo, Boxin Shi, Yinqiang Zheng, Sai-Kit Yeung
Coding Kendall's Shape Trajectories for 3D Action Recognition
Amor Ben Tanfous, Hassen Drira, Boulbaba Ben Amor
Efficient, Sparse Representation of Manifold Distance Matrices for Classical Scaling
Javier S. Turek, Alexander G. Huth
Motion Segmentation by Exploiting Complementary Geometric Models
Xun Xu, Loong Fah Cheong, Zhuwen Li
Estimation of Camera Locations in Highly Corrupted Scenarios: All About That Base, No Shape Trouble
Yunpeng Shi, Gilad Lerman
4D Human Body Correspondences From Panoramic Depth Maps
Zhong Li, Minye Wu, Wangyiteng Zhou, Jingyi Yu
Reconstructing Thin Structures of Manifold Surfaces by Integrating Spatial Curves
Shiwei Li, Yao Yao, Tian Fang, Long Quan
Multi-View Consistency as Supervisory Signal for Learning Shape and Pose Prediction
Shubham Tulsiani, Alexei A. Efros, Jitendra Malik
Probabilistic Plant Modeling via Multi-View Image-to-Image Translation
Takahiro Isokane, Fumio Okura, Ayaka Ide, Yasuyuki Matsushita, Yasushi Yagi
Deep Marching Cubes: Learning Explicit Surface Representations
Yiyi Liao, Simon Donné, Andreas Geiger
Tags2Parts: Discovering Semantic Regions From Shape Tags
Sanjeev Muralikrishnan, Vladimir G. Kim, Siddhartha Chaudhuri
Uncalibrated Photometric Stereo Under Natural Illumination
Zhipeng Mo, Boxin Shi, Feng Lu, Sai-Kit Yeung, Yasuyuki Matsushita
Robust Depth Estimation From Auto Bracketed Images
Sunghoon Im, Hae-Gon Jeon, In So Kweon
Free Supervision From Video Games
Philipp Krähenbühl
Planar Shape Detection at Structural Scales
Hao Fang, Florent Lafarge, Mathieu Desbrun
Pix3D: Dataset and Methods for Single-Image 3D Shape Modeling
Xingyuan Sun, Jiajun Wu, Xiuming Zhang, Zhoutong Zhang, Chengkai Zhang, Tianfan Xue, Joshua B. Tenenbaum, William T. Freeman
Camera Pose Estimation With Unknown Principal Point
Viktor Larsson, Zuzana Kukelova, Yinqiang Zheng
Inverse Composition Discriminative Optimization for Point Cloud Registration
Jayakorn Vongkulbhisal, Beñat Irastorza Ugalde, Fernando De la Torre, João P. Costeira
SurfConv: Bridging 3D and 2D Convolution for RGBD Images
Hang Chu, Wei-Chiu Ma, Kaustav Kundu, Raquel Urtasun, Sanja Fidler
A Fast Resection-Intersection Method for the Known Rotation Problem
Qianggong Zhang, Tat-Jun Chin, Huu Minh Le
3D Pose Estimation and 3D Model Retrieval for Objects in the Wild
Alexander Grabner, Peter M. Roth, Vincent Lepetit
Structure From Recurrent Motion: From Rigidity to Recurrency
Xiu Li, Hongdong Li, Hanbyul Joo, Yebin Liu, Yaser Sheikh
Learning Patch Reconstructability for Accelerating Multi-View Stereo
Alex Poms, Chenglei Wu, Shoou-I Yu, Yaser Sheikh
Progressively Complementarity-Aware Fusion Network for RGB-D Salient Object Detection
Hao Chen, Youfu Li
Pixels, Voxels, and Views: A Study of Shape Representations for Single View 3D Object Shape Prediction
Daeyun Shin, Charless C. Fowlkes, Derek Hoiem
Learning Dual Convolutional Neural Networks for Low-Level Vision
Jinshan Pan, Sifei Liu, Deqing Sun, Jiawei Zhang, Yang Liu, Jimmy Ren, Zechao Li, Jinhui Tang, Huchuan Lu, Yu-Wing Tai, Ming-Hsuan Yang
Defocus Blur Detection via Multi-Stream Bottom-Top-Bottom Fully Convolutional Network
Wenda Zhao, Fan Zhao, Dong Wang, Huchuan Lu
PiCANet: Learning Pixel-Wise Contextual Attention for Saliency Detection
Nian Liu, Junwei Han, Ming-Hsuan Yang
Curve Reconstruction via the Global Statistics of Natural Curves
Ehud Barnea, Ohad Ben-Shahar
What Do Deep Networks Like to See?
Sebastian Palacio, Joachim Folz, Jörn Hees, Federico Raue, Damian Borth, Andreas Dengel
“Zero-Shot” Super-Resolution Using Deep Internal Learning
Assaf Shocher, Nadav Cohen, Michal Irani
Detect Globally, Refine Locally: A Novel Approach to Saliency Detection
Tiantian Wang, Lihe Zhang, Shuo Wang, Huchuan Lu, Gang Yang, Xiang Ruan, Ali Borji
Beyond the Pixel-Wise Loss for Topology-Aware Delineation
Agata Mosinska, Pablo Márquez-Neila, Mateusz Koziński, Pascal Fua
KIPPI: KInetic Polygonal Partitioning of Images
Jean-Philippe Bauchet, Florent Lafarge
Image Blind Denoising With Generative Adversarial Network Based Noise Modeling
Jingwen Chen, Jiawei Chen, Hongyang Chao, Ming Yang
Multi-Scale Weighted Nuclear Norm Image Restoration
Noam Yair, Tomer Michaeli
MoNet: Moments Embedding Network
Mengran Gou, Fei Xiong, Octavia Camps, Mario Sznaier
Active Fixation Control to Predict Saccade Sequences
Calden Wloka, Iuliia Kotseruba, John K. Tsotsos
Densely Connected Pyramid Dehazing Network
He Zhang, Vishal M. Patel
Universal Denoising Networks : A Novel CNN Architecture for Image Denoising
Stamatios Lefkimmiatis
Learning Convolutional Networks for Content-Weighted Image Compression
Mu Li, Wangmeng Zuo, Shuhang Gu, Debin Zhao, David Zhang
Deep Video Super-Resolution Network Using Dynamic Upsampling Filters Without Explicit Motion Compensation
Younghyun Jo, Seoung Wug Oh, Jaeyeon Kang, Seon Joo Kim
Erase or Fill? Deep Joint Recurrent Rain Removal and Reconstruction in Videos
Jiaying Liu, Wenhan Yang, Shuai Yang, Zongming Guo
Flow Guided Recurrent Neural Encoder for Video Salient Object Detection
Guanbin Li, Yuan Xie, Tianhao Wei, Keze Wang, Liang Lin
Gated Fusion Network for Single Image Dehazing
Wenqi Ren, Lin Ma, Jiawei Zhang, Jinshan Pan, Xiaochun Cao, Wei Liu, Ming-Hsuan Yang
Learning a Single Convolutional Super-Resolution Network for Multiple Degradations
Kai Zhang, Wangmeng Zuo, Lei Zhang
Non-Blind Deblurring: Handling Kernel Uncertainty With CNNs
Subeesh Vasu, Venkatesh Reddy Maligireddy, A. N. Rajagopalan
Boundary Flow: A Siamese Network That Predicts Boundary Motion Without Training on Motion
Peng Lei, Fuxin Li, Sinisa Todorovic
Learning to See in the Dark
Chen Chen, Qifeng Chen, Jia Xu, Vladlen Koltun
BPGrad: Towards Global Optimality in Deep Learning via Branch and Pruning
Ziming Zhang, Yuanwei Wu, Guanghui Wang
Perturbative Neural Networks
Felix Juefei-Xu, Vishnu Naresh Boddeti, Marios Savvides
Unsupervised Correlation Analysis
Yedid Hoshen, Lior Wolf
A Biresolution Spectral Framework for Product Quantization
Lopamudra Mukherjee, Sathya N. Ravi, Jiming Peng, Vikas Singh
Domain Adaptive Faster R-CNN for Object Detection in the Wild
Yuhua Chen, Wen Li, Christos Sakaridis, Dengxin Dai, Luc Van Gool
Low-Shot Learning With Large-Scale Diffusion
Matthijs Douze, Arthur Szlam, Bharath Hariharan, Hervé Jégou
Joint Pose and Expression Modeling for Facial Expression Recognition
Feifei Zhang, Tianzhu Zhang, Qirong Mao, Changsheng Xu
Lightweight Probabilistic Deep Networks
Jochen Gast, Stefan Roth
Adversarially Learned One-Class Classifier for Novelty Detection
Mohammad Sabokrou, Mohammad Khalooei, Mahmood Fathy, Ehsan Adeli
Defense Against Universal Adversarial Perturbations
Naveed Akhtar, Jian Liu, Ajmal Mian
Disentangling Factors of Variation by Mixing Them
Qiyang Hu, Attila Szabó, Tiziano Portenier, Paolo Favaro, Matthias Zwicker
Deformable GANs for Pose-Based Human Image Generation
Aliaksandr Siarohin, Enver Sangineto, Stéphane Lathuilière, Nicu Sebe
Hierarchical Recurrent Attention Networks for Structured Online Maps
Namdar Homayounfar, Wei-Chiu Ma, Shrinidhi Kowshika Lakshmikanth, Raquel Urtasun
Sliced Wasserstein Distance for Learning Gaussian Mixture Models
Soheil Kolouri, Gustavo K. Rohde, Heiko Hoffmann
Aligning Infinite-Dimensional Covariance Matrices in Reproducing Kernel Hilbert Spaces for Domain Adaptation
Zhen Zhang, Mianzhi Wang, Yan Huang, Arye Nehorai
CLEAR: Cumulative LEARning for One-Shot One-Class Image Recognition
Jedrzej Kozerawski, Matthew Turk
Local and Global Optimization Techniques in Graph-Based Clustering
Daiki Ikami, Toshihiko Yamasaki, Kiyoharu Aizawa
Multi-Task Learning by Maximizing Statistical Dependence
Youssef A. Mejjati, Darren Cosker, Kwang In Kim
Robust Classification With Convolutional Prototype Learning
Hong-Ming Yang, Xu-Yao Zhang, Fei Yin, Cheng-Lin Liu
Generative Modeling Using the Sliced Wasserstein Distance
Ishan Deshpande, Ziyu Zhang, Alexander G. Schwing
Learning Time/Memory-Efficient Deep Architectures With Budgeted Super Networks
Tom Véniat, Ludovic Denoyer
Cross-View Image Synthesis Using Conditional GANs
Krishna Regmi, Ali Borji
Sparse, Smart Contours to Represent and Edit Images
Tali Dekel, Chuang Gan, Dilip Krishnan, Ce Liu, William T. Freeman
Anticipating Traffic Accidents With Adaptive Loss and Large-Scale Incident DB
Tomoyuki Suzuki, Hirokatsu Kataoka, Yoshimitsu Aoki, Yutaka Satoh
A Minimalist Approach to Type-Agnostic Detection of Quadrics in Point Clouds
Tolga Birdal, Benjamin Busam, Nassir Navab, Slobodan Ilic, Peter Sturm
Facelet-Bank for Fast Portrait Manipulation
Ying-Cong Chen, Huaijia Lin, Michelle Shu, Ruiyu Li, Xin Tao, Xiaoyong Shen, Yangang Ye, Jiaya Jia
Visual to Sound: Generating Natural Sound for Videos in the Wild
Yipin Zhou, Zhaowen Wang, Chen Fang, Trung Bui, Tamara L. Berg
3D-RCNN: Instance-Level 3D Object Reconstruction via Render-and-Compare
Abhijit Kundu, Yin Li, James M. Rehg
Fast and Furious: Real Time End-to-End 3D Detection, Tracking and Motion Forecasting With a Single Convolutional Net
Wenjie Luo, Bin Yang, Raquel Urtasun
An Analysis of Scale Invariance in Object Detection SNIP
Bharat Singh, Larry S. Davis
Relation Networks for Object Detection
Han Hu, Jiayuan Gu, Zheng Zhang, Jifeng Dai, Yichen Wei
Zero-Shot Sketch-Image Hashing
Yuming Shen, Li Liu, Fumin Shen, Ling Shao
VizWiz Grand Challenge: Answering Visual Questions From Blind People
Danna Gurari, Qing Li, Abigale J. Stangl, Anhong Guo, Chi Lin, Kristen Grauman, Jiebo Luo, Jeffrey P. Bigham
Divide and Grow: Capturing Huge Diversity in Crowd Images With Incrementally Growing CNN
Deepak Babu Sam, Neeraj N. Sajjan, R. Venkatesh Babu, Mukundhan Srinivasan
Structured Set Matching Networks for One-Shot Part Labeling
Jonghyun Choi, Jayant Krishnamurthy, Aniruddha Kembhavi, Ali Farhadi
Self-Supervised Learning of Geometrically Stable Features Through Probabilistic Introspection
David Novotny, Samuel Albanie, Diane Larlus, Andrea Vedaldi
Link and Code: Fast Indexing With Graphs and Compact Regression Codes
Matthijs Douze, Alexandre Sablayrolles, Hervé Jégou
Textbook Question Answering Under Instructor Guidance With Memory Networks
Juzheng Li, Hang Su, Jun Zhu, Siyu Wang, Bo Zhang
Unsupervised Deep Generative Adversarial Hashing Network
Kamran Ghasedi Dizaji, Feng Zheng, Najmeh Sadoughi, Yanhua Yang, Cheng Deng, Heng Huang
Vision-and-Language Navigation: Interpreting Visually-Grounded Navigation Instructions in Real Environments
Peter Anderson, Qi Wu, Damien Teney, Jake Bruce, Mark Johnson, Niko Sünderhauf, Ian Reid, Stephen Gould, Anton van den Hengel
DenseASPP for Semantic Segmentation in Street Scenes
Maoke Yang, Kun Yu, Chi Zhang, Zhiwei Li, Kuiyuan Yang
Efficient Optimization for Rank-Based Loss Functions
Pritish Mohapatra, Michal Rolínek, C.V. Jawahar, Vladimir Kolmogorov, M. Pawan Kumar
Wasserstein Introspective Neural Networks
Kwonjoon Lee, Weijian Xu, Fan Fan, Zhuowen Tu
Taskonomy: Disentangling Task Transfer Learning
Amir R. Zamir, Alexander Sax, William Shen, Leonidas J. Guibas, Jitendra Malik, Silvio Savarese
Maximum Classifier Discrepancy for Unsupervised Domain Adaptation
Kuniaki Saito, Kohei Watanabe, Yoshitaka Ushiku, Tatsuya Harada
Unsupervised Feature Learning via Non-Parametric Instance Discrimination
Zhirong Wu, Yuanjun Xiong, Stella X. Yu, Dahua Lin
Multi-Task Adversarial Network for Disentangled Feature Learning
Yang Liu, Zhaowen Wang, Hailin Jin, Ian Wassell
Learning From Synthetic Data: Addressing Domain Shift for Semantic Segmentation
Swami Sankaranarayanan, Yogesh Balaji, Arpit Jain, Ser Nam Lim, Rama Chellappa
Empirical Study of the Topology and Geometry of Deep Networks
Alhussein Fawzi, Seyed-Mohsen Moosavi-Dezfooli, Pascal Frossard, Stefano Soatto
Boosting Domain Adaptation by Discovering Latent Domains
Massimiliano Mancini, Lorenzo Porzi, Samuel Rota Bulò, Barbara Caputo, Elisa Ricci
Shape From Shading Through Shape Evolution
Dawei Yang, Jia Deng
Weakly Supervised Instance Segmentation Using Class Peak Response
Yanzhao Zhou, Yi Zhu, Qixiang Ye, Qiang Qiu, Jianbin Jiao
Collaborative and Adversarial Network for Unsupervised Domain Adaptation
Weichen Zhang, Wanli Ouyang, Wen Li, Dong Xu
Environment Upgrade Reinforcement Learning for Non-Differentiable Multi-Stage Pipelines
Shuqin Xie, Zitian Chen, Chao Xu, Cewu Lu
Teaching Categories to Human Learners With Visual Explanations
Oisin Mac Aodha, Shihan Su, Yuxin Chen, Pietro Perona, Yisong Yue
Density Adaptive Point Set Registration
Felix Järemo Lawin, Martin Danelljan, Fahad Shahbaz Khan, Per-Erik Forssén, Michael Felsberg
Left-Right Comparative Recurrent Model for Stereo Matching
Zequn Jie, Pengfei Wang, Yonggen Ling, Bo Zhao, Yunchao Wei, Jiashi Feng, Wei Liu
Im2Pano3D: Extrapolating 360° Structure and Semantics Beyond the Field of View
Shuran Song, Andy Zeng, Angel X. Chang, Manolis Savva, Silvio Savarese, Thomas Funkhouser
Polarimetric Dense Monocular SLAM
Luwei Yang, Feitong Tan, Ao Li, Zhaopeng Cui, Yasutaka Furukawa, Ping Tan
A Unifying Contrast Maximization Framework for Event Cameras, With Applications to Motion, Depth, and Optical Flow Estimation
Guillermo Gallego, Henri Rebecq, Davide Scaramuzza
Modeling Facial Geometry Using Compositional VAEs
Timur Bagautdinov, Chenglei Wu, Jason Saragih, Pascal Fua, Yaser Sheikh
Tangent Convolutions for Dense Prediction in 3D
Maxim Tatarchenko, Jaesik Park, Vladlen Koltun, Qian-Yi Zhou
RayNet: Learning Volumetric 3D Reconstruction With Ray Potentials
Despoina Paschalidou, Osman Ulusoy, Carolin Schmitt, Luc Van Gool, Andreas Geiger
Neural 3D Mesh Renderer
Hiroharu Kato, Yoshitaka Ushiku, Tatsuya Harada
Structured Attention Guided Convolutional Neural Fields for Monocular Depth Estimation
Dan Xu, Wei Wang, Hao Tang, Hong Liu, Nicu Sebe, Elisa Ricci
Automatic 3D Indoor Scene Modeling From Single Panorama
Yang Yang, Shi Jin, Ruiyang Liu, Sing Bing Kang, Jingyi Yu
Extreme 3D Face Reconstruction: Seeing Through Occlusions
Anh Tuấn Trần, Tal Hassner, Iacopo Masi, Eran Paz, Yuval Nirkin, Gérard Medioni
Beyond Grobner Bases: Basis Selection for Minimal Solvers
Viktor Larsson, Magnus Oskarsson, Kalle Astrom, Alge Wallis, Zuzana Kukelova, Tomas Pajdla
Lions and Tigers and Bears: Capturing Non-Rigid, 3D, Articulated Shape From Images
Silvia Zuffi, Angjoo Kanazawa, Michael J. Black
Deep Cocktail Network: Multi-Source Unsupervised Domain Adaptation With Category Shift
Ruijia Xu, Ziliang Chen, Wangmeng Zuo, Junjie Yan, Liang Lin
DOTA: A Large-Scale Dataset for Object Detection in Aerial Images
Gui-Song Xia, Xiang Bai, Jian Ding, Zhen Zhu, Serge Belongie, Jiebo Luo, Mihai Datcu, Marcello Pelillo, Liangpei Zhang
Finding Beans in Burgers: Deep Semantic-Visual Embedding With Localization
Martin Engilberge, Louis Chevallier, Patrick Pérez, Matthieu Cord
Feature Super-Resolution: Make Machine See More Clearly
Weimin Tan, Bo Yan, Bahetiyaer Bare
ClusterNet: Detecting Small Objects in Large Scenes by Exploiting Spatio-Temporal Information
Rodney LaLonde, Dong Zhang, Mubarak Shah
MaskLab: Instance Segmentation by Refining Object Detection With Semantic and Direction Features
Liang-Chieh Chen, Alexander Hermans, George Papandreou, Florian Schroff, Peng Wang, Hartwig Adam
Hashing as Tie-Aware Learning to Rank
Kun He, Fatih Cakir, Sarah Adel Bargal, Stan Sclaroff
Classification-Driven Dynamic Image Enhancement
Vivek Sharma, Ali Diba, Davy Neven, Michael S. Brown, Luc Van Gool, Rainer Stiefelhagen
Knowledge Aided Consistency for Weakly Supervised Phrase Grounding
Kan Chen, Jiyang Gao, Ram Nevatia
Who Let the Dogs Out? Modeling Dog Behavior From Visual Data
Kiana Ehsani, Hessam Bagherinezhad, Joseph Redmon, Roozbeh Mottaghi, Ali Farhadi
Pseudo Mask Augmented Object Detection
Xiangyun Zhao, Shuang Liang, Yichen Wei
Dual Skipping Networks
Changmao Cheng, Yanwei Fu, Yu-Gang Jiang, Wei Liu, Wenlian Lu, Jianfeng Feng, Xiangyang Xue
Memory Matching Networks for One-Shot Image Recognition
Qi Cai, Yingwei Pan, Ting Yao, Chenggang Yan, Tao Mei
IQA: Visual Question Answering in Interactive Environments
Daniel Gordon, Aniruddha Kembhavi, Mohammad Rastegari, Joseph Redmon, Dieter Fox, Ali Farhadi
Pose Transferrable Person Re-Identification
Jinxian Liu, Bingbing Ni, Yichao Yan, Peng Zhou, Shuo Cheng, Jianguo Hu
Large Scale Fine-Grained Categorization and Domain-Specific Transfer Learning
Yin Cui, Yang Song, Chen Sun, Andrew Howard, Serge Belongie
Data Distillation: Towards Omni-Supervised Learning
Ilija Radosavovic, Piotr Dollár, Ross Girshick, Georgia Gkioxari, Kaiming He
Object Referring in Videos With Language and Human Gaze
Arun Balajee Vasudevan, Dengxin Dai, Luc Van Gool
Feature Selective Networks for Object Detection
Yao Zhai, Jingjing Fu, Yan Lu, Houqiang Li
Learning a Discriminative Filter Bank Within a CNN for Fine-Grained Recognition
Yaming Wang, Vlad I. Morariu, Larry S. Davis
Grounding Referring Expressions in Images by Variational Context
Hanwang Zhang, Yulei Niu, Shih-Fu Chang
Dynamic Graph Generation Network: Generating Relational Knowledge From Diagrams
Daesik Kim, YoungJoon Yoo, Jee-Soo Kim, SangKuk Lee, Nojun Kwak
A Network Architecture for Point Cloud Classification via Automatic Depth Images Generation
Riccardo Roveri, Lukas Rahmann, Cengiz Oztireli, Markus Gross
Towards Dense Object Tracking in a 2D Honeybee Hive
Katarzyna Bozek, Laetitia Hebert, Alexander S. Mikheyev, Greg J. Stephens
Long-Term On-Board Prediction of People in Traffic Scenes Under Uncertainty
Apratim Bhattacharyya, Mario Fritz, Bernt Schiele
Single-Shot Refinement Neural Network for Object Detection
Shifeng Zhang, Longyin Wen, Xiao Bian, Zhen Lei, Stan Z. Li
Video Captioning via Hierarchical Reinforcement Learning
Xin Wang, Wenhu Chen, Jiawei Wu, Yuan-Fang Wang, William Yang Wang
Tips and Tricks for Visual Question Answering: Learnings From the 2017 Challenge
Damien Teney, Peter Anderson, Xiaodong He, Anton van den Hengel
Learning to Segment Every Thing
Ronghang Hu, Piotr Dollár, Kaiming He, Trevor Darrell, Ross Girshick
Self-Supervised Adversarial Hashing Networks for Cross-Modal Retrieval
Chao Li, Cheng Deng, Ning Li, Wei Liu, Xinbo Gao, Dacheng Tao
Parallel Attention: A Unified Framework for Visual Object Discovery Through Dialogs and Queries
Bohan Zhuang, Qi Wu, Chunhua Shen, Ian Reid, Anton van den Hengel
Zigzag Learning for Weakly Supervised Object Detection
Xiaopeng Zhang, Jiashi Feng, Hongkai Xiong, Qi Tian
Attentive Fashion Grammar Network for Fashion Landmark Detection and Clothing Category Classification
Wenguan Wang, Yuanlu Xu, Jianbing Shen, Song-Chun Zhu
Generalized Zero-Shot Learning via Synthesized Examples
Vinay Kumar Verma, Gundeep Arora, Ashish Mishra, Piyush Rai
Partially Shared Multi-Task Convolutional Neural Network With Local Constraint for Face Attribute Learning
Jiajiong Cao, Yingming Li, Zhongfei Zhang
SYQ: Learning Symmetric Quantization for Efficient Deep Neural Networks
Julian Faraone, Nicholas Fraser, Michaela Blott, Philip H.W. Leong
DS*: Tighter Lifting-Free Convex Relaxations for Quadratic Matching Problems
Florian Bernard, Christian Theobalt, Michael Moeller
Deep Mutual Learning
Ying Zhang, Tao Xiang, Timothy M. Hospedales, Huchuan Lu
Coupled End-to-End Transfer Learning With Generalized Fisher Information
Shixing Chen, Caojin Zhang, Ming Dong
Residual Parameter Transfer for Deep Domain Adaptation
Artem Rozantsev, Mathieu Salzmann, Pascal Fua
High-Order Tensor Regularization With Application to Attribute Ranking
Kwang In Kim, Juhyun Park, James Tompkin
Learning to Localize Sound Source in Visual Scenes
Arda Senocak, Tae-Hyun Oh, Junsik Kim, Ming-Hsuan Yang, In So Kweon
Dynamic Few-Shot Visual Learning Without Forgetting
Spyros Gidaris, Nikos Komodakis
Two-Step Quantization for Low-Bit Neural Networks
Peisong Wang, Qinghao Hu, Yifan Zhang, Chunjie Zhang, Yang Liu, Jian Cheng
Improved Lossy Image Compression With Priming and Spatially Adaptive Bit Rates for Recurrent Networks
Nick Johnston, Damien Vincent, David Minnen, Michele Covell, Saurabh Singh, Troy Chinen, Sung Jin Hwang, Joel Shor, George Toderici
Conditional Probability Models for Deep Image Compression
Fabian Mentzer, Eirikur Agustsson, Michael Tschannen, Radu Timofte, Luc Van Gool
Deep Diffeomorphic Transformer Networks
Nicki Skafte Detlefsen, Oren Freifeld, Søren Hauberg
The Lovász-Softmax Loss: A Tractable Surrogate for the Optimization of the Intersection-Over-Union Measure in Neural Networks
Maxim Berman, Amal Rannen Triki, Matthew B. Blaschko
Generative Adversarial Perturbations
Omid Poursaeed, Isay Katsman, Bicheng Gao, Serge Belongie
Learning Strict Identity Mappings in Deep Residual Networks
Xin Yu, Zhiding Yu, Srikumar Ramalingam
Geometric Robustness of Deep Networks: Analysis and Improvement
Can Kanbak, Seyed-Mohsen Moosavi-Dezfooli, Pascal Frossard
View Extrapolation of Human Body From a Single Image
Hao Zhu, Hao Su, Peng Wang, Xun Cao, Ruigang Yang
Geometry Aware Constrained Optimization Techniques for Deep Learning
Soumava Kumar Roy, Zakaria Mhammedi, Mehrtash Harandi
PointNetVLAD: Deep Point Cloud Based Retrieval for Large-Scale Place Recognition
Mikaela Angelina Uy, Gim Hee Lee
An Efficient and Provable Approach for Mixture Proportion Estimation Using Linear Independence Assumption
Xiyu Yu, Tongliang Liu, Mingming Gong, Kayhan Batmanghelich, Dacheng Tao
VoxelNet: End-to-End Learning for Point Cloud Based 3D Object Detection
Yin Zhou, Oncel Tuzel
Image to Image Translation for Domain Adaptation
Zak Murez, Soheil Kolouri, David Kriegman, Ravi Ramamoorthi, Kyungnam Kim
MobileNetV2: Inverted Residuals and Linear Bottlenecks
Mark Sandler, Andrew Howard, Menglong Zhu, Andrey Zhmoginov, Liang-Chieh Chen
Im2Struct: Recovering 3D Shape Structure From a Single RGB Image
Chengjie Niu, Jun Li, Kai Xu
Trust Your Model: Light Field Depth Estimation With Inline Occlusion Handling
Hendrik Schilling, Maximilian Diebold, Carsten Rother, Bernd Jähne
Baseline Desensitizing in Translation Averaging
Bingbing Zhuang, Loong-Fah Cheong, Gim Hee Lee
Mining Point Cloud Local Structures by Kernel Correlation and Graph Pooling
Yiru Shen, Chen Feng, Yaoqing Yang, Dong Tian
Large-Scale Point Cloud Semantic Segmentation With Superpoint Graphs
Loic Landrieu, Martin Simonovsky
Very Large-Scale Global SfM by Distributed Motion Averaging
Siyu Zhu, Runze Zhang, Lei Zhou, Tianwei Shen, Tian Fang, Ping Tan, Long Quan
ScanComplete: Large-Scale Scene Completion and Semantic Segmentation for 3D Scans
Angela Dai, Daniel Ritchie, Martin Bokeloh, Scott Reed, Jürgen Sturm, Matthias Nießner
Solving the Perspective-2-Point Problem for Flying-Camera Photo Composition
Ziquan Lan, David Hsu, Gim Hee Lee
Reflection Removal for Large-Scale 3D Point Clouds
Jae-Seong Yun, Jae-Young Sim
Attentional ShapeContextNet for Point Cloud Recognition
Saining Xie, Sainan Liu, Zeyu Chen, Zhuowen Tu
Geometry-Aware Deep Network for Single-Image Novel View Synthesis
Miaomiao Liu, Xuming He, Mathieu Salzmann
InverseFaceNet: Deep Monocular Inverse Face Rendering
Hyeongwoo Kim, Michael Zollhöfer, Ayush Tewari, Justus Thies, Christian Richardt, Christian Theobalt
Sparse Photometric 3D Face Reconstruction Guided by Morphable Models
Xuan Cao, Zhang Chen, Anpei Chen, Xin Chen, Shiying Li, Jingyi Yu
Texture Mapping for 3D Reconstruction With RGB-D Sensor
Yanping Fu, Qingan Yan, Long Yang, Jie Liao, Chunxia Xiao
Learning Less Is More - 6D Camera Localization via 3D Surface Regression
Eric Brachmann, Carsten Rother
Feature Mapping for Learning Fast and Accurate 3D Pose Inference From Synthetic Images
Mahdi Rad, Markus Oberweger, Vincent Lepetit
Indoor RGB-D Compass From a Single Line and Plane
Pyojin Kim, Brian Coltin, H. Jin Kim
Geometry-Aware Network for Non-Rigid Shape Prediction From a Single View
Albert Pumarola, Antonio Agudo, Lorenzo Porzi, Alberto Sanfeliu, Vincent Lepetit, Francesc Moreno-Noguer
Sim2Real Viewpoint Invariant Visual Servoing by Recurrent Control
Fereshteh Sadeghi, Alexander Toshev, Eric Jang, Sergey Levine
DocUNet: Document Image Unwarping via a Stacked U-Net
Ke Ma, Zhixin Shu, Xue Bai, Jue Wang, Dimitris Samaras
Analysis of Hand Segmentation in the Wild
Aisha Urooj, Ali Borji
RoadTracer: Automatic Extraction of Road Networks From Aerial Images
Favyen Bastani, Songtao He, Sofiane Abbar, Mohammad Alizadeh, Hari Balakrishnan, Sanjay Chawla, Sam Madden, David DeWitt
Alternating-Stereo VINS: Observability Analysis and Performance Evaluation
Mrinal K. Paul, Stergios I. Roumeliotis
Soccer on Your Tabletop
Konstantinos Rematas, Ira Kemelmacher-Shlizerman, Brian Curless, Steve Seitz
EPINET: A Fully-Convolutional Neural Network Using Epipolar Geometry for Depth From Light Field Images
Changha Shin, Hae-Gon Jeon, Youngjin Yoon, In So Kweon, Seon Joo Kim
A Hybrid l1-l0 Layer Decomposition Model for Tone Mapping
Zhetong Liang, Jun Xu, David Zhang, Zisheng Cao, Lei Zhang
Deeply Learned Filter Response Functions for Hyperspectral Reconstruction
Shijie Nie, Lin Gu, Yinqiang Zheng, Antony Lam, Nobutaka Ono, Imari Sato
CRRN: Multi-Scale Guided Concurrent Reflection Removal Network
Renjie Wan, Boxin Shi, Ling-Yu Duan, Ah-Hwee Tan, Alex C. Kot
Single Image Reflection Separation With Perceptual Losses
Xuaner Zhang, Ren Ng, Qifeng Chen
A Robust Method for Strong Rolling Shutter Effects Correction Using Lines With Automatic Feature Selection
Yizhen Lao, Omar Ait-Aider
Time-Resolved Light Transport Decomposition for Thermal Photometric Stereo
Kenichiro Tanaka, Nobuhiro Ikeya, Tsuyoshi Takatani, Hiroyuki Kubo, Takuya Funatomi, Yasuhiro Mukaigawa
Efficient Diverse Ensemble for Discriminative Co-Tracking
Kourosh Meshgi, Shigeyuki Oba, Shin Ishii
Rolling Shutter and Radial Distortion Are Features for High Frame Rate Multi-Camera Tracking
Akash Bapat, True Price, Jan-Michael Frahm
A Twofold Siamese Network for Real-Time Object Tracking
Anfeng He, Chong Luo, Xinmei Tian, Wenjun Zeng
Multi-Cue Correlation Filters for Robust Visual Tracking
Ning Wang, Wengang Zhou, Qi Tian, Richang Hong, Meng Wang, Houqiang Li
Learning Attentions: Residual Attentional Siamese Network for High Performance Online Visual Tracking
Qiang Wang, Zhu Teng, Junliang Xing, Jin Gao, Weiming Hu, Stephen Maybank
SINT++: Robust Visual Tracking via Adversarial Positive Instance Generation
Xiao Wang, Chenglong Li, Bin Luo, Jin Tang
High-Speed Tracking With Multi-Kernel Correlation Filters
Ming Tang, Bin Yu, Fan Zhang, Jinqiao Wang
Occlusion Aware Unsupervised Learning of Optical Flow
Yang Wang, Yi Yang, Zhenheng Yang, Liang Zhao, Peng Wang, Wei Xu
Revisiting Video Saliency: A Large-Scale Benchmark and a New Model
Wenguan Wang, Jianbing Shen, Fang Guo, Ming-Ming Cheng, Ali Borji
Learning Spatial-Temporal Regularized Correlation Filters for Visual Tracking
Feng Li, Cheng Tian, Wangmeng Zuo, Lei Zhang, Ming-Hsuan Yang
Multimodal Visual Concept Learning With Weakly Supervised Techniques
Giorgos Bouritsas, Petros Koutras, Athanasia Zlatintsi, Petros Maragos
Efficient Large-Scale Approximate Nearest Neighbor Search on OpenCL FPGA
Jialiang Zhang, Soroosh Khoram, Jing Li
Learning a Complete Image Indexing Pipeline
Himalaya Jain, Joaquin Zepeda, Patrick Pérez, Rémi Gribonval
Transparency by Design: Closing the Gap Between Performance and Interpretability in Visual Reasoning
David Mascharka, Philip Tran, Ryan Soklaski, Arjun Majumdar
Fooling Vision and Language Models Despite Localization and Attention Mechanism
Xiaojun Xu, Xinyun Chen, Chang Liu, Anna Rohrbach, Trevor Darrell, Dawn Song
Categorizing Concepts With Basic Level for Vision-to-Language
Hanzhang Wang, Hanli Wang, Kaisheng Xu
Don't Just Assume; Look and Answer: Overcoming Priors for Visual Question Answering
Aishwarya Agrawal, Dhruv Batra, Devi Parikh, Aniruddha Kembhavi
Learning Pixel-Level Semantic Affinity With Image-Level Supervision for Weakly Supervised Semantic Segmentation
Jiwoon Ahn, Suha Kwak
From Lifestyle Vlogs to Everyday Interactions
David F. Fouhey, Wei-cheng Kuo, Alexei A. Efros, Jitendra Malik
Cross-Domain Weakly-Supervised Object Detection Through Progressive Domain Adaptation
Naoto Inoue, Ryosuke Furuta, Toshihiko Yamasaki, Kiyoharu Aizawa
RotationNet: Joint Object Categorization and Pose Estimation Using Multiviews From Unsupervised Viewpoints
Asako Kanezaki, Yasuyuki Matsushita, Yoshifumi Nishida
An End-to-End TextSpotter With Explicit Alignment and Attention
Tong He, Zhi Tian, Weilin Huang, Chunhua Shen, Yu Qiao, Changming Sun
WILDTRACK: A Multi-Camera HD Dataset for Dense Unscripted Pedestrian Detection
Tatjana Chavdarova, Pierre Baqué, Stéphane Bouquet, Andrii Maksai, Cijo Jose, Timur Bagautdinov, Louis Lettry, Pascal Fua, Luc Van Gool, François Fleuret
Direct Shape Regression Networks for End-to-End Face Alignment
Xin Miao, Xiantong Zhen, Xianglong Liu, Cheng Deng, Vassilis Athitsos, Heng Huang
Natural and Effective Obfuscation by Head Inpainting
Qianru Sun, Liqian Ma, Seong Joon Oh, Luc Van Gool, Bernt Schiele, Mario Fritz
3D Semantic Trajectory Reconstruction From 3D Pixel Continuum
Jae Shin Yoon, Ziwei Li, Hyun Soo Park
Optimizing Filter Size in Convolutional Neural Networks for Facial Action Unit Recognition
Shizhong Han, Zibo Meng, Zhiyuan Li, James O'Reilly, Jie Cai, Xiaofeng Wang, Yan Tong
V2V-PoseNet: Voxel-to-Voxel Prediction Network for Accurate 3D Hand and Human Pose Estimation From a Single Depth Map
Gyeongsik Moon, Ju Yong Chang, Kyoung Mu Lee
Ring Loss: Convex Feature Normalization for Face Recognition
Yutong Zheng, Dipan K. Pal, Marios Savvides
Adversarially Occluded Samples for Person Re-Identification
Houjing Huang, Dangwei Li, Zhang Zhang, Xiaotang Chen, Kaiqi Huang
Classifier Learning With Prior Probabilities for Facial Action Unit Recognition
Yong Zhang, Weiming Dong, Bao-Gang Hu, Qiang Ji
4DFAB: A Large Scale 4D Database for Facial Expression Analysis and Biometric Applications
Shiyang Cheng, Irene Kotsia, Maja Pantic, Stefanos Zafeiriou
Seeing Small Faces From Robust Anchor's Perspective
Chenchen Zhu, Ran Tao, Khoa Luu, Marios Savvides
2D/3D Pose Estimation and Action Recognition Using Multitask Deep Learning
Diogo C. Luvizon, David Picard, Hedi Tabia
Dense 3D Regression for Hand Pose Estimation
Chengde Wan, Thomas Probst, Luc Van Gool, Angela Yao
Camera Style Adaptation for Person Re-Identification
Zhun Zhong, Liang Zheng, Zhedong Zheng, Shaozi Li, Yi Yang
PoseTrack: A Benchmark for Human Pose Estimation and Tracking
Mykhaylo Andriluka, Umar Iqbal, Eldar Insafutdinov, Leonid Pishchulin, Anton Milan, Juergen Gall, Bernt Schiele
Exploit the Unknown Gradually: One-Shot Video-Based Person Re-Identification by Stepwise Learning
Yu Wu, Yutian Lin, Xuanyi Dong, Yan Yan, Wanli Ouyang, Yi Yang
Pose-Robust Face Recognition via Deep Residual Equivariant Mapping
Kaidi Cao, Yu Rong, Cheng Li, Xiaoou Tang, Chen Change Loy
DecideNet: Counting Varying Density Crowds Through Attention Guided Detection and Density Estimation
Jiang Liu, Chenqiang Gao, Deyu Meng, Alexander G. Hauptmann
LSTM Pose Machines
Yue Luo, Jimmy Ren, Zhouxia Wang, Wenxiu Sun, Jinshan Pan, Jianbo Liu, Jiahao Pang, Liang Lin
Disentangling Features in 3D Face Shapes for Joint Face Reconstruction and Recognition
Feng Liu, Ronghang Zhu, Dan Zeng, Qijun Zhao, Xiaoming Liu
Convolutional Sequence to Sequence Model for Human Dynamics
Chen Li, Zhen Zhang, Wee Sun Lee, Gim Hee Lee
Gesture Recognition: Focus on the Hands
Pradyumna Narayana, Ross Beveridge, Bruce A. Draper
Crowd Counting via Adversarial Cross-Scale Consistency Pursuit
Zan Shen, Yi Xu, Bingbing Ni, Minsi Wang, Jianguo Hu, Xiaokang Yang
3D Human Pose Estimation in the Wild by Adversarial Learning
Wei Yang, Wanli Ouyang, Xiaolong Wang, Jimmy Ren, Hongsheng Li, Xiaogang Wang
CosFace: Large Margin Cosine Loss for Deep Face Recognition
Hao Wang, Yitong Wang, Zheng Zhou, Xing Ji, Dihong Gong, Jingchao Zhou, Zhifeng Li, Wei Liu
Encoding Crowd Interaction With Deep Neural Network for Pedestrian Trajectory Prediction
Yanyu Xu, Zhixin Piao, Shenghua Gao
Mean-Variance Loss for Deep Age Estimation From a Face
Hongyu Pan, Hu Han, Shiguang Shan, Xilin Chen
Probabilistic Joint Face-Skull Modelling for Facial Reconstruction
Dennis Madsen, Marcel Lüthi, Andreas Schneider, Thomas Vetter
Learning Latent Super-Events to Detect Multiple Activities in Videos
AJ Piergiovanni, Michael S. Ryoo
Temporal Hallucinating for Action Recognition With Few Still Images
Yali Wang, Lei Zhou, Yu Qiao
Deep Progressive Reinforcement Learning for Skeleton-Based Action Recognition
Yansong Tang, Yi Tian, Jiwen Lu, Peiyang Li, Jie Zhou
Gaze Prediction in Dynamic 360° Immersive Videos
Yanyu Xu, Yanbing Dong, Junru Wu, Zhengzhong Sun, Zhiru Shi, Jingyi Yu, Shenghua Gao
When Will You Do What? - Anticipating Temporal Occurrences of Activities
Yazan Abu Farha, Alexander Richard, Juergen Gall
Fusing Crowd Density Maps and Visual Object Trackers for People Tracking in Crowd Scenes
Weihong Ren, Di Kang, Yandong Tang, Antoni B. Chan
Dual Attention Matching Network for Context-Aware Feature Sequence Based Person Re-Identification
Jianlou Si, Honggang Zhang, Chun-Guang Li, Jason Kuen, Xiangfei Kong, Alex C. Kot, Gang Wang
Easy Identification From Better Constraints: Multi-Shot Person Re-Identification From Reference Constraints
Jiahuan Zhou, Bing Su, Ying Wu
Crowd Counting With Deep Negative Correlation Learning
Zenglin Shi, Le Zhang, Yun Liu, Xiaofeng Cao, Yangdong Ye, Ming-Ming Cheng, Guoyan Zheng
Human Appearance Transfer
Mihai Zanfir, Alin-Ionut Popa, Andrei Zanfir, Cristian Sminchisescu
Domain Generalization With Adversarial Feature Learning
Haoliang Li, Sinno Jialin Pan, Shiqi Wang, Alex C. Kot
Pyramid Stereo Matching Network
Jia-Ren Chang, Yong-Sheng Chen
Event-Based Vision Meets Deep Learning on Steering Prediction for Self-Driving Cars
Ana I. Maqueda, Antonio Loquercio, Guillermo Gallego, Narciso García, Davide Scaramuzza
Learning Answer Embeddings for Visual Question Answering
Hexiang Hu, Wei-Lun Chao, Fei Sha
Good View Hunting: Learning Photo Composition From Dense View Pairs
Zijun Wei, Jianming Zhang, Xiaohui Shen, Zhe Lin, Radomír Mech, Minh Hoai, Dimitris Samaras
CleanNet: Transfer Learning for Scalable Image Classifier Training With Label Noise
Kuang-Huei Lee, Xiaodong He, Lei Zhang, Linjun Yang
Independently Recurrent Neural Network (IndRNN): Building a Longer and Deeper RNN
Shuai Li, Wanqing Li, Chris Cook, Ce Zhu, Yanbo Gao
Mix and Match Networks: Encoder-Decoder Alignment for Zero-Pair Image Translation
Yaxing Wang, Joost van de Weijer, Luis Herranz
Structured Uncertainty Prediction Networks
Garoe Dorta, Sara Vicente, Lourdes Agapito, Neill D. F. Campbell, Ivor Simpson
Between-Class Learning for Image Classification
Yuji Tokozume, Yoshitaka Ushiku, Tatsuya Harada
Adversarial Feature Augmentation for Unsupervised Domain Adaptation
Riccardo Volpi, Pietro Morerio, Silvio Savarese, Vittorio Murino
Generative Image Inpainting With Contextual Attention
Jiahui Yu, Zhe Lin, Jimei Yang, Xiaohui Shen, Xin Lu, Thomas S. Huang
CSGNet: Neural Shape Parser for Constructive Solid Geometry
Gopal Sharma, Rishabh Goyal, Difan Liu, Evangelos Kalogerakis, Subhransu Maji
Conditional Image-to-Image Translation
Jianxin Lin, Yingce Xia, Tao Qin, Zhibo Chen, Tie-Yan Liu
Continuous Relaxation of MAP Inference: A Nonconvex Perspective
D. Khuê Lê-Huu, Nikos Paragios
Feature Generating Networks for Zero-Shot Learning
Yongqin Xian, Tobias Lorenz, Bernt Schiele, Zeynep Akata
Joint Optimization Framework for Learning With Noisy Labels
Daiki Tanaka, Daiki Ikami, Toshihiko Yamasaki, Kiyoharu Aizawa
Convolutional Image Captioning
Jyoti Aneja, Aditya Deshpande, Alexander G. Schwing
AON: Towards Arbitrarily-Oriented Text Recognition
Zhanzhan Cheng, Yangliu Xu, Fan Bai, Yi Niu, Shiliang Pu, Shuigeng Zhou
Wrapped Gaussian Process Regression on Riemannian Manifolds
Anton Mallasto, Aasa Feragen
Geometry Guided Convolutional Neural Networks for Self-Supervised Video Representation Learning
Chuang Gan, Boqing Gong, Kun Liu, Hao Su, Leonidas J. Guibas
DiverseNet: When One Right Answer Is Not Enough
Michael Firman, Neill D. F. Campbell, Lourdes Agapito, Gabriel J. Brostow
Deep Face Detector Adaptation Without Negative Transfer or Catastrophic Forgetting
Muhammad Abdullah Jamal, Haoxiang Li, Boqing Gong
Analyzing Filters Toward Efficient ConvNet
Takumi Kobayashi
Regularizing Deep Networks by Modeling and Predicting Label Structure
Mohammadreza Mostajabi, Michael Maire, Gregory Shakhnarovich
In-Place Activated BatchNorm for Memory-Optimized Training of DNNs
Samuel Rota Bulò, Lorenzo Porzi, Peter Kontschieder
DVQA: Understanding Data Visualizations via Question Answering
Kushal Kafle, Brian Price, Scott Cohen, Christopher Kanan
DA-GAN: Instance-Level Image Translation by Deep Attention Generative Adversarial Networks
Shuang Ma, Jianlong Fu, Chang Wen Chen, Tao Mei
Unsupervised Learning of Depth and Ego-Motion From Monocular Video Using 3D Geometric Constraints
Reza Mahjourian, Martin Wicke, Anelia Angelova
FOTS: Fast Oriented Text Spotting With a Unified Network
Xuebo Liu, Ding Liang, Shi Yan, Dagui Chen, Yu Qiao, Junjie Yan
Mobile Video Object Detection With Temporally-Aware Feature Maps
Mason Liu, Menglong Zhu
Weakly Supervised Phrase Localization With Multi-Scale Anchored Transformer Network
Fang Zhao, Jianshu Li, Jian Zhao, Jiashi Feng
Revisiting Oxford and Paris: Large-Scale Image Retrieval Benchmarking
Filip Radenović, Ahmet Iscen, Giorgos Tolias, Yannis Avrithis, Ondřej Chum
Cross-Dataset Adaptation for Visual Question Answering
Wei-Lun Chao, Hexiang Hu, Fei Sha
Globally Optimal Inlier Set Maximization for Atlanta Frame Estimation
Kyungdon Joo, Tae-Hyun Oh, In So Kweon, Jean-Charles Bazin
End-to-End Convolutional Semantic Embeddings
Quanzeng You, Zhengyou Zhang, Jiebo Luo
Referring Image Segmentation via Recurrent Refinement Networks
Ruiyu Li, Kaican Li, Yi-Chun Kuo, Michelle Shu, Xiaojuan Qi, Xiaoyong Shen, Jiaya Jia
Two Can Play This Game: Visual Dialog With Discriminative Question Generation and Answering
Unnat Jain, Svetlana Lazebnik, Alexander G. Schwing
Generative Adversarial Learning Towards Fast Weakly Supervised Detection
Yunhan Shen, Rongrong Ji, Shengchuan Zhang, Wangmeng Zuo, Yan Wang
A Deeper Look at Power Normalizations
Piotr Koniusz, Hongguang Zhang, Fatih Porikli
Dimensionality's Blessing: Clustering Images by Underlying Distribution
Wen-Yan Lin, Siying Liu, Jian-Huang Lai, Yasuyuki Matsushita
Eliminating Background-Bias for Robust Person Re-Identification
Maoqing Tian, Shuai Yi, Hongsheng Li, Shihua Li, Xuesen Zhang, Jianping Shi, Junjie Yan, Xiaogang Wang
Learning to Evaluate Image Captioning
Yin Cui, Guandao Yang, Andreas Veit, Xun Huang, Serge Belongie
Single-Shot Object Detection With Enriched Semantics
Zhishuai Zhang, Siyuan Qiao, Cihang Xie, Wei Shen, Bo Wang, Alan L. Yuille
Low-Shot Learning With Imprinted Weights
Hang Qi, Matthew Brown, David G. Lowe
Neural Motifs: Scene Graph Parsing With Global Context
Rowan Zellers, Mark Yatskar, Sam Thomson, Yejin Choi
Variational Autoencoders for Deforming 3D Mesh Models
Qingyang Tan, Lin Gao, Yu-Kun Lai, Shihong Xia
Fast Monte-Carlo Localization on Aerial Vehicles Using Approximate Continuous Belief Representations
Aditya Dhawale, Kumar Shaurya Shankar, Nathan Michael
DeLS-3D: Deep Localization and Segmentation With a 3D Semantic Map
Peng Wang, Ruigang Yang, Binbin Cao, Wei Xu, Yuanqing Lin
LiDAR-Video Driving Dataset: Learning Driving Policies Effectively
Yiping Chen, Jingkang Wang, Jonathan Li, Cewu Lu, Zhipeng Luo, Han Xue, Cheng Wang
Logo Synthesis and Manipulation With Clustered Generative Adversarial Networks
Alexander Sage, Eirikur Agustsson, Radu Timofte, Luc Van Gool
Egocentric Basketball Motion Planning From a Single First-Person Image
Gedas Bertasius, Aaron Chan, Jianbo Shi
Human-Centric Indoor Scene Synthesis Using Stochastic Grammar
Siyuan Qi, Yixin Zhu, Siyuan Huang, Chenfanfu Jiang, Song-Chun Zhu
Rotation-Sensitive Regression for Oriented Scene Text Detection
Minghui Liao, Zhen Zhu, Baoguang Shi, Gui-song Xia, Xiang Bai
Separating Self-Expression and Visual Content in Hashtag Supervision
Andreas Veit, Maximilian Nickel, Serge Belongie, Laurens van der Maaten
Distort-and-Recover: Color Enhancement Using Deep Reinforcement Learning
Jongchan Park, Joon-Young Lee, Donggeun Yoo, In So Kweon
Im2Flow: Motion Hallucination From Static Images for Action Recognition
Ruohan Gao, Bo Xiong, Kristen Grauman
Finding "It": Weakly-Supervised Reference-Aware Visual Grounding in Instructional Videos
De-An Huang, Shyamal Buch, Lucio Dery, Animesh Garg, Li Fei-Fei, Juan Carlos Niebles
Actor and Action Video Segmentation From a Sentence
Kirill Gavrilyuk, Amir Ghodrati, Zhenyang Li, Cees G. M. Snoek
Egocentric Activity Recognition on a Budget
Rafael Possas, Sheila Pinto Caceres, Fabio Ramos
CNN in MRF: Video Object Segmentation via Inference in a CNN-Based Higher-Order Spatio-Temporal MRF
Linchao Bao, Baoyuan Wu, Wei Liu
Action Sets: Weakly Supervised Action Segmentation Without Ordering Constraints
Alexander Richard, Hilde Kuehne, Juergen Gall
Low-Latency Video Semantic Segmentation
Yule Li, Jianping Shi, Dahua Lin
Fine-Grained Video Captioning for Sports Narrative
Huanyu Yu, Shuo Cheng, Bingbing Ni, Minsi Wang, Jian Zhang, Xiaokang Yang
End-to-End Learning of Motion Representation for Video Understanding
Lijie Fan, Wenbing Huang, Chuang Gan, Stefano Ermon, Boqing Gong, Junzhou Huang
Compressed Video Action Recognition
Chao-Yuan Wu, Manzil Zaheer, Hexiang Hu, R. Manmatha, Alexander J. Smola, Philipp Krähenbühl
Features for Multi-Target Multi-Camera Tracking and Re-Identification
Ergys Ristani, Carlo Tomasi
AVA: A Video Dataset of Spatio-Temporally Localized Atomic Visual Actions
Chunhui Gu, Chen Sun, David A. Ross, Carl Vondrick, Caroline Pantofaru, Yeqing Li, Sudheendra Vijayanarasimhan, George Toderici, Susanna Ricco, Rahul Sukthankar, Cordelia Schmid, Jitendra Malik
Who's Better? Who's Best? Pairwise Deep Ranking for Skill Determination
Hazel Doughty, Dima Damen, Walterio Mayol-Cuevas
MX-LSTM: Mixing Tracklets and Vislets to Jointly Forecast Trajectories and Head Poses
Irtiza Hasan, Francesco Setti, Theodore Tsesmelis, Alessio Del Bue, Fabio Galasso, Marco Cristani
Bottom-Up and Top-Down Attention for Image Captioning and Visual Question Answering
Peter Anderson, Xiaodong He, Chris Buehler, Damien Teney, Mark Johnson, Stephen Gould, Lei Zhang
Improved Fusion of Visual and Language Representations by Dense Symmetric Co-Attention for Visual Question Answering
Duy-Kien Nguyen, Takayuki Okatani
FlipDial: A Generative Model for Two-Way Visual Dialogue
Daniela Massiceti, N. Siddharth, Puneet K. Dokania, Philip H.S. Torr
Are You Talking to Me? Reasoned Visual Dialog Generation Through Adversarial Learning
Qi Wu, Peng Wang, Chunhua Shen, Ian Reid, Anton van den Hengel
Visual Question Generation as Dual Task of Visual Question Answering
Yikang Li, Nan Duan, Bolei Zhou, Xiao Chu, Wanli Ouyang, Xiaogang Wang, Ming Zhou
Unsupervised Textual Grounding: Linking Words to Image Concepts
Raymond A. Yeh, Minh N. Do, Alexander G. Schwing
Focal Visual-Text Attention for Visual Question Answering
Junwei Liang, Lu Jiang, Liangliang Cao, Li-Jia Li, Alexander G. Hauptmann
SeGAN: Segmenting and Generating the Invisible
Kiana Ehsani, Roozbeh Mottaghi, Ali Farhadi
Cascade R-CNN: Delving Into High Quality Object Detection
Zhaowei Cai, Nuno Vasconcelos
Learning Semantic Concepts and Order for Image and Sentence Matching
Yan Huang, Qi Wu, Chunfeng Song, Liang Wang
Functional Map of the World
Gordon Christie, Neil Fendley, James Wilson, Ryan Mukherjee
MegDet: A Large Mini-Batch Object Detector
Chao Peng, Tete Xiao, Zeming Li, Yuning Jiang, Xiangyu Zhang, Kai Jia, Gang Yu, Jian Sun
Learning Globally Optimized Object Detector via Policy Gradient
Yongming Rao, Dahua Lin, Jiwen Lu, Jie Zhou
Photographic Text-to-Image Synthesis With a Hierarchically-Nested Adversarial Network
Zizhao Zhang, Yuanpu Xie, Lin Yang
Illuminant Spectra-Based Source Separation Using Flash Photography
Zhuo Hui, Kalyan Sunkavalli, Sunil Hadap, Aswin C. Sankaranarayanan
Trapping Light for Time of Flight
Ruilin Xu, Mohit Gupta, Shree K. Nayar
The Perception-Distortion Tradeoff
Yochai Blau, Tomer Michaeli
Label Denoising Adversarial Network (LDAN) for Inverse Lighting of Faces
Hao Zhou, Jin Sun, Yaser Yacoob, David W. Jacobs
Optimal Structured Light à La Carte
Parsa Mirdehghan, Wenzheng Chen, Kiriakos N. Kutulakos
Tracking Multiple Objects Outside the Line of Sight Using Speckle Imaging
Brandon M. Smith, Matthew O'Toole, Mohit Gupta
Inferring Light Fields From Shadows
Manel Baradad, Vickie Ye, Adam B. Yedidia, Frédo Durand, William T. Freeman, Gregory W. Wornell, Antonio Torralba
Modifying Non-Local Variations Across Multiple Views
Tal Tlusty, Tomer Michaeli, Tali Dekel, Lihi Zelnik-Manor
Robust Video Content Alignment and Compensation for Rain Removal in a CNN Framework
Jie Chen, Cheen-Hau Tan, Junhui Hou, Lap-Pui Chau, He Li
SfSNet: Learning Shape, Reflectance and Illuminance of Faces `in the Wild'
Soumyadip Sengupta, Angjoo Kanazawa, Carlos D. Castillo, David W. Jacobs
Deep Photo Enhancer: Unpaired Learning for Image Enhancement From Photographs With GANs
Yu-Sheng Chen, Yu-Ching Wang, Man-Hsin Kao, Yung-Yu Chuang
LIME: Live Intrinsic Material Estimation
Abhimitra Meka, Maxim Maximov, Michael Zollhöfer, Avishek Chatterjee, Hans-Peter Seidel, Christian Richardt, Christian Theobalt
Learning to Detect Features in Texture Images
Linguang Zhang, Szymon Rusinkiewicz
Learning to Extract a Video Sequence From a Single Motion-Blurred Image
Meiguang Jin, Givi Meishvili, Paolo Favaro
Lose the Views: Limited Angle CT Reconstruction via Implicit Sinogram Completion
Rushil Anirudh, Hyojin Kim, Jayaraman J. Thiagarajan, K. Aditya Mohan, Kyle Champley, Timo Bremer
A Common Framework for Interactive Texture Transfer
Yifang Men, Zhouhui Lian, Yingmin Tang, Jianguo Xiao
AMNet: Memorability Estimation With Attention
Jiri Fajtl, Vasileios Argyriou, Dorothy Monekosso, Paolo Remagnino
Blind Predicting Similar Quality Map for Image Quality Assessment
Da Pan, Ping Shi, Ming Hou, Zefeng Ying, Sizhe Fu, Yuan Zhang
Deep End-to-End Time-of-Flight Imaging
Shuochen Su, Felix Heide, Gordon Wetzstein, Wolfgang Heidrich
Aperture Supervision for Monocular Depth Estimation
Pratul P. Srinivasan, Rahul Garg, Neal Wadhwa, Ren Ng, Jonathan T. Barron
Seeing Temporal Modulation of Lights From Standard Cameras
Naoki Sakakibara, Fumihiko Sakaue, Jun Sato
Statistical Tomography of Microscopic Life
Aviad Levis, Yoav Y. Schechner, Ronen Talmon
Divide and Conquer for Full-Resolution Light Field Deblurring
M. R. Mahesh Mohan, A. N. Rajagopalan
Multispectral Image Intrinsic Decomposition via Subspace Constraint
Qian Huang, Weixin Zhu, Yang Zhao, Linsen Chen, Yao Wang, Tao Yue, Xun Cao
Improving Color Reproduction Accuracy on Cameras
Hakki Can Karaimer, Michael S. Brown
A Closer Look at Spatiotemporal Convolutions for Action Recognition
Du Tran, Heng Wang, Lorenzo Torresani, Jamie Ray, Yann LeCun, Manohar Paluri
Inferring Shared Attention in Social Scene Videos
Lifeng Fan, Yixin Chen, Ping Wei, Wenguan Wang, Song-Chun Zhu
Making Convolutional Networks Recurrent for Visual Sequence Learning
Xiaodong Yang, Pavlo Molchanov, Jan Kautz
Real-World Anomaly Detection in Surveillance Videos
Waqas Sultani, Chen Chen, Mubarak Shah
Viewpoint-Aware Attentive Multi-View Inference for Vehicle Re-Identification
Yi Zhou, Ling Shao
Efficient Video Object Segmentation via Network Modulation
Linjie Yang, Yanran Wang, Xuehan Xiong, Jianchao Yang, Aggelos K. Katsaggelos
Weakly-Supervised Action Segmentation With Iterative Soft Boundary Assignment
Li Ding, Chenliang Xu
Depth-Aware Stereo Video Retargeting
Bing Li, Chia-Wen Lin, Boxin Shi, Tiejun Huang, Wen Gao, C.-C. Jay Kuo
Instance Embedding Transfer to Unsupervised Video Object Segmentation
Siyang Li, Bryan Seybold, Alexey Vorobyov, Alireza Fathi, Qin Huang, C.-C. Jay Kuo
Future Frame Prediction for Anomaly Detection – A New Baseline
Wen Liu, Weixin Luo, Dongze Lian, Shenghua Gao
Can Spatiotemporal 3D CNNs Retrace the History of 2D CNNs and ImageNet?
Kensho Hara, Hirokatsu Kataoka, Yutaka Satoh
Dynamic Video Segmentation Network
Yu-Syuan Xu, Tsu-Jui Fu, Hsuan-Kung Yang, Chun-Yi Lee
Recognize Actions by Disentangling Components of Dynamics
Yue Zhao, Yuanjun Xiong, Dahua Lin
Motion-Appearance Co-Memory Networks for Video Question Answering
Jiyang Gao, Runzhou Ge, Kan Chen, Ram Nevatia
Learning to Understand Image Blur
Shanghang Zhang, Xiaohui Shen, Zhe Lin, Radomír Měch, João P. Costeira, José M. F. Moura
Dense Decoder Shortcut Connections for Single-Pass Semantic Segmentation
Piotr Bilinski, Victor Prisacariu
Generative Adversarial Image Synthesis With Decision Tree Latent Controller
Takuhiro Kaneko, Kaoru Hiramatsu, Kunio Kashino
Learning a Discriminative Prior for Blind Image Deblurring
Lerenhan Li, Jinshan Pan, Wei-Sheng Lai, Changxin Gao, Nong Sang, Ming-Hsuan Yang
Frame-Recurrent Video Super-Resolution
Mehdi S. M. Sajjadi, Raviteja Vemulapalli, Matthew Brown
Discovering Point Lights With Intensity Distance Fields
Edward Zhang, Michael F. Cohen, Brian Curless
Video Rain Streak Removal by Multiscale Convolutional Sparse Coding
Minghan Li, Qi Xie, Qian Zhao, Wei Wei, Shuhang Gu, Jing Tao, Deyu Meng
Stereoscopic Neural Style Transfer
Dongdong Chen, Lu Yuan, Jing Liao, Nenghai Yu, Gang Hua
Multi-Frame Quality Enhancement for Compressed Video
Ren Yang, Mai Xu, Zulin Wang, Tianyi Li
CNN Based Learning Using Reflection and Retinex Models for Intrinsic Image Decomposition
Anil S. Baslamisli, Hoang-An Le, Theo Gevers
Image Restoration by Estimating Frequency Distribution of Local Patches
Jaeyoung Yoo, Sang-ho Lee, Nojun Kwak
Latent RANSAC
Simon Korman, Roee Litman
Two-Stream Convolutional Networks for Dynamic Texture Synthesis
Matthew Tesfaldet, Marcus A. Brubaker, Konstantinos G. Derpanis
Towards Open-Set Identity Preserving Face Synthesis
Jianmin Bao, Dong Chen, Fang Wen, Houqiang Li, Gang Hua
A Revised Underwater Image Formation Model
Derya Akkaynak, Tali Treibitz
Graph-Cut RANSAC
Daniel Barath, Jiří Matas
Temporal Deformable Residual Networks for Action Segmentation in Videos
Peng Lei, Sinisa Todorovic
Weakly Supervised Action Localization by Sparse Temporal Pooling Network
Phuc Nguyen, Ting Liu, Gautam Prasad, Bohyung Han
PoseFlow: A Deep Motion Representation for Understanding Human Behaviors in Videos
Dingwen Zhang, Guangyu Guo, Dong Huang, Junwei Han
FFNet: Video Fast-Forwarding via Reinforcement Learning
Shuyue Lan, Rameswar Panda, Qi Zhu, Amit K. Roy-Chowdhury
Multi-Shot Pedestrian Re-Identification via Sequential Decision Making
Jianfu Zhang, Naiyan Wang, Liqing Zhang
Attend and Interact: Higher-Order Object Interactions for Video Understanding
Chih-Yao Ma, Asim Kadav, Iain Melvin, Zsolt Kira, Ghassan AlRegib, Hans Peter Graf
Where and Why Are They Looking? Jointly Inferring Human Attention and Intentions in Complex Tasks
Ping Wei, Yang Liu, Tianmin Shu, Nanning Zheng, Song-Chun Zhu
Fully Convolutional Adaptation Networks for Semantic Segmentation
Yiheng Zhang, Zhaofan Qiu, Ting Yao, Dong Liu, Tao Mei
Semantic Video Segmentation by Gated Recurrent Flow Propagation
David Nilsson, Cristian Sminchisescu
Interpretable Video Captioning via Trajectory Structured Localization
Xian Wu, Guanbin Li, Qingxing Cao, Qingge Ji, Liang Lin
Deep Hashing via Discrepancy Minimization
Zhixiang Chen, Xin Yuan, Jiwen Lu, Qi Tian, Jie Zhou
ShuffleNet: An Extremely Efficient Convolutional Neural Network for Mobile Devices
Xiangyu Zhang, Xinyu Zhou, Mengxiao Lin, Jian Sun
Zero-Shot Recognition via Semantic Embeddings and Knowledge Graphs
Xiaolong Wang, Yufei Ye, Abhinav Gupta
Referring Relationships
Ranjay Krishna, Ines Chami, Michael Bernstein, Li Fei-Fei
Improving Object Localization With Fitness NMS and Bounded IoU Loss
Lachlan Tychsen-Smith, Lars Petersson
End-to-End Deep Kronecker-Product Matching for Person Re-Identification
Yantao Shen, Tong Xiao, Hongsheng Li, Shuai Yi, Xiaogang Wang
Semantic Visual Localization
Johannes L. Schönberger, Marc Pollefeys, Andreas Geiger, Torsten Sattler
Objects as Context for Detecting Their Semantic Parts
Abel Gonzalez-Garcia, Davide Modolo, Vittorio Ferrari
End-to-End Weakly-Supervised Semantic Alignment
Ignacio Rocco, Relja Arandjelović, Josef Sivic
Dynamic Zoom-In Network for Fast Object Detection in Large Images
Mingfei Gao, Ruichi Yu, Ang Li, Vlad I. Morariu, Larry S. Davis
Learning Markov Clustering Networks for Scene Text Detection
Zichuan Liu, Guosheng Lin, Sheng Yang, Jiashi Feng, Weisi Lin, Wang Ling Goh
Deep Reinforcement Learning of Region Proposal Networks for Object Detection
Aleksis Pirinen, Cristian Sminchisescu
Beyond Holistic Object Recognition: Enriching Image Understanding With Part States
Cewu Lu, Hao Su, Yonglu Li, Yongyi Lu, Li Yi, Chi-Keung Tang, Leonidas J. Guibas
Discriminability Objective for Training Descriptive Captions
Ruotian Luo, Brian Price, Scott Cohen, Gregory Shakhnarovich
Visual Question Answering With Memory-Augmented Networks
Chao Ma, Chunhua Shen, Anthony Dick, Qi Wu, Peng Wang, Anton van den Hengel, Ian Reid
Structure Inference Net: Object Detection Using Scene-Level Context and Instance-Level Relationships
Yong Liu, Ruiping Wang, Shiguang Shan, Xilin Chen
Occluded Pedestrian Detection Through Guided Attention in CNNs
Shanshan Zhang, Jian Yang, Bernt Schiele
Reward Learning From Narrated Demonstrations
Hsiao-Yu Tung, Adam W. Harley, Liang-Kang Huang, Katerina Fragkiadaki
Weakly-Supervised Semantic Segmentation Network With Deep Seeded Region Growing
Zilong Huang, Xinggang Wang, Jiasi Wang, Wenyu Liu, Jingdong Wang
PoTion: Pose MoTion Representation for Action Recognition
Vasileios Choutas, Philippe Weinzaepfel, Jérôme Revaud, Cordelia Schmid
Bilateral Ordinal Relevance Multi-Instance Regression for Facial Action Unit Intensity Estimation
Yong Zhang, Rui Zhao, Weiming Dong, Bao-Gang Hu, Qiang Ji
Pulling Actions out of Context: Explicit Separation for Effective Combination
Yang Wang, Minh Hoai
Dynamic Feature Learning for Partial Face Recognition
Lingxiao He, Haiqing Li, Qi Zhang, Zhenan Sun
Exploiting Transitivity for Learning Person Re-Identification Models on a Budget
Sourya Roy, Sujoy Paul, Neal E. Young, Amit K. Roy-Chowdhury
Deep Spatial Feature Reconstruction for Partial Person Re-Identification: Alignment-Free Approach
Lingxiao He, Jian Liang, Haiqing Li, Zhenan Sun
Every Smile Is Unique: Landmark-Guided Diverse Smile Generation
Wei Wang, Xavier Alameda-Pineda, Dan Xu, Pascal Fua, Elisa Ricci, Nicu Sebe
UV-GAN: Adversarial Facial UV Map Completion for Pose-Invariant Face Recognition
Jiankang Deng, Shiyang Cheng, Niannan Xue, Yuxiang Zhou, Stefanos Zafeiriou
Cascaded Pyramid Network for Multi-Person Pose Estimation
Yilun Chen, Zhicheng Wang, Yuxiang Peng, Zhiqiang Zhang, Gang Yu, Jian Sun
A Face-to-Face Neural Conversation Model
Hang Chu, Daiqing Li, Sanja Fidler
End-to-End Recovery of Human Shape and Pose
Angjoo Kanazawa, Michael J. Black, David W. Jacobs, Jitendra Malik
Squeeze-and-Excitation Networks
Jie Hu, Li Shen, Gang Sun
Revisiting Salient Object Detection: Simultaneous Detection, Ranking, and Subitizing of Multiple Salient Objects
Md Amirul Islam, Mahmoud Kalash, Neil D. B. Bruce
Context Encoding for Semantic Segmentation
Hang Zhang, Kristin Dana, Jianping Shi, Zhongyue Zhang, Xiaogang Wang, Ambrish Tyagi, Amit Agrawal
Creating Capsule Wardrobes From Fashion Images
Wei-Lin Hsiao, Kristen Grauman
Webly Supervised Learning Meets Zero-Shot Learning: A Hybrid Approach for Fine-Grained Classification
Li Niu, Ashok Veeraraghavan, Ashutosh Sabharwal
Look, Imagine and Match: Improving Textual-Visual Cross-Modal Retrieval With Generative Models
Jiuxiang Gu, Jianfei Cai, Shafiq R. Joty, Li Niu, Gang Wang
Bidirectional Attentive Fusion With Context Gating for Dense Video Captioning
Jingwen Wang, Wenhao Jiang, Lin Ma, Wei Liu, Yong Xu
InLoc: Indoor Visual Localization With Dense Matching and View Synthesis
Hajime Taira, Masatoshi Okutomi, Torsten Sattler, Mircea Cimpoi, Marc Pollefeys, Josef Sivic, Tomas Pajdla, Akihiko Torii
Towards High Performance Video Object Detection
Xizhou Zhu, Jifeng Dai, Lu Yuan, Yichen Wei
Neural Baby Talk
Jiasen Lu, Jianwei Yang, Dhruv Batra, Devi Parikh
Few-Shot Image Recognition by Predicting Parameters From Activations
Siyuan Qiao, Chenxi Liu, Wei Shen, Alan L. Yuille
Iterative Visual Reasoning Beyond Convolutions
Xinlei Chen, Li-Jia Li, Li Fei-Fei, Abhinav Gupta
Visual Question Reasoning on General Dependency Tree
Qingxing Cao, Xiaodan Liang, Bailing Li, Guanbin Li, Liang Lin
CVM-Net: Cross-View Matching Network for Image-Based Ground-to-Aerial Geo-Localization
Sixing Hu, Mengdan Feng, Rang M. H. Nguyen, Gim Hee Lee
Revisiting Dilated Convolution: A Simple Approach for Weakly- and Semi-Supervised Semantic Segmentation
Yunchao Wei, Huaxin Xiao, Honghui Shi, Zequn Jie, Jiashi Feng, Thomas S. Huang
Low-Shot Learning From Imaginary Data
Yu-Xiong Wang, Ross Girshick, Martial Hebert, Bharath Hariharan
DoubleFusion: Real-Time Capture of Human Performances With Inner Body Shapes From a Single Depth Sensor
Tao Yu, Zerong Zheng, Kaiwen Guo, Jianhui Zhao, Qionghai Dai, Hao Li, Gerard Pons-Moll, Yebin Liu
DensePose: Dense Human Pose Estimation in the Wild
Rıza Alp Güler, Natalia Neverova, Iasonas Kokkinos
Ordinal Depth Supervision for 3D Human Pose Estimation
Georgios Pavlakos, Xiaowei Zhou, Kostas Daniilidis
Consensus Maximization for Semantic Region Correspondences
Pablo Speciale, Danda P. Paudel, Martin R. Oswald, Hayko Riemenschneider, Luc Van Gool, Marc Pollefeys
Robust Hough Transform Based 3D Reconstruction From Circular Light Fields
Alessandro Vianello, Jens Ackermann, Maximilian Diebold, Bernd Jähne
Alive Caricature From 2D to 3D
Qianyi Wu, Juyong Zhang, Yu-Kun Lai, Jianmin Zheng, Jianfei Cai
Nonlinear 3D Face Morphable Model
Luan Tran, Xiaoming Liu
Through-Wall Human Pose Estimation Using Radio Signals
Mingmin Zhao, Tianhong Li, Mohammad Abu Alsheikh, Yonglong Tian, Hang Zhao, Antonio Torralba, Dina Katabi
What Makes a Video a Video: Analyzing Temporal Information in Video Understanding Models and Datasets
De-An Huang, Vignesh Ramanathan, Dhruv Mahajan, Lorenzo Torresani, Manohar Paluri, Li Fei-Fei, Juan Carlos Niebles
Fast Video Object Segmentation by Reference-Guided Mask Propagation
Seoung Wug Oh, Joon-Young Lee, Kalyan Sunkavalli, Seon Joo Kim
NeuralNetwork-Viterbi: A Framework for Weakly Supervised Video Learning
Alexander Richard, Hilde Kuehne, Ahsan Iqbal, Juergen Gall
Actor and Observer: Joint Modeling of First and Third-Person Videos
Gunnar A. Sigurdsson, Abhinav Gupta, Cordelia Schmid, Ali Farhadi, Karteek Alahari
HSA-RNN: Hierarchical Structure-Adaptive RNN for Video Summarization
Bin Zhao, Xuelong Li, Xiaoqiang Lu
Fast and Accurate Online Video Object Segmentation via Tracking Parts
Jingchun Cheng, Yi-Hsuan Tsai, Wei-Chih Hung, Shengjin Wang, Ming-Hsuan Yang
Now You Shake Me: Towards Automatic 4D Cinema
Yuhao Zhou, Makarand Tapaswi, Sanja Fidler
Viewpoint-Aware Video Summarization
Atsushi Kanehira, Luc Van Gool, Yoshitaka Ushiku, Tatsuya Harada
Photometric Stereo in Participating Media Considering Shape-Dependent Forward Scatter
Yuki Fujimura, Masaaki Iiyama, Atsushi Hashimoto, Michihiko Minoh
Direction-Aware Spatial Context Features for Shadow Detection
Xiaowei Hu, Lei Zhu, Chi-Wing Fu, Jing Qin, Pheng-Ann Heng
Discriminative Learning of Latent Features for Zero-Shot Recognition
Yan Li, Junge Zhang, Jianguo Zhang, Kaiqi Huang
Learning to Adapt Structured Output Space for Semantic Segmentation
Yi-Hsuan Tsai, Wei-Chih Hung, Samuel Schulter, Kihyuk Sohn, Ming-Hsuan Yang, Manmohan Chandraker
Multi-Task Learning Using Uncertainty to Weigh Losses for Scene Geometry and Semantics
Alex Kendall, Yarin Gal, Roberto Cipolla
Jointly Localizing and Describing Events for Dense Video Captioning
Yehao Li, Ting Yao, Yingwei Pan, Hongyang Chao, Tao Mei
Going From Image to Video Saliency: Augmenting Image Salience With Dynamic Attentional Push
Siavash Gorji, James J. Clark
M3: Multimodal Memory Modelling for Video Captioning
Junbo Wang, Wei Wang, Yan Huang, Liang Wang, Tieniu Tan
Emotional Attention: A Study of Image Sentiment and Visual Attention
Shaojing Fan, Zhiqi Shen, Ming Jiang, Bryan L. Koenig, Juan Xu, Mohan S. Kankanhalli, Qi Zhao
A Low Power, High Throughput, Fully Event-Based Stereo System
Alexander Andreopoulos, Hirak J. Kashyap, Tapan K. Nayak, Arnon Amir, Myron D. Flickner
VITON: An Image-Based Virtual Try-On Network
Xintong Han, Zuxuan Wu, Zhe Wu, Ruichi Yu, Larry S. Davis
Multi-Oriented Scene Text Detection via Corner Localization and Region Segmentation
Pengyuan Lyu, Cong Yao, Wenhao Wu, Shuicheng Yan, Xiang Bai
Multi-Content GAN for Few-Shot Font Style Transfer
Samaneh Azadi, Matthew Fisher, Vladimir G. Kim, Zhaowen Wang, Eli Shechtman, Trevor Darrell
Audio to Body Dynamics
Eli Shlizerman, Lucio Dery, Hayden Schoen, Ira Kemelmacher-Shlizerman
Weakly Supervised Coupled Networks for Visual Sentiment Analysis
Jufeng Yang, Dongyu She, Yu-Kun Lai, Paul L. Rosin, Ming-Hsuan Yang
Future Person Localization in First-Person Videos
Takuma Yagi, Karttikeya Mangalam, Ryo Yonetani, Yoichi Sato
Preserving Semantic Relations for Zero-Shot Learning
Yashas Annadani, Soma Biswas
Show Me a Story: Towards Coherent Neural Story Illustration
Hareesh Ravi, Lezi Wang, Carlos Muniz, Leonid Sigal, Dimitris Metaxas, Mubbasir Kapadia
Reconstruction Network for Video Captioning
Bairui Wang, Lin Ma, Wei Zhang, Wei Liu
Fast Spectral Ranking for Similarity Search
Ahmet Iscen, Yannis Avrithis, Giorgos Tolias, Teddy Furon, Ondřej Chum
Mining on Manifolds: Metric Learning Without Labels
Ahmet Iscen, Giorgos Tolias, Yannis Avrithis, Ondřej Chum
PIXOR: Real-Time 3D Object Detection From Point Clouds
Bin Yang, Wenjie Luo, Raquel Urtasun
Leveraging Unlabeled Data for Crowd Counting by Learning to Rank
Xialei Liu, Joost van de Weijer, Andrew D. Bagdanov
Zero-Shot Kernel Learning
Hongguang Zhang, Piotr Koniusz
Differential Attention for Visual Question Answering
Badri Patro, Vinay P. Namboodiri
Learning From Noisy Web Data With Category-Level Supervision
Li Niu, Qingtao Tang, Ashok Veeraraghavan, Ashutosh Sabharwal
Toward Driving Scene Understanding: A Dataset for Learning Driver Behavior and Causal Reasoning
Vasili Ramanishka, Yi-Ting Chen, Teruhisa Misu, Kate Saenko
Learning Attribute Representations With Localization for Flexible Fashion Search
Kenan E. Ak, Ashraf A. Kassim, Joo Hwee Lim, Jo Yew Tham
Bidirectional Retrieval Made Simple
Jônatas Wehrmann, Rodrigo C. Barros
Learning Multi-Instance Enriched Image Representations via Non-Greedy Ratio Maximization of the l1-Norm Distances
Kai Liu, Hua Wang, Feiping Nie, Hao Zhang
Learning Visual Knowledge Memory Networks for Visual Question Answering
Zhou Su, Chen Zhu, Yinpeng Dong, Dongqi Cai, Yurong Chen, Jianguo Li
Visual Grounding via Accumulated Attention
Chaorui Deng, Qi Wu, Qingyao Wu, Fuyuan Hu, Fan Lyu, Mingkui Tan
Beyond Trade-Off: Accelerate FCN-Based Face Detector With Higher Accuracy
Guanglu Song, Yu Liu, Ming Jiang, Yujie Wang, Junjie Yan, Biao Leng
PackNet: Adding Multiple Tasks to a Single Network by Iterative Pruning
Arun Mallya, Svetlana Lazebnik
Repulsion Loss: Detecting Pedestrians in a Crowd
Xinlong Wang, Tete Xiao, Yuning Jiang, Shuai Shao, Jian Sun, Chunhua Shen
Neural Sign Language Translation
Necati Cihan Camgoz, Simon Hadfield, Oscar Koller, Hermann Ney, Richard Bowden
Non-Local Neural Networks
Xiaolong Wang, Ross Girshick, Abhinav Gupta, Kaiming He
LAMV: Learning to Align and Match Videos With Kernelized Temporal Layers
Lorenzo Baraldi, Matthijs Douze, Rita Cucchiara, Hervé Jégou
Optimizing Video Object Detection via a Scale-Time Lattice
Kai Chen, Jiaqi Wang, Shuo Yang, Xingcheng Zhang, Yuanjun Xiong, Chen Change Loy, Dahua Lin
Learning Compressible 360° Video Isomers
Yu-Chuan Su, Kristen Grauman
Attention Clusters: Purely Attention Based Local Feature Integration for Video Classification
Xiang Long, Chuang Gan, Gerard de Melo, Jiajun Wu, Xiao Liu, Shilei Wen
What Have We Learned From Deep Representations for Action Recognition?
Christoph Feichtenhofer, Axel Pinz, Richard P. Wildes, Andrew Zisserman
Controllable Video Generation With Sparse Trajectories
Zekun Hao, Xun Huang, Serge Belongie
Representing and Learning High Dimensional Data With the Optimal Transport Map From a Probabilistic Viewpoint
Serim Park, Matthew Thorpe
CLIP-Q: Deep Network Compression Learning by In-Parallel Pruning-Quantization
Frederick Tung, Greg Mori
Inference in Higher Order MRF-MAP Problems With Small and Large Cliques
Ishant Shanu, Chetan Arora, S.N. Maheshwari
ROAD: Reality Oriented Adaptation for Semantic Segmentation of Urban Scenes
Yuhua Chen, Wen Li, Luc Van Gool
Eye In-Painting With Exemplar Generative Adversarial Networks
Brian Dolhansky, Cristian Canton Ferrer
ClcNet: Improving the Efficiency of Convolutional Neural Network Using Channel Local Convolutions
Dong-Qing Zhang
Towards Effective Low-Bitwidth Convolutional Neural Networks
Bohan Zhuang, Chunhua Shen, Mingkui Tan, Lingqiao Liu, Ian Reid
Stochastic Downsampling for Cost-Adjustable Inference and Improved Regularization in Convolutional Networks
Jason Kuen, Xiangfei Kong, Zhe Lin, Gang Wang, Jianxiong Yin, Simon See, Yap-Peng Tan
Face Aging With Identity-Preserved Conditional Generative Adversarial Networks
Zongwei Wang, Xu Tang, Weixin Luo, Shenghua Gao
Unsupervised Cross-Dataset Person Re-Identification by Transfer Learning of Spatial-Temporal Patterns
Jianming Lv, Weihang Chen, Qing Li, Can Yang
Feature Quantization for Defending Against Distortion of Images
Zhun Sun, Mete Ozay, Yan Zhang, Xing Liu, Takayuki Okatani
Tagging Like Humans: Diverse and Distinct Image Annotation
Baoyuan Wu, Weidong Chen, Peng Sun, Wei Liu, Bernard Ghanem, Siwei Lyu
Re-Weighted Adversarial Adaptation Network for Unsupervised Domain Adaptation
Qingchao Chen, Yang Liu, Zhaowen Wang, Ian Wassell, Kevin Chetty
Inferring Semantic Layout for Hierarchical Text-to-Image Synthesis
Seunghoon Hong, Dingdong Yang, Jongwook Choi, Honglak Lee
Regularizing RNNs for Caption Generation by Reconstructing the Past With the Present
Xinpeng Chen, Lin Ma, Wenhao Jiang, Jian Yao, Wei Liu
Unsupervised Domain Adaptation With Similarity Learning
Pedro O. Pinheiro
Learning Deep Sketch Abstraction
Umar Riaz Muhammad, Yongxin Yang, Yi-Zhe Song, Tao Xiang, Timothy M. Hospedales
Matching Adversarial Networks
Gellért Máttyus, Raquel Urtasun
SoS-RSC: A Sum-of-Squares Polynomial Approach to Robustifying Subspace Clustering Algorithms
Mario Sznaier, Octavia Camps
Resource Aware Person Re-Identification Across Multiple Resolutions
Yan Wang, Lequn Wang, Yurong You, Xu Zou, Vincent Chen, Serena Li, Gao Huang, Bharath Hariharan, Kilian Q. Weinberger
Learning and Using the Arrow of Time
Donglai Wei, Joseph J. Lim, Andrew Zisserman, William T. Freeman
Neural Style Transfer via Meta Networks
Falong Shen, Shuicheng Yan, Gang Zeng
People, Penguins and Petri Dishes: Adapting Object Counting Models to New Visual Domains and Object Types Without Forgetting
Mark Marsden, Kevin McGuinness, Suzanne Little, Ciara E. Keogh, Noel E. O'Connor
HydraNets: Specialized Dynamic Architectures for Efficient Inference
Ravi Teja Mullapudi, William R. Mark, Noam Shazeer, Kayvon Fatahalian
SketchMate: Deep Hashing for Million-Scale Human Sketch Retrieval
Peng Xu, Yongye Huang, Tongtong Yuan, Kaiyue Pang, Yi-Zhe Song, Tao Xiang, Timothy M. Hospedales, Zhanyu Ma, Jun Guo
From Source to Target and Back: Symmetric Bi-Directional Adaptive GAN
Paolo Russo, Fabio M. Carlucci, Tatiana Tommasi, Barbara Caputo
OLÉ: Orthogonal Low-Rank Embedding - A Plug and Play Geometric Loss for Deep Learning
José Lezama, Qiang Qiu, Pablo Musé, Guillermo Sapiro
Efficient Parametrization of Multi-Domain Deep Neural Networks
Sylvestre-Alvise Rebuffi, Hakan Bilen, Andrea Vedaldi
Deep Density Clustering of Unconstrained Faces
Wei-An Lin, Jun-Cheng Chen, Carlos D. Castillo, Rama Chellappa
Geometric Multi-Model Fitting With a Convex Relaxation Algorithm
Paul Amayo, Pedro Piniés, Lina M. Paz, Paul Newman
Fast and Robust Estimation for Unit-Norm Constrained Linear Fitting Problems
Daiki Ikami, Toshihiko Yamasaki, Kiyoharu Aizawa
Importance Weighted Adversarial Nets for Partial Domain Adaptation
Jing Zhang, Zewei Ding, Wanqing Li, Philip Ogunbona
Efficient Subpixel Refinement With Symbolic Linear Predictors
Vincent Lui, Jonathon Geeves, Winston Yii, Tom Drummond
Scale-Recurrent Network for Deep Image Deblurring
Xin Tao, Hongyun Gao, Xiaoyong Shen, Jue Wang, Jiaya Jia
DeblurGAN: Blind Motion Deblurring Using Conditional Adversarial Networks
Orest Kupyn, Volodymyr Budzan, Mykola Mykhailych, Dmytro Mishkin, Jiří Matas
A2-RL: Aesthetics Aware Reinforcement Learning for Image Cropping
Debang Li, Huikai Wu, Junge Zhang, Kaiqi Huang
Single Image Dehazing via Conditional Generative Adversarial Network
Runde Li, Jinshan Pan, Zechao Li, Jinhui Tang
On the Duality Between Retinex and Image Dehazing
Adrian Galdran, Aitor Alvarez-Gila, Alessandro Bria, Javier Vazquez-Corral, Marcelo Bertalmío
Arbitrary Style Transfer With Deep Feature Reshuffle
Shuyang Gu, Congliang Chen, Jing Liao, Lu Yuan
Nonlocal Low-Rank Tensor Factor Analysis for Image Restoration
Xinyuan Zhang, Xin Yuan, Lawrence Carin
Avatar-Net: Multi-Scale Zero-Shot Style Transfer by Feature Decoration
Lu Sheng, Ziyi Lin, Jing Shao, Xiaogang Wang
Missing Slice Recovery for Tensors Using a Low-Rank Model in Embedded Space
Tatsuya Yokota, Burak Erem, Seyhmus Guler, Simon K. Warfield, Hidekata Hontani
Deep Semantic Face Deblurring
Ziyi Shen, Wei-Sheng Lai, Tingfa Xu, Jan Kautz, Ming-Hsuan Yang
GraphBit: Bitwise Interaction Mining via Deep Reinforcement Learning
Yueqi Duan, Ziwei Wang, Jiwen Lu, Xudong Lin, Jie Zhou
Recurrent Saliency Transformation Network: Incorporating Multi-Stage Visual Cues for Small Organ Segmentation
Qihang Yu, Lingxi Xie, Yan Wang, Yuyin Zhou, Elliot K. Fishman, Alan L. Yuille
Thoracic Disease Identification and Localization With Limited Supervision
Zhe Li, Chong Wang, Mei Han, Yuan Xue, Wei Wei, Li-Jia Li, Li Fei-Fei
Quantization of Fully Convolutional Networks for Accurate Biomedical Image Segmentation
Xiaowei Xu, Qing Lu, Lin Yang, Sharon Hu, Danny Chen, Yu Hu, Yiyu Shi
Visual Feature Attribution Using Wasserstein GANs
Christian F. Baumgartner, Lisa M. Koch, Kerem Can Tezcan, Jia Xi Ang, Ender Konukoglu
Total Capture: A 3D Deformation Model for Tracking Faces, Hands, and Bodies
Hanbyul Joo, Tomas Simon, Yaser Sheikh
Augmented Skeleton Space Transfer for Depth-Based Hand Pose Estimation
Seungryul Baek, Kwang In Kim, Tae-Kyun Kim
Synthesizing Images of Humans in Unseen Poses
Guha Balakrishnan, Amy Zhao, Adrian V. Dalca, Frédo Durand, John Guttag
SSNet: Scale Selection Network for Online 3D Action Prediction
Jun Liu, Amir Shahroudy, Gang Wang, Ling-Yu Duan, Alex C. Kot
Detecting and Recognizing Human-Object Interactions
Georgia Gkioxari, Ross Girshick, Piotr Dollár, Kaiming He
Unsupervised Learning and Segmentation of Complex Activities From Video
Fadime Sener, Angela Yao
Unsupervised Training for 3D Morphable Model Regression
Kyle Genova, Forrester Cole, Aaron Maschinot, Aaron Sarna, Daniel Vlasic, William T. Freeman
Video Based Reconstruction of 3D People Models
Thiemo Alldieck, Marcus Magnor, Weipeng Xu, Christian Theobalt, Gerard Pons-Moll
Pose-Guided Photorealistic Face Rotation
Yibo Hu, Xiang Wu, Bing Yu, Ran He, Zhenan Sun
Mesoscopic Facial Geometry Inference Using Deep Neural Networks
Loc Huynh, Weikai Chen, Shunsuke Saito, Jun Xing, Koki Nagano, Andrew Jones, Paul Debevec, Hao Li
Hand PointNet: 3D Hand Pose Estimation Using Point Sets
Liuhao Ge, Yujun Cai, Junwu Weng, Junsong Yuan
Seeing Voices and Hearing Faces: Cross-Modal Biometric Matching
Arsha Nagrani, Samuel Albanie, Andrew Zisserman
Learning Monocular 3D Human Pose Estimation From Multi-View Images
Helge Rhodin, Jörg Spörri, Isinsu Katircioglu, Victor Constantin, Frédéric Meyer, Erich Müller, Mathieu Salzmann, Pascal Fua
Separating Style and Content for Generalized Style Transfer
Yexun Zhang, Ya Zhang, Wenbin Cai
TextureGAN: Controlling Deep Image Synthesis With Texture Patches
Wenqi Xian, Patsorn Sangkloy, Varun Agrawal, Amit Raj, Jingwan Lu, Chen Fang, Fisher Yu, James Hays
Connecting Pixels to Privacy and Utility: Automatic Redaction of Private Information in Images
Tribhuvanesh Orekondy, Mario Fritz, Bernt Schiele
MapNet: An Allocentric Spatial Memory for Mapping Environments
João F. Henriques, Andrea Vedaldi
Accurate and Diverse Sampling of Sequences Based on a “Best of Many” Sample Objective
Apratim Bhattacharyya, Bernt Schiele, Mario Fritz
VirtualHome: Simulating Household Activities via Programs
Xavier Puig, Kevin Ra, Marko Boben, Jiaman Li, Tingwu Wang, Sanja Fidler, Antonio Torralba
Generate to Adapt: Aligning Domains Using Generative Adversarial Networks
Swami Sankaranarayanan, Yogesh Balaji, Carlos D. Castillo, Rama Chellappa
Multi-Agent Diverse Generative Adversarial Networks
Arnab Ghosh, Viveka Kulharia, Vinay P. Namboodiri, Philip H.S. Torr, Puneet K. Dokania
A PID Controller Approach for Stochastic Optimization of Deep Networks
Wangpeng An, Haoqian Wang, Qingyun Sun, Jun Xu, Qionghai Dai, Lei Zhang
“Learning-Compression” Algorithms for Neural Net Pruning
Miguel Á. Carreira-Perpiñán, Yerlan Idelbayev
Large-Scale Distance Metric Learning With Uncertainty
Qi Qian, Jiasheng Tang, Hao Li, Shenghuo Zhu, Rong Jin
Guide Me: Interacting With Deep Networks
Christian Rupprecht, Iro Laina, Nassir Navab, Gregory D. Hager, Federico Tombari
Art of Singular Vectors and Universal Adversarial Perturbations
Valentin Khrulkov, Ivan Oseledets
Deflecting Adversarial Attacks With Pixel Deflection
Aaditya Prakash, Nick Moran, Solomon Garber, Antonella DiLillo, James Storer
MovieGraphs: Towards Understanding Human-Centric Situations From Videos
Paul Vicol, Makarand Tapaswi, Lluís Castrejón, Sanja Fidler
SemStyle: Learning to Generate Stylised Image Captions Using Unaligned Text
Alexander Mathews, Lexing Xie, Xuming He
Benchmarking 6DOF Outdoor Visual Localization in Changing Conditions
Torsten Sattler, Will Maddern, Carl Toft, Akihiko Torii, Lars Hammarstrand, Erik Stenborg, Daniel Safari, Masatoshi Okutomi, Marc Pollefeys, Josef Sivic, Fredrik Kahl, Tomas Pajdla
IVQA: Inverse Visual Question Answering
Feng Liu, Tao Xiang, Timothy M. Hospedales, Wankou Yang, Changyin Sun
Unsupervised Person Image Synthesis in Arbitrary Poses
Albert Pumarola, Antonio Agudo, Alberto Sanfeliu, Francesc Moreno-Noguer
Learning Descriptor Networks for 3D Shape Synthesis and Analysis
Jianwen Xie, Zilong Zheng, Ruiqi Gao, Wenguan Wang, Song-Chun Zhu, Ying Nian Wu
Neural Kinematic Networks for Unsupervised Motion Retargetting
Ruben Villegas, Jimei Yang, Duygu Ceylan, Honglak Lee
Group Consistent Similarity Learning via Deep CRF for Person Re-Identification
Dapeng Chen, Dan Xu, Hongsheng Li, Nicu Sebe, Xiaogang Wang
Learning Compositional Visual Concepts With Mutual Consistency
Yunye Gong, Srikrishna Karanam, Ziyan Wu, Kuan-Chuan Peng, Jan Ernst, Peter C. Doerschuk
NestedNet: Learning Nested Sparse Structures in Deep Neural Networks
Eunwoo Kim, Chanho Ahn, Songhwai Oh
Context Embedding Networks
Kun Ho Kim, Oisin Mac Aodha, Pietro Perona
Iterative Learning With Open-Set Noisy Labels
Yisen Wang, Weiyang Liu, Xingjun Ma, James Bailey, Hongyuan Zha, Le Song, Shu-Tao Xia
Learning Transferable Architectures for Scalable Image Recognition
Barret Zoph, Vijay Vasudevan, Jonathon Shlens, Quoc V. Le
SBNet: Sparse Blocks Network for Fast Inference
Mengye Ren, Andrei Pokrovsky, Bin Yang, Raquel Urtasun
Language-Based Image Editing With Recurrent Attentive Models
Jianbo Chen, Yelong Shen, Jianfeng Gao, Jingjing Liu, Xiaodong Liu
Net2Vec: Quantifying and Explaining How Concepts Are Encoded by Filters in Deep Neural Networks
Ruth Fong, Andrea Vedaldi
End-to-End Dense Video Captioning With Masked Transformer
Luowei Zhou, Yingbo Zhou, Jason J. Corso, Richard Socher, Caiming Xiong
A Neural Multi-Sequence Alignment TeCHnique (NeuMATCH)
Pelin Dogan, Boyang Li, Leonid Sigal, Markus Gross
Path Aggregation Network for Instance Segmentation
Shu Liu, Lu Qi, Haifang Qin, Jianping Shi, Jiaya Jia
The INaturalist Species Classification and Detection Dataset
Grant Van Horn, Oisin Mac Aodha, Yang Song, Yin Cui, Chen Sun, Alex Shepard, Hartwig Adam, Pietro Perona, Serge Belongie
Multimodal Explanations: Justifying Decisions and Pointing to the Evidence
Dong Huk Park, Lisa Anne Hendricks, Zeynep Akata, Anna Rohrbach, Bernt Schiele, Trevor Darrell, Marcus Rohrbach
StarGAN: Unified Generative Adversarial Networks for Multi-Domain Image-to-Image Translation
Yunjey Choi, Minje Choi, Munyoung Kim, Jung-Woo Ha, Sunghun Kim, Jaegul Choo
High-Resolution Image Synthesis and Semantic Manipulation With Conditional GANs
Ting-Chun Wang, Ming-Yu Liu, Jun-Yan Zhu, Andrew Tao, Jan Kautz, Bryan Catanzaro
Semi-Parametric Image Synthesis
Xiaojuan Qi, Qifeng Chen, Jiaya Jia, Vladlen Koltun
BlockDrop: Dynamic Inference Paths in Residual Networks
Zuxuan Wu, Tushar Nagarajan, Abhishek Kumar, Steven Rennie, Larry S. Davis, Kristen Grauman, Rogerio Feris
Interpretable Convolutional Neural Networks
Quanshi Zhang, Ying Nian Wu, Song-Chun Zhu
Deep Cross-Media Knowledge Transfer
Xin Huang, Yuxin Peng
Interleaved Structured Sparse Convolutional Neural Networks
Guotian Xie, Jingdong Wang, Ting Zhang, Jianhuang Lai, Richang Hong, Guo-Jun Qi
A Variational U-Net for Conditional Appearance and Shape Generation
Patrick Esser, Ekaterina Sutter, Björn Ommer
Detach and Adapt: Learning Cross-Domain Disentangled Deep Representation
Yen-Cheng Liu, Yu-Ying Yeh, Tzu-Chien Fu, Sheng-De Wang, Wei-Chen Chiu, Yu-Chiang Frank Wang
Learning Deep Structured Active Contours End-to-End
Diego Marcos, Devis Tuia, Benjamin Kellenberger, Lisa Zhang, Min Bai, Renjie Liao, Raquel Urtasun
Deep Learning Under Privileged Information Using Heteroscedastic Dropout
John Lambert, Ozan Sener, Silvio Savarese
Smooth Neighbors on Teacher Graphs for Semi-Supervised Learning
Yucen Luo, Jun Zhu, Mengxi Li, Yong Ren, Bo Zhang
Interpret Neural Networks by Identifying Critical Data Routing Paths
Yulong Wang, Hang Su, Bo Zhang, Xiaolin Hu
Deep Spatio-Temporal Random Fields for Efficient Video Segmentation
Siddhartha Chandra, Camille Couprie, Iasonas Kokkinos
Customized Image Narrative Generation via Interactive Visual Question Generation and Answering
Andrew Shin, Yoshitaka Ushiku, Tatsuya Harada
PWC-Net: CNNs for Optical Flow Using Pyramid, Warping, and Cost Volume
Deqing Sun, Xiaodong Yang, Ming-Yu Liu, Jan Kautz
Revisiting Deep Intrinsic Image Decompositions
Qingnan Fan, Jiaolong Yang, Gang Hua, Baoquan Chen, David Wipf
Multi-Cell Detection and Classification Using a Generative Convolutional Model
Florence Yellin, Benjamin D. Haeffele, Sophie Roth, René Vidal
Learning Spatial-Aware Regressions for Visual Tracking
Chong Sun, Dong Wang, Huchuan Lu, Ming-Hsuan Yang
High Performance Visual Tracking With Siamese Region Proposal Network
Bo Li, Junjie Yan, Wei Wu, Zheng Zhu, Xiaolin Hu
LiteFlowNet: A Lightweight Convolutional Neural Network for Optical Flow Estimation
Tak-Wai Hui, Xiaoou Tang, Chen Change Loy
VITAL: VIsual Tracking via Adversarial Learning
Yibing Song, Chao Ma, Xiaohe Wu, Lijun Gong, Linchao Bao, Wangmeng Zuo, Chunhua Shen, Rynson W.H. Lau, Ming-Hsuan Yang
Super SloMo: High Quality Estimation of Multiple Intermediate Frames for Video Interpolation
Huaizu Jiang, Deqing Sun, Varun Jampani, Ming-Hsuan Yang, Erik Learned-Miller, Jan Kautz
Real-World Repetition Estimation by Div, Grad and Curl
Tom F. H. Runia, Cees G. M. Snoek, Arnold W. M. Smeulders
Recurrent Pixel Embedding for Instance Grouping
Shu Kong, Charless C. Fowlkes
Deep Unsupervised Saliency Detection: A Multiple Noisy Labeling Perspective
Jing Zhang, Tong Zhang, Yuchao Dai, Mehrtash Harandi, Richard Hartley
Learning Intrinsic Image Decomposition From Watching the World
Zhengqi Li, Noah Snavely
TieNet: Text-Image Embedding Network for Common Thorax Disease Classification and Reporting in Chest X-Rays
Xiaosong Wang, Yifan Peng, Le Lu, Zhiyong Lu, Ronald M. Summers
Generating Synthetic X-Ray Images of a Person From the Surface Geometry
Brian Teixeira, Vivek Singh, Terrence Chen, Kai Ma, Birgi Tamersoy, Yifan Wu, Elena Balashova, Dorin Comaniciu
Gibson Env: Real-World Perception for Embodied Agents
Fei Xia, Amir R. Zamir, Zhiyang He, Alexander Sax, Jitendra Malik, Silvio Savarese
Reinforcement Cutting-Agent Learning for Video Object Segmentation
Junwei Han, Le Yang, Dingwen Zhang, Xiaojun Chang, Xiaodan Liang
Feature Space Transfer for Data Augmentation
Bo Liu, Xudong Wang, Mandar Dixit, Roland Kwitt, Nuno Vasconcelos
Analytic Expressions for Probabilistic Moments of PL-DNN With Gaussian Input
Adel Bibi, Modar Alfadly, Bernard Ghanem
Detail-Preserving Pooling in Deep Networks
Faraz Saeedan, Nicolas Weber, Michael Goesele, Stefan Roth
Rethinking Feature Distribution for Loss Functions in Image Classification
Weitao Wan, Yuanyi Zhong, Tianpeng Li, Jiansheng Chen
Shift: A Zero FLOP, Zero Parameter Alternative to Spatial Convolutions
Bichen Wu, Alvin Wan, Xiangyu Yue, Peter Jin, Sicheng Zhao, Noah Golmant, Amir Gholaminejad, Joseph Gonzalez, Kurt Keutzer
Sketch-a-Classifier: Sketch-Based Photo Classifier Generation
Conghui Hu, Da Li, Yi-Zhe Song, Tao Xiang, Timothy M. Hospedales
Light Field Intrinsics With a Deep Encoder-Decoder Network
Anna Alperovich, Ole Johannsen, Michael Strecke, Bastian Goldluecke
Learning Generative ConvNets via Multi-Grid Modeling and Sampling
Ruiqi Gao, Yang Lu, Junpei Zhou, Song-Chun Zhu, Ying Nian Wu
Manifold Learning in Quotient Spaces
Éloi Mehr, André Lieutier, Fernando Sanchez Bermudez, Vincent Guitteny, Nicolas Thome, Matthieu Cord
Learning Intelligent Dialogs for Bounding Box Annotation
Ksenia Konyushkova, Jasper Uijlings, Christoph H. Lampert, Vittorio Ferrari
Boosting Adversarial Attacks With Momentum
Yinpeng Dong, Fangzhou Liao, Tianyu Pang, Hang Su, Jun Zhu, Xiaolin Hu, Jianguo Li
NISP: Pruning Networks Using Neuron Importance Score Propagation
Ruichi Yu, Ang Li, Chun-Fu Chen, Jui-Hsin Lai, Vlad I. Morariu, Xintong Han, Mingfei Gao, Ching-Yung Lin, Larry S. Davis
PointGrid: A Deep Network for 3D Shape Understanding
Truc Le, Ye Duan
Tell Me Where to Look: Guided Attention Inference Network
Kunpeng Li, Ziyan Wu, Kuan-Chuan Peng, Jan Ernst, Yun Fu
3D Semantic Segmentation With Submanifold Sparse Convolutional Networks
Benjamin Graham, Martin Engelcke, Laurens van der Maaten
TOM-Net: Learning Transparent Object Matting From a Single Image
Guanying Chen, Kai Han, Kwan-Yee K. Wong
Translating and Segmenting Multimodal Medical Volumes With Cycle- and Shape-Consistency Generative Adversarial Network
Zizhao Zhang, Lin Yang, Yefeng Zheng
An Unsupervised Learning Model for Deformable Medical Image Registration
Guha Balakrishnan, Amy Zhao, Mert R. Sabuncu, John Guttag, Adrian V. Dalca
Deep Lesion Graphs in the Wild: Relationship Learning and Organization of Significant Radiology Image Findings in a Diverse Large-Scale Lesion Database
Ke Yan, Xiaosong Wang, Le Lu, Ling Zhang, Adam P. Harrison, Mohammadhadi Bagheri, Ronald M. Summers
Learning Distributions of Shape Trajectories From Longitudinal Datasets: A Hierarchical Model on a Manifold of Diffeomorphisms
Alexandre Bône, Olivier Colliot, Stanley Durrleman
CNN Driven Sparse Multi-Level B-Spline Image Registration
Pingge Jiang, James A. Shackleford
Anatomical Priors in Convolutional Networks for Unsupervised Biomedical Segmentation
Adrian V. Dalca, John Guttag, Mert R. Sabuncu
3D Registration of Curves and Surfaces Using Local Differential Information
Carolina Raposo, João P. Barreto
Weakly Supervised Learning of Single-Cell Feature Embeddings
Juan C. Caicedo, Claire McQuin, Allen Goodman, Shantanu Singh, Anne E. Carpenter
Guided Proofreading of Automatic Segmentations for Connectomics
Daniel Haehn, Verena Kaynig, James Tompkin, Jeff W. Lichtman, Hanspeter Pfister
Wide Compression: Tensor Ring Nets
Wenqi Wang, Yifan Sun, Brian Eriksson, Wenlin Wang, Vaneet Aggarwal
Improvements to Context Based Self-Supervised Learning
T. Nathan Mundhenk, Daniel Ho, Barry Y. Chen
Learning Structure and Strength of CNN Filters for Small Sample Size Training
Rohit Keshari, Mayank Vatsa, Richa Singh, Afzel Noore
Boosting Self-Supervised Learning via Knowledge Transfer
Mehdi Noroozi, Ananth Vinjimoor, Paolo Favaro, Hamed Pirsiavash
The Power of Ensembles for Active Learning in Image Classification
William H. Beluch, Tim Genewein, Andreas Nürnberger, Jan M. Köhler
Learning Compact Recurrent Neural Networks With Block-Term Tensor Decomposition
Jinmian Ye, Linnan Wang, Guangxi Li, Di Chen, Shandian Zhe, Xinqi Chu, Zenglin Xu
Spatially-Adaptive Filter Units for Deep Neural Networks
Domen Tabernik, Matej Kristan, Aleš Leonardis
SO-Net: Self-Organizing Network for Point Cloud Analysis
Jiaxin Li, Ben M. Chen, Gim Hee Lee
SGAN: An Alternative Training of Generative Adversarial Networks
Tatjana Chavdarova, François Fleuret
SketchyGAN: Towards Diverse and Realistic Sketch to Image Synthesis
Wengling Chen, James Hays
Explicit Loss-Error-Aware Quantization for Low-Bit Deep Neural Networks
Aojun Zhou, Anbang Yao, Kuan Wang, Yurong Chen
Towards Universal Representation for Unseen Action Recognition
Yi Zhu, Yang Long, Yu Guan, Shawn Newsam, Ling Shao
Deep Image Prior
Dmitry Ulyanov, Andrea Vedaldi, Victor Lempitsky
ST-GAN: Spatial Transformer Generative Adversarial Networks for Image Compositing
Chen-Hsuan Lin, Ersin Yumer, Oliver Wang, Eli Shechtman, Simon Lucey
CartoonGAN: Generative Adversarial Networks for Photo Cartoonization
Yang Chen, Yu-Kun Lai, Yong-Jin Liu