Sessions

Sessions

Oral sessions
1A - Learning for Vision 1September 10, 08:30 AM
78Convolutional Networks with Adaptive Computation GraphsAndreas Veit*, Cornell University; Serge Belongie, Cornell University
128Progressive Neural Architecture SearchChenxi Liu*, Johns Hopkins University; Maxim Neumann, Google; Barret Zoph, Google; Jon Shlens, Google; Wei Hua, Google; Li-Jia Li, Google; Li Fei-Fei, Stanford University; Alan Yuille, Johns Hopkins University; Jonathan Huang, Google; Kevin Murphy, Google
153Diverse Image-to-Image Translation via Disentangled RepresentationsHsin-Ying Lee*, University of California, Merced; Hung-Yu Tseng, University of California, Merced; Maneesh Singh, Verisk Analytics; Jia-Bin Huang, Virginia Tech; Ming-Hsuan Yang, University of California at Merced
691Lifting Layers: Analysis and ApplicationsMichael Moeller*, University of Siegen; Peter Ochs, Saarland University; Tim Meinhardt, Technical University of Munich; Laura Leal-Taixé, TUM
2070Learning with Biased Complementary LabelsXiyu Yu*, The University of Sydney; Tongliang Liu, The University of Sydney; Mingming Gong, University of Pittsburgh; Dacheng Tao, University of Sydney
1B - Computational Photography 1September 10, 01:00 PM
255Light Structure from Pin Motion: Simple and Accurate Point Light Calibration for Physics-based ModelingHiroaki Santo*, Osaka University; Michael Waechter, Osaka University; Masaki Samejima, Osaka University; Yusuke Sugano, Osaka University; Yasuyuki Matsushita, Osaka University
84Programmable Light CurtainsJian Wang*, Carnegie Mellon University; Joe Bartels, Carnegie Mellon University; William Whittaker, Carnegie Mellon University; Aswin Sankaranarayanan, Carnegie Mellon University; Srinivasa Narasimhan, Carnegie Mellon University
999Learning to Separate Object Sounds by Watching Unlabeled VideoRuohan Gao*, University of Texas at Austin; Rogerio Feris, IBM Research; Kristen Grauman, University of Texas
1717Coded Two-Bucket Cameras for Computer VisionMian Wei, University of Toronto; Navid Navid Sarhangnejad, University of Toronto; Zhengfan Xia, University of Toronto; Nikola Katic, University of Toronto; Roman Genov, University of Toronto; Kyros Kutulakos*, University of Toronto
2455Materials for Masses: SVBRDF Acquisition with a Single Mobile Phone ImageZhengqin Li*, UC San Diego; Manmohan Chandraker, UC San Diego; Sunkavalli Kalyan, Adobe Research
1C - VideoSeptember 10, 02:45 PM
123End-to-End Joint Semantic Segmentation of Actors and Actions in VideoJingwei Ji*, Stanford University; Shyamal Buch, Stanford University; Alvaro Soto, Universidad Catolica de Chile; Juan Carlos Niebles, Stanford University
12Learning-based Video Motion MagnificationTae-Hyun Oh, MIT CSAIL; Ronnachai Jaroensri*, MIT CSAIL; Changil Kim, MIT CSAIL; Mohamed A. Elghareb, Qatar Computing Research Institute; Fredo Durand, MIT; Bill Freeman, MIT; Wojciech Matusik, Adobe
368Massively Parallel Video NetworksViorica Patraucean*, DeepMind; Joao Carreira, DeepMind; Laurent Mazare, DeepMind; Simon Osindero, DeepMind; Andrew Zisserman, University of Oxford
1507DeepWrinkles: Accurate and Realistic Clothing ModelingZorah Laehner, TU Munich; Tony Tung*, Facebook / Oculus Research; Daniel Cremers, TUM
1538Learning Discriminative Video Representations Using Adversarial PerturbationsJue Wang*, ANU; Anoop Cherian, MERL
2A - Humans analysis 1September 11, 08:30 AM
1101Scaling Egocentric Vision: The E-Kitchens DatasetDima Damen*, University of Bristol; Hazel Doughty, University of Bristol; Sanja Fidler, University of Toronto; Antonino Furnari, University of Catania; Evangelos Kazakos, University of Bristol; Giovanni Farinella, University of Catania, Italy; Davide Moltisanti, University of Bristol; Jonathan Munro, University of Bristol; Toby Perrett, University of Bristol; Will Price, University of Bristol; Michael Wray, University of Bristol
1233Unsupervised Person Re-identification by Deep Learning Tracklet AssociationMinxian Li*, Nanjing University and Science and Technology; Xiatian Zhu, Queen Mary University, London, UK; Shaogang Gong, Queen Mary University of London
1321Predicting Gaze in Egocentric Video by Learning Task-dependent Attention TransitionYifei Huang*, The University of Tokyo
1458Instance-level Human Parsing via Part Grouping NetworkKe Gong*, SYSU; Xiaodan Liang, Carnegie Mellon University; Yicheng Li, Sun Yat-sen University; Yimin Chen, sensetime; Liang Lin, Sun Yat-sen University
2709Adversarial Geometry-Aware Human Motion PredictionLiangyan Gui*, Carnegie Mellon University; Yu-Xiong Wang, Carnegie Mellon University; Xiaodan Liang, Carnegie Mellon University; José M. F. Moura, Carnegie Mellon University
2B – Human Sensing ISeptember 11, 01:00 PM
1456Weakly-supervised 3D Hand Pose Estimation from Monocular RGB ImagesYujun Cai*, Nanyang Technological University; Liuhao Ge, NTU; Jianfei Cai, Nanyang Technological University; Junsong Yuan, State University of New York at Buffalo, USA
121Audio-Visual Scene Analysis with Self-Supervised Multisensory FeaturesAndrew Owens*, UC Berkeley; Alexei Efros, UC Berkeley
1403Jointly Discovering Visual Objects and Spoken Words from Raw Sensory InputDavid Harwath*, MIT CSAIL; Adria Recasens, Massachusetts Institute of Technology; Dídac Surís, Universitat Politecnica de Catalunya; Galen Chuang, MIT; Antonio Torralba, MIT; James Glass, MIT
1903DeepIM: Deep Iterative Matching for 6D Pose EstimationYi Li*, Tsinghua University; Gu Wang, Tsinghua University; Xiangyang Ji, Tsinghua University; Yu Xiang, University of Michigan; Dieter Fox, University of Washington
2662Implicit 3D Orientation Learning for 6D Object Detection from RGB ImagesMartin Sundermeyer*, German Aerospace Center (DLR); Zoltan Marton, DLR; Maximilian Durner, DLR; Rudolph Triebel, German Aerospace Center (DLR)
2C – Computational Photograpy 2September 11, 02:45 PM
1772Direct Sparse Odometry With Rolling ShutterDavid Schubert*, Technical University of Munich; Vladyslav Usenko, TU Munich; Nikolaus Demmel, TUM; Joerg Stueckler, Technical University of Munich; Daniel Cremers, TUM
7993D Motion Sensing from 4D Light Field GradientsSizhuo Ma*, University of Wisconsin-Madison; Brandon Smith, University of Wisconsin-Madison; Mohit Gupta, University of Wisconsin-Madison, USA
2150A Style-aware Content Loss for Real-time HD Style TransferArtsiom Sanakoyeu*, Heidelberg University; Dmytro Kotovenko, Heidelberg University; Bjorn Ommer, Heidelberg University
2174Scale-Awareness of Light Field Camera based Visual OdometryNiclas Zeller*, Karlsruhe University of Applied Sciences; Franz Quint, Karlsruhe University of Applied Sciences; Uwe Stilla, Technische Universitaet Muenchen
2225Burst Image Deblurring Using Permutation Invariant Convolutional Neural NetworksMiika Aittala*, MIT; Fredo Durand, MIT
3A – Stereo and reconstructionSeptember 12, 08:30 AM
885MVSNet: Depth Inference for Unstructured Multi-view StereoYao Yao*, The Hong Kong University of Science and Technology; Zixin Luo, HKUST; Shiwei Li, HKUST; Tian Fang, HKUST; Long Quan, Hong Kong University of Science and Technology
184PlaneMatch: Patch Coplanarity Prediction for Robust RGB-D RegistrationYifei Shi, Princeton University; Kai Xu, Princeton University and National University of Defense Technology; Matthias Niessner, Technical University of Munich; Szymon Rusinkiewicz, Princeton University; Thomas Funkhouser*, Princeton, USA
1530Active Stereo Net: End-to-End Self-Supervised Learning for Active Stereo SystemsYinda Zhang*, Princeton University; Sean Fanello, Google; Sameh Khamis, Google; Christoph Rhemann, Google; Julien Valentin, Google; Adarsh Kowdle, Google; Vladimir Tankovich, Google; Shahram Izadi, Google; Thomas Funkhouser, Princeton, USA
1591GAL: Geometric Adversarial Loss for Single-View 3D-Object ReconstructionLi Jiang*, The Chinese University of Hong Kong; Xiaojuan Qi, CUHK; Shaoshuai SHI, The Chinese University of Hong Kong; Jia Jiaya, Chinese University of Hong Kong
1811Deep Virtual Stereo Odometry: Leveraging Deep Depth Prediction for Monocular Direct Sparse OdometryNan Yang*, Technical University of Munich; Rui Wang, Technical University of Munich; Joerg Stueckler, Technical University of Munich; Daniel Cremers, TUM
3B - Human Sensing IISeptember 12, 01:00 PM
366Unsupervised Geometry-Aware Representation for 3D Human Pose EstimationHelge Rhodin*, EPFL; Mathieu Salzmann, EPFL; Pascal Fua, EPFL, Switzerland
1061Dual-Agent Deep Reinforcement Learning for Deformable Face TrackingMinghao Guo, Tsinghua University; Jiwen Lu*, Tsinghua University; Jie Zhou, Tsinghua University, China
2147Deep Autoencoder for Combined Human Pose Estimation and Body Model UpscalingMatthew Trumble*, University of Surrey; Andrew Gilbert, University of Surrey; John Collomosse, Adobe Research; Adrian Hilton, University of Surrey
2189Occlusion-aware Hand Pose Estimation Using Hierarchical Mixture Density NetworkQi Ye*, Imperial College London; Tae-Kyun Kim, Imperial College London
2463GANimation: Anatomically-aware Facial Animation from a Single ImageAlbert Pumarola*, Institut de Robotica i Informatica Industrial; Antonio Agudo, Institut de Robotica i Informatica Industrial, CSIC-UPC; Aleix Martinez, The Ohio State University; Alberto Sanfeliu, Industrial Robotics Institute; Francesc Moreno, IRI
3C - OptimizationSeptember 12, 04:00 PM
685Deterministic Consensus Maximization with Biconvex ProgrammingZhipeng Cai*, The University of Adelaide; Tat-Jun Chin, University of Adelaide; Huu Le, University of Adelaide; David Suter, University of Adelaide
1479Robust fitting in computer vision: easy or hard?Tat-Jun Chin*, University of Adelaide; Zhipeng Cai, The University of Adelaide; Frank Neumann, The University of Adelaide, School of Computer Science, Faculty of Engineering, Computer and Mathematical Science
1929Highly-Economized Multi-View Binary Compression for Scalable Image ClusteringZheng Zhang*, Harbin Institute of Technology Shenzhen Graduate School; Li Liu, the inception institute of artificial intelligence; Jie Qin, ETH Zurich; Fan Zhu, the inception institute of artificial intelligence ; Fumin Shen, UESTC; Yong Xu, Harbin Institute of Technology Shenzhen Graduate School; Ling Shao, Inception Institute of Artificial Intelligence; Heng Tao Shen, University of Electronic Science and Technology of China (UESTC)
2039Efficient Semantic Scene Completion Network with Spatial Group ConvolutionJiahui Zhang*, Tsinghua University; Hao Zhao, Intel Labs China; Anbang Yao, Intel Labs China; Yurong Chen, Intel Labs China; Hongen Liao, Tsinghua University
2441Asynchronous, Photometric Feature Tracking using Events and FramesDaniel Gehrig, University of Zurich; Henri Rebecq*, University of Zurich; Guillermo Gallego, University of Zurich; Davide Scaramuzza, University of Zurich& ETH Zurich, Switzerland
4A - Learning for Vision 2September 13, 08:30 AM
172Group NormalizationYuxin Wu, Facebook; Kaiming He*, Facebook Inc., USA
2486Deep Expander Networks: Efficient Deep Networks from Graph TheoryAmeya Prabhu*, IIIT Hyderabad; Girish Varma, IIIT Hyderabad; Anoop Namboodiri, IIIT Hyderbad
3056Towards Realistic PredictorsPei Wang*, UC San Diego; Nuno Vasconcelos, UC San Diego
3134Learning SO(3) Equivariant Representations with Spherical CNNsCarlos Esteves*, University of Pennsylvania; Kostas Daniilidis, University of Pennsylvania; Ameesh Makadia, Google Research; Christine Allec-Blanchette, University of Pennsylvania
4B - Matching and RecognitionSeptember 13, 01:00 PM
160CornerNet: Detecting Objects as Paired KeypointsHei Law*, University of Michigan; Jia Deng, University of Michigan
720RelocNet: Continous Metric Learning Relocalisation using Neural NetsVassileios Balntas*, University of Oxford; Victor Prisacariu, University of Oxford; Shuda Li, University of Oxford
897The Contextual Loss for Image Transformation with Non-Aligned DataRoey Mechrez*, Technion; Itamar Talmi, Technion; Lihi Zelnik-Manor, Technion
1118Acquisition of Localization Confidence for Accurate Object DetectionBorui Jiang*, Peking University; Ruixuan Luo, Peking University; Jiayuan Mao, Tsinghua University; Tete Xiao, Peking University; Yuning Jiang, Megvii(Face++) Inc
1276Deep Model-Based 6D Pose Refinement in RGBFabian Manhardt*, TU Munich; Wadim Kehl, Toyota Research Institute; Nassir Navab, Technische Universität München, Germany; Federico Tombari, Technical University of Munich, Germany
4C - Video and attentionSeptember 13, 02:45 PM
2996DeepTAM: Deep Tracking and MappingHuizhong Zhou*, University of Freiburg; Benjamin Ummenhofer, University of Freiburg; Thomas Brox, University of Freiburg
427ContextVP: Fully Context-Aware Video PredictionWonmin Byeon*, NVIDIA; Qin Wang, ETH Zurich; Rupesh Kumar Srivastava, NNAISENSE; Petros Koumoutsakos, ETH Zurich
1846Saliency Benchmarking Made Easy: Separating Models, Maps and MetricsMatthias Kümmerer*, University of Tübingen; Thomas Wallis, University of Tübingen; Matthias Bethge, University of Tübingen
2118Museum Exhibit Identification Challenge for the Supervised Domain Adaptation.Piotr Koniusz*, Data61/CSIRO, ANU; Yusuf Tas, Data61; Hongguang Zhang, Australian National University; Mehrtash Harandi, Monash University; Fatih Porikli, ANU; Rui Zhang, University of Canberra
2343Multi-Attention Multi-Class Constraint for Fine-grained Image RecognitionMing Sun, baidu; Yuchen Yuan, Baidu Inc.; Feng Zhou*, Baidu Research; Errui Ding, Baidu Inc.

Poster sessions
1ASeptember 10, 10:00 AM
2935ECO: Efficient Convolutional Network for Online Video UnderstandingMohammadreza Zolfaghari*, University of Freiburg; kamaljeet singh, University of Freiburg; Thomas Brox, University of Freiburg
831Learning to Anonymize Faces for Privacy Preserving Action DetectionZhongzheng Ren*, University of California, Davis; Yong Jae Lee, University of California, Davis; Michael Ryoo, Indiana University
1893Adversarial Open-World Person Re-IdentificationXiang Li, Sun Yat-sen University; Ancong Wu, Sun Yat-sen University; Jason Wei Shi Zheng*, Sun Yat Sen University
1021Graph R-CNN for Scene Graph GenerationJianwei Yang*, Georgia Institute of Technology; Jiasen Lu, Georgia Institute of Technology; Stefan Lee, Georgia Institute of Technology; Dhruv Batra, Georgia Tech & Facebook AI Research; Devi Parikh, Georgia Tech & Facebook AI Research
2418Contemplating Visual Emotions: Understanding and Overcoming Dataset BiasRameswar Panda*, UC Riverside; Jianming Zhang, Adobe Research; Haoxiang Li, Adobe; Joon-Young Lee, Adobe Research; Xin Lu, Adobe; Amit Roy-Chowdhury , University of California, Riverside, USA
1491Graph Adaptive Knowledge Transfer for Unsupervised Domain AdaptationZhengming Ding*, Northeastern University; Sheng Li, Adobe Research; Ming Shao, University of Massachusetts Dartmouth; YUN FU, Northeastern University
2443Deep Recursive HDRI: Inverse Tone Mapping using Generative Adversarial NetworksSiyeong Lee, Sogang University; Gwon Hwan An, Sogang University; Suk-Ju Kang*, Nil
1057Deep Cross-Modal Projection Learning for Image-Text MatchingYing Zhang*, Dalian University of Technology; Huchuan Lu, Dalian University of Technology
2324Composition Loss for Counting, Density Map Estimation and Localization in Dense CrowdsHaroon Idrees*, Carnegie Mellon University; Muhammad Tayyab, UCF; Kishan Athrey, UCF; Mubarak Shah, University of Central Florida; Dong Zhang, University of Central Florida, USA
774Person Search by Multi-Scale MatchingXu Lan*, Queen Mary University of London; Xiatian Zhu, Queen Mary University, London, UK; Shaogang Gong, Queen Mary University of London
2136Efficient 6-DoF Tracking of Handheld Objects from an Egocentric ViewpointRohit Pandey, Google; Pavel Pidlypenskyi, Google; Shuoran Yang, Google; Christine Kaeser-Chen*, Google
2001Deep Video Generation, Prediction and Completion of Human Action SequencesChunyan Bai, Hong Kong University of Science and Technology; Haoye Cai*, Hong Kong University of Science and Technology; Yu-Wing Tai, Tencent YouTu; Chi-Keung Tang, Hong Kong University of Science and Technology
748Efficient Uncertainty Estimation for Semantic Segmentation in VideosPo-Yu Huang*, National Tsing Hua University; Wan-Ting Hsu, National Tsing Hua University; Chun-Yueh Chiu, National Tsing Hua University; Tingfan Wu, Umbo Computer Vision; Min Sun, NTHU
2444DeepKSPD: Learning Kernel-matrix-based SPD Representation for Fine-grained Image RecognitionMelih Engin, university of wollongong; Lei Wang*, University of Wollongong, Australia; Luping Zhou, University of Wollongong, Australia; Xinwang Liu, National University of Defense Technology
3085From Face Recognition to Models of Identity: A Bayesian Approach to Learning about Unknown Identities from Unsupervised DataDaniel Castro*, Imperial College London; Sebastian Nowozin, Microsoft Research Cambridge
1288ShapeStacks: Learning Vision-Based Physical Intuition for Generalised Object StackingOliver Groth*, Oxford Robotics Insitute; Fabian Fuchs, Oxford Robotics Insitute; Andrea Vedaldi, Oxford University; Ingmar Posner, Oxford
2906Fast and Precise Camera Covariance Computation for Large 3D ReconstructionMichal Polic*, Czech Technical University in Prague; Wolfgang Foerstner, University Bonn; Tomas Pajdla, Czech Technical University in Prague
1305Inner Space Preserving Generative Pose MachineShuangjun Liu, Northeastern University; Sarah Ostadabbas*, Northeastern University
1526CTAP: Complementary Temporal Action Proposal GenerationJiyang Gao*, USC; Kan Chen, University of Southern California, USA; Ram Nevatia, U of Southern California
821Learning to Reenact Faces via Boundary TransferWayne Wu, SenseTime Research; Yunxuan Zhang, sensetime research; Cheng Li*, SenseTime Research; Chen Qian, SenseTime; Chen Change Loy, Chinese University of Hong Kong
623Fast and Accurate Intrinsic Symmetry DetectionRajendra Nagar*, Indian Institute of Technology Gandhinagar; Shanmuganathan Raman, IIT Gandhinagar
66Fictitious GAN: Training GANs with Historical ModelsYin Xia*, Northwestern University; Xu Chen, Northwestern University; Hao Ge, Northwestern University; Ying Wu, Northwestern University; Randall Berry, Northwestern University
1818Audio-Visual Event Localization in Unconstrained VideosYapeng Tian*, University of Rochester; Jing Shi, University of Rochester; Bochen Li, University of Rochester; Zhiyao Duan, Unversity of Rochester; Chenliang Xu, University of Rochester
530Tackling 3D ToF Artifacts Through Learning and the FLAT DatasetQi Guo, Harvard University; Iuri Frosio*, NVIDIA; Orazio Gallo, NVIDIA Research; Todd Zickler, Harvard University; Kautz Jan, NVIDIA
352Self-Calibrating Isometric Non-Rigid Structure-from-Motionshaifali parashar*, CNRS; Adrien Bartoli, Université Clermont Auvergne; Daniel Pizarro, Universidad de Alcala
388Semi-Supervised Deep Learning with MemoryYanbei Chen*, Queen Mary University of London; Xiatian Zhu, Queen Mary University, London, UK; Shaogang Gong, Queen Mary University of London
660Question-Guided Hybrid Convolution for Visual Question Answeringgao peng*, Chinese university of hong kong; Hongsheng Li, Chinese University of Hong Kong; Shuang Li, The Chinese University of Hong Kong; Pan Lu, Tsinghua University; Yikang LI, The Chinese University of Hong Kong; Steven Hoi, SMU; Xiaogang Wang, Chinese University of Hong Kong, Hong Kong
2193Rolling Shutter Pose and Ego-motion Estimation using Shape-from-TemplateYizhen Lao*, Université Clermont Auvergne; Omar Ait-Aider, Université Clermont Auvergne; Adrien Bartoli, Université Clermont Auvergne
345Semi-Dense 3D Reconstruction with a Stereo Event CameraYi Zhou*, The Australian National University; Guillermo Gallego, University of Zurich; Henri Rebecq, University of Zurich; Laurent Kneip, ShanghaiTech University; HONGDONG LI, Australian National University, Australia; Davide Scaramuzza, University of Zurich& ETH Zurich, Switzerland
2187Local Orthogonal-Group TestingAhmet Iscen*, Czech Technical University; Ondrej Chum, Vision Recognition Group, Czech Technical University in Prague
1400Temporal Relational Reasoning in VideosBolei Zhou*, MIT; Alex Andonian, Massachusetts Institute of Technology; Aude Oliva, MIT; Antonio Torralba, MIT
1558Deep High Dynamic Range Imaging with Large Foreground MotionsShangzhe Wu*, HKUST; Jiarui Xu, Hong Kong University of Science and Technology (HKUST); Yu-Wing Tai, Tencent YouTu; Chi-Keung Tang, Hong Kong University of Science and Technology
682Geometric Constrained Joint Lane Segmentation and Lane Boundary DetectionJie Zhang*, Shanghai Jiao Tong University; Yi Xu, Shanghai Jiao Tong University; Bingbing Ni, Shanghai Jiao Tong University; Zhenyu Duan, Shanghai Jiao Tong University
130Attributes as OperatorsTushar Nagarajan*, UT Austin; Kristen Grauman, University of Texas
2397Textual Explanations for Self-Driving VehiclesJinkyu Kim*, UC Berkeley; Anna Rohrbach, UC Berkeley; Trevor Darrell, UC Berkeley; John Canny, UC Berkeley; Zeynep Akata, University of Amsterdam
1917Generative Domain-Migration Hashing for Sketch-to-Image RetrievalJingyi Zhang*, University of Electronic Science and Technology of China; Fumin Shen, UESTC; Li Liu, the inception institute of artificial intelligence; Fan Zhu, the inception institute of artificial intelligence ; Mengyang Yu, ETH Zurich; Ling Shao, Inception Institute of Artificial Intelligence; Heng Tao Shen, University of Electronic Science and Technology of China (UESTC); Luc Van Gool, ETH Zurich
2276Recurrent Fusion Network for Image captioningWenhao Jiang*, Tencent AI Lab; Lin Ma, Tencent AI Lab; Yu-Gang Jiang, Fudan University; Wei Liu, Tencent AI Lab; Tong Zhang, Tecent AI Lab
1314Attention-based Ensemble for Deep Metric LearningWonsik Kim*, Samsung Electronics; Bhavya Goyal, Samsung Electronics; Kunal Chawla, Samsung Electronics; Jungmin Lee, Samsung Electronics; Keunjoo Kwon, Samsung Electronics
1722Egocentric Activity Prediction via Event Modulated AttentionYang Shen*, Shanghai Jiao Tong University; Bingbing Ni, Shanghai Jiao Tong University; Zefan Li, Shanghai Jiao Tong University; Ning Zhuang, Shanghai Jiao Tong University
2788A+D Net: Training a Shadow Detector with Adversarial Shadow AttenuationHieu Le*, Stony Brook University; Tomas F Yago Vicente, Stony Brook University; Vu Nguyen, Stony Brook University; Minh Hoai Nguyen, Stony Brook University; Dimitris Samaras, Stony Brook University
2714Stereo Vision-based Semantic 3D Object and Ego-motion Tracking for Autonomous DrivingPeiliang LI*, HKUST Robotics Institute; Tong QIN, HKUST Robotics Institute; Shaojie Shen, HKUST
1942End-to-end View Synthesis for Light Field Imaging with Pseudo 4DCNNYunlong Wang*, Center for Research on Intelligent Perception and Computing (CRIPAC) National Laboratory of Pattern Recognition (NLPR) Institute of Automation, Chinese Academy of Sciences (CASIA) ; Fei Liu, Center for Research on Intelligent Perception and Computing (CRIPAC) National Laboratory of Pattern Recognition (NLPR) Institute of Automation, Chinese Academy of Sciences (CASIA); Zilei Wang, University of Science and Technology of China; Guangqi Hou, Center for Research on Intelligent Perception and Computing (CRIPAC) National Laboratory of Pattern Recognition (NLPR) Institute of Automation, Chinese Academy of Sciences (CASIA); Zhenan Sun, Chinese of Academy of Sciences; Tieniu Tan, NLPR, China
1517Robust image stitching using multiple registrationsCharles Herrmann, Cornell; Chen Wang, Google Research; Richard Bowen, Cornell; Mike Krainin, Google; Ce Liu, Google; Bill Freeman, MIT; Ramin Zabih*, Cornell Tech/Google Research
477Fast Multi-fiber Network for Video RecognitionYunpeng Chen*, National University of Singapore; Yannis Kalantidis, Facebook Research, USA; Jianshu Li, NUS; Yan Shuicheng, National University of Singapore; Jiashi Feng, NUS
1920TBN: Convolutional Neural Network with Ternary Inputs and Binary WeightsDiwen Wan*, University of Electronic Science and Technology of China; Fumin Shen, UESTC; Li Liu, the inception institute of artificial intelligence; Fan Zhu, the inception institute of artificial intelligence ; Jie Qin, ETH Zurich; Ling Shao, Inception Institute of Artificial Intelligence; Heng Tao Shen, University of Electronic Science and Technology of China (UESTC)
1442Contextual Based Image Inpainting: Infer, Match and TranslateYuhang Song*, USC; Chao Yang, University of Southern California; Zhe Lin, Adobe Research; Xiaofeng Liu, Carnegie Mellon University; Hao Li, Pinscreen/University of Southern California/USC ICT; Qin Huang, University of Southern California; C.-C. Jay Kuo, USC
414Deep Fundamental Matrix EstimationRene Ranftl*, Intel Labs; Vladlen Koltun, Intel Labs
859Joint Person Segmentation and Identification in Synchronized First- and Third-person VideosMingze Xu*, Indiana University; Chenyou Fan, JD.com; Yuchen Wang, Indiana University; Michael Ryoo, Indiana University; David Crandall, Indiana University
1576Linear Span Network for Object Skeleton DetectionChang Liu*, University of Chinese Academy of Sciences; Wei Ke, University of Chinese Academy of Sciences; Fei Qin, University of Chinese Academy of Sciences; Qixiang Ye, University of Chinese Academy of Sciences, China
472Category-Agnostic Semantic Keypoint Representations in Canonical Object ViewsXingyi Zhou*, The University of Texas at Austin; Arjun Karpur, The University of Texas at Austin; Linjie Luo, Snap Inc; Qixing Huang, The University of Texas at Austin
2365Where are the blobs: Counting by Localization with Point SupervisionIssam Hadj Laradji*, University of British Columbia (UBC); Negar Rostamzadeh, Element AI; Pedro Pinheiro, EPFL; David Vazquez, Element AI; Mark Schmidt, University of British Columbia
787A Hybrid Model for Identity Obfuscation by Face ReplacementQianru Sun*, National University of Singapore; Ayush Tewari, Max Planck Institute for Informatics; Weipeng Xu, MPII; Mario Fritz, Max-Planck-Institut für Informatik; Christian Theobalt, MPI Informatik; Bernt Schiele, MPI
1659Exploring the Limits of Supervised PretrainingDhruv Mahajan, Facebook; Ross Girshick*, Facebook AI Research (FAIR); Vignesh Ramanathan, Facebook; Kaiming He, Facebook Inc., USA; Manohar Paluri, Facebook; Yixuan Li, Facebook Research; Ashwin Bharambe, Facebook; Laurens van der Maaten, Facebook AI Research
434TrackingNet: A Large-Scale Dataset and Benchmark for Object Tracking in the WildMatthias Müller*, King Abdullah University of Science and Technology (KAUST); Adel Bibi, KAUST; Silvio Giancola, KAUST; Salman Al-Subaihi, KAUST; Bernard Ghanem, KAUST
698Unpaired Image Captioning by Language PivotingJiuxiang Gu*, Nanyang Technological University; Shafiq Joty, Nanyang Technological University; Jianfei Cai, Nanyang Technological University; Gang Wang, Alibaba Group
2552Pairwise Relational Networks for Face RecognitionBong-Nam Kang*, POSTECH
1977DeepPhys: Video-Based Physiological Measurement Using Convolutional Attention NetworksWeixuan Chen*, MIT Media Lab; Daniel McDuff, Microsoft Research
2063Semantic Match Consistency for Long-Term Visual LocalizationCarl Toft*, Chalmers; Erik Stenborg, Chalmers University; Lars Hammarstrand, Chalmers university of technology; Lucas Brynte, Chalmers University of Technology; Marc Pollefeys, ETH Zurich; Torsten Sattler, ETH Zurich; Fredrik Kahl, Chalmers
1873Grounding Visual ExplanationsLisa Anne Hendricks*, Uc berkeley; Ronghang Hu, University of California, Berkeley; Trevor Darrell, UC Berkeley; Zeynep Akata, University of Amsterdam
181Cross-Modal Hamming HashingYue Cao, Tsinghua University; Mingsheng Long*, Tsinghua University; Bin Liu, Tsinghua University; Jianmin Wang, Tsinghua University, China
559A Modulation Module for Multi-task Learning with Applications in Image RetrievalXiangyun Zhao*, Northwestern University; Haoxiang Li, Adobe; Xiaohui Shen, Adobe Research; Xiaodan Liang, Carnegie Mellon University; Ying Wu, Northwestern University
1546Open-World Stereo Video Matching with Deep RNNYiran Zhong*, Australian National University; HONGDONG LI, Australian National University, Australia; Yuchao Dai, Northwestern Polytechnical University
648Deblurring Natural Image Using Super-Gaussian FieldsYuhang Liu, Wuhan University; Wenyong Dong*, Wuhan University; Dong Gong, Northwestern Polytechnical University & The University of Adelaide; Lei Zhang, The unversity of Adelaide; Qinfeng Shi, University of Adelaide
2987Diverse and Coherent Paragraph Generation from ImagesMoitreya Chatterjee*, University of Illinois at Urbana Champaign; Alexander Schwing, UIUC
1339Learning Compression from limited unlabeled DataXiangyu He*, Chinese Academy of Sciences; Jian Cheng, Chinese Academy of Sciences, China
198Deep Video Quality Assessor: From Spatio-temporal Visual Sensitivity to A Convolutional Neural Aggregation NetworkWoojae Kim*, Yonsei University; Jongyoo Kim, Yonsei University; Sewoong Ahn, Yonsei University; Jinwoo Kim, Yonsei University; Sanghoon Lee, Yonsei University, Korea
141Product Quantization Network for Fast Image RetrievalTan Yu*, Nanyang Technological University; Junsong Yuan, State University of New York at Buffalo, USA; CHEN FANG, Adobe Research, San Jose, CA; Hailin Jin, Adobe Research
474Factorizable Net: An Efficient Subgraph-based Framework for Scene Graph GenerationYikang LI*, The Chinese University of Hong Kong; Bolei Zhou, MIT; Yawen Cui, National University of Defense Technology ; Jianping Shi, Sensetime Group Limited; Xiaogang Wang, Chinese University of Hong Kong, Hong Kong; Wanli Ouyang, CUHK
77C-WSL: Count-guided Weakly Supervised LocalizationMingfei Gao*, University of Maryland; Ang Li, Google DeepMind; Ruichi Yu, University of Maryland, College Park; Vlad Morariu, Adobe Research; Larry Davis, University of Maryland
806The Sound of PixelsHang Zhao*, Massachusetts Institute of Technology; Chuang Gan, MIT; Andrew Rouditchenko, MIT; Carl Vondrick, MIT; Josh McDermott, Massachusetts Institute of Technology; Antonio Torralba, MIT
1392Unsupervised Video Object Segmentation using Motion Saliency-Guided Spatio-Temporal PropagationYuan-Ting Hu*, University of Illinois at Urbana-Champaign; Jia-Bin Huang, Virginia Tech; Alexander Schwing, UIUC
2318Good Line Cutting: towards Accurate Pose Tracking of Line-assisted VO/VSLAMYipu Zhao*, Georgia Institute of Technology; Patricio Vela, Georgia Institute of Technology
75Bi-box Regression for Pedestrian Detection and Occlusion EstimationCHUNLUAN ZHOU*, Nanyang Technological University; Junsong Yuan, State University of New York at Buffalo, USA
2233Unveiling the Power of Deep TrackingGoutam Bhat*, Linkoping University; Joakim Johnander, Linköping University; Martin Danelljan, Linkoping University; Fahad Shahbaz Khan, Linköping University; Michael Felsberg, Linköping University
2937Multi-Scale Structure-Aware Network for Human Pose Estimation Lipeng Ke*, University of Chinese Academy of Sciences; Ming-Ching Chang, Albany University; Honggang Qi, University of Chinese Academy of Sciences; Siwei Lyu, University at Albany
1008Neural Graph Matching Networks for Fewshot 3D Action RecognitionMichelle Guo*, Stanford University; Edward Chou, Stanford University; De-An Huang, Stanford University; Shuran Song, Princeton; Serena Yeung, Stanford University; Li Fei-Fei, Stanford University
633Objects that SoundRelja Arandjelovi?*, DeepMind; Andrew Zisserman, University of Oxford
1358Discriminative Region Proposal Adversarial Networks for High-Quality Image-to-Image TranslationChao Wang, Ocean University of China; Haiyong Zheng*, Ocean University of China; Zhibin Yu, Ocean University of China; Ziqiang Zheng, Ocean University of China; Zhaorui Gu, Ocean University of China; Bing Zheng, Ocean University of China
1624SaaS: Speed as a Supervisor for Semi-supervised LearningSafa Cicek*, UCLA; Alhussein Fawzi, UCLA; Stefano Soatto, UCLA
809Adaptive Affinity Field for Semantic SegmentationTsung-Wei Ke, UC Berkeley / ICSI; Jyh-Jing Hwang*, UC Berkeley / ICSI;
Ziwei Liu, UC Berkeley / ICSI; Stella Yu, UC Berkeley / ICSI
8Semi-convolutional Operators for Instance SegmentationSamuel Albanie*, University of Oxford; Andrea Vedaldi, Oxford University; David Novotny, Oxford University; Diane Larlus, Naver Labs Europe
1527Effective Use of Synthetic Data for Urban Scene Semantic SegmentationFatemeh Sadat Saleh*, Australian National University (ANU); Mohammad Sadegh Aliakbarian, Data61; Mathieu Salzmann, EPFL; Lars Petersson, Data61/CSIRO; Jose Manuel Alvarez, Toyota Research Institute
1804Shape correspondences from learnt template-based parametrizationThibault Groueix*, École des ponts ParisTech; Bryan Russell, Adobe Research; Mathew Fisher, Adobe Research; Vladimir Kim, Adobe Research; Mathieu Aubry, École des ponts ParisTech
1459TextSnake: A Flexible Representation for Detecting Text of Arbitrary ShapesShangbang Long, Peking University; Jiaqiang Ruan, Peking University; Wenjie Zhang, Peking University; Xin He*, Megvii; Wenhao Wu, Megvii; Cong Yao, Megvii
1801How good is my GAN?Konstantin Shmelkov*, Inria; Cordelia Schmid, INRIA; Karteek Alahari, Inria
2125Deep Generative Models for Weakly-Supervised Multi-Label ClassificationHong-Min Chu*, National Taiwan University; Chih-Kuan Yeh, Carnegie Mellon University; Yu-Chiang Frank Wang, National Taiwan University
1652Attention-GAN for Object Transfiguration in Wild ImagesXinyuan Chen*, Shanghai Jiao Tong University; Chang Xu, University of Sydney; Xiaokang Yang, Shanghai Jiao Tong University of China; Dacheng Tao, University of Sydney
59Skeleton-Based Action Recognition with Spatial Reasoning and Temporal Stack LearningChenyang Si*, Institute of Automation, Chinese Academy of Sciences; Ya Jing, Institute of Automation, Chinese Academy of Sciences; wei wang, Institute of Automation Chinese Academy of Sciences; Liang Wang, NLPR, China; Tieniu Tan, NLPR, China
153Diverse Image-to-Image Translation via Disentangled RepresentationsHsin-Ying Lee*, University of California, Merced; Hung-Yu Tseng, University of California, Merced; Maneesh Singh, Verisk Analytics; Jia-Bin Huang, Virginia Tech; Ming-Hsuan Yang, University of California at Merced
78Convolutional Networks with Adaptive Computation GraphsAndreas Veit*, Cornell University; Serge Belongie, Cornell University
1BSeptember 10, 04:00 PM
999Learning to Separate Object Sounds by Watching Unlabeled VideoRuohan Gao*, University of Texas at Austin; Rogerio Feris, IBM Research; Kristen Grauman, University of Texas
12Learning-based Video Motion Magni_x000c_cationTae-Hyun Oh, MIT CSAIL; Ronnachai Jaroensri*, MIT CSAIL; Changil Kim, MIT CSAIL; Mohamed A. Elghareb, Qatar Computing Research Institute; Fredo Durand, MIT; Bill Freeman, MIT; Wojciech Matusik, Adobe
255Light Structure from Pin Motion: Simple and Accurate Point Light Calibration for Physics-based ModelingHiroaki Santo*, Osaka University; Michael Waechter, Osaka University; Masaki Samejima, Osaka University; Yusuke Sugano, Osaka University; Yasuyuki Matsushita, Osaka University
15Video Object Segmentation with Joint Re-identification and Attention-Aware Mask PropagationXiaoxiao Li*, The Chinese University of Hong Kong; Chen Change Loy, Chinese University of Hong Kong
1717Coded Two-Bucket Cameras for Computer VisionMian Wei, University of Toronto; Navid Navid Sarhangnejad, University of Toronto; Zhengfan Xia, University of Toronto; Nikola Katic, University of Toronto; Roman Genov, University of Toronto; Kyros Kutulakos*, University of Toronto
143Multimodal Unsupervised Image-to-image TranslationXun Huang*, Cornell University; Ming-Yu Liu, NVIDIA; Serge Belongie, Cornell University; Kautz Jan, NVIDIA
2571Learning to Detect and Track Visible and Occluded Body Joints in a Virtual WorldMatteo Fabbri, University of Modena and Reggio Emilia; Fabio Lanzi*, University of Modena and Reggio Emilia; SIMONE CALDERARA, University of Modena and Reggio Emilia, Italy; Andrea Palazzi, University of Modena and Reggio Emilia; ROBERTO VEZZANI, University of Modena and Reggio Emilia, Italy; Rita Cucchiara, Universita Di Modena E Reggio Emilia
1631Local Spectral Graph Convolution for Point Set Feature LearningChu Wang*, McGill University; Babak Samari, McGill University; Kaleem Siddiqi, McGill University
1019Meta-Tracker: Fast and Robust Online Adaptation for Visual Object TrackersEunbyung Park*, UNC-CHAPEL HILL; Alex Berg, University of North Carolina, USA
2058VSO: Visual Semantic OdometryKonstantinos-Nektarios Lianos, Geomagical Labs, Inc; Johannes Schoenberger, ETH Zurich; Marc Pollefeys, ETH Zurich; Torsten Sattler*, ETH Zurich
833Progressive Lifelong Learning by Distillation and RetrospectionSaihui Hou*, University of Science and Technology of China; Xinyu Pan, MMLAB, CUHK; Chen Change Loy, Chinese University of Hong Kong; Dahua Lin, The Chinese University of Hong Kong
2144Spatio-Temporal Channel Correlation Networks for Action ClassificationAli Diba*, KU Leuven; Mohsen Fayyaz, University of Bonn; Vivek Sharma, Karlsruhe Institute of Technology; Mohammad Arzani, Sensifai; Rahman Yousefzadeh, sensifai; Jürgen Gall, University of Bonn; Luc Van Gool, ETH Zurich
1262Long-term Tracking in the Wild: a BenchmarkEfstratios Gavves, University of Amsterdam ; Luca Bertinetto*, University of Oxford; Joao Henriques, University of Oxford; Andrea Vedaldi, Oxford University; Philip Torr, University of Oxford; Ran Tao, University of Amsterdam; Jack Valmadre, Oxford
974Online Detection of Action Start in Untrimmed, Streaming VideosZheng Shou*, Columbia University; Junting Pan, Columbia University ; Jonathan Chan, Columbia University; Kazuyuki Miyazawa, Mitsubishi Electric; Hassan Mansour, Mitsubishi Electric Research Laboratories (MERL); Anthony Vetro, Mitsubishi Electric Research Lab; Xavier Giro-i-Nieto, Universitat Politecnica de Catalunya; Shih-Fu Chang, Columbia University
51Two Stream Pose Transfer Guided by Dense Pose EstimationNatalia Neverova*, Facebook AI Research; Alp Guler, INRIA; Iasonas Kokkinos, Facebook, France
1398Simultaneous 3D Reconstruction for Water Surface and Underwater SceneYiming Qian*, University of Alberta; Yinqiang Zheng, National Institute of Informatics; Minglun Gong, Memorial University; Herb Yang, University of Alberta
3145Multiple-gaze geometry: Inferring novel 3D locations from gazes observed in monocular videoErnesto Brau, CiBO Technologies; Jinyan Guan, UC San Diego; Tanya Jeffries, U. Arizona; Kobus Barnard*, University of Arizona
1092Multi-Scale Context Intertwining for Semantic SegmentationDi Lin*, Shenzhen University; Yuanfeng Ji, Shenzhen University; Dani Lischinski, The Hebrew University of Jerusalem; Danny Cohen-Or, Tel Aviv University; Hui Huang, Shenzhen University
1519Object-centered image stitchingCharles Herrmann, Cornell; Chen Wang, Google Research; Richard Bowen, Cornell; Ramin Zabih*, Cornell Tech/Google Research
580Grassmann Pooling for Fine-Grained Visual ClassificationXing Wei*, Xi'an Jiaotong University; Yihong Gong, Xi'an Jiaotong University; Yue Zhang, Xi'an Jiaotong University; Nanning Zheng, Xi'an Jiaotong University; Jiawei Zhang, City University of Hong Kong
358Diagnosing Error in Temporal Action DetectorsHumam Alwassel*, KAUST; Fabian Caba, KAUST; Victor Escorcia, KAUST; Bernard Ghanem, KAUST
631CGIntrinsics: Better Intrinsic Image Decomposition through Physically-Based RenderingZhengqi Li*, Cornell University; Noah Snavely, -
849A Closed-form Solution to Photorealistic Image StylizationYijun Li*, University of California, Merced; Ming-Yu Liu, NVIDIA; Xueting Li, University of California, Merced; Ming-Hsuan Yang, University of California at Merced; Kautz Jan, NVIDIA
2667Two at Once: Enhancing Learning and Generalization Capacities via IBN-NetXingang Pan*, The Chinese University of Hong Kong; Ping Luo, The Chinese University of Hong Kong; Jianping Shi, Sensetime Group Limited; Xiaoou Tang, The Chinese University of Hong Kong
1071Collaborative Deep Reinforcement Learning for Multi-Object TrackingLiangliang Ren, Tsinghua University; Zifeng Wang, Tsinghua University; Jiwen Lu*, Tsinghua University; Qi Tian , The University of Texas at San Antonio; Jie Zhou, Tsinghua University, China
2110Single Image Highlight Removal with a Sparse and Low-Rank Reflection ModelJie Guo*, Nanjing University; Zuojian Zhou, Nanjing University Of Chinese Medicine; Limin Wang, Nanjing University
1313Hierarchical Relational Networks for Group Activity Recognition and RetrievalMostafa Ibrahim*, Simon Fraser University; Greg Mori, Simon Fraser University
479Towards Human-Level License Plate RecognitionJiafan Zhuang, University of Science and Technology of China; Zilei Wang*, University of Science and Technology of China
1971Stacked Cross Attention for Image-Text MatchingKuang-Huei Lee*, Microsoft AI and Research; Xi Chen, Microsoft AI and Research; Gang Hua, Microsoft Cloud and AI; Houdong Hu, Microsoft AI and Research; Xiaodong He, JD AI Research
2454Deep Discriminative Model for Video ClassificationMohammad Tavakolian*, University of Oulu; Abdenour Hadid, Finland
2921The Mutex Watershed: Efficient, Parameter-Free Image PartitioningSteffen Wolf*, Univertity of Heidelberg; Constantin Pape, University of Heidelberg; Nasim Rahaman, University of Heidelberg; Anna Kreshuk, University of Heidelberg; Ullrich Köthe, University of Heidelberg; Fred Hamprecht, Heidelberg Collaboratory for Image Processing
214Monocular Depth Estimation with Affinity, Vertical Pooling, and Label EnhancementYuKang Gan*, SUN YAT-SEN University; Xiangyu Xu, Tsinghua University; Wenxiu Sun, SenseTime Research; Liang Lin, SenseTime
442Improved Structure from Motion Using Fiducial Marker MatchingJoseph DeGol*, UIUC; Timothy Bretl, University of Illinois at Urbana-Champaign; Derek Hoiem, University of Illinois at Urbana-Champaign
1009Temporal Modular Networks for Retrieving Complex Compositional Activities in VideoBingbin Liu*, Stanford University; Serena Yeung, Stanford University; Edward Chou, Stanford University; De-An Huang, Stanford University; Li Fei-Fei, Stanford University; Juan Carlos Niebles, Stanford University
547Quantized Densely Connected U-Nets for Efficient Landmark LocalizationZhiqiang Tang*, Rutgers; Xi Peng, Rutgers University; Shijie Geng, Rutgers; Shaoting Zhang, University of North Carolina at Charlotte; Lingfei Wu, IBM T. J. Watson Research Center; Dimitris Metaxas, Rutgers
2867Real-to-Virtual Domain Uni_x000c_cation for End-to-End Autonomous DrivingLuona Yang*, Carnegie Mellon University; Xiaodan Liang, Carnegie Mellon University; Eric Xing, Petuum Inc.
2694Beyond Part Models: Person Retrieval with Refined Part Pooling (and A Strong Convolutional Baseline)Yifan Sun*, Tsinghua University; Liang Zheng, Singapore University of Technology and Design; Yi Yang, University of Technology, Sydney; Qi Tian , The University of Texas at San Antonio; Shengjin Wang, Tsinghua University
3122Fully-Convolutional Point Networks for Large-Scale Point CloudsDario Rethage*, Technical University of Munich, Germany; Johanna Wald, Technical University of Munich; Nassir Navab, TU Munich, Germany; Federico Tombari, Technical University of Munich, Germany
1736Real-Time Hair Rendering using Sequential Adversarial NetworksLingyu Wei*, University of Southern California; Liwen Hu, University of Southern California; Vladimir Kim, Adobe Research; Ersin Yumer, Argo AI; Hao Li, Pinscreen/University of Southern California/USC ICT
875Visual Tracking via Spatially Aligned Correlation Filters Networkmengdan zhang*, Institute of Automation, Chinese Academy of Sciences; qiang wang, Institute of Automation, Chinese Academy of Sciences; Junliang Xing, National Laboratory of Pattern Recognition, Institute of Automation, Chinese Academy of Sciences; Jin Gao, Institute of Automation, Chinese Academy of Sciences; peixi peng, Institute of Automation, Chinese Academy of Sciences; Weiming Hu, Institute of Automation,Chinese Academy of Sciences; Steve Maybank, University of London
27Spatio-temporal Transformer Network for Video RestorationTae Hyun Kim*, Max Planck Institute for Intelligent Systems; Mehdi S. M. Sajjadi, Max Planck Institute for Intelligent Systems; Michael Hirsch, Max Planck Institut for Intelligent Systems ; Bernhard Schölkopf, Max Planck Institute for Intelligent Systems
3062Value-aware Quantization for Training and Inference of Neural NetworksEunhyeok Park, Seoul National University; Sungjoo Yoo*, Seoul National University; Peter Vajda, Facebook
2154Lambda Twist: An Accurate Fast Robust Perspective Three Point (P3P) SolverMikael Persson*, Linköping University;
Klas Nordberg, Linköping University
84Programmable Light CurtainsJian Wang*, Carnegie Mellon University; Joe Bartels, Carnegie Mellon University; William Whittaker, Carnegie Mellon University; Aswin Sankaranarayanan, Carnegie Mellon University; Srinivasa Narasimhan, Carnegie Mellon University
1602Monocular Depth Estimation Using Whole Strip Masking and Reliability-Based RefinementMinhyeok Heo*, Korea University; Jaehan Lee, Korea University; Kyung-Rae Kim, Korea University; Han-Ul Kim, Korea University; Chang-Su Kim, Korea university
2495Task-Aware Image DownscalingHeewon Kim, Seoul National University; Myungsub Choi, Seoul National University; Bee Lim, Seoul National University; Kyoung Mu Lee*, Seoul National University
2712Single Image Scene Refocusing using Conditional Adversarial NetworksParikshit Sakurikar*, IIIT-Hyderabad; Ishit Mehta, IIIT Hyderabad; Vineeth N Balasubramanian, IIT Hyderabad; P. J. Narayanan, IIIT-Hyderabad
1786Model-free Consensus Maximization for Non-Rigid ShapesThomas Probst*, ETH Zurich; Ajad Chhatkuli , ETHZ; Danda Pani Paudel, ETH Zürich; Luc Van Gool, ETH Zurich
1539BSN: Boundary Sensitive Network for Temporal Action Proposal GenerationTianwei Lin, Shanghai Jiao Tong University; Xu Zhao*, Shanghai Jiao Tong University; Haisheng Su, Shanghai Jiao Tong University; Chongjing Wang, China Academy of Information and Communications Technology; Ming Yang, Shanghai Jiao Tong University
2455Materials for Masses: SVBRDF Acquisition with a Single Mobile Phone ImageZhengqin Li*, UC San Diego; Manmohan Chandraker, UC San Diego; Sunkavalli Kalyan, Adobe Research
2325Attentive Semantic Alignment with Offset-Aware Correlation KernelsPaul Hongsuck Seo*, POSTECH; Jongmin Lee, POSTECH; Deunsol Jung, POSTECH; Bohyung Han, Seoul National University; Minsu Cho, POSTECH
158Deeply Learned Compositional Models for Human Pose EstimationWei Tang*, Northwestern University; Pei Yu, Northwestern University; Ying Wu, Northwestern University
1727Real-Time MDNetIlchae Jung*, POSTECH; Jeany Son, POSTECH; Mooyeol Baek, POSTECH; Bohyung Han, Seoul National University
1406Women also Snowboard: Overcoming Bias in Captioning ModelsLisa Anne Hendricks*, UC Berkeley; Kaylee Burns, UC Berkeley; Kate Saenko, Boston University; Trevor Darrell, UC Berkeley; Anna Rohrbach, UC Berkeley
1589Progressive Structure from MotionAlex Locher*, ETH Zürich; Michal Havlena, Vuforia, PTC, Vienna; Luc Van Gool, ETH Zurich
1154Occlusion-aware R-CNN: Detecting Pedestrians in a CrowdShifeng Zhang*, CBSR, NLPR, CASIA; Longyin Wen, GE Global Research; Xiao Bian, GE Global Research; Zhen Lei, NLPR, CASIA, China; Stan Li, National Lab. of Pattern Recognition, China
1267Affinity Derivation and Graph Merge for Instance SegmentationYiding Liu*, University of Science and Technology of China; Siyu Yang, Beihang University; Bin Li, Microsoft Research Asia; Wengang Zhou, University of Science and Technology of China; Ji-Zeng Xu, Microsoft Research Asia; Houqiang Li, University of Science and Technology of China; Yan Lu, Microsoft Research Asia
1138Second-order Democratic AggregationTsung-Yu Lin*, University of Massachusetts Amherst; Subhransu Maji, University of Massachusetts, Amherst; Piotr Koniusz, Data61/CSIRO, ANU
972Improving Sequential Determinantal Point Processes for Supervised Video SummarizationAidean Sharghi*, University of Central Florida; Boqing Gong, Tencent AI Lab; Ali Borji, University of Central Florida; Chengtao Li, MIT; Tianbao Yang, University of Iowa
1235Seeing Deeply and Bidirectionally: A Deep Learning Approach for Single Image Reflection RemovalJie Yang*, University of Adelaide; Dong Gong, Northwestern Polytechnical University & The University of Adelaide; Lingqiao Liu, University of Adelaide; Qinfeng Shi, University of Adelaide
1958Specular-to-Diffuse Translation for Multi-View ReconstructionShihao Wu*, University of Bern; Hui Huang, Shenzhen University; Tiziano Portenier, University of Bern; Matan Sela, Technion - Israel Institute of Technology; Danny Cohen-Or, Tel Aviv University; Ron Kimmel, Technion; Matthias Zwicker, University of Maryland
696SEAL: A Framework Towards Simultaneous Edge Alignment and LearningZhiding Yu*, NVIDIA; Weiyang Liu, Georgia Tech; Yang Zou, Carnegie Mellon University; Chen Feng, Mitsubishi Electric Research Laboratories (MERL); Srikumar Ramalingam, University of Utah; B. V. K. Vijaya Kumar, CMU, USA; Kautz Jan, NVIDIA
1845Question Type Guided Attention in Visual Question AnsweringYang Shi*, University of California, Irvine; Tommaso Furlanello, University of Southern California; Sheng Zha, Amazon Web Services; Anima Anandkumar, Amazon
1316Neural Procedural Reconstruction for Residential BuildingsHuayi Zeng*, Washington University in St.Louis; Jiaye Wu, Washington University in St.Louis; Yasutaka Furukawa, Simon Fraser University
2507Self-Calibration of Cameras with Euclidean Image Plane in Case of Two Views and Known Relative Rotation AngleEvgeniy Martyushev*, South Ural State University
1828Towards Optimal Deep Hashing via Policy GradientXin Yuan, Tsinghua University; Liangliang Ren, Tsinghua University; Jiwen Lu*, Tsinghua University; Jie Zhou, Tsinghua University, China
1632Piggyback: Adapting a Single Network to Multiple Tasks by Learning to Mask WeightsArun Mallya*, UIUC; Svetlana Lazebnik, UIUC; Dillon Davis, UIUC
1285Generating 3D Faces using Convolutional Mesh AutoencodersAnurag Ranjan*, MPI for Intelligent Systems; Timo Bolkart, Max Planck for Intelligent Systems; Soubhik Sanyal, Max Planck Institute for Intelligent Systems; Michael Black, Max Planck Institute for Intelligent Systems
753ICNet for Real-Time Semantic Segmentation on High-Resolution ImagesHengshuang Zhao, The Chinese University of Hong Kong; Xiaojuan Qi, CUHK; Xiaoyong Shen*, CUHK; Jianping Shi, Sensetime Group Limited; Jia Jiaya, Chinese University of Hong Kong
89Memory Aware Synapses: Learning what (not) to forget Rahaf Aljundi*, KU Leuven; Francesca babiloni, KU Leuven; Mohamed Elhoseiny, Facebook; Marcus Rohrbach, Facebook AI Research; Tinne Tuytelaars, K.U. Leuven
2046Deep Texture and Structure Aware Filtering Network for Image SmoothingKaiyue Lu*, Australian National University & Data61-CSIRO; Shaodi You, Data61-CSIRO, Australia; Nick Barnes, CSIRO(Data61)
2260Linear RGB-D SLAM for Planar EnvironmentsPyojin Kim*, Seoul National University; Brian Coltin, NASA Ames Research Center; Hyoun Jin Kim, Seoul National University
2583DeepJDOT: Deep Joint distribution optimal transport for unsupervised domain adaptationBharath Bhushan Damodaran*, IRISA,Universite de Bretagne-Sud; Benjamin Kellenberger, Wageningen University and Research; Rémi Flamary, Université Côte d’Azur; Devis Tuia, Wageningen University and Research; Nicolas Courty, IRISA, Universite Bretagne-Sud
3004W-TALC: Weakly-supervised Temporal Activity Localization and ClassificationSujoy Paul*, University of California-Riverside; Sourya Roy, University of California, Riverside; Amit Roy-Chowdhury , University of California, Riverside, USA
159Unsupervised Video Object Segmentation with Motion-based Bilateral NetworksSiyang Li*, University of Southern California; Bryan Seybold, Google Inc.; Alexey Vorobyov, Google Inc.; Xuejing Lei, University of Southern California ; C.-C. Jay Kuo, USC
1504Disentangling Factors of Variation with Cycle-Consistent Variational Auto-EncodersAnanya Harsh Jha*, Indraprastha Institute of Information Technology Delhi; Saket Anand, Indraprastha Institute of Information Technology Delhi; Maneesh Singh, Verisk Analytics; VSR Veeravasarapu, Verisk Analytics
2401Mancs: A Multi-task Attentional Network with Curriculum Sampling for Person Re-identificationCheng Wang, Huazhong Univ. of Science and Technology; Qian Zhang, Horizon Robotics; Chang Huang, Horizon Robotics, Inc.; Wenyu Liu, Huazhong University of Science and Technology; Xinggang Wang*, Huazhong Univ. of Science and Technology
112Multi-view to Novel view: Synthesizing Views via Self-Learned ConfidenceShao-Hua Sun*, University of Southern California; Jacob Huh, Carnegie Mellon University; Yuan-Hong Liao, National Tsing Hua University; Ning Zhang, SnapChat; Joseph Lim, USC
828Part-Activated Deep Reinforcement Learning for Action PredictionLei Chen, Tianjin University; Jiwen Lu*, Tsinghua University; Zhanjie Song, Tianjin University; Jie Zhou, Tsinghua University, China
908Online Dictionary Learning for Approximate Archetypal AnalysisJieru Mei, Microsoft Research Asia; Chunyu Wang*, Microsoft Research asia; Wenjun Zeng, Microsoft Research
1871Estimating Depth from RGB and Sparse SensingZhao Chen*, Magic Leap, Inc.; Vijay Badrinarayanan, Magic Leap, Inc.; Gilad Drozdov, Magic Leap, Inc.; Andrew Rabinovich, Magic Leap, Inc.
452Unsupervised Domain Adaptation for Semantic Segmentation via Class-Balanced Self-TrainingYang Zou*, Carnegie Mellon University; Zhiding Yu, NVIDIA; B. V. K. Vijaya Kumar, CMU, USA; Jinsong Wang, General Motors
482Zoom-Net: Mining Deep Feature Interactions for Visual Relationship RecognitionGuojun Yin, University of Science and Technology of China; Lu Sheng, The Chinese University of Hong Kong; Bin Liu, University of Science and Technology of China; Nenghai Yu, University of Science and Technology of China; Xiaogang Wang, Chinese University of Hong Kong, Hong Kong; Chen Change Loy, Chinese University of Hong Kong; Jing Shao*, The Chinese University of Hong Kong
1421Joint Camera Spectral Sensitivity Selection and Hyperspectral Image RecoveryYing Fu*, Beijing Institute of Technology; Tao Zhang, Beijing Institute of Technology; Yinqiang Zheng, National Institute of Informatics; debing zhang, DeepGlint; Hua Huang, Beijing Institute of Technology
963Compositing-aware Image SearchHengshuang Zhao*, The Chinese University of Hong Kong; Xiaohui Shen, Adobe Research; Zhe Lin, Adobe Research; Sunkavalli Kalyan, Adobe Research; Brian Price, Adobe; Jia Jiaya, Chinese University of Hong Kong
2717Zero-shot keyword search for visual speech recognition in-the-wildThemos Stafylakis*, University of Nottingham; Georgios Tzimiropoulos, University of Nottingham
123End-to-End Joint Semantic Segmentation of Actors and Actions in VideoJingwei Ji*, Stanford University; Shyamal Buch, Stanford University; Alvaro Soto, Universidad Catolica de Chile; Juan Carlos Niebles, Stanford University
1538Learning Discriminative Video Representations Using Adversarial PerturbationsJue Wang*, ANU; Anoop Cherian, MERL
1507DeepWrinkles: Accurate and Realistic Clothing ModelingZorah Laehner, TU Munich; Tony Tung*, Facebook / Oculus Research; Daniel Cremers, TUM
368Massively Parallel Video NetworksViorica Patraucean*, DeepMind; Joao Carreira, DeepMind; Laurent Mazare, DeepMind; Simon Osindero, DeepMind; Andrew Zisserman, University of Oxford
2ASeptember 11, 10:00 AM
1233Unsupervised Person Re-identification by Deep Learning Tracklet AssociationMinxian Li*, Nanjing University and Science and Technology; Xiatian Zhu, Queen Mary University, London, UK; Shaogang Gong, Queen Mary University of London
1458Instance-level Human Parsing via Part Grouping NetworkKe Gong*, SYSU; Xiaodan Liang, Carnegie Mellon University; Yicheng Li, Sun Yat-sen University; Yimin Chen, sensetime; Liang Lin, Sun Yat-sen University
1101Scaling Egocentric Vision: The E-Kitchens DatasetDima Damen*, University of Bristol; Hazel Doughty, University of Bristol; Sanja Fidler, University of Toronto; Antonino Furnari, University of Catania; Evangelos Kazakos, University of Bristol; Giovanni Farinella, University of Catania, Italy; Davide Moltisanti, University of Bristol; Jonathan Munro, University of Bristol; Toby Perrett, University of Bristol; Will Price, University of Bristol; Michael Wray, University of Bristol
1321Predicting Gaze in Egocentric Video by Learning Task-dependent Attention TransitionYifei Huang*, The University of Tokyo
2645Beyond local reasoning for stereo confidence estimation with deep learningFabio Tosi, University of Bologna; Matteo Poggi*, University of Bologna; Antonio Benincasa, University of Bologna; Stefano Mattoccia, University of Bologna
711DeepGUM: Learning Deep Robust Regression with a Gaussian-Uniform Mixture ModelStéphane Lathuiliere, INRIA; Pablo Mesejo-Santiago, University of Granada; Xavier Alameda-Pineda*, INRIA; Radu Horaud, INRIA
495Into the Twilight Zone: Depth Estimation using Joint Structure-Stereo OptimizationAashish Sharma*, National University of Singapore; Loong Fah Cheong, NUS
486Generalized Loss-Sensitive Adversarial Learning with Manifold MarginsMarzieh Edraki*, University of Central Florida; Guo-Jun Qi, University of Central Florida
626Adversarial Open Set Domain AdaptationKuniaki Saito*, The University of Tokyo; Shohei Yamamoto, The University of Tokyo; Yoshitaka Ushiku, The University of Tokyo; Tatsuya Harada, The University of Tokyo
1002Connecting Gaze, Scene and AttentionEunji Chong*, Georgia Institute of Technology; Nataniel Ruiz, Georgia Institute of Technology; Richard Wang, Georgia Institute of Technology; Yun Zhang, Georgia Institute of Technology
2156Multi-modal Cycle-consistent Generalized Zero-Shot LearningRAFAEL FELIX*, The University of Adelaide; Vijay Kumar B G, University of Adelaide; Ian Reid, University of Adelaide, Australia; Gustavo Carneiro, University of Adelaide
1823Understanding Degeneracies and Ambiguities in Attribute TransferAttila Szabo*, University of Bern; Qiyang Hu, University of Bern; Tiziano Portenier, University of Bern; Matthias Zwicker, University of Maryland; Paolo Favaro, Bern University, Switzerland
2703Start, Follow, Read: End-to-End Full Page Handwriting RecognitionCurtis Wigington*, Brigham Young University; Chris Tensmeyer, Brigham Young University; Brian Davis, Brigham Young University; Bill Barrett, Brigham Young University; Brian Price, Adobe; Scott Cohen, Adobe Research
855Rethinking the Form of Latent States in Image CaptioningBo Dai*, the Chinese University of Hong Kong; Deming Ye, Tsinghua University; Dahua Lin, The Chinese University of Hong Kong
2849ConvNets and ImageNet Beyond Accuracy: Understanding Mistakes and Uncovering BiasesPierre Stock*, Facebook AI Research; Moustapha Cisse, Facebook AI Research
2028Deep Shape MatchingFilip Radenovic*, Visual Recognition Group, CTU Prague; Giorgos Tolias, Vision Recognition Group, Czech Technical University in Prague; Ondrej Chum, Vision Recognition Group, Czech Technical University in Prague
225Neural Stereoscopic Image Style TransferXinyu Gong*, University of Electronic Science and Technology of China; Haozhi Huang, Tencent AI Lab; Lin Ma, Tencent AI Lab; Fumin Shen, UESTC; Wei Liu, Tencent AI Lab; Tong Zhang, Tecent AI Lab
1641Semi-supervised FusedGAN for Conditional Image GenerationNavaneeth Bodla*, University of Maryland; Gang Hua, Microsoft Cloud and AI; Rama Chellappa, University of Maryland
2845Affine Correspondences between Central Cameras for Rapid Relative Pose EstimationIván Eichhardt*, MTA SZTAKI; Mitya Csetverikov, MTA SZTAKI & ELTE
2347Bi-directional Feature Pyramid Network with Recursive Attention Residual Modules For Shadow DetectionLei Zhu*, The Chinese University of Hong Kong; Zijun Deng, South China University of Technology; Xiaowei Hu, The Chinese University of Hong Kong; Chi-Wing Fu, The Chinese University of Hong Kong; Xuemiao Xu, South China University of Technology; Jing Qin, The Hong Kong Polytechnic University; Pheng-Ann Heng, The Chinese Univsersity of Hong Kong
2603Joint Learning of Intrinsic Images and Semantic SegmentationAnil Baslamisli*, University of Amsterdam; Thomas Tiel Groenestege, University of Amsterdam; Partha Das, University of Amsterdam; Hoang-An Le, University of Amsterdam; Sezer Karaoglu, University of Amsterdam; Theo Gevers, University of Amsterdam
2093Visual Reasoning with a Multi-hop FiLM GeneratorFlorian Strub*, University of Lille; Mathieu Seurin, University of Lille; Ethan Perez, Rice University; Harm De Vries, Montreal Institute for Learning Algorithms; Jeremie Mary, Criteo; Philippe Preux, INRIA; Aaron Courville, MILA, Université de Montréal; Olivier Pietquin, GoogleBrain
1333View-graph Selection Framework for SfMRajvi Shah*, IIIT Hyderabad; Visesh Chari, INRIA; P. J. Narayanan, IIIT-Hyderabad
607Fine-grained Video Categorization with Redundancy Reduction AttentionChen Zhu, University of Maryland; Xiao Tan, Baidu Inc.; Feng Zhou, Baidu Inc.; Xiao Liu, Baidu Research; Kaiyu Yue*, Baidu Inc.; Errui Ding, Baidu Inc.; Yi Ma, UC Berkeley
508Space-time Knowledge for Unpaired Image-to-Image TranslationAayush Bansal*, Carnegie Mellon University; Shugao Ma, Facebook / Occulus; Deva Ramanan, Carnegie Mellon University; Yaser Sheikh, CMU
2922Integral Human Pose RegressionXiao Sun*, Microsoft Research Asia; Bin Xiao, MSR Asia; Fangyin Wei, Peking University; Shuang Liang, Tongji University; Yichen Wei, MSR Asia
2612Recurrent Tubelet Proposal and Recognition Networks for Action DetectionDong Li, University of Science and Technology of China; Zhaofan Qiu, University of Science and Technology of China; Qi Dai, Microsoft Research; Ting Yao*, Microsoft Research; Tao Mei, JD.com
2980Learning to Predict Crisp EdgeRuoxi Deng*, Central South University; Chunhua Shen, University of Adelaide; Shengjun Liu, Central South University; Huibing Wang, Dalian University of Technology; Xinru Liu, Central South University
3090Open Set Learning with Counterfactual ImagesLawrence Neal*, Oregon State University; Matthew Olson, Oregon State University; Xiaoli Fern, Oregon State University; Weng-Keen Wong, Oregon State University; Fuxin Li, Oregon State University
733Estimating the Success of Unsupervised Image to Image TranslationLior Wolf, Tel Aviv University, Israel; Sagie Benaim*, Tel Aviv University; Tomer Galanti, Tel Aviv University
812Joint Map and Symmetry SynchronizationQixing Huang*, The University of Texas at Austin; Xiangru Huang, University of Texas at Austin; Zhenxiao Liang, Tsinghua University; Yifan Sun, The University of Texas at Austin
2340Single Image Water Hazard Detection using FCN with Reflection Attention UnitsXiaofeng Han, Nanjing University of Science and Technology; Chuong Nguyen*, CSIRO Data61; Shaodi You, Data61-CSIRO, Australia; Jianfeng Lu, Nanjing University of Science and Technology
2739Realtime Time Synchronized Event-based StereoAlex Zhu*, University of Pennsylvania; Yibo Chen, University of Pennsylvania; Kostas Daniilidis, University of Pennsylvania
2528Transferring GANs: generating images from limited datayaxing wang*, Computer Vision Center; Chenshen Wu, Computer Vision Center; Luis Herranz, Computer Vision Center (Ph.D.); Joost van de Weijer, Computer Vision Center; Abel Gonzalez-Garcia, Computer Vision Center; BOGDAN RADUCANU, Computer Version Center, Edifici
2512To learn image super-resolution, use a GAN to learn how to do image degradation firstAdrian Bulat*, University of Nottingham; Jing Yang, University of Nottingham; Georgios Tzimiropoulos, University of Nottingham
1220Unsupervised CNN-based co-saliency detection with graphical optimizationKuang-Jui Hsu*, Academia Sinica; Chung-Chi Tsai, Texas A&M University; Yen-Yu Lin, Academia Sinica; Xiaoning Qian, Texas A&M University; Yung-Yu Chuang, National Taiwan University
2453Fast Light Field Reconstruction With Deep Coarse-To-Fine Modeling of Spatial-Angular CluesHenry W. F. Yeung, the University of Sydney; Junhui Hou*, City University of Hong Kong, Hong Kong; Jie Chen, Nanyang Technological University; Yuk Ying Chung, the University of Sydney; Xiaoming Chen, University of Science and Technology of China
1066Unified Perceptual Parsing for Scene UnderstandingTete Xiao*, Peking University; Yingcheng Liu, Peking University; Yuning Jiang, Megvii(Face++) Inc; Bolei Zhou, MIT; Jian Sun, Megvii, Face++
2690PARN: Pyramidal Affine Regression Networks for Dense Semantic Correspondence EstimationSangryul Jeon*, Yonsei university; Seungryung Kim, Yonsei University; Dongbo Min, Ewha Womans University; Kwanghoon Sohn , Yonsei Univ.
3071Structural Consistency and Controllability for Diverse ColorizationSafa Messaoud*, University of Illinois at Urbana Champaign; Alexander Schwing, UIUC; David Forsyth, Univeristy of Illinois at Urbana-Champaign
971Online Multi-Object Tracking with Dual Matching Attention NetworksJi Zhu, Shanghai Jiao Tong University; Hua Yang*, Shanghai Jiao Tong University; Nian Liu, Northwestern Polytechnical University; Minyoung Kim, Perceptive Automata; Wenjun Zhang, Shanghai Jiao Tong University; Ming-Hsuan Yang, University of California at Merced
949MaskConnect: Connectivity Learning by Gradient DescentKarim Ahmed*, Dartmouth College; Lorenzo Torresani, Dartmouth College
2518FloorNet: A Unified Framework for Floorplan Reconstruction from 3D ScansChen Liu*, Washington University in St. Louis; Jiaye Wu, Washington University in St.Louis; Yasutaka Furukawa, Simon Fraser University
2992Image Manipulation with Perceptual DiscriminatorsDiana Sungatullina*, Skolkovo Institute of Science and Technology; Egor Zakharov, Skolkovo Institute of Science and Technology; Dmitry Ulyanov, Skolkovo Institute of Science and Technology; Victor Lempitsky, Skoltech
372Transductive Centroid Projection for Semi-supervised Large-scale RecognitionYu Liu*, The Chinese University of Hong Kong; Xiaogang Wang, Chinese University of Hong Kong, Hong Kong; Guanglu Song, Sensetime; Jing Shao, Sensetime
2036Eigendecomposition-free Training of Deep Networks with Zero Eigenvalue-based LossesZheng Dang*, Xi'an Jiaotong University; Kwang Moo Yi, University of Victoria; Yinlin Hu, EPFL; Fei Wang, Xi'an Jiaotong University; Pascal Fua, EPFL, Switzerland; Mathieu Salzmann, EPFL
2651Self-supervised Knowledge Distillation Using Singular Value DecompositionSEUNG HYUN LEE, Inha University; Daeha Kim, Inha University ; Byung Cheol Song*, Inha University
88Snap Angle Prediction for 360$^{\circ}$ PanoramasBo Xiong*, University of Texas at Austin; Kristen Grauman, University of Texas
2551Saliency Preservation in Low-Resolution Grayscale ImagesShivanthan Yohanandan*, RMIT University; Adrian Dyer, RMIT University; Dacheng Tao, University of Sydney; Andy Song, RMIT University
1463PPF-FoldNet: Unsupervised Learning of Rotation Invariant 3D Local DescriptorsTolga Birdal*, TU Munich; Haowen Deng, Technical University of Munich; Slobodan Ilic, Siemens AG
2488BusterNet: Detecting Copy-Move Image Forgery with Source/Target LocalizationRex Yue Wu*, USC ISI; Wael Abd-Almageed, Information Sciences Institute; Prem Natarajan, USC ISI
1554Double JPEG Detection in Mixed JPEG Quality Factors using Deep Convolutional Neural NetworkJin-Seok Park*, Korea Advanced Institute of Science and Technology (KAIST); Donghyeon Cho, KAIST; Wonhyuk Ahn, KAIST; Heung-Kyu Lee, Korea Advanced Institute of Science and Technology (KAIST)
164Unsupervised holistic image generation from key local patchesDonghoon Lee*, Seoul National University; Sangdoo Yun, Clova AI Research, NAVER Corp.; Sungjoon Choi, Seoul National University; Hwiyeon Yoo, Seoul National University; Ming-Hsuan Yang, University of California at Merced; Songhwai Oh, Seoul National University
2331CrossNet: An End-to-end Reference-based Super Resolution Network using Cross-scale WarpingHaitian Zheng, HKUST; Mengqi Ji, HKUST; Haoqian Wang, Tsinghua University; Yebin Liu*, Tsinghua University; Lu Fang, Tsinghua University
1284DCAN: Dual Channel-wise Alignment Networks for Unsupervised Scene AdaptationZuxuan Wu*, UMD; Xintong Han, University of Maryland, USA; Yen-Liang Lin, GE Global Research ; Gokhan Uzunbas, Avitas Systems-GE Venture; Tom Goldstein, University of Maryland, College Park; Ser-Nam Lim, GE Global Research; Larry Davis, University of Maryland
1401YouTube-VOS: Sequence-to-Sequence Video Object SegmentationNing Xu*, Adobe Research; Linjie Yang, Snap Research; Dingcheng Yue, UIUC; Jianchao Yang, Snap; Brian Price, Adobe; Jimei Yang, Adobe; Scott Cohen, Adobe Research; Yuchen Fan, Image Formation and Processing (IFP) Group, University of Illinois at Urbana-Champaign; Yuchen Liang, UIUC; Thomas Huang, University of Illinois at Urbana Champaign
1385Selfie Video StabilizationJiyang Yu*, University of California San Diego; Ravi Ramamoorthi, University of California San Diego
1033Videos as Space-Time Region GraphsXiaolong Wang*, CMU; Abhinav Gupta, CMU
755Parallel Feature Pyramid Network for Object DetectionSeung-Wook Kim*, Korea University; Hyong-Keun Kook, Korea University; Jee-Young Sun, Korea University; Mun-Cheon Kang, Korea University; Sung-Jea Ko, Korea University
699Goal-Oriented Visual Question Generation via Intermediate RewardsJunjie Zhang, University of Technology, Sydney; Qi Wu*, University of Adelaide; Chunhua Shen, University of Adelaide; Jian Zhang, UTS; Jianfeng Lu, Nanjing University of Science and Technology; Anton Van Den Hengel, University of Adelaide
2710WildDash - Creating Hazard-Aware BenchmarksOliver Zendel*, AIT Austrian Institute of Technology; Katrin Honauer, Heidelberg University; Markus Murschitz, AIT Austrian Institute of Technology; Daniel Steininger, AIT Austrian Institute of Technology; Gustavo Fernandez, n/a
1864Reinforced Temporal Attention and Split-Rate Transfer for Depth-Based Person Re-identificationNikolaos Karianakis*, Microsoft; Zicheng Liu, Microsoft; Yinpeng Chen, Microsoft; Stefano Soatto, UCLA
186DF-Net: Unsupervised Joint Learning of Depth and Flow using Cross-Network ConsistencyYuliang Zou*, Virginia Tech; Zelun Luo, Stanford University; Jia-Bin Huang, Virginia Tech
850Generating Multimodal Human Dynamics with a Transformation based RepresentationXinchen Yan*, University of Michigan; Akash Rastogi, UM; Ruben Villegas, University of Michigan; Eli Shechtman, Adobe Research, US; Sunkavalli Kalyan, Adobe Research; Sunil Hadap, Adobe; Ersin Yumer, Argo AI; Honglak Lee, UM
1159Learning Rigidity in Dynamic Scenes with a Moving Camera for 3D Motion Field EstimationZhaoyang Lv*, GEORGIA TECH; Kihwan Kim, NVIDIA; Alejandro Troccoli, NVIDIA; Deqing Sun, NVIDIA; Kautz Jan, NVIDIA; James Rehg, Georgia Institute of Technology
2139Learning Visual Question Answering by Bootstrapping Hard AttentionMateusz Malinowski*, DeepMind; Carl Doersch, DeepMind; Adam Santoro, DeepMind; Peter Battaglia, DeepMind
2456Image Reassembly Combining Deep Learning and Shortest Path ProblemMarie-Morgane Paumard*, ETIS; David Picard, ETIS/LIP6; Hedi Tabia, France
2898RESOUND: Towards Action Recognition without Representation BiasYingwei Li*, UCSD; Nuno Vasconcelos, UC San Diego; Yi Li, University of California San Diego
2271Key-Word-Aware Network for Referring Expression Image SegmentationHengcan Shi*, University of Electronic Science and Technology of China; Hongliang Li, University of Electronic Science and Technology of China; Fanman Meng, University of Electronic Science and Technology of China; Qingbo Wu, University of Electronic Science and Technology of China
1245Mutual Learning to Adapt for Joint Human Parsing and Pose EstimationXuecheng Nie*, NUS; Jiashi Feng, NUS; Shuicheng Yan, Qihoo/360
2795Simple Baselines for Human Pose Estimation and TrackingBin Xiao*, MSR Asia; Haiping Wu, MSR Asia; Yichen Wei, MSR Asia
1802Pose Partition Networks for Multi-Person Pose EstimationXuecheng Nie*, NUS; Jiashi Feng, NUS; Junliang Xing, National Laboratory of Pattern Recognition, Institute of Automation, Chinese Academy of Sciences; Shuicheng Yan, Qihoo/360
1638Wasserstein Divergence For GANsJiqing Wu*, ETH Zurich; Zhiwu Huang, ETH Zurich; Janine Thoma, ETH Zurich; Dinesh Acharya, ETH Zurich; Luc Van Gool, ETH Zurich
2292A Segmentation-aware Deep Fusion Network for Compressed Sensing MRIZhiwen Fan, Xiamen University; Liyan Sun, Xiamen University; Xinghao Ding*, Xiamen University; Yue Huang, Xiamen University; Congbo Cai, Xiamen University; John Paisley, Columbia University
2575Deep Metric Learning with Hierarchical Triplet LossWeifeng Ge*, The University of Hong Kong
2720Generative Adversarial Network with Spatial Attention for Face Attribute EditingGang Zhang*, Institute of Computing Technology, CAS; Meina Kan, Institute of Computing Technology, Chinese Academy of Sciences; Shiguang Shan, Chinese Academy of Sciences; Xilin Chen, China
2553Proxy Clouds for Live RGB-D Stream Processing and ConsolidationAdrien Kaiser*, Telecom ParisTech; Jose Alonso Ybanez Zepeda, Ayotle SAS; Tamy Boubekeur, Paris Telecom
1132Synthetically Supervised Feature Learning for Scene Text RecognitionYang Liu*, University of Cambridge; Zhaowen Wang, Adobe Research; Hailin Jin, Adobe Research; Ian Wassell, University of Cambridge
1963Scale Aggregation Network for Accurate and Efficient Crowd CountingXinkun Cao*, Beijing University of Posts and Telecommunications; Zhipeng Wang, School of Communication and Information Engineering, Beijing University of Posts and Telecommunications; Yanyun Zhao, Beijing Univiersity of Posts and Telecommunications; Fei Su, Beijing University of Posts and Telecommunications
2704PM-GANs: Discriminative Representation Learning for Action Recognition Using Partial-modalitiesLan Wang, Chongqing Key Laboratory of Signal and Information Processing, Chongqing University of Posts and Telecommunications; Chenqiang Gao*, Chongqing Key Laboratory of Signal and Information Processing, Chongqing University of Posts and Telecommunications; Luyu Yang, Chongqing Key Laboratory of Signal and Information Processing, Chongqing University of Posts and Telecommunications; Yue Zhao, Chongqing Key Laboratory of Signal and Information Processing, Chongqing University of Posts and Telecommunications; Wangmeng Zuo, Harbin Institute of Technology, China; Deyu Meng, Xi'an Jiaotong University
2775OmniDepth: Dense Depth Estimation for Indoors Spherical Panoramas.NIKOLAOS ZIOULIS*, CERTH / CENTRE FOR RESEARCH AND TECHNOLOGY HELLAS; Antonis Karakottas, CERTH / CENTRE FOR RESEARCH AND TECHNOLOGY HELLAS; Dimitrios Zarpalas, CERTH / CENTRE FOR RESEARCH AND TECHNOLOGY HELLAS; Petros Daras, ITI-CERTH, Greece
917Hashing with Binary Matrix PursuitFatih Cakir*, Boston University; Kun He, Boston University; Stan Sclaroff, Boston University
1142Probabilistic Video Generation using Holistic Attribute ControlJiawei He*, Simon Fraser University; Andreas Lehrmann, Facebook; Joe Marino, California Institute of Technology; Greg Mori, Simon Fraser University; Leonid Sigal, University of British Columbia
860Transductive Semi-Supervised Deep Learning using Min-Max FeaturesWeiwei Shi*, Xi'an Jiaotong University; Yihong Gong, Xi'an Jiaotong University; Chris Ding, UNIVERSITY OF TEXAS AT ARLINGTON; Zhiheng Ma, Xi'an Jiaotong University; Xiaoyu Tao, Institute of Artificial Intelligence and Robotics, Xi'an Jiaotong University.; Nanning Zheng, Xi'an Jiaotong University
697Deep Feature Pyramid Reconfiguration for Object DetectionTao Kong*, Tsinghua; Fuchun Sun, Tsinghua; Wenbing Huang, Tencent AI Lab; ? ??, ????
2924Quadtree Convolutional Neural NetworksPradeep Kumar Jayaraman*, Nanyang Technological University; Jianhan Mei, Nanyang Technological University; Jianfei Cai, Nanyang Technological University; Jianmin Zheng, Nanyang Technological University
2307Correcting the Triplet Selection Bias for Triplet LossBaosheng Yu*, The University of Sydney; Tongliang Liu, The University of Sydney; Mingming Gong, CMU & U Pitt; Changxing Ding, South China University of Technology; Dacheng Tao, University of Sydney
2709Adversarial Geometry-Aware Human Motion PredictionLiangyan Gui*, Carnegie Mellon University; Yu-Xiong Wang, Carnegie Mellon University; Xiaodan Liang, Carnegie Mellon University; José M. F. Moura, Carnegie Mellon University
2BSeptember 11, 04:00 PM
7993D Motion Sensing from 4D Light Field GradientsSizhuo Ma*, University of Wisconsin-Madison; Brandon Smith, University of Wisconsin-Madison; Mohit Gupta, University of Wisconsin-Madison, USA
1573A Trilateral Weighted Sparse Coding Scheme for Real-World Image DenoisingXU JUN, The Hong Kong Polytechnic University; Lei Zhang*, Hong Kong Polytechnic University, Hong Kong, China; D. Zhang, The Hong Kong Polytechnic University
1096Saliency Detection in 360$^\circ$ VideosZiheng Zhang, Shanghaitech University; Yanyu Xu*, Shanghaitech University; Shenghua Gao, Shanghaitech University; Jingyi Yu, Shanghai Tech University
155Learning to Blend PhotosWei-Chih Hung*, University of California, Merced; Jianming Zhang, Adobe Research; Xiaohui Shen, Adobe Research; Zhe Lin, Adobe Research; Joon-Young Lee, Adobe Research; Ming-Hsuan Yang, University of California at Merced
518Escaping from Collapsing Modes in a Constrained SpaceChieh Lin, National Tsing Hua University; Chia-Che Chang, National Tsing Hua University; Che-Rung Lee, National Tsing Hua University; Hwann-Tzong Chen*, National Tsing Hua University
1994Verisimilar Image Synthesis for Accurate Detection and Recognition of Texts in ScenesFangneng Zhan, Nanyang Technological University; Shijian Lu*, Nanyang Technological University; Chuhui Xue, Nanyang Technological University
647Layer-structured 3D Scene Inference via View SynthesisShubham Tulsiani*, UC Berkeley; Richard Tucker, Google; Noah Snavely, -
1306Perturbation Robust Representations of Topological Persistence DiagramsAnirudh Som*, Arizona State University; Kowshik Thopalli, Arizona State University; Karthikeyan Natesan Ramamurthy, IBM Research; Vinay Venkataraman, Arizona State University; Ankita Shukla, Indraprastha Institute of Information Technology - Delhi; Pavan Turaga, Arizona State University
612Analyzing Clothing Layer Deformation Statistics of 3D Human MotionsJinlong YANG*, Inria; Jean-Sebastien Franco, INRIA; Franck Hétroy-Wheeler, University of Strasbourg; Stefanie Wuhrer, Inria
2052Neural Nonlinear least Squares with Application to Dense Tracking and MappingRonald Clark*, Imperial College London; Michael Bloesch, Imperial; Jan Czarnowski, Imperial College London; Andrew Davison, Imperial College London; Stefan Leutenegger, Imperial College London
197Propagating LSTM: 3D Pose Estimation based on Joint InterdependencyKyoungoh Lee*, Yonsei University; Inwoong Lee, Yonsei University; Sanghoon Lee, Yonsei University, Korea
1410Proximal Dehaze-Net: A Prior Learning-Based Deep Network for Single Image DehazingDong Yang, Xi'an Jiaotong University; JIAN SUN*, Xi'an Jiaotong University
2126Attend and Rectify: a gated attention mechanism for fine-grained recoveryPau Rodriguez Lopez*, Computer Vision Center, Universitat Autonoma de Barcelona; Guillem Cucurull, Computer Vision Center, Universitat Autonoma de Barcelona; Josep Gonfaus, Computer Vision Center; Jordi Gonzalez, UA Barcelona; Xavier Roca, Computer Vision Center, Universitat Autonoma de Barcelona
918Learning to Capture Light Fields through A Coded Aperture CameraYasutaka Inagaki*, Nagoya University; Yuto Kobayashi, Nagoya University; Keita Takahashi, Nagoya University; Toshiaki Fujii, Nagoya University; Hajime Nagahara, Osaka University
1478AMC: Automated Model Compression and Acceleration with Reinforcement LearningYihui He, Xi'an Jiaotong University; Ji Lin, Tsinghua University; Song Han*, MIT
2102Extreme Network Compression via Filter Group ApproximationBo Peng*, Hikvision Research Institute; Wenming Tan, Hikvision Research Institute; Zheyang Li, Hikvision Research Institute; Shun Zhang, Hikvision Research Institute; Di Xie, Hikvision Research Institute; Shiliang Pu, Hikvision Research Institute
2251Retrospective Encoders for Video SummarizationKe Zhang*, USC; Kristen Grauman, University of Texas; Fei Sha, USC
2142Optimized Quantization for Highly Accurate and Compact DNNsDongqing Zhang, Microsoft Research; Jiaolong Yang*, Microsoft Research Asia (MSRA); Dongqiangzi Ye, Microsoft Research; Gang Hua, Microsoft Cloud and AI
2836Universal Sketch Perceptual GroupingKe LI*, Queen Mary University of London; Kaiyue Pang, Queen Mary University of London; Jifei Song, Queen Mary, University of London; Yi-Zhe Song, Queen Mary University of London; Tao Xiang, Queen Mary, University of London, UK; Timothy Hospedales, Edinburgh University; Honggang Zhang, Beijing University of Posts and Telecommunications
1377Uncertainty Estimates and Multi-Hypotheses Networks for Optical FlowEddy Ilg*, University of Freiburg; Özgün Çiçek, University of Freiburg; Silvio Galesso, University of Freiburg; Aaron Klein, Universität Freiburg; Osama Makansi, University of Freiburg; Frank Hutter, University of Freiburg; Thomas Brox, University of Freiburg
1565Learning 3D Keypoint Descriptors for Non-Rigid Shape MatchingHanyu Wang, NLPR, Institute of Automation, Chinese Academy of Sciences; Jianwei Guo*, NLPR, Institute of Automation, Chinese Academy of Sciences; Yan Dong-Ming, NLPR, CASIA; Weize Quan, NLPR, Institute of Automation, Chinese Academy of Sciences; Xiaopeng Zhang, Institute of Automation, Chinese Academy of Sciences
1077A Joint Sequence Fusion Model for Video Question Answering and RetrievalYoungjae Yu, Seoul National University Vision and Learning Lab; Jongseok Kim, Seoul National University Vision and Learning Lab; Gunhee Kim*, Seoul National University
336Deformable Pose Traversal Convolution for 3D Action and Gesture RecognitionJunwu Weng*, Nanyang Technological University; Mengyuan Liu, Nanyang Technological University; Xudong Jiang, Nanyang Technological University; Junsong Yuan, State University of New York at Buffalo, USA
1964Fine-Grained Visual Categorization using Meta-Learning Optimization with Sample Selection of Auxiliary DataYabin Zhang, South China University of Technology; Tang Hui, South China University of Technology; Kui Jia*, South China University of Technology
3169Stereo relative pose from line and point feature tripletsAlexander Vakhitov*, Skoltech; Victor Lempitsky, Skoltech; Yinqiang Zheng, National Institute of Informatics
22Convolutional Block Attention ModuleSanghyun Woo*, KAIST; Jongchan Park, KAIST; Joon-Young Lee, Adobe Research; In So Kweon, KAIST
817EC-Net: an Edge-aware Point set Consolidation NetworkLequan Yu*, The Chinese University of Hong Kong; Xianzhi Li, The Chinese University of Hong Kong; Chi-Wing Fu, The Chinese University of Hong Kong; Danny Cohen-Or, Tel Aviv University; Pheng-Ann Heng, The Chinese Univsersity of Hong Kong
2267Video Compression through Image InterpolationChao-Yuan Wu*, UT Austin; Nayan Singhal, UT Austin; Philipp Kraehenbuehl, UT Austin
2225Burst Image Deblurring Using Permutation Invariant Convolutional Neural NetworksMiika Aittala*, MIT; Fredo Durand, MIT
347HybridNet: Classification and Reconstruction Cooperation for Semi-Supervised LearningThomas Robert*, LIP6 / Sorbonne Universite; Nicolas Thome, CNAM, Paris; Matthieu Cord, Sorbonne University
2757Structure-from-Motion-Aware PatchMatch for Adaptive Optical Flow EstimationDaniel Maurer*, University of Stuttgart; Nico Marniok, Universität Konstanz; Bastian Goldluecke, University of Konstanz; Andrés Bruhn, University of Stuttgart
2426Joint & Progressive Learning from High-Dimensional Data for Multi-Label ClassificationDanfeng Hong*, Technical University of Munich (TUM); German Aerospace Center (DLR); Naoto Yokoya, RIKEN Center for Advanced Intelligence Project (AIP); Jian Xu, German Aerospace Center (DLR); Xiaoxiang Zhu, DLR&TUM
1416SDC-Net: Video prediction using spatially-displaced convolutionFitsum Reda*, NVIDIA; Guilin Liu, NVIDIA; Kevin Shih, NVIDIA; Robert Kirby, Nvidia; Jon Barker, Nvidia; David Tarjan, Nvidia; Andrew Tao, NVIDIA; Bryan Catanzaro, NVIDIA
1516Encoder-Decoder with Atrous Separable Convolution for Semantic Image SegmentationLiang-Chieh Chen*, Google Inc.; Yukun Zhu, Google Inc.; George Papandreou, Google; Florian Schroff, Google Inc.; Hartwig Adam, Google
1173VQA-E: Explaining, Elaborating, and Enhancing Your Answers for Visual QuestionsQing Li*, University of Science and Technology of China; Qingyi Tao, Nanyang Techonological University; Shafiq Joty, Nanyang Technological University; Jianfei Cai, Nanyang Technological University; Jiebo Luo, U. Rochester
642Image Super-Resolution Using Very Deep Residual Channel Attention NetworksYulun Zhang*, Northeastern University; Kunpeng Li, Northeastern University; kai li, northeastern university; Lichen Wang, Northeastern University; Bineng Zhong, Huaqiao University; YUN FU, Northeastern University
2930Urban Zoning Using Higher-Order Markov Random Fields on Multi-View Imagery DataTian Feng, University of New South Wales; Quang-Trung Truong, SUTD; Thanh Nguyen*, Deakin University, Australia; Jing Yu Koh, SUTD; Lap-Fai Yu, UMass Boston; Sai-Kit Yeung, Singapore University of Technology and Design; Alexander Binder, Singapore University of Technology and Design
1959Clustering Convolutional Kernels to Compress Deep Neural NetworksSanghyun Son, Seoul National University; Seungjun Nah, Seoul National University; Kyoung Mu Lee*, Seoul National University
129Explainable Neural Computation via Stack Neural Module NetworksRonghang Hu*, University of California, Berkeley; Jacob Andreas, UC Berkeley; Trevor Darrell, UC Berkeley; Kate Saenko, Boston University
2947Quaternion Convolutional Neural NetworksXuanyu Zhu*, Shanghai Jiao Tong University; Yi Xu, Shanghai Jiao Tong University; Hongteng Xu, Duke University; Changjian Chen, Shanghai Jiao Tong University
1140Lip Movements Generation at a GlanceLele Chen*, University of Rochester; Zhiheng Li, WuHan University; Ross Maddox, University of Rochester; Zhiyao Duan, Unversity of Rochester; Chenliang Xu, University of Rochester
1877Toward Scale-Invariance and Position-Sensitive Object Proposal NetworksHsueh-Fu Lu, Umbo Computer Vision; Ping-Lin Chang*, Umbo Computer Vision; Xiaofei Du, Umbo Computer Vision
2254Constraints Matter in Deep Neural Network CompressionChangan Chen, Simon Fraser University; Fred Tung*, Simon Fraser University; Naveen Vedula, Simon Fraser University; Greg Mori, Simon Fraser University
2122MRF Optimization with Separable Convex Prior on Partially Ordered LabelsCsaba Domokos*, Technical University of Munich; Frank Schmidt, BCAI; Daniel Cremers, TUM
157Switchable Temporal Propagation NetworkSifei Liu*, NVIDIA; Ming-Hsuan Yang, University of California at Merced; Guangyu Zhong, Dalian University of Technology; Jinwei Gu, Nvidia; Shalini De Mello, NVIDIA Research; Kautz Jan, NVIDIA; Varun Jampani, Nvidia Research
1457T2Net: Synthetic-to-Realistic Translation for Solving Single-Image Depth Estimation TasksChuanxia Zheng*, Nanyang Technological University; Tat-Jen Cham, Nanyang Technological University; Jianfei Cai, Nanyang Technological University
2115ArticulatedFusion: Real-time Reconstruction of Motion, Geometry and Segmentation Using a Single Depth CameraChao Li*, The University of Texas at Dallas; Zheheng Zhao, The University of Texas at Dallas; Xiaohu Guo, The University of Texas at Dallas
1601NNEval: Neural Network based Evaluation Metric for Image CaptioningNaeha Sharif*, University of Western Australia; Lyndon White, University of Western Australia; Mohammed Bennamoun, University of Western Australia; Syed Afaq Ali Shah, Department of Computer Science and Software Engineering, The University of Western Australia
978Coreset-Based Convolutional Neural Network CompressionAbhimanyu Dubey*, Massachusetts Institute of Technology; Moitreya Chatterjee, University of Illinois at Urbana Champaign; Ramesh Raskar, Massachusetts Institute of Technology; Narendra Ahuja, University of Illinois at Urbana-Champaign, USA
1645Context Refinement for Object DetectionZhe Chen*, University of Sydney; Shaoli Huang, University of Sydney; Dacheng Tao, University of Sydney
658Real-time ‘Actor-Critic’ TrackingBoyu Chen*, Dalian University of Technology; Dong Wang, Dalian University of Technology; Peixia Li, Dalian University of Technology; Huchuan Lu, Dalian University of Technology
1814Partial Adversarial Domain AdaptationZhangjie Cao, Tsinghua University; Lijia Ma, Tsinghua University; Mingsheng Long*, Tsinghua University; Jianmin Wang, Tsinghua University, China
1134Localization Recall Precision (LRP): A New Performance Metric for Object DetectionKemal Oksuz*, Middle East Technical University; Bar?? Can Çam, Roketsan; Emre Akbas, Middle East Technical University; Sinan Kalkan, Middle East Technical University
1389Improving Embedding Generalization via Scalable Neighborhood Component AnalysisZhirong Wu*, UC Berkeley; Alexei Efros, UC Berkeley; Stella Yu, UC Berkeley / ICSI
587Leveraging Motion Priors in Videos for Improving Human SegmentationYu-Ting Chen*, NTHU; Wen-Yen Chang, NTHU; Hai-Lun Lu, NTHU; Tingfan Wu, Umbo Computer Vision; Min Sun, NTHU
618Recurrent Squeeze-and-Excitation Context Aggregation Net for Single Image DerainingXia Li*, Peking University Shenzhen Graduate School; Jianlong Wu, Peking University; Zhouchen Lin, Peking University; Hong Liu, Peking University Shenzhen Graduate School; Hongbin Zha, Peking University, China
1246Statistically-motivated Second-order PoolingKaicheng Yu*, EPFL; Mathieu Salzmann, EPFL
1359SegStereo: Exploiting Semantic Information for Disparity EstimationGuorun Yang*, Tsinghua University; Hengshuang Zhao, The Chinese University of Hong Kong; Jianping Shi, Sensetime Group Limited; Jia Jiaya, Chinese University of Hong Kong
1169Small-scale Pedestrian Detection Based on Somatic Topology Localization and Temporal Feature AggregationTao Song, Hikvision Research Institute; Leiyu Sun, Hikvision Research Institute; Di Xie*, Hikvision Research Institute; Haiming Sun, Hikvision Research Institute; Shiliang Pu, Hikvision Research Institute
2438Object Detection with an Aligned Spatial-Temporal MemoryFanyi Xiao*, University of California Davis; Yong Jae Lee, University of California, Davis
933Learning to Drive with 360° Surround-View Cameras and a MapSimon Hecker*, ETH Zurich; Dengxin Dai, ETH Zurich; Luc Van Gool, ETH Zurich
435Monocular Scene Parsing and Reconstruction using 3D Holistic Scene GrammarSiyuan Huang*, UCLA; Siyuan Qi, UCLA; Yixin Zhu, UCLA; Yinxue Xiao, University of California, Los Angeles; Yuanlu Xu, University of California, Los Angeles; Song-Chun Zhu, UCLA
2458Coded Illumination and Imaging for Fluorescence Based ClassificationYuta Asano, Tokyo Institute of Technology; Misaki Meguro, Tokyo Institute of Technology; Chao Wang, Kyushu Institute of Technology; Antony Lam*, Saitama University; Yinqiang Zheng, National Institute of Informatics; Takahiro Okabe, Kyushu Institute of Technology; Imari Sato, National Institute of Informatics
1770Modality Distillation with Multiple Stream Networks for Action RecognitionNuno Garcia, IIT; Pietro Morerio*, IIT; Vittorio Murino, Istituto Italiano di Tecnologia
1636VideoMatch: Matching based Video Object SegmentationYuan-Ting Hu*, University of Illinois at Urbana-Champaign; Jia-Bin Huang, Virginia Tech; Alexander Schwing, UIUC
801Superpixel Sampling NetworksVarun Jampani*, Nvidia Research; Deqing Sun, NVIDIA; Ming-Yu Liu, NVIDIA; Ming-Hsuan Yang, University of California at Merced; Kautz Jan, NVIDIA
717Deep Bilinear Learning for RGB-D Action RecognitionHU Jian-Fang, Sun Yat-sen University; Jason Wei Shi Zheng*, Sun Yat Sen University; Pan Jiahui, Sun Yat-sen University; Jian-Huang Lai, Sun Yat-sen University; Jianguo Zhang, University of Dundee
1925Multi-object Tracking with Neural Gating using bilinear LSTMsChanho Kim*, Georgia Tech; Fuxin Li, Oregon State University; James Rehg, Georgia Institute of Technology
1772Direct Sparse Odometry With Rolling ShutterDavid Schubert*, Technical University of Munich; Vladyslav Usenko, TU Munich; Nikolaus Demmel, TUM; Joerg Stueckler, Technical University of Munich; Daniel Cremers, TUM
1423Person Search via A Mask-guided Two-stream CNN ModelDi Chen*, Nanjing University of Science and Techonology; Shanshan Zhang, Max Planck Institute for Informatics; Wanli Ouyang, CUHK; Jian Yang, Nanjing University of Science and Technology; Ying Tai, Tencent
2869Imagine This! Scripts to Compositions to VideosTanmay Gupta*, UIUC; Dustin Schwenk, Allen Institute for Artificial Intelligence; Ali Farhadi, University of Washington; Derek Hoiem, University of Illinois at Urbana-Champaign; Aniruddha Kembhavi, Allen Institute for Artificial Intelligence
196Multiresolution Tree Networks for Point Cloud ProcesingMatheus Gadelha*, University of Massachusetts Amherst; Subhransu Maji, University of Massachusetts, Amherst; Rui Wang, U Massachusetts
2007Quantization Mimic: Towards Very Tiny CNN for Object DetectionYi Wei*, Tsinghua University; Xinyu Pan, MMLAB, CUHK; Hongwei Qin, SenseTime; Junjie Yan, Sensetime; Wanli Ouyang, CUHK
2513Multi-scale Residual Network for Image Super-ResolutionJuncheng Li, East China Normal University; Faming Fang*, East China Normal University; Kangfu Mei, Jiangxi Normal University; Guixu Zhang, East China Normal University
24BodyNet: Volumetric Inference of 3D Human Body ShapesGul Varol*, INRIA; Duygu Ceylan, Adobe Research; Bryan Russell, Adobe Research; Jimei Yang, Adobe; Ersin Yumer, Argo AI; Ivan Laptev, INRIA Paris; Cordelia Schmid, INRIA
8533D Recurrent Neural Networks with Context Fusion for Point Cloud Semantic SegmentationXiaoqing Ye*, SIMIT; Jiamao Li, SIMIT; Hexiao Huang, Shanghai Opening University; Xiaolin Zhang, SIMIT
378Robust Anchor Embedding for Unsupervised Video Re-Identification in the WildMang YE*, Hong Kong Baptist University; Xiangyuan Lan, Department of Computer Science, Hong Kong Baptist University; PongChi Yuen, Department of Computer Science, Hong Kong Baptist University
804Towards Robust Neural Networks via Random Self-ensembleXuanqing Liu, UC Davis Department of Computer Science; Minhao Cheng, University of California, Davis; Huan Zhang, UC Davis; Cho-Jui Hsieh*, UC Davis Department of Computer Science and Statistics
1708SpiderCNN: Deep Learning on Point Sets with Parameterized Convolutional FiltersYifan Xu, Tsinghua University; Tianqi Fan, Multimedia Laboratory, Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences; Mingye Xu, Multimedia Laboratory, Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences; Long Zeng, Tsinghua University; Yu Qiao*, Multimedia Laboratory, Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences
1195CIRL: Controllable Imitative Reinforcement Learning for Vision-based Self-drivingXiaodan Liang*, Carnegie Mellon University; Tairui Wang, Petuum Inc; Luona Yang, Carnegie Mellon University; Eric Xing, Petuum Inc.
1379Normalized Blind DeconvolutionMeiguang Jin*, University of Bern; Stefan Roth, TU Darmstadt; Paolo Favaro, Bern University, Switzerland
2298Few-Shot Human Motion Prediction via Meta-LearningLiangyan Gui*, Carnegie Mellon University; Yu-Xiong Wang, Carnegie Mellon University; Deva Ramanan, Carnegie Mellon University; José M. F. Moura, Carnegie Mellon University
60Learning to Segment via Cut-and-PasteTal Remez*, Tel-Aviv University; Matthew Brown, Google; Jonathan Huang, Google
1456Weakly-supervised 3D Hand Pose Estimation from Monocular RGB ImagesYujun Cai*, Nanyang Technological University; Liuhao Ge, NTU; Jianfei Cai, Nanyang Technological University; Junsong Yuan, State University of New York at Buffalo, USA
1903DeepIM: Deep Iterative Matching for 6D Pose EstimationYi Li*, Tsinghua University; Gu Wang, Tsinghua University; Xiangyang Ji, Tsinghua University; Yu Xiang, University of Michigan; Dieter Fox, University of Washington
1403Jointly Discovering Visual Objects and Spoken Words from Raw Sensory InputDavid Harwath*, MIT CSAIL; Adria Recasens, Massachusetts Institute of Technology; Dídac Surís, Universitat Politecnica de Catalunya; Galen Chuang, MIT; Antonio Torralba, MIT; James Glass, MIT
2150A Style-aware Content Loss for Real-time HD Style TransferArtsiom Sanakoyeu*, Heidelberg University; Dmytro Kotovenko, Heidelberg University; Bjorn Ommer, Heidelberg University
2662Implicit 3D Orientation Learning for 6D Object Detection from RGB ImagesMartin Sundermeyer*, German Aerospace Center (DLR); Zoltan Marton, DLR; Maximilian Durner, DLR; Rudolph Triebel, German Aerospace Center (DLR)
2174Scale-Awareness of Light Field Camera based Visual OdometryNiclas Zeller*, Karlsruhe University of Applied Sciences; Franz Quint, Karlsruhe University of Applied Sciences; Uwe Stilla, Technische Universitaet Muenchen
121Audio-Visual Scene Analysis with Self-Supervised Multisensory FeaturesAndrew Owens*, UC Berkeley; Alexei Efros, UC Berkeley
3ASeptember 12, 10:00 AM
2104Efficient Sliding Window Computation for NN-Based Template MatchingLior Talker*, Haifa University; Yael Moses, IDC, Israel; Ilan Shimshoni, University of Haifa
1530Active Stereo Net: End-to-End Self-Supervised Learning for Active Stereo SystemsYinda Zhang*, Princeton University; Sean Fanello, Google; Sameh Khamis, Google; Christoph Rhemann, Google; Julien Valentin, Google; Adarsh Kowdle, Google; Vladimir Tankovich, Google; Shahram Izadi, Google; Thomas Funkhouser, Princeton, USA
1591GAL: Geometric Adversarial Loss for Single-View 3D-Object ReconstructionLi Jiang*, The Chinese University of Hong Kong; Xiaojuan Qi, CUHK; Shaoshuai SHI, The Chinese University of Hong Kong; Jia Jiaya, Chinese University of Hong Kong
926Learning to Reconstruct High-quality 3D Shapes with Cascaded Fully Convolutional NetworksYan-Pei Cao*, Tsinghua University; Zheng-Ning Liu, Tsinghua University; Zheng-Fei Kuang, Tsinghua University; Shi-Min Hu, Tsinghua University
1069Deep Reinforcement Learning with Iterative Shift for Visual TrackingLiangliang Ren, Tsinghua University; Xin Yuan, Tsinghua University; Jiwen Lu*, Tsinghua University; Ming Yang, Horizon Robotics; Jie Zhou, Tsinghua University, China
2333CPlaNet: Enhancing Image Geolocalization by Combinatorial Partitioning of MapsPaul Hongsuck Seo*, POSTECH; Tobias Weyand, Google Inc.; Jack Sim, Google LLC; Bohyung Han, Seoul National University
1341Bayesian Instance Segmentation in Open Set WorldTrung Pham*, NVIDIA; Vijay Kumar B G, University of Adelaide; Thanh-Toan Do, The University of Adelaide; Gustavo Carneiro, University of Adelaide; Ian Reid, University of Adelaide, Australia
1682Characterizing Adversarial Examples Based on Spatial Consistency Information for Semantic SegmentationChaowei Xiao, University of Michigan, Ann Arbor; Ruizhi Deng, Simon Fraser University; Bo Li*, University of Illinois at Urbana–Champaign and UC Berkeley; Fisher Yu, UC Berkeley;
Mingyan Liu, University of Michigan, Ann Arbor; Dawn Song, UC Berkeley
1388CubeNet: Equivariance to 3D Rotation and TranslationDaniel Worrall*, UCL; Gabriel Brostow, University College London
23133D Face Reconstruction from Light Field Images: A Model-free ApproachMingtao Feng, Hunan Unversity; Syed Zulqarnain Gilani*, The University of Western Australia; Yaonan Wang, Hunan University; Ajmal Mian, University of Western Australia
1553stagNet: An Attentive Semantic RNN for Group Activity RecognitionMengshi Qi*, Beihang University; Jie Qin, ETH Zurich; Annan Li, Beijing University of Aeronautics and Astronautics; Yunhong Wang, State Key Laboratory of Virtual Reality Technology and System, Beihang University, Beijing 100191, China; Jiebo Luo, U. Rochester; Luc Van Gool, ETH Zurich
3047Supervising the new with the old: learning SFM from SFMMaria Klodt*, University of Oxford; Andrea Vedaldi, Oxford University
376PSANet: Point-wise Spatial Attention Network for Scene ParsingHengshuang Zhao*, The Chinese University of Hong Kong; Yi ZHANG, The Chinese University of Hong Kong; Shu Liu, CUHK; Jianping Shi, Sensetime Group Limited; Chen Change Loy, Chinese University of Hong Kong; Dahua Lin, The Chinese University of Hong Kong; Jia Jiaya, Chinese University of Hong Kong
2229FishEyeRecNet: A Multi-Context Collaborative Deep Network for Fisheye Image Recti_x000c_cationXiaoqing Yin*, University of Sydney; Xinchao Wang, Stevens Institute of Technology; Jun Yu, HDU; Maojun Zhang, National University of Defense Technology, China; Pascal Fua, EPFL, Switzerland; Dacheng Tao, University of Sydney
1583ELEGANT: Exchanging Latent Encodings with GAN for Transferring Multiple Face AttributesTaihong Xiao*, Peking University; Jiapeng Hong, Peking University; Jinwen Ma, Peking University
2728Deep Bilevel LearningSimon Jenni*, Universität Bern; Paolo Favaro, Bern University, Switzerland
2131ADVIO: An Authentic Dataset for Visual-Inertial OdometrySantiago Cortes, Aalto University; Arno Solin*, Aalto University; Esa Rahtu, Tampere University of Technology; Juho Kannala, Aalto University, Finland
2434D2S: Densely Segmented Supermarket DatasetPatrick Follmann*, MVTec Software GmbH; Tobias Böttger, MVTec Software GmbH; Philipp Härtinger, MVTec Software GmbH; Rebecca König, MVTec Software GmbH
1317PyramidBox: A Context-assisted Single Shot Face DetectorXu Tang, Baidu; Daniel Du*, Baidu; Zeqiang He, Baidu; jingtuo liu, baidu
499Structured Siamese Network for Real-Time Visual TrackingYunhua Zhang, Dalian University of Technology; Lijun Wang, Dalian University of Technology; Dong Wang, Dalian University of Technology; Mengyang Feng, Dalian University of Technology; Huchuan Lu*, Dalian University of Technology; Jinqing Qi, Dalian University of Technology
1088Probabilistic Signed Distance Function for On-the-fly Scene ReconstructionWei Dong*, Peking University; Qiuyuan Wang, Peking University; Xin Wang, Peking University; Hongbin Zha, Peking University, China
13563D Vehicle Trajectory Reconstruction in Monocular Video Data Using Environment Structure ConstraintsSebastian Bullinger*, Fraunhofer IOSB; Christoph Bodensteiner, Fraunhofer IOSB; Michael Arens, Fraunhofer IOSB; Rainer Stiefelhagen, Karlsruhe Institute of Technology
249Unsupervised Image-to-Image Translation with Stacked Cycle-Consistent Adversarial NetworksMinjun Li*, Fudan University; Haozhi Huang, Tencent AI Lab; Lin Ma, Tencent AI Lab; Wei Liu, Tencent AI Lab; Tong Zhang, Tecent AI Lab; Yu-Gang Jiang, Fudan University
1027Pose-Normalized Image Generation for Person Re-identificationXuelin Qian, Fudan University; Yanwei Fu*, Fudan Univ.; Tao Xiang, Queen Mary, University of London, UK; Wenxuan Wang, Fudan University; Jie Qiu, Nara Institute of Science and Technology; Yang Wu, Nara Institute of Science and Technology; Yu-Gang Jiang, Fudan University; Xiangyang Xue, Fudan University
1944Action Anticipation with RBF Kernelized Feature Mapping RNNYuge Shi*, Australian National University; Basura Fernando, Australian National University; RICHARD HARTLEY, Australian National University, Australia
48Rendering Portraitures from Monocular Camera and BeyondXiangyu Xu*, Tsinghua University; Deqing Sun, NVIDIA; Sifei Liu, NVIDIA; Wenqi Ren, Institute of Information Engineering, Chinese Academy of Sciences; Yu-Jin Zhang, Tsinghua University; Ming-Hsuan Yang, University of California at Merced; Jian Sun, Megvii, Face++
1511Recovering 3D Planes from a Single Image via Convolutional Neural NetworksFengting Yang*, Pennsylvania State University ; Zihan Zhou, Penn State University
1201The Devil of Face Recognition is in the NoiseLiren Chen*, Sensetime Group Limited; Fei Wang, SenseTime; Cheng Li, SenseTime Research; Shiyao Huang, SenseTime Co Ltd; Yanjie Chen, sensetime; Chen Qian, SenseTime; Chen Change Loy, Chinese University of Hong Kong
22073DMV: Joint 3D-Multi-View Prediction for 3D Semantic Scene SegmentationAngela Dai*, Stanford University; Matthias Niessner, Technical University of Munich
2781Joint optimization for compressive video sensing and reconstruction under hardware constraintsMichitaka Yoshida*, Kyushu University; Akihiko Torii, Tokyo Institute of Technology, Japan; Masatoshi Okutomi, Tokyo Institute of Technology; Kenta Endo, Hamamatsu Photonics K. K.; Yukinobu Sugiyama, Hamamatsu Photonics K. K.; Hajime Nagahara, Osaka University
844Consensus-Driven Propagation in Massive Unlabeled Data for Face RecognitionXiaohang Zhan*, The Chinese University of Hong Kong; Ziwei Liu, The Chinese University of Hong Kong; Junjie Yan, Sensetime Group Limited; Dahua Lin, The Chinese University of Hong Kong; Chen Change Loy, Chinese University of Hong Kong
2642Recovering Accurate 3D Human Pose in The Wild Using IMUs and a Moving CameraTimo von Marcard*, University of Hannover; Roberto Henschel, Leibniz University of Hannover; Michael Black, Max Planck Institute for Intelligent Systems; Bodo Rosenhahn, Leibniz University Hannover; Gerard Pons-Moll, MPII, Germany
883Predicting Future Instance Segmentations by Forecasting Convolutional FeaturesPauline Luc*, Facebook AI Research; Camille Couprie, Facebook; yann lecun, Facebook; Jakob Verbeek, INRIA
30PS-FCN: A Flexible Learning Framework for Photometric StereoGuanying Chen*, The University of Hong Kong; Kai Han, University of Oxford; Kwan-Yee Wong, The University of Hong Kong
2053Unsupervised Class-Specific DeblurringNimisha T M*, Indian Institute of Technology Madras; Sunil Kumar, Indian Institute of Technology Madras; Rajagopalan Ambasamudram, Indian Institute of Technology Madras
324Face Super-resolution Guided by Facial Component HeatmapsXin Yu*, Australian National University; Basura Fernando, Australian National University; Bernard Ghanem, KAUST; Fatih Porikli, ANU; RICHARD HARTLEY, Australian National University, Australia
1988A Contrario Horizon-First Vanishing Point Detection Using Second-Order Grouping LawsGilles Simon*, Université de Lorraine; Antoine Fond, Université de Lorraine; Marie-Odile Berger, INRIA
1742Fast, Accurate, and, Lightweight Super-Resolution with Cascading Residual NetworkNamhyuk Ahn, Ajou University; Byungkon Kang, Ajou University; Kyung-Ah Sohn*, Ajou University
202Face Recognition with Contrastive ConvolutionChunrui Han*, ICT, Chinese Academy of Sciences, China; Shiguang Shan, Chinese Academy of Sciences; Meina Kan, ICT, CAS; Shuzhe Wu, Chinese Academy of Sciences; xilin chen, ICT, Chinese Academy of Sciences, China
2813Deforming Autoencoders: Unsupervised Disentangling of Shape and AppearanceZhixin Shu*, Stony Brook University; Mihir Sahasrabudhe, CentraleSupelec; Alp Guler, INRIA; Dimitris Samaras, Stony Brook University; Nikos Paragios, Therapanacea; Iasonas Kokkinos , UCL
1867NetAdapt: Platform-Aware Neural Network Adaptation for Mobile ApplicationsTien-Ju Yang*, Massachusetts Institute of Technology; Andrew Howard, Google; Bo Chen, Google; Xiao Zhang, Google; Alec Go, Google; Vivienne Sze, Massachusetts Institute of Technology; Hartwig Adam, Google
1817ExFuse: Enhancing Feature Fusion for Semantic SegmentationZhenli Zhang*, Fudan University; Xiangyu Zhang, Megvii Inc; Chao Peng, Megvii(Face++) Inc; Jian Sun, Megvii, Face++
1104AugGAN: Cross Domain Adaptation with GAN-based Data AugmentationSheng-Wei Huang, National Tsing Hua University; Che-Tsung Lin*, National Tsing Hua University; Shu-Ping Chen, National Tsing Hua University; Yen-Yi Wu, NTHU CS; Po-Hao Hsu, National Tsing Hua University; Shang-Hong Lai , National Tsing Hua University
2272LAPCSR:A Deep Laplacian Pyramid Generative Adversarial Network for Scalable Compressive Sensing ReconstructionKai Xu*, Arizona State University; Zhikang Zhang, Arizona State University; Fengbo Ren, Arizona State University
2570U-PC: Unsupervised Planogram ComplianceArchan Ray, University of Massachusetts Amherst; Nishant Kumar, SMART-FM; Avishek Shaw*, Tata Consultancy Services Limited; Dipti Prasad Mukherjee, ISI, Kolkata
1157Seeing Tree Structure from VibrationTianfan Xue, MIT; Jiajun Wu*, MIT; Zhoutong Zhang, MIT; Chengkai Zhang, MIT; Joshua Tenenbaum, MIT; Bill Freeman, MIT
944A Dataset of Flash and Ambient Illumination Pairs from the CrowdYagiz Aksoy*, ETH Zurich; Changil Kim, MIT CSAIL; Petr Kellnhofer, MIT; Sylvain Paris, Adobe Research; Mohamed A. Elghareb, Qatar Computing Research Institute; Marc Pollefeys, ETH Zurich; Wojciech Matusik, MIT
404Compressing the Input for CNNs with the First-Order Scattering TransformEdouard Oyallon*, CentraleSupélec; Eugene Belilovsky, Inria Galen / KU Leuven; Sergey Zagoruyko, Inria; Michal Valko, Inria
188Distractor-aware Siamese Networks for Visual Object TrackingZheng Zhu*, CASIA; Qiang Wang, University of Chinese Academy of Sciences; Bo Li, sensetime; Wu Wei, Sensetime; Junjie Yan, Sensetime Group Limited
2329"Factual" or "Emotional": Stylized Image Captioning with Adaptive Learning and Attention"Tianlang Chen*, University of Rochester; Zhongping Zhang, University of Rochester; Quanzeng You, Microsoft; CHEN FANG, Adobe Research, San Jose, CA; Zhaowen Wang, Adobe Research; Hailin Jin, Adobe Research; Jiebo Luo, U. Rochester
3155Constrained Optimization Based Low-Rank Approximation of Deep Neural NetworksChong Li*, University of Washington; C.J. Richard Shi, University of Washington
2162Extending Layered Models to 3D MotionDong Lao, KAUST; Ganesh Sundaramoorthi*, Kaust
2889ExplainGAN: Model Explanation via Decision Boundary Crossing TransformationsNathan Silberman*, Butterfly Network; Pouya Samangouei, Butterfly Network; Liam Nakagawa, Butterfly Network; Ardavan Saeedi, Butterfly Network Inc
222Adding Attentiveness to the Neurons in Recurrent Neural NetworksPengfei Zhang, Xi'an Jiaotong University; Jianru Xue, Xi'an Jiaotong University; Cuiling Lan*, Microsoft Research; Wenjun Zeng, Microsoft Research; Zhanning Gao, Xi'an Jiaotong University; Nanning Zheng, Xi'an Jiaotong University
2342ESPNet: Efficient Spatial Pyramid of Dilated Convolutions for Semantic SegmentationSachin Mehta*, University of Washington; Mohammad Rastegari, Allen Institute for Artificial Intelligence; Anat Caspi, University of Washington; Linda Shapiro, University of Washington; Hannaneh Hajishirzi, University of Washington
562Learning Human-Object Interactions by Graph Parsing Neural NetworksSiyuan Qi*, UCLA; Wenguan Wang, Beijing Institute of Technology; Baoxiong Jia, UCLA; Jianbing Shen, Beijing Institute of Technology; Song-Chun Zhu, UCLA
1355PESTO: 6D Object Pose Estimation BenchmarkTomas Hodan*, Czech Technical University in Prague; Frank Michel, Technical University Dresden; Eric Brachmann, TU Dresden; Wadim Kehl, Toyota Research Institute; Anders Buch, University of Southern Denmark; Dirk Kraft, Syddansk Universitet; Bertram Drost, MVTec Software GmbH; Joel Vidal, National Taiwan University of Science and Technology; Stephan Ihrke , Fraunhofer ivi ; Xenophon Zabulis, FORTH; Caner Sahin, Imperial College London; Fabian Manhardt, TU Munich; Federico Tombari, Technical University of Munich, Germany; Tae-Kyun Kim, Imperial College London; Jiri Matas, CMP CTU FEE; Carsten Rother, University of Heidelberg
138RCAA: Relational Context-Aware Agents for Person SearchXiaojun Chang*, Carnegie Mellon University; Po-Yao Huang, Carnegie Mellon University; Xiaodan Liang, Carnegie Mellon University; Yi Yang, UTS; Alexander Hauptmann, Carnegie Mellon University
453DetNet: Design Backbone for Object DetectionZeming Li*, Tsinghua University;Megvii Inc; Chao Peng, Megvii(Face++) Inc; Gang Yu, Face++; Yangdong Deng, Tsinghua University; Xiangyu Zhang, Megvii Inc; Jian Sun, Megvii, Face++
703Modeling Varying Camera-IMU Time Offset in Optimization-Based Visual-Inertial OdometryYonggen Ling*, Tencent AI Lab; Linchao Bao, Tencent AI Lab; Zequn Jie, Tencent AI Lab; Fengming Zhu, Tencent AI Lab; Ziyang Li, Tencent AI Lab; Shanmin Tang, Tencent AI Lab; YongSheng Liu, Tencent AI Lab; Wei Liu, Tencent AI Lab; Tong Zhang, Tecent AI Lab
1417Exploiting temporal information for 3D human pose estimationMir Rayat Imtiaz Hossain*, University of British Columbia; Jim Little, University of British Columbia, Canada
777Joint Representation and Truncated Inference Learning for Correlation Filter based TrackingYingjie Yao, Harbin Institute of technology; Xiaohe Wu, Harbin Institute of technology; Lei Zhang, University of Pittsburgh; Shiguang Shan, Chinese Academy of Sciences; Wangmeng Zuo*, Harbin Institute of Technology, China
94Learning to Zoom: a Saliency-Based Sampling Layer for Neural NetworksAdria Recasens*, Massachusetts Institute of Technology; Petr Kellnhofer, MIT; Simon Stent, Toyota Research Institute; Wojciech Matusik, MIT; Antonio Torralba, MIT
2942Does Haze Removal Help Image Classification?Yanting Pei*, Beijing Jiaotong University; Yaping Huang, Beijing Jiaotong University; Qi Zou, Beijing Jiaotong University; Yuhang Lu, University of South Carolina; Song Wang, University of South Carolina
247Learning Local Descriptors by Integrating Geometry ConstraintsZixin Luo*, HKUST; Tianwei Shen, HKUST; Lei Zhou, HKUST; Siyu Zhu, HKUST; Runze Zhang, HKUST; Tian Fang, HKUST; Long Quan, Hong Kong University of Science and Technology
395Repeatability Is Not Enough: Learning Affine Regions via DiscriminabilityDmytro Mishkin*, Czech Technical University in Prague; Filip Radenovic, Visual Recognition Group, CTU Prague; Jiri Matas, CMP CTU FEE
597Macro-Micro Adversarial Network for Human ParsingYawei Luo*, University of Technology Sydney; Zhedong Zheng, University of Technology Sydney; Liang Zheng, University of Technology Sydney; Yi Yang, UTS
1570Learning Class Prototypes via Structure Alignment for Zero-Shot RecognitionHuajie Jiang, ICT, CAS; Ruiping Wang*, ICT, CAS; Shiguang Shan, Chinese Academy of Sciences; Xilin Chen, China
743SphereNet: Learning Spherical Representations for Detection and Classification in Omnidirectional ImagesBenjamin Coors*, MPI Intelligent Systems, Bosch; Alexandru Condurache, Bosch; Andreas Geiger, MPI-IS and University of Tuebingen
3075A dataset and architecture for visual reasoning with a working memoryGuangyu Robert Yang*, Columbia University; Igor Ganichev, Google Brain; Xiao-Jing Wang, New York University; Jon Shlens, Google; David Sussillo, Google Brain
896Flow-Grounded Spatial-Temporal Video Prediction from Still ImagesYijun Li*, University of California, Merced; CHEN FANG, Adobe Research, San Jose, CA; Jimei Yang, Adobe; Zhaowen Wang, Adobe Research; Xin Lu, Adobe; Ming-Hsuan Yang, University of California at Merced
2068The Unmanned Aerial Vehicle Benchmark: Object Detection and TrackingDawei Du*, University of Chinese Academy of Sciences; Yuankai Qi, Harbin Institute of Technology; Hongyang Yu, Harbin Institute of Technology; Yifang Yang, University of Chinese Academy of Sciences; Kaiwen Duan, University of Chinese Academy of Sciences; guorong Li, CAS; Weigang Zhang, Harbin Institute of Technology, Weihai; Qingming Huang, University of Chinese Academy of Sciences; Qi Tian , The University of Texas at San Antonio
657Selective Zero-Shot Classification with Augmented AttributesJie Song, College of Computer Science and Technology, Zhejiang University; Chengchao Shen, Zhejiang University; Jie Lei, Zhejiang University; An-Xiang Zeng, Alibaba; Kairi Ou, Alibaba; Dacheng Tao, University of Sydney; Mingli Song*, Zhejiang University
357Action Search: Spotting Actions in Videos and Its Application to Temporal Action LocalizationHumam Alwassel*, KAUST; Fabian Caba, KAUST; Bernard Ghanem, KAUST
729A Principled Approach to Hard Triplet Generation via Adversarial NetsYiru Zhao*, Shanghai Jiao Tong University; Zhongming Jin, Alibaba Group; Guo-Jun Qi, University of Central Florida; Hongtao Lu, Shanghai Jiao Tong University; Xian-Sheng Hua, Alibaba Group
1656Pose Guided Human Video GenerationCeyuan Yang*, SenseTime Group Limited; Zhe Wang, Sensetime Group Limited; Xinge Zhu, Sensetime Group Limited; Chen Huang, Carnegie Mellon University; Jianping Shi, Sensetime Group Limited; Dahua Lin, The Chinese University of Hong Kong
770Deep Directional Statistics: Pose Estimation with Uncertainty QuantificationSergey Prokudin*, Max Planck Institute for Intelligent Systems; Sebastian Nowozin, Microsoft Research Cambridge; Peter Gehler, Amazon
1039Learning 3D Human Pose from Structure and MotionRishabh Dabral*, IIT Bombay; Anurag Mundhada, IIT Bombay; Abhishek Sharma, Gobasco AI Labs
231Learning Dynamic Memory Networks for Object TrackingTianyu Yang*, City University of Hong Kong; Antoni Chan, City University of Hong Kong, Hong, Kong
409Faces as Lighting Probes via Unsupervised Deep Highlight ExtractionRenjiao Yi*, Simon Fraser University; Chenyang Zhu, Simon Fraser University; Ping Tan, Simon Fraser University; Stephen Lin, Microsoft Research
1572CurriculumNet: Learning from Large-Scale Web Images without Human AnnotationSheng Guo*, Malong Technologies; Weilin Huang, Malong Technologies; Haozhi Zhang, Malong Technologies
1741Joint Task-Recursive Learning for Semantic Segmentation and Depth EstimationZhenyu Zhang*, Nanjing University of Sci & Tech; Zhen Cui, Nanjing University of Science and Technology; Zequn Jie, Tencent AI Lab; Xiang Li, NJUST; Chunyan Xu, Nanjing University of Science and Technology; Jian Yang, Nanjing University of Science and Technology
555HybridFusion: Real-Time Performance Capture Using a Single Depth Sensor and Sparse IMUsZerong Zheng*, Tsinghua University; Tao Yu, Beihang University; Hao Li, Pinscreen/University of Southern California/USC ICT; Kaiwen Guo, Google Inc.; Qionghai Dai, Tsinghua University; Lu Fang, Tsinghua University; Yebin Liu, Tsinghua University
501Associating Inter-Image Salient Instances for Weakly Supervised Semantic SegmentationRuochen Fan*, Tsinghua University; Qibin Hou, Nankai University; Ming-Ming Cheng, Nankai University; Gang Yu, Face++; Ralph Martin, Cardiff University; Shimin Hu, Tsinghua University
42Ask, Acquire and Attack: Data-free UAP generation using Class impressionsKonda Reddy Mopuri*, Indian Institute of Science, Bangalore; Phani Krishna Uppala, Indian Institute of Science; Venkatesh Babu RADHAKRISHNAN, Indian Institute of Science
133A Scalable Exemplar-based Subspace Clustering Algorithm for Class-Imbalanced DataChong You*, Johns Hopkins University; Chi Li, Johns Hopkins University; Daniel Robinson, Johns Hopkins University; Rene Vidal, Johns Hopkins University
309Find and Focus: Retrieve and Localize Video Events with Natural Language QueriesDian SHAO*, The Chinese University of Hong Kong; Yu Xiong, The Chinese University of HK; Yue Zhao, The Chinese University of Hong Kong; Qingqiu Huang, CUHK; Yu Qiao, Multimedia Laboratory, Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences; Dahua Lin, The Chinese University of Hong Kong
1117Graininess-Aware Deep Feature Learning for Pedestrian DetectionChunze Lin, Tsinghua University; Jiwen Lu*, Tsinghua University; Jie Zhou, Tsinghua University, China
885MVSNet: Depth Inference for Unstructured Multi-view StereoYao Yao*, The Hong Kong University of Science and Technology; Zixin Luo, HKUST; Shiwei Li, HKUST; Tian Fang, HKUST; Long Quan, Hong Kong University of Science and Technology
184PlaneMatch: Patch Coplanarity Prediction for Robust RGB-D RegistrationYifei Shi, Princeton University; Kai Xu, Princeton University and National University of Defense Technology; Matthias Niessner, Technical University of Munich; Szymon Rusinkiewicz, Princeton University; Thomas Funkhouser*, Princeton, USA
1811Deep Virtual Stereo Odometry: Leveraging Deep Depth Prediction for Monocular Direct Sparse OdometryNan Yang*, Technical University of Munich; Rui Wang, Technical University of Munich; Joerg Stueckler, Technical University of Munich; Daniel Cremers, TUM
3BSeptember 12, 02:30 PM
2463GANimation: Anatomically-aware Facial Animation from a Single ImageAlbert Pumarola*, Institut de Robotica i Informatica Industrial; Antonio Agudo, Institut de Robotica i Informatica Industrial, CSIC-UPC; Aleix Martinez, The Ohio State University; Alberto Sanfeliu, Industrial Robotics Institute; Francesc Moreno, IRI
366Unsupervised Geometry-Aware Representation for 3D Human Pose EstimationHelge Rhodin*, EPFL; Mathieu Salzmann, EPFL; Pascal Fua, EPFL, Switzerland
2039Efficient Semantic Scene Completion Network with Spatial Group ConvolutionJiahui Zhang*, Tsinghua University; Hao Zhao, Intel Labs China; Anbang Yao, Intel Labs China; Yurong Chen, Intel Labs China; Hongen Liao, Tsinghua University
2147Deep Autoencoder for Combined Human Pose Estimation and Body Model UpscalingMatthew Trumble*, University of Surrey; Andrew Gilbert, University of Surrey; John Collomosse, Adobe Research; Adrian Hilton, University of Surrey
1929Highly-Economized Multi-View Binary Compression for Scalable Image ClusteringZheng Zhang*, Harbin Institute of Technology Shenzhen Graduate School; Li Liu, the inception institute of artificial intelligence; Jie Qin, ETH Zurich; Fan Zhu, the inception institute of artificial intelligence ; Fumin Shen, UESTC; Yong Xu, Harbin Institute of Technology Shenzhen Graduate School; Ling Shao, Inception Institute of Artificial Intelligence; Heng Tao Shen, University of Electronic Science and Technology of China (UESTC)
2441Asynchronous, Photometric Feature Tracking using Events and FramesDaniel Gehrig, University of Zurich; Henri Rebecq*, University of Zurich; Guillermo Gallego, University of Zurich; Davide Scaramuzza, University of Zurich& ETH Zurich, Switzerland
685Deterministic Consensus Maximization with Biconvex ProgrammingZhipeng Cai*, The University of Adelaide; Tat-Jun Chin, University of Adelaide; Huu Le, University of Adelaide; David Suter, University of Adelaide
148Depth-aware CNN for RGB-D SegmentationWeiyue Wang*, USC; Ulrich Neumann, USC
2096Object Detection in Video with Spatiotemporal Sampling NetworksGedas Bertasius*, University of Pennsylvania; Lorenzo Torresani, Dartmouth College; Jianbo Shi, University of Pennsylvania
955Dependency-aware Attention Control for Unconstrained Face Recognition with Image SetsXiaofeng Liu*, Carnegie Mellon University; B. V. K. Vijaya Kumar, CMU, USA; Chao Yang, University of Southern California; Qingming Tang, TTIC; Jane You, The Hong Kong Polytechnic University
2840License Plate Detection and Recognition in Unconstrained ScenariosSérgio Silva*, UFRGS; Claudio Jung, UFRGS
1740Revisiting the Inverted Indices for Billion-Scale Approximate Nearest NeighborsDmitry Baranchuk*, MSU / Yandex; Artem Babenko, MIPT/Yandex; Yury Malkov, NTechLab
659Zero-Annotation Object Detection with Web Knowledge TransferQingyi Tao*, Nanyang Techonological University; Hao Yang, NTU; Jianfei Cai, Nanyang Technological University
441Semi-supervised Adversarial Learning to Generate Photorealistic Face Images of New Identities from 3D Morphable ModelBaris Gecer*, Imperial College London; Binod Bhattarai, Imperial College London; Josef Kittler, University of Surrey, UK; Tae-Kyun Kim, Imperial College London
3124Improving Shape Deformation in Unsupervised Image-to-Image TranslationAaron Gokaslan*, Brown University; Vivek Ramanujan, Brown University; Daniel Ritchie, Brown University; Kwang In Kim, University of Bath; James Tompkin, Brown University
97K-convexity shape priors for segmentationHossam Isack*, UWO; Lena Gorelick, University of Western Ontario; Karin nG, University of Western Ontario; Olga Veksler, University of Western Ontario; Yuri Boykov, University of Waterloo
2546Visual Question Generation for Class Acquisition of Unknown ObjectsKohei Uehara*, The University of Tokyo; Antonio Tejero-de-Pablos, The University of Tokyo; Yoshitaka Ushiku, The University of Tokyo; Tatsuya Harada, The University of Tokyo
1970Sampling Algebraic Varieties for Robust Camera AutocalibrationDanda Pani Paudel*, ETH Zürich; Luc Van Gool, ETH Zurich
142Hand Pose Estimation via Latent 2.5D Heatmap RegressionUmar Iqbal*, University of Bonn; Pavlo Molchanov, NVIDIA; Thomas Breuel, NVIDIA; Jürgen Gall, University of Bonn; Kautz Jan, NVIDIA
468Single-view Hair Reconstruction using Convolutional NetworkYi Zhou*, University of Southern California; Jun Xing, Institute for Creative Technologies, USC; Liwen Hu, University of Southern California; Weikai Chen, USC Institute for Creative Technology; Hao Li, Pinscreen/University of Southern California/USC ICT; Han-Wei Kung, University of California, Santa Barbara
318Super-Identity Convolutional Neural Network for Face HallucinationKaipeng Zhang*, National Taiwan University; ZHANPENG ZHANG, SenseTime Group Limited; Chia-Wen Cheng, UT Austin; Winston Hsu, National Taiwan University; Yu Qiao, Multimedia Laboratory, Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences; Wei Liu, Tencent AI Lab; Tong Zhang, Tecent AI Lab
707Receptive Field Block Net for Accurate and Fast Object DetectionSongtao Liu, BUAA; Di Huang*, Beihang University, China; Yunhong Wang, State Key Laboratory of Virtual Reality Technology and System, Beihang University, Beijing 100191, China
1443Interpretable Intuitive Physics ModelTian Ye*, Carnegie Mellon University; Xiaolong Wang, CMU; James Davidson, Google; Abhinav Gupta, CMU
1020Variable Ring Light Imaging: Capturing Transient Subsurface Scattering with An Ordinary CameraKo Nishino*, Kyoto University; Art Subpa-asa, Tokyo Institute of Technology; Yuta Asano, Tokyo Institute of Technology; Mihoko Shimano, National Institute of Informatics; Imari Sato, National Institute of Informatics
2520Facial Dynamics Interpreter Network: What are the Important Relations between Local Dynamics for Facial Trait Estimation?Seong Tae Kim*, KAIST; Yong Man Ro, KAIST
2487Text2Colors: Guiding Image Colorization through Text-Driven Palette GenerationWonwoong Cho, Korea University; Hyojin Bahng, Korea University; David Park, Korea University; Seungjoo Yoo, Korea University; Ziming Wu, Hong Kong University of Science and Technology; Xiaojuan MA, Hong Kong University of Science and Technology; Jaegul Choo*, Korea University
1737Sparsely Aggregated Convolutional NetworksLigeng Zhu*, Simon Fraser University; Ruizhi Deng, Simon Fraser University; Michael Maire, Toyota Technological Institute at Chicago; Zhiwei Deng, Simon Fraser University; Greg Mori, Simon Fraser University; Ping Tan, Simon Fraser University
1365Deep Attention Neural Tensor Network for Visual Question AnsweringYalong Bai*, Harbin Institute of Technology; Jianlong Fu, Microsoft Research; Tao Mei, JD.com
1863Diverse feature visualizations reveal invariances in early layers of deep neural networksSantiago Cadena*, University of Tübingen; Marissa Weis, University of Tübingen; Leon A. Gatys, University of Tuebingen; Matthias Bethge, University of Tübingen; Alexander Ecker, University of Tübingen
2317Sidekick Policy Learning for Active Visual ExplorationSanthosh Kumar Ramakrishnan*, University of Texas at Austin; Kristen Grauman, University of Texas
936DPP-Net: Device-aware Progressive Search for Pareto-optimal Neural ArchitecturesJin-Dong Dong*, National Tsing-Hua University; An-Chieh Cheng, National Tsing-Hua University; Da-Cheng Juan, Google; Wei Wei, Google; Min Sun, NTHU
99Pixel2Mesh: Generating 3D Mesh Models from Single RGB ImagesNanyang Wang, Fudan University; Yinda Zhang*, Princeton University; Zhuwen Li, Intel Labs; Yanwei Fu, Fudan Univ.; Wei Liu, Tencent AI Lab; Yu-Gang Jiang, Fudan University
1875End-to-End Incremental LearningFrancisco M. Castro*, University of Málaga; Manuel J. Marín-Jiménez, University of Córdoba; Nicolás Guil, University of Málaga; Cordelia Schmid, INRIA; Karteek Alahari, Inria
178CAR-Net: Clairvoyant Attentive Recurrent NetworkAmir Sadeghian*, Stanford; Maxime Voisin, Stanford University; Ferdinand Legros, Stanford University; Ricky Vesel, Race Optimal; Alexandre Alahi, EPFL; Silvio Savarese, Stanford University
1236Learning Data Terms for Image DeblurringJiangxin Dong*, Dalian University of Technology; Jinshan Pan, Dalian University of Technology; Deqing Sun, NVIDIA; Zhixun Su, Dalian University of Technology; Ming-Hsuan Yang, University of California at Merced
116Image Inpainting for Irregular Holes Using Partial ConvolutionsGuilin Liu*, NVIDIA; Fitsum Reda, NVIDIA; Kevin Shih, NVIDIA; Ting-Chun Wang, NVIDIA; Andrew Tao, NVIDIA; Bryan Catanzaro, NVIDIA
1506SRDA: Generating Instance Segmentation Annotation Via Scanning, Reasoning And Domain AdaptionWenqiang Xu, Shanghai Jiaotong University; Yonglu Li, Shanghai Jiao Tong University; Jun Lv, SJTU; Cewu Lu*, Shanghai Jiao Tong Univercity
2067Learning Priors for Semantic 3D ReconstructionIan Cherabier*, ETH Zurich; Johannes Schoenberger, ETH Zurich; Martin R. Oswald, ETH Zurich; Marc Pollefeys, ETH Zurich; Andreas Geiger, MPI-IS and University of Tuebingen
526Integrating Egocentric Videos in Top-view Surveillance Videos: Joint Identification and Temporal AlignmentShervin Ardeshir*, University of Central Florida; Ali Borji, University of Central Florida
61Deep Boosting for Image DenoisingChang Chen, University of Science and Technology of China; Zhiwei Xiong*, University of Science and Technology of China; Xinmei Tian, USTC; Feng Wu, University of Science and Technology of China
2726Descending, lifting or smoothing: Secrets of robust cost optimizationChristopher Zach*, Toshiba Research; Guillaume Bourmaud, University of Bordeaux
757MultiPoseNet: Fast Multi-Person Pose Estimation using Pose Residual NetworkMuhammed Kocabas*, Middle East Technical University; Salih Karagoz, Middle East Technical University; Emre Akbas, Middle East Technical University
779TS2C: Tight Box Mining with Surrounding Segmentation Context for Weakly Supervised Object DetectionYunchao Wei*, UIUC; Zhiqiang Shen, UIUC; Honghui Shi, UIUC; Bowen Cheng, UIUC; Jinjun Xiong, IBM Thomas J. Watson Research Center; Jiashi Feng, NUS; Thomas Huang, UIUC
2294End-to-End Deep Structured Models for Drawing CrosswalksJustin Liang*, Uber ATG; Raquel Urtasun, Uber ATG
2515Efficient Global Point Cloud Registration by Matching Rotation Invariant Features Through Translation SearchYinlong Liu, Fudan University; Wang Chen*, Shanghai Key Laboratory of Medical Imaging Computing and Computer Assisted Intervention, Digital Medical Research Center, Fudan University; Zhijian Song, Fudan University; Manning Wang, Fudan University
1058Large Scale Urban Scene Modeling from MVS MeshesLingjie Zhu, University of Chinese Academy of Sciences; National Laboratory of Pattern Recognition, Institute of Automation, Chinese Academy of Sciences; Shuhan Shen*, National Laboratory of Pattern Recognition, Institute of Automation, Chinese Academy of Sciences; Zhanyi Hu, National Laboratory of Pattern Recognition, Institute of Automation, Chinese Academy of Sciences
1172Sub-GAN: An Unsupervised Generative Model via SubspacesJie Liang, Nankai University; Jufeng Yang*, Nankai University ; Hsin-Ying Lee, University of California, Merced; Kai Wang, Nankai University; Ming-Hsuan Yang, University of California at Merced
1194Pseudo Pyramid Deeper Bidirectional ConvLSTM for Video Saliency DetectionHongmei Song, Beijing Institute of Technology; Sanyuan Zhao*, Beijing Institute of Technology ; Jianbing Shen, Beijing Institute of Technology; Kin-Man Lam, The Hong Kong Polytechnic University
1643Practical Black-box Attacks on Deep Neural Networks using Efficient Query MechanismsArjun Nitin Bhagoji*, Princeton University; Warren he, University of California, Berkeley; Bo Li, University of Illinois at Urbana–Champaign; Dawn Song, UC Berkeley
1126Learning 3D Shape Priors for Shape Completion and ReconstructionJiajun Wu*, MIT; Chengkai Zhang, MIT; Xiuming Zhang, MIT; Zhoutong Zhang, MIT; Joshua Tenenbaum, MIT; Bill Freeman, MIT
1280Comparator NetworksWeidi Xie*, University of Oxford; Li Shen, University of Oxford; Andrew Zisserman, University of Oxford
1394Improving Fine-Grained Visual Classification using Pairwise ConfusionAbhimanyu Dubey*, Massachusetts Institute of Technology; Otkrist Gupta, MIT; Pei Guo, Brigham Young University; Ryan Farrell, Brigham Young University; Ramesh Raskar, Massachusetts Institute of Technology; Nikhil Naik, MIT
533Visual-Inertial Object Detection and MappingXiaohan Fei*, UCLA; Stefano Soatto, UCLA
2263Learning Region Features for Object DetectionJiayuan Gu, Peking University; Han Hu, Microsoft Research Asia; Liwei Wang, Peking University; Yichen Wei, MSR Asia; Jifeng Dai*, Microsoft Research Asia
2582Efficient Dense Point Cloud Object Reconstruction using Deformation Vector FieldsKejie Li*, University of Adelaide; Trung Pham, NVIDIA; Huangying Zhan, The University of Adelaide; Ian Reid, University of Adelaide, Australia
314Evaluating Capability of Deep Neural Networks for Image Classification via Information PlaneHao Cheng*, Shanghaitech University; Dongze Lian, Shanghaitech University; Shenghua Gao, Shanghaitech University; Yanlin Geng, Shanghaitech University
1372Shuffle-Then-Assemble: Learning Object-Agnostic Visual Relationship FeaturesXU YANG*, NTU; Hanwang Zhang, Nanyang Technological University; Jianfei Cai, Nanyang Technological University
1277Zero-Shot Deep Domain AdaptationKuan-Chuan Peng*, siemens corporation; Ziyan Wu, Siemens Corporation; Jan Ernst, Siemens Corporation
1164Deep Imbalanced Attribute Classification using Visual Attention AggregationNikolaos Sarafianos*, University of Houston; Xiang Xu, University of Houston; Ioannis Kakadiaris, University of Houston
910Video Object Segmentation by Learning Location-Sensitive EmbeddingsHai Ci, Peking University; Chunyu Wang*, Microsoft Research asia; Yizhou Wang, PKU
1505Deep Multi-Task Learning to Recognise Subtle Facial Expressions of Mental StatesGuosheng Hu*, AnyVision; Li Liu, the inception institute of artificial intelligence; Yang Yuan, AnyVision; Zehao Yu, Xiamen University; Yang Hua, Queen's University Belfast; Zhihong Zhang, Xiamen University; Fumin Shen, UESTC; Ling Shao, Inception Institute of Artificial Intelligence; Timothy Hospedales, Edinburgh University; Neil Robertson, Queen's University Belfast; Yongxin Yang, University of Edinburgh
1210Where Will They Go? Predicting Fine-Grained Adversarial Multi-Agent Motion using Conditional Variational AutoencodersPanna Felsen*, University of California Berkeley; Patrick Lucey, STATS; Sujoy Ganguly, STATS
2145Video Summarization Using Fully Convolutional Sequence NetworksMrigank Rochan*, University of Manitoba; Linwei Ye, University of Manitoba; Yang Wang, University of Manitoba
2189Occlusion-aware Hand Pose Estimation Using Hierarchical Mixture Density NetworkQi Ye*, Imperial College London; Tae-Kyun Kim, Imperial College London
2070Learning with Biased Complementary LabelsXiyu Yu*, The University of Sydney; Tongliang Liu, The University of Sydney; Mingming Gong, University of Pittsburgh; Dacheng Tao, University of Sydney
2665ConceptMask: Large-Scale Segmentation from Semantic ConceptsYufei Wang*, Facebook; Zhe Lin, Adobe Research; Xiaohui Shen, Adobe Research; Scott Cohen, Adobe Research; Jianming Zhang, Adobe Research
1899Conditional Image-Text Embedding NetworksBryan Plummer*, Boston University; Paige Kordas, University of Illinois at Urbana Champaign; Hadi Kiapour, eBay; Shuai Zheng, eBay; Robinson Piramuthu, eBay Inc.; Svetlana Lazebnik, UIUC
2832Geolocation Estimation of Photos using a Hierarchical Model and Scene ClassificationEric Müller-Budack*, Leibniz Information Centre of Science and Technology (TIB); Kader Pustu-Iren, Leibniz Information Center of Science and Technology (TIB); Ralph Ewerth, Leibniz Information Center of Science and Technology (TIB)
691Lifting Layers: Analysis and ApplicationsMichael Moeller*, University of Siegen; Peter Ochs, Saarland University; Tim Meinhardt, Technical University of Munich; Laura Leal-Taixé, TUM
128Progressive Neural Architecture SearchChenxi Liu*, Johns Hopkins University; Maxim Neumann, Google; Barret Zoph, Google; Jon Shlens, Google; Wei Hua, Google; Li-Jia Li, Google; Li Fei-Fei, Stanford University; Alan Yuille, Johns Hopkins University; Jonathan Huang, Google; Kevin Murphy, Google
507Learning Deep Representations with Probabilistic Knowledge TransferNikolaos Passalis*, Aristotle University of Thessaloniki; Anastasios Tefas, Aristotle University of Thessaloniki
1479Robust fitting in computer vision: easy or hard?Tat-Jun Chin*, University of Adelaide; Zhipeng Cai, The University of Adelaide; Frank Neumann, The University of Adelaide, School of Computer Science, Faculty of Engineering, Computer and Mathematical Science
1061Dual-Agent Deep Reinforcement Learning for Deformable Face TrackingMinghao Guo, Tsinghua University; Jiwen Lu*, Tsinghua University; Jie Zhou, Tsinghua University, China
3CSeptember 12, 05:15 PM
536Zero-Shot Object DetectionAnkan Bansal*, University of Maryland; Karan Sikka, SRI International; Gaurav Sharma, NEC Labs America; Rama Chellappa, University of Maryland; Ajay Divakaran, SRI, USA
2158ForestHash: Semantic Hashing With Shallow Random Forests and Tiny Convolutional NetworksQiang Qiu*, Duke University; Jose Lezama, Universidad de la Republica, Uruguay; Alex Bronstein, Tel Aviv University, Israel; Guillermo Sapiro, Duke University
326ML-LocNet: Improving Object Localization with Multi-view Learning NetworkXiaopeng Zhang*, National University of Singapore; Jiashi Feng, NUS
2108MPLP++: Fast, Parallel Dual Block-Coordinate Ascent for Dense Graphical ModelsSiddharth Tourani*, Visual Learning Lab, HCI, Uni-Heidelberg; Alexander Shekhovtsov, Czech Technical University in Prague, Czech Republic; Carsten Rother, University of Heidelberg; Bogdan Savchynskyy, Heidelberg University
2152A Zero-Shot Framework for Sketch based Image RetrievalSasikiran Yelamarthi , IIT Madras; Shiva Krishna Reddy M, Indian Institute of Technology Madras; Ashish Mishra*, IIT Madras; Anurag Mittal, Indian Institute of Technology Madras
1542In the Eye of Beholder: Joint Learning of Gaze and Actions in First Person VisionYin Li*, CMU; Miao Liu, Georgia Tech; James Rehg, Georgia Institute of Technology
866SAN: Learning Relationship between Convolutional Features for Multi-Scale Object DetectionYongHyun Kim*, POSTECH
1881A Systematic DNN Weight Pruning Framework using Alternating Direction Method of MultipliersTianyun Zhang*, Syracuse University; Shaokai Ye, Syracuse University; Kaiqi Zhang, Syracuse University; Yanzhi Wang, Syracuse University; Makan Fardad, Syracuse Universtiy; Wujie Wen, Florida International University
639Iterative Crowd CountingViresh Ranjan*, Stony Brook University; Hieu Le, Stony Brook University; Minh Hoai Nguyen, Stony Brook University
2538A Dataset for Lane Instance Segmentation in Urban EnvironmentsBrook Roberts, Five AI Ltd.; Sebastian Kaltwang*, Five AI Ltd.; Sina Samangooei, Five AI Ltd.; Mark Pender-Bare, Five AI Ltd.; Konstantinos Tertikas, Five AI Ltd.; John Redford, Five AI Ltd.
2698Out-of-Distribution Detection Using an Ensemble of Self Supervised Leave-out ClassifiersNataraj Jammalamadaka*, Intel Labs;
Xia Zhu, Intel Labs;
Dipankar Das, Intel Labs;
Bharat Kaul, Intel Labs;
Theodore Willke, Intel Labs
1179Penalizing Top Performers: Conservative Loss for Semantic Segmentation AdaptationXinge Zhu*, Sensetime Group Limited; Hui Zhou, Sensetime Group Limited.; Ceyuan Yang, SenseTime Group Limited; Jianping Shi, Sensetime Group Limited; Dahua Lin, The Chinese University of Hong Kong
1429Compound Memory Networks for Few-shot Video ClassificationLinchao Zhu*, University of Technology, Sydney; Yi Yang, UTS
2314Straight to the Facts: Learning Knowledge Base Retrieval for Factual Visual Question AnsweringMedhini Narasimhan*, University of Illinois at Urbana-Champaign ; Alexander Schwing, UIUC
1806Interpretable Basis Decomposition for Visual ExplanationAntonio Torralba, MIT; Bolei Zhou*, MIT; David Bau, MIT; Yiyou Sun, Harvard
1839How Local is the Local Diversity? Reinforcing Sequential Determinantal Point Processes with Dynamic Ground Sets for Supervised Video SummarizationYandong Li*, University of Central Florida; Boqing Gong, Tencent AI Lab; Tianbao Yang, University of Iowa; Liqiang Wang, University of Central Florida
646Dividing and Aggregating Network for Multi-view Action RecognitionDongang Wang*, The University of Sydney; Wanli Ouyang, CUHK; Wen Li, ETHZ; Dong Xu, University of Sydney
1217Shape Reconstruction Using Volume Sweeping and Learned PhotoconsistencyVincent Leroy*, INRIA Grenoble Rhône-Alpes; Edmond Boyer, Inria; Jean-Sebastien Franco, INRIA
2000RT-GENE: Real-Time Eye Gaze Estimation in Natural EnvironmentsTobias Fischer*, Imperial College London; Hyung Jin Chang, University of Birmingham; Yiannis Demiris, Imperial College London
1369Pairwise Body-Part Attention for Recognizing Human-Object InteractionsHaoshu Fang, SJTU; Jinkun Cao, Shanghai Jiao Tong University; Yu-Wing Tai, Tencent YouTu; Cewu Lu*, Shanghai Jiao Tong Univercity
2074Motion Feature Network: Fixed Motion Filter for Action RecognitionMyunggi Lee, Seoul National University; Seung Eui Lee, Seoul National University; Sung Joon Son, Seoul National University; Gyutae Park, Seoul National University; Nojun Kwak*, Seoul National University
356Reverse Attention for Salient Object DetectionShuhan Chen*, Yangzhou University; Xiuli Tan, Yangzhou University; Ben Wang, Yangzhou University; Xuelong Hu, Yangzhou University
1608Dynamic Sampling Convolutional Neural NetworksJialin Wu*, UT Austin; Dai Li, Tsinghua University; Yu Yang, Tsinghua University; Chandrajit Bajaj, University of Texas, Austin; Xiangyang Ji, Tsinghua University
1582DDRNet: Depth Map Denoising and Refinement for Consumer Depth Cameras Using Cascaded CNNsShi Yan, Tsinghua University; Chenglei Wu, Oculus Research; Lizheng Wang, Tsinghua University; Liang An, Tsinghua University; Feng Xu, Tsinghua University; Kaiwen Guo, Google Inc.; Yebin Liu*, Tsinghua University
632Stereo Computation for a Single Mixture ImageYiran Zhong, Australian National University; Yuchao Dai*, Northwestern Polytechnical University; HONGDONG LI, Australian National University, Australia
977Volumetric performance capture from minimal camera viewpointsAndrew Gilbert*, University of Surrey; Marco Volino, University of Surrey; John Collomosse, Adobe Research; Adrian Hilton, University of Surrey
586Liquid Pouring Monitoring via Rich Sensory InputsTz-Ying Wu*, National Tsing Hua University; Juan-Ting Lin, National Tsing Hua University; Tsun-Hsuang Wang, National Tsing Hua University; Chan-Wei Hu, National Tsing Hua University; Juan Carlos Niebles, Stanford University; Min Sun, NTHU
856Move Forward and Tell: A Progressive Generator of Video DescriptionsYilei Xiong*, The Chinese University of Hong Kong; Bo Dai, the Chinese University of Hong Kong; Dahua Lin, The Chinese University of Hong Kong
1684DYAN: A Dynamical Atoms-Based Network for Video PredictionWenqian Liu*, Northeastern University; Abhishek Sharma, Northeastern University ; Octavia Camps, Northeastern University; Mario Sznaier, Northeastern University
2027Deep Structure Inference Network for Facial Action Unit RecognitionCiprian Corneanu*, Universitat de Barcelona; Meysam Madadi, CVC; Sergio Escalera, Computer Vision Center (UAB) & University of Barcelona,
1300Physical Primitive DecompositionZhijian Liu, Shanghai Jiao Tong University; Jiajun Wu*, MIT; Bill Freeman, MIT; Joshua Tenenbaum, MIT
106Boosted Attention: Leveraging Human Attention for Image CaptioningShi Chen*, University of Minnesota; Qi Zhao, University of Minnesota
3015Is Robustness the Cost of Accuracy? -- Lessons Learned from 18 Deep Image ClassifiersDong Su*, IBM Research T.J. Watson Center; Huan Zhang, UC Davis; Hongge Chen, MIT; Jinfeng Yi, JD AI Research; Pin-Yu Chen, IBM Research; Yupeng Gao, IBM Research AI
1116Dynamic Multimodal Instance Segmentation guided by natural language queriesEdgar Margffoy-Tuay*, Universidad de los Andes; Emilio Botero, Universidad de los Andes; Juan Pérez, Universidad de los Andes; PABLO ARBELÁEZ, Universidad de los Andes
780Hierarchy of Alternating Specialists for Scene RecognitionHyo Jin Kim*, University of North Carolina at Chapel Hill; Jan-Michael Frahm, UNC-Chapel Hill
3133SwapNet: Garment Transfer in Single View ImagesAmit Raj*, Georgia Institute of Technology; Patsorn Sangkloy, Georgia Institute of Technology; Huiwen Chang, Princeton University; Jingwan Lu, Adobe Research ; Duygu Ceylan, Adobe Research; James Hays, Georgia Institute of Technology, USA
346What do I Annotate Next? An Empirical Study of Active Learning for Action LocalizationFabian Caba*, KAUST; Joon-Young Lee, Adobe Research; Hailin Jin, Adobe Research; Bernard Ghanem, KAUST
1391Combining 3D Model Contour Energy and Keypoints for Object TrackingBogdan Bugaev*, Saint Petersburg Academic University; Anton Kryshchenko, Saint Petersburg Academic University; Roman Belov, KeenTools
1150AGIL: Learning Attention from Human for Visuomotor TasksRuohan Zhang*, University of Texas at Austin; Zhuode Liu, Google Inc.; Luxin Zhang, Peking University; Jake Whritner, University of Texas at Austin; Karl Muller, University of Texas at Austin; Mary Hayhoe, University of Texas at Austin; Dana Ballard, University of Texas at Austin
1644PersonLab: Person Pose Estimation and Instance Segmentation with a Bottom-Up, Part-Based, Geometric Embedding ModelGeorge Papandreou*, Google; Tyler Zhu, Google; Liang-Chieh Chen, Google Inc.; Spyros Gidaris, Ecole des Ponts ParisTech; Jonathan Tompson, Google; Kevin Murphy, Google
2768Accelerating Dynamic Programs via Nested Benders Decomposition with Application to Multi-Person Pose EstimationShaofei Wang*, Baidu Inc.; Alexander Ihler, UC Irvine; Konrad Kording, Northwestern; Julian Yarkony, Experian Data Lab
38Separating Reflection and Transmission Images in the WildPatrick Wieschollek*, University of Tuebingen; Orazio Gallo, NVIDIA Research; Jinwei Gu, Nvidia; Kautz Jan, NVIDIA
601Point-to-Point Regression PointNet for 3D Hand Pose EstimationLiuhao Ge*, NTU; Zhou Ren, Snap Research, USA, ; Junsong Yuan, State University of New York at Buffalo, USA
80Summarizing First-Person Videos from Third Persons' Points of ViewHSUAN-I HO*, National Taiwan University; Wei-Chen Chiu, National Chiao Tung University; Yu-Chiang Frank Wang, National Taiwan University
649Learning Category-Specific Mesh Reconstruction from Image CollectionsAngjoo Kanazawa*, UC Berkeley; Shubham Tulsiani, UC Berkeley; Alexei Efros, UC Berkeley; Jitendra Malik, University of California at Berkley
959StereoNet: Guided Hierarchical Refinement for Real-Time Edge-Aware Depth PredictionSameh Khamis*, Google; Sean Fanello, Google; Christoph Rhemann, Google; Julien Valentin, Google; Adarsh Kowdle, Google; Shahram Izadi, Google
333Visual Question Answering as a Meta Learning TaskDamien Teney*, The Unversity of Adelaide; Anton van den Hengel, The University of Adelaide
2132SRFeat: Single Image Super Resolution with Feature DiscriminationSeong-Jin Park*, POSTECH; Hyeongseok Son, POSTECH; Sunghyun Cho, DGIST; Ki-Sang Hong, POSTECH; Seungyong Lee, POSTECH
32Deep Factorised Inverse-SketchingKaiyue Pang*, Queen Mary University of London; Da Li, QMUL; Jifei Song, Queen Mary, University of London; Yi-Zhe Song, Queen Mary University of London; Tao Xiang, Queen Mary, University of London, UK; Timothy Hospedales, Edinburgh University
2691Multimodal image alignment through a multiscale chain of neural networks with application to remote sensingArmand Zampieri, Inria Sophia-Antipolis; Guillaume Charpiat, INRIA; Nicolas Girard, Inria Sophia-Antipolis; Yuliya Tarabalka*, Inria Sophia-Antipolis
1470Improving Deep Visual Representation for Person Re-identification by Global and Local Image-language AssociationDapeng Chen*, The Chinese University of HongKong; Hongsheng Li, Chinese University of Hong Kong; Xihui Liu, The Chinese University of Hong Kong; Jing Shao, The Chinese University of Hong Kong; Xiaogang Wang, Chinese University of Hong Kong, Hong Kong
494Robust Optical Flow Estimation in Rainy ScenesRuoteng Li*, National University of Singapore; Robby Tan, Yale-NUS College, Singapore; Loong Fah Cheong, NUS
1730Image Generation from Sketch Constraint Using Contextual GANYongyi Lu*, HKUST; Shangzhe Wu, HKUST; Yu-Wing Tai, Tencent YouTu; Chi-Keung Tang, Hong Kong University of Science and Technology
1997Accurate Scene Text Detection through Border Semantics Awareness and BootstrappingChuhui Xue, Nanyang Technological University; Shijian Lu*, Nanyang Technological University; Fangneng Zhan, Nanyang Technological University
26CNN-PS: CNN-based Photometric Stereo for General Non-Convex SurfacesSatoshi Ikehata*, National Institute of Informatics
115Making Deep Heatmaps Robust to Partial Occlusions for 3D Object Pose EstimationMarkus Oberweger*, TU Graz; Mahdi Rad, TU Graz; Vincent Lepetit, TU Graz
2197Recognition in Terra IncognitaSara Beery*, Caltech; Grant van Horn, Caltech; Pietro Perona, Caltech
1595Super-Resolution and Sparse View CT ReconstructionGuangming Zang, KAUST; Ramzi Idoughi, KAUST; Mohamed Aly, KAUST; Peter Wonka, KAUST; Wolfgang Heidrich*, KAUST
2157Modeling Visual Context is Key to Augmenting Object Detection DatasetsNIKITA DVORNIK*, INRIA; Julien Mairal, INRIA; Cordelia Schmid, INRIA
2949Occlusions, Motion and Depth Boundaries with a Generic Network for Optical Flow, Disparity, or Scene Flow Estimation Eddy Ilg*, University of Freiburg; Tonmoy Saikia, University of Freiburg; Margret Keuper, University of Mannheim; Thomas Brox, University of Freiburg
1533Unsupervised Domain Adaptation for 3D Keypoint Estimation via View ConsistencyXingyi Zhou, The University of Texas at Austin; Arjun Karpur, The University of Texas at Austin; Chuang Gan, MIT; Linjie Luo, Snap Inc; Qixing Huang*, The University of Texas at Austin
2602Improving DNN Robustness to Adversarial Attacks using Jacobian RegularizationDaniel Jakubovitz*, Tel Aviv University; Raja Giryes, Tel Aviv University
982A Framework for Evaluating 6-DOF Object TrackersMathieu Garon, Université Laval; Denis Laurendeau, Laval University; Jean-Francois Lalonde*, Université Laval
67Self-Supervised Relative Depth Learning for Urban Scene UnderstandingHuaizu Jiang*, UMass Amherst; Erik Learned-Miller, University of Massachusetts, Amherst; Gustav Larsson, University of Chicago; Michael Maire, Toyota Technological Institute at Chicago; Greg Shakhnarovich, Toyota Technological Institute at Chicago
538Actor-centric Relation Network Chen Sun*, Google; Abhinav Shrivastava, UMD / Google; Carl Vondrick, MIT; Kevin Murphy, Google; Rahul Sukthankar, Google; Cordelia Schmid, Google
2932Self-produced Guidance for Weakly-supervised Object LocalizationXiaolin Zhang*, University of Technology Sydney; Yunchao Wei, UIUC; Guoliang Kang, UTS; Yi Yang, UTS; Thomas Huang, UIUC
1980Attribute-Guided Face Generation Using Conditional CycleGANYongyi Lu*, HKUST; Yu-Wing Tai, Tencent YouTu; Chi-Keung Tang, Hong Kong University of Science and Technology
469Neural Network EncapsulationHongyang Li*, Chinese University of Hong Kong; Bo Dai, the Chinese University of Hong Kong; Wanli Ouyang, CUHK; Xiaoyang Guo, The Chinese University of Hong Kong; Xiaogang Wang, Chinese University of Hong Kong, Hong Kong
1282Deep Regionlets for Object DetectionHongyu Xu*, University of Maryland; Xutao Lv, Intellifusion; Xiaoyu Wang, -; Zhou Ren, Snap Inc.; Navaneeth Bodla, University of Maryland; Rama Chellappa, University of Maryland
752Deep Adversarial Attention Alignment for Unsupervised Domain Adaptation: the Benefit of Target Expectation MaximizationGuoliang Kang*, UTS; Liang Zheng, Singapore University of Technology and Design; Yan Yan, UTS; Yi Yang, UTS
122Fighting Fake News: Image Splice Detection via Learned Self-ConsistencyJacob Huh*, Carnegie Mellon University; Andrew Liu, University of California, Berkeley; Andrew Owens, UC Berkeley; Alexei Efros, UC Berkeley
890Learning Monocular Depth by Distilling Cross-domain Stereo NetworksXiaoyang Guo*, The Chinese University of Hong Kong; Hongsheng Li, Chinese University of Hong Kong; Shuai Yi, The Chinese University of Hong Kong; Jimmy Ren, Sensetime Research; Xiaogang Wang, Chinese University of Hong Kong, Hong Kong
940 Riemannian Walk for Incremental Learning: Understanding Forgetting and IntransigenceArslan Chaudhry*, University of Oxford; Puneet Dokania, University of Oxford; Thalaiyasingam Ajanthan, University of Oxford; Philip Torr, University of Oxford
640Weakly Supervised Region Proposal Network and Object DetectionPeng Tang*, Huazhong University of Science and Technology; Xinggang Wang, Huazhong Univ. of Science and Technology; Angtian Wang, Huazhong University of Science and Technology ; Yongluan Yan, Huazhong University of Science and Technology ; Wenyu Liu, Huazhong University of Science and Technology; Junzhou Huang, Tencent AI Lab; Alan Yuille, Johns Hopkins University
4ASeptember 13, 10:00 AM
1593Viewpoint Estimation - Insights & ModelGilad Divon, Technion; Ayellet Tal*, Technion
3056Towards Realistic PredictorsPei Wang*, UC San Diego; Nuno Vasconcelos, UC San Diego
172Group NormalizationYuxin Wu, Facebook; Kaiming He*, Facebook Inc., USA
2486Deep Expander Networks: Efficient Deep Networks from Graph TheoryAmeya Prabhu*, IIIT Hyderabad; Girish Varma, IIIT Hyderabad; Anoop Namboodiri, IIIT Hyderbad
3134Learning SO(3) Equivariant Representations with Spherical CNNsCarlos Esteves*, University of Pennsylvania; Kostas Daniilidis, University of Pennsylvania; Ameesh Makadia, Google Research; Christine Allec-Blanchette, University of Pennsylvania
1248Video Re-localization via Cross Gated Bilinear MatchingYang Feng*, University of Rochester; Lin Ma, Tencent AI Lab; Wei Liu, Tencent AI Lab; Tong Zhang, Tecent AI Lab; Jiebo Luo, U. Rochester
2466A Deeply-initialized Coarse-to-fine Ensemble of Regression Trees for Face AlignmentRoberto Valle*, Universidad Politécnica de Madrid; José Buenaposada, Universidad Rey Juan Carlos; Antonio Valdés, Universidad Complutense de Madrid; Luis Baumela, Universidad Politecnica de Madrid
2465Deep Kalman Filtering Network for Video Compression Artifact ReductionGuo Lu*, Shanghai Jiao Tong University; Wanli Ouyang, CUHK; Dong Xu, University of Sydney; Xiaoyun Zhang, Shanghai Jiao Tong University; Zhiyong Gao, Shanghai Jiao Tong University; Ming Ting Sun, -
2884Exploring Visual Relationship for Image CaptioningTing Yao*, Microsoft Research; Yingwei Pan, University of Science and Technology of China; Yehao Li, Sun Yat-Sen University; Tao Mei, JD.com
2253Sequential Clique Optimization for Video Object SegmentationYeong Jun Koh*, Korea University; Young-Yoon Lee, Samsung; Chang-Su Kim, Korea university
621Spatial Pyramid Calibration for Image ClassificationYan Wang, Shanghai Jiao Tong University; Lingxi Xie*, JHU; Siyuan Qiao, Johns Hopkins University; Ya Zhang, Cooperative Medianet Innovation Center, Shang hai Jiao Tong University; Wenjun Zhang, Shanghai Jiao Tong University; Alan Yuille, Johns Hopkins University
124Visual Text CorrectionAmir Mazaheri*, University of Central Florida; Mubarak Shah, University of Central Florida
1216X-ray Computed Tomography Through ScatterAdam Geva*, Technion; Yoav Y. Schechner, Technion; Jonathan Chernyak, Technion; Rajiv Gupta, MGH Harvard
1407Graph Distillation for Action Detection with Privileged Information in RGB-D VideosZelun Luo*, Stanford University; Lu Jiang, Google; Jun-Ting Hsieh, Stanford University; Juan Carlos Niebles, Stanford University; Li Fei-Fei, Stanford University
1396Modular Generative Adversarial NetworksBo Zhao*, University of British Columbia; Bo Chang, University of British Columbia; Zequn Jie, Tencent AI Lab; Leonid Sigal, University of British Columbia
1131R2P2: A ReparameteRized Pushforward Policy for Diverse, Precise Generative Path ForecastingNicholas Rhinehart*, CMU; Kris Kitani, CMU; Paul Vernaza, NEC Labs America
1337DFT-based Transformation Invariant Pooling Layer for Visual ClassificationJongbin Ryu*, Hanyang University; Ming-Hsuan Yang, University of California at Merced; Jongwoo Lim, Hanyang University
930X2Face: A network for controlling face generation by using images, audio, and pose codesOlivia Wiles*, University of Oxford; A Koepke, University of Oxford; Andrew Zisserman, University of Oxford
1543Compositional Learning of Human Object InteractionsKeizo Kato, CMU; Yin Li*, CMU; Abhinav Gupta, CMU
1952Learning to Navigate for Fine-grained ClassificationZe Yang*, Peking University; Tiange Luo, Peking University; Dong Wang, Peking University; Zhiqiang Hu, Peking University; Jun Gao, Peking University; Liwei Wang, Peking University
1166Cross-Modal Ranking with Soft Consistency and Noisy Labels for Robust RGB-T TrackingChenglong Li, Anhui University; Chengli Zhu, Anhui University; Yan Huang, Institute of Automation, Chinese Academy of Sciences; Jin Tang, Anhui University; Liang Wang*, NLPR, China
1363Light-weight CNN Architecture Design for Fast InferenceNingning Ma*, Tsinghua; Xiangyu Zhang, Megvii Inc; Hai-Tao Zheng, Tsinghua University; Jian Sun, Megvii, Face++
671Fully Motion-Aware Network for Video Object DetectionShiyao Wang*, Tsinghua University; Yucong Zhou, Beihang University; Junjie Yan, Sensetime Group Limited
1168Shift-Net: Image Inpainting via Deep Feature RearrangementZhaoyi Yan, Harbin Institute of Technology; Xiaoming Li, Harbin Institute of Technology; Mu LI, The Hong Kong Polytechnic University; Wangmeng Zuo*, Harbin Institute of Technology, China; Shiguang Shan, Chinese Academy of Sciences
628Choose Your Neuron: Incorporating Domain Knowledge through Neuron ImportanceRamprasaath Ramasamy Selvaraju*, Virginia Tech; Prithvijit Chattopadhyay, Georgia Institute of Technology; Mohamed Elhoseiny, Facebook; Tilak Sharma, Facebook; Dhruv Batra, Georgia Tech & Facebook AI Research; Devi Parikh, Georgia Tech & Facebook AI Research; Stefan Lee, Georgia Institute of Technology
2176Joint 3D tracking of a deformable object in interaction with a handAggeliki Tsoli*, FORTH; Antonis Argyros, CSD-UOC and ICS-FORTH
723Interpolating Convolutional Neural Networks Using Batch NormalizationGratianus Wesley Putra Data*, University of Oxford; Kirjon Ngu, University of Oxford; David Murray, University of Oxford; Victor Prisacariu, University of Oxford
399Learning Warped Guidance for Blind Face RestorationXiaoming Li, Harbin Institute of Technology; Ming Liu, Harbin Institute of Technology; Yuting Ye, Harbin Institute of Technology; Wangmeng Zuo*, Harbin Institute of Technology, China; Liang Lin, Sun Yat-sen University; Ruigang Yang, University of Kentucky, USA
2071Separable Cross-Domain TranslationYedid Hoshen*, Facebook AI Research (FAIR); Lior Wolf, Tel Aviv University, Israel
1672Task-driven Webpage SaliencyQuanlong Zheng*, City University of HongKong; Jianbo Jiao, City University of Hong Kong; Ying Cao, City University of Hong Kong; Rynson Lau, City University of Hong Kong
1357Appearance-Based Gaze Estimation via Evaluation-Guided Asymmetric RegressionYihua Cheng, Beihang University; Feng Lu*, U. Tokyo; Xucong Zhang, Max Planck Institute for Informatics and Saarland University
1913Pivot Correlational Neural Network for Multimodal Video CategorizationSunghun Kang*, KAIST; Junyeong Kim, KAIST; Hyunsoo Choi, SAMSUNG ELECTRONICS CO.,LTD; Sungjin Kim, SAMSUNG ELECTRONICS CO.,LTD; Chang D. Yoo, KAIST
1188Interactive Boundary Prediction for Object SelectionHoang Le, Portland State University; Long Mai*, Adobe Research; Brian Price, Adobe; Scott Cohen, Adobe Research; Hailin Jin, Adobe Research; Feng Liu, Portland State University
2723Scenes-Objects-Actions: A Multi-Task, Multi-Label Video DatasetHeng Wang*, Facebook Inc; Lorenzo Torresani, Dartmouth College; Matt Feiszli, Facebook Research; Manohar Paluri, Facebook; Du Tran, Facebook; Jamie Ray, Facebook Research; Yufei Wang, Facebook
2075Transferable Adversarial PerturbationsBruce Hou*, Tencent; Wen Zhou, Tencent
1106Incremental Non-Rigid Structure-from-Motion with Unknown Focal LengthThomas Probst, ETH Zurich; Danda Pani Paudel*, ETH Zürich; Ajad Chhatkuli , ETHZ; Luc Van Gool, ETH Zurich
2083Semantically Aware Urban 3D Reconstruction with Plane-Based RegularizationThomas Holzmann*, Graz University of Technology; Michael Maurer, Graz University of Technology; Friedrich Fraundorfer, Graz University of Technology; Horst Bischof, Graz University of Technology
1520Learning to Dodge A Bulletshi jin*, ShanghaiTech University; Jinwei Ye, Louisiana State University; Yu Ji, Plex-VR; RUIYANG LIU, ShanghaiTech University; Jingyi Yu, Shanghai Tech University
825Training Binary Weight Networks via Semi-Binary DecompositionQinghao Hu*, Institute of Automation, Chinese Academy of Sciences; Gang Li, Institute of Automation, Chinese Academy of Sciences; Peisong Wang, Institute of Automation, Chinese Academy of Sciences; yifan zhang, Institute of Automation,Chinese Academy of Sciences; Jian Cheng, Chinese Academy of Sciences, China
9Learnable PINs: Cross-Modal Embeddings for Person IdentitySamuel Albanie*, University of Oxford; Arsha Nagrani, Oxford University ; Andrew Zisserman, University of Oxford
732Toward Characteristic-Preserving Image-based Virtual Try-On NetworkBochao Wang, Sun Yet-sen University; Huabin Zheng, Sun Yat-Sen University; Xiaodan Liang*, Carnegie Mellon University; Yimin Chen, sensetime; Liang Lin, Sun Yat-sen University
1855Deep Feature Factorization For Unsupervised Concept DiscoveryEdo Collins*, EPFL; Radhakrishna Achanta, EPFL; Sabine Süsstrunk, EPFL
319SOD-MTGAN: Small Object Detection via Multi-Task Generative Adversarial NetworkYongqiang Zhang*, Harbin institute of Technology/KAUST; Yancheng Bai, KAUST/ISCAS; Mingli Ding, Harbin institute of Technology; Bernard Ghanem, KAUST
2856Human Motion Analysis with Deep Metric LearningHUSEYIN COSKUN*, Technical University of Munich; David Joseph Tan, CAMP, TU Munich; Sailesh Conjeti, Technical University of Munich; Nassir Navab, TU Munich, Germany; Federico Tombari, Technical University of Munich, Germany
1912Dist-GAN: An Improved GAN using Distance ConstraintsNgoc-Trung Tran*, Singapore University of Technology and Design; Tuan Anh Bui, Singapore University of Technology and Design; Ngai-Man Cheung, Singapore University of Technology and Design
528Cross-Modal and Hierarchical Modeling of Video and TextBowen Zhang*, University of Southern California; Hexiang Hu, University of Southern California; Fei Sha, USC
1767Deep Image Demosaicking using a Cascade of Convolutional Residual Denoising NetworksFilippos Kokkinos*, Skolkovo Institute of Science and Technology; Stamatis Lefkimmiatis, Skolkovo Institute of Science and Technology
1370Deep Clustering for Unsupervised Learning of Visual FeaturesMathilde Caron*, Facebook Artificial Intelligence Research; Piotr Bojanowski, Facebook; Armand Joulin, Facebook AI Research; Matthijs Douze, Facebook AI Research
219Domain Adaptation through Synthesis for Unsupervised Person Re-identificationSlawomir Bak*, Argo AI; Jean-Francois Lalonde, Université Laval; Pete Carr, Argo AI
327Facial Expression Recognition with Inconsistently Annotated DatasetsJiabei Zeng*, Institute of Computing Technology, Chinese Academy on Sciences; Shiguang Shan, Chinese Academy of Sciences; Chen Xilin, Institute of Computing Technology, Chinese Academy of Sciences
2959Single Shot Scene Text RetrievalLluis Gomez*, Universitat Autónoma de Barcelona; Andres Mafla, Computer Vision Center; Marçal Rossinyol, Universitat Autónoma de Barcelona; Dimosthenis Karatzas, Computer Vision Centre
2550DeepVS: A Deep Learning Based Video Saliency Prediction ApproachLai Jiang, BUAA; Mai Xu*, BUAA; Minglang Qiao, BUAA; Zulin Wang, BUAA
177Generalizing A Person Retrieval Model Hetero- and HomogeneouslyZhun Zhong*, Xiamen University; Liang Zheng, Singapore University of Technology and Design; Shaozi Li, Xiamen University, China; Yi Yang, University of Technology, Sydney
1853A New Large Scale Dynamic Texture Dataset with Application to ConvNet UnderstandingIsma Hadji*, York University; Rick Wildes, York University
751Deep Cross-modality Adaptation via Semantics Preserving Adversarial Learning for Sketch-based 3D Shape RetrievalJiaxin Chen, New York University Abu Dhabi; Yi Fang*, New York University
455BiSeNet: Bilateral Segmentation Network for Real-time Semantic SegmentationChangqian Yu*, Huazhong University of Science and Technology; Jingbo Wang, Peking University; Chao Peng, Megvii(Face++) Inc; Changxin Gao, Huazhong University of Science and Technology; Gang Yu, Face++; Nong Sang, School of Automation, Huazhong University of Science and Technology
407Face De-spoofingYaojie Liu*, Michigan State University; Amin Jourabloo, Michigan State University; Xiaoming Liu, Michigan State University
390Towards End-to-End License Plate Detection and Recognition: A Large Dataset and BaselineZhenbo Xu*, University of Science and Technology in China; Wei Yang, University of Science and Technology in China; Ajin Meng, University of Science and Technology in China; Nanxue Lu, University of Science and Technology in China; Huan Huang, Xingtai Financial Holdings Group Co., Ltd.
537Self-supervised Tracking by ColorizationCarl Vondrick*, MIT; Abhinav Shrivastava, UMD / Google; Alireza Fathi, Google; Sergio Guadarrama, Google; Kevin Murphy, Google
487Pose Proposal NetworksTaiki Sekii*, Konica Minolta, inc.
111Incremental Multi-graph Matching via Diversity and Randomness based Graph ClusteringTianshu Yu*, Arizona State University; Junchi Yan, Shanghai Jiao Tong University; baoxin Li, Arizona State University; Wei Liu, Tencent AI Lab
1503Single Image Intrinsic Decomposition Without a Single Intrinsic ImageWei-Chiu Ma*, MIT; Hang Chu, University of Toronto; Bolei Zhou, MIT; Raquel Urtasun, University of Toronto; Antonio Torralba, MIT
596Triplet Loss with Theoretical Analysis in Siamese Network for Real-Time Object TrackingXingping Dong, Beijing Institute of Technology; Jianbing Shen*, Beijing Institute of Technology
578Learning to Learn Parameterized Image OperatorsQingnan Fan, Shandong University; Dongdong Chen*, university of science and technology of china; Lu Yuan, Microsoft Research Asia; Gang Hua, Microsoft Cloud and AI; Nenghai Yu, University of Science and Technology of China; Baoquan Chen, Shandong University
2248HBE: Hand Branch Ensemble network for real time 3D hand pose estimationYidan Zhou, Dalian University of Technology; Jian Lu, Laboratory of Advanced Design and Intelligent Computing, Dalian University; Kuo Du, Dalian University of Technology; Xiangbo Lin*, Dalian University of Technology; Yi Sun, Dalian University of Technology; Xiaohong Ma, Dalian University of Technology
721Generative Semantic Manipulation with Mask-Contrasting GANXiaodan Liang*, Carnegie Mellon University
1093Learning to Fuse Proposals from Multiple Scanline Optimizations in Semi-Global MatchingJohannes Schoenberger*, ETH Zurich; Sudipta Sinha, Microsoft Research; Marc Pollefeys, ETH Zurich
493Less is More: Picking Informative Frames for Video CaptioningYangyu Chen*, University of Chinese Academy of Sciences; Shuhui Wang, vipl,ict,Chinese academic of science; Weigang Zhang, Harbin Institute of Technology, Weihai; Qingming Huang, University of Chinese Academy of Sciences, China
1083Deep Pictorial Gaze EstimationSeonwook Park*, ETH Zurich; Adrian Spurr, ETH Zurich; Otmar Hilliges, ETH Zurich
544SkipNet: Learning Dynamic Execution in Residual NetworksXin Wang*, UC Berkeley; Fisher Yu, UC Berkeley; Zi-Yi Dou, Nanjing University; Trevor Darrell, UC Berkeley; Joseph Gonzalez, UC Berkeley
1323Mask TextSpotter: An End-to-End Trainable Neural Network for Spotting Text with Arbitrary ShapesPengyuan Lyu*, Huazhong University of Science and Technology; Minghui Liao, Huazhong University of Science and Technology; Cong Yao, Megvii; Wenhao Wu, Megvii; Xiang Bai, Huazhong University of Science and Technology
1054Deep Adaptive Attention for Joint Facial Action Unit Detection and Face AlignmentZhiwen Shao*, Shanghai Jiao Tong University; Zhilei Liu, Tianjin University; Jianfei Cai, Nanyang Technological University; Lizhuang Ma, Shanghai Jiao Tong University
934Semantic Scene Understanding under Dense Fog with Synthetic and Real DataChristos Sakaridis*, ETH Zurich; Dengxin Dai, ETH Zurich; Simon Hecker, ETH Zurich; Luc Van Gool, ETH Zurich
800RIDI: Robust IMU Double IntegrationHang Yan*, Washington University in St. Louis; Qi Shan, Zillow Group; Yasutaka Furukawa, Simon Fraser University
1427Weakly-supervised Video Summarization using Variational Encoder-Decoder and Web PriorSijia Cai*, The Hong Kong Polytechnic University; Wangmeng Zuo, Harbin Institute of Technology; Larry Davis, University of Maryland; Lei Zhang, Hong Kong Polytechnic University, Hong Kong, China
615Transferring Common-Sense Knowledge for Object DetectionKrishna Kumar Singh*, University of California Davis; Santosh Divvala, Allen AI; Ali Farhadi, University of Washington; Yong Jae Lee, University of California, Davis
549Person Search in Videos with One Portrait Through Visual and Temporal LinksQingqiu Huang*, CUHK; Wentao Liu, Sensetime; Dahua Lin, The Chinese University of Hong Kong
1156Eliminating the Dreaded Blind Spot: Adapting 3D Object Detection and Monocular Depth Estimation to 360° Panoramic ImageryGregoire Payen de La Garanderie*, Durham University; Toby Breckon, Durham University; Amir Atapour-Abarghouei, Durham University
2990Folded Recurrent Neural Networks for Future Video PredictionMarc Oliu*, Universitat Oberta de Catalunya; Javier Selva, Universitat de Barcelona; Sergio Escalera, Computer Vision Center (UAB) & University of Barcelona,
1880Deep Regression Tracking with Shrinkage LossXiankai Lu, Shanghai Jiao Tong University; Chao Ma*, University of Adelaide; Bingbing Ni, Shanghai Jiao Tong University; Xiaokang Yang, Shanghai Jiao Tong University of China; Ian Reid, University of Adelaide, Australia; Ming-Hsuan Yang, University of California at Merced
353Stroke Controllable Fast Style Transfer with Adaptive Receptive FieldsYongcheng Jing, Zhejiang University; Yang Liu, Zhejiang University; Yezhou Yang, Arizona State University; Zunlei Feng, Zhejiang University; Yizhou Yu, The University of Hong Kong; Dacheng Tao, University of Sydney; Mingli Song*, Zhejiang University
1938Part-Aligned Bilinear Representations for Person Re-IdentificationYumin Suh, Seoul National University; Jingdong Wang, Microsoft Research; Kyoung Mu Lee*, Seoul National University
2289Joint 3D Face Reconstruction and Dense Alignment with Position Map Regression NetworkYao Feng*, Shanghai Jiao Tong University; Fan Wu, CloudWalk Technology; Xiao-Hu Shao, Chongqing Institute of Green and Intelligent Technology,Chinese Academy of Sciences; Yan-Feng Wang, Shanghai Jiao Tong University; Xi Zhou, CloudWalk Technology
2718Learning Efficient Single-stage Pedestrian Detection by Asymptotic Localization FittingWei Liu*, National University of Defense Technology; Shengcai Liao, NLPR, Chinese Academy of Sciences, China; Weidong Hu, National University of Defence Technology; Xuezhi Liang, Center for Biometrics and Security Research & National Laboratory of Pattern Recognition Institute of Automation, Chinese Academy of Sciences; Xiao Chen, National University of Defense Technology
412Unsupervised Hard-Negative Mining from Videos for Object DetectionSouYoung Jin*, UMASS Amherst; Huaizu Jiang, UMass Amherst; Aruni RoyChowdhury, University of Massachusetts, Amherst; Ashish Singh, UMASS Amherst; Aditya Prasad, UMASS Amherst; Deep Chakraborty, UMASS Amherst; Erik Learned-Miller, University of Massachusetts, Amherst
842Focus, Segment and Erase: An Efficient Network for Multi-Label Brain Tumor SegmentationXuan Chen*, NUS; Jun Hao Liew, NUS; Wei Xiong, A*STAR Institute for Infocomm Research, Singapore; Chee-Kong Chui, NUS; Sim-Heng Ong, NUS
87Maximum Margin Metric Learning Over Discriminative Nullspace for Person Re-identificationT M Feroz Ali*, Indian Institute of Technology Bombay, Mumbai; Subhasis Chaudhuri, Indian Institute of Technology Bombay
2406Efficient Relative Attribute Learning using Graph Neural NetworksZihang Meng*, University of Wisconsin Madison; Nagesh Adluru , WISC; Vikas Singh, University of Wisconsin-Madison USA
49Object Level Visual Reasoning in VideosFabien Baradel, LIRIS; Natalia Neverova*, Facebook AI Research; Christian Wolf, INSA Lyon, France; Julien Mille, INSA Centre Val de Loire; Greg Mori, Simon Fraser University
4BSeptember 13, 04:00 PM
1276Deep Model-Based 6D Pose Refinement in RGBFabian Manhardt*, TU Munich; Wadim Kehl, Toyota Research Institute; Nassir Navab, Technische Universität München, Germany; Federico Tombari, Technical University of Munich, Germany
427ContextVP: Fully Context-Aware Video PredictionWonmin Byeon*, NVIDIA; Qin Wang, ETH Zurich; Rupesh Kumar Srivastava, NNAISENSE; Petros Koumoutsakos, ETH Zurich
160CornerNet: Detecting Objects as Paired KeypointsHei Law*, University of Michigan; Jia Deng, University of Michigan
720RelocNet: Continous Metric Learning Relocalisation using Neural NetsVassileios Balntas*, University of Oxford; Victor Prisacariu, University of Oxford; Shuda Li, University of Oxford
2118Museum Exhibit Identification Challenge for the Supervised Domain Adaptation.Piotr Koniusz*, Data61/CSIRO, ANU; Yusuf Tas, Data61; Hongguang Zhang, Australian National University; Mehrtash Harandi, Monash University; Fatih Porikli, ANU; Rui Zhang, University of Canberra
1118Acquisition of Localization Confidence for Accurate Object DetectionBorui Jiang*, Peking University; Ruixuan Luo, Peking University; Jiayuan Mao, Tsinghua University; Tete Xiao, Peking University; Yuning Jiang, Megvii(Face++) Inc
897The Contextual Loss for Image Transformation with Non-Aligned DataRoey Mechrez*, Technion; Itamar Talmi, Technion; Lihi Zelnik-Manor, Technion
1846Saliency Benchmarking Made Easy: Separating Models, Maps and MetricsMatthias Kümmerer*, University of Tübingen; Thomas Wallis, University of Tübingen; Matthias Bethge, University of Tübingen
2343Multi-Attention Multi-Class Constraint for Fine-grained Image RecognitionMing Sun, baidu; Yuchen Yuan, Baidu Inc.; Feng Zhou*, Baidu Research; Errui Ding, Baidu Inc.
1455Look Before You Leap: Bridging Model-Free and Model-Based Reinforcement Learning for Planned-Ahead Vision-and-Language NavigationXin Wang*, University of California, Santa Barbara; Wenhan Xiong, University of California, Santa Barbara; Hongmin Wang, University of California, Santa Barbara; William Wang, UC Santa Barbara
1813HandMap: Robust Hand Pose Estimation via Intermediate Dense Guidance Map SupervisionXiaokun Wu*, University of Bath; Daniel Finnegan, University of Bath; Eamonn O'Neill, University of Bath; Yongliang Yang, University of Bath
2234LSQ++: lower runtime and higher recall in multi-codebook quantizationJulieta Martinez*, University of British Columbia; Shobhit Zakhmi, University of British Columbia; Holger Hoos, University of British Columbia; Jim Little, University of British Columbia, Canada
1067Multimodal Dual Attention Memory for Video Story Question AnsweringKyungmin Kim*, Seoul National University; Seong-Ho Choi, Seoul National University; Jin-Hwa Kim, Seoul National University; Byoung-Tak Zhang, Seoul National University
2500Hierarchical Bilinear Pooling for Fine-Grained Visual RecognitionChaojian Yu*, Huazhong University of Science and Technology; Qi Zheng, Huazhong University of Science and Technology; Xinyi Zhao, Huazhong University of Science and Technology; Peng Zhang, Huazhong University of Science and Technology; Xinge YOU, School of Electronic Information and Communications,Huazhong University of Science and Technology
2393Dense Semantic and Topological Correspondence of 3D Faces without LandmarksZhenfeng Fan*, Chinese Academy of Sciences; hu xiyuan, The Chinese academy of science; chen chen, The Chinese academy of science; peng silong, The Chinese academy of science
152Real-Time Blind Video Temporal ConsistencyWei-Sheng Lai*, University of California, Merced; Jia-Bin Huang, Virginia Tech; Oliver Wang, Adobe Systems Inc; Eli Shechtman, Adobe Research, US; Ersin Yumer, Argo AI; Ming-Hsuan Yang, University of California at Merced
1518Depth Estimation via Affinity Learned with Convolutional Spatial Propagation NetworkXinjing Cheng, Baidu; Peng Wang*, Baidu USA LLC; Ruigang Yang, University of Kentucky, USA
1408Hierarchical Metric Learning and Matching for 2D and 3D Geometric CorrespondencesMohammed Fathy, University of Maryland College Park; Quoc-Huy Tran*, NEC Labs; Zeeshan Zia, Microsoft; Paul Vernaza, NEC Labs America; Manmohan Chandraker, NEC Labs America
1425GridFace: Face Rectification via Learning Local Homography TransformationsErjin Zhou*, Megvii Research
539Rethinking Spatiotemporal Feature Learning: Speed-Accuracy Trade-offs in Video ClassificationSaining Xie*, UCSD; Chen Sun, Google; Jonathan Huang, Google; Zhuowen Tu, UC San Diego; Kevin Murphy, Google
1074Deep Variational Metric LearningXudong Lin, Tsinghua University; Yueqi Duan, Tsinghua University; Qiyuan Dong, Tsinghua University; Jiwen Lu*, Tsinghua University; Jie Zhou, Tsinghua University, China
1777Multi-Class Model Fitting by Energy Minimization and Mode-SeekingDániel Baráth*, MTA SZTAKI, CMP Prague; Jiri Matas, CMP CTU FEE
85A Unified Framework for Single-View 3D Reconstruction with Limited Pose SupervisionGuandao Yang*, Cornell University; Yin Cui, Cornell University; Bharath Hariharan, Cornell University
2100Diverse Conditional Image Generation by Stochastic Regression with Latent Drop-Out CodesYang He*, MPI Informatics; Bernt Schiele, MPI; Mario Fritz, Max-Planck-Institut für Informatik
1260Orthogonal Deep Features Decomposition for Age-Invariant Face Recognitionyitong wang, Tencent AI Lab; dihong gong, Tencent AI Lab; zheng zhou, Tencent AI Lab; xing ji, Tencent AI Lab; Hao Wang, Tencent AI Lab; Zhifeng Li*, Tencent AI Lab; Wei Liu, Tencent AI Lab; Tong Zhang, Tecent AI Lab
1055HiDDeN: Hiding Data with Deep NetworksJiren Zhu*, Stanford University; Russell Kaplan, Stanford University; Justin Johnson, Stanford University; Li Fei-Fei, Stanford University
893Learning and Matching Multi-View Descriptors for Registration of Point CloudsLei Zhou*, HKUST; Siyu Zhu, HKUST; Zixin Luo, HKUST; Tianwei Shen, HKUST; Runze Zhang, HKUST; Tian Fang, HKUST; Long Quan, Hong Kong University of Science and Technology
946Deep Burst DenoisingClement Godard*, University College London; Kevin Matzen, Facebook; Matt Uyttendaele, Facebook
413On Offline Evaluation of Vision-based Driving ModelsFelipe Codevilla, UAB; Antonio Lopez, CVC & UAB; Vladlen Koltun, Intel Labs; Alexey Dosovitskiy*, Intel Labs
2764Distortion-Aware Convolutional Filters for Dense Prediction in Panoramic ImagesKeisuke Tateno*, Technical University Munich; Nassir Navab, TU Munich, Germany; Federico Tombari, Technical University of Munich, Germany
200Salient Objects in Clutter: Bringing Salient Object Detection to the ForegroundDeng-Ping Fan, Nankai University; Jiang-Jiang Liu, Nankai University; Shanghua Gao, Nankai University; Qibin Hou, Nankai University; Ming-Ming Cheng*, Nankai University; Ali Borji, University of Central Florida
2910Randomized Ensemble EmbeddingsHong Xuan*, The George Washington University; Robert Pless, George Washington University
448Conditional Prior Networks for Optical FlowYanchao Yang*, UCLA; Stefano Soatto, UCLA
2675Adaptively Transforming Graph MatchingFudong Wang, Wuhan University; Nan Xue, Wuhan University; yi-peng Zhang, Syracuse University; Xiang Bai, Huazhong University of Science and Technology; Gui-Song Xia*, Wuhan University
1512Learning 3D shapes as multi-layered height maps using 2D convolutional neural networksKripasindhu Sarkar*, University of Kaiserslautern; Basavaraj Hampiholi, University of Kaiserslautern; Kiran Varanasi, German Research Center for Artificial Intelligence; Didier Stricker, DFKI
951ISNN - Impact Sound Neural Network for Material and Geometry ClassificationAuston Sterling*, UNC Chapel Hill; Justin Wilson, UNC Chapel Hill; Sam Lowe, UNC Chapel Hill; Ming Lin, UNC Chapel Hill
429Visual Psychophysics for Making Face Recognition Algorithms More ExplainableBrandon RichardWebster*, University of Notre Dame; So Yon Kwon, Perceptive Automata; Samuel Anthony, Perceptive Automata; Christopher Clarizio, University of Notre Dame; Walter Scheirer, University of Notre Dame
564Show, Tell and Discriminate: Image Captioning by Self-retrieval with Partially Labeled DataXihui Liu*, The Chinese University of Hong Kong; Hongsheng Li, Chinese University of Hong Kong; Jing Shao, The Chinese University of Hong Kong; Dapeng Chen, The Chinese University of HongKong; Xiaogang Wang, Chinese University of Hong Kong, Hong Kong
1026Using LIP to Gloss Over Faces in Single-Stage Face Detection NetworksSiqi Yang*, UQ ITEE; Arnold Wiliem, University of Queensland; Shaokang Chen, University of Queensland; Brian Lovell, University of Queensland
557Variational Wasserstein ClusteringLiang Mi*, Arizona State University; wen zhang, ASU; Xianfeng GU, Stony Brook University; Yalin Wang, Arizona State University
1422ADVISE: Symbolism and External Knowledge for Decoding AdvertisementsKeren Ye*, University of Pittsburgh; Adriana Kovashka, University of Pittsburgh
95Weakly- and Semi-Supervised, Non-Overlapping Instance Segmentation of Things and StuffAnurag Arnab*, University of Oxford; Philip Torr, University of Oxford; Qizhu Li, University of Oxford
1330Broadcasting Convolutional Network for Visual Relational ReasoningSimyung Chang, Seoul National University; John Yang, Seoul National University; Seonguk Park, Seoul National University; Nojun Kwak*, Seoul National University
1848A Unified Framework for Multi-View Multi-Class Object Pose EstimationChi Li*, Johns Hopkins University; Jin Bai, Johns Hopkins University; Gregory D. Hager, The Johns Hopkins University
1162Fast and Accurate Point Cloud Registration using Trees of Gaussian MixturesBenjamin Eckart*, NVIDIA; Kihwan Kim, NVIDIA; Kautz Jan, NVIDIA
704Teaching Machines to Understand Baseball Games: Large Scale Baseball Video Database for Multiple Video Understanding TasksMinho Shim, Yonsei University; KYUNGMIN KIM, Yonsei University; Young Hwi Kim, Yonsei University; Seon Joo Kim*, Yonsei Univ.
2430Using Object Information for Spotting TextShitala Prasad*, NTU Singapore; Wai-Kin Adams Kong, Nanyang Technological University
1023Deep Domain Generalization via Conditional Invariant Adversarial NetworksYa Li, USTC; Xinmei Tian, USTC; Mingming Gong, CMU & U Pitt; Yajing Liu*, USTC; Tongliang Liu, The University of Sydney; Kun Zhang, Carnegie Mellon University; Dacheng Tao, University of Sydney
1983On the Solvability of Viewing GraphsMatthew Trager*, INRIA; Brian Osserman, UC Davis; Jean Ponce, Inria
2084Learning Type-Aware Embeddings for Fashion CompatibilityMariya Vasileva*, University of Illinois at Urbana-Champaign; Bryan Plummer, Boston University; Krishna Dusad, University of Illinois at Urbana-Champaign; Shreya Rajpal, University of Illinois at Urbana-Champaign; David Forsyth, Univeristy of Illinois at Urbana-Champaign; Ranjitha Kumar, UIUC: CS
150Visual Coreference Resolution in Visual Dialog using Neural Module NetworksSatwik Kottur*, Carnegie Mellon University; José M. F. Moura, Carnegie Mellon University; Devi Parikh, Georgia Tech & Facebook AI Research; Dhruv Batra, Georgia Tech & Facebook AI Research; Marcus Rohrbach, Facebook AI Research
1710Hard-Aware Point-to-Set Deep Metric for Person Re-identificationRui Yu*, Huazhong University of Science and Technology; Zhiyong Dou, Huazhong University of Science and Technology; Song Bai, HUST; ZHAO-XIANG ZHANG, Chinese Academy of Sciences, China; Yongchao Xu, HUST; Xiang Bai, Huazhong University of Science and Technology
244Gray box adversarial trainingVivek B S*, Indian Institute of Science; Konda Reddy Mopuri, Indian Institute of Science, Bangalore; Venkatesh Babu RADHAKRISHNAN, Indian Institute of Science
1667Exploiting Vector Fields for Geometric Rectification of Distorted Document ImagesGaofeng Meng*, Chinese Academy of Sciences; Yuanqi Su, Xi'an Jiaotong University; Ying Wu, Northwestern University; SHIMING XIANG, Chinese Academy of Sciences, China; Chunhong Pan, Institute of Automation, Chinese Academy of Sciences
781Revisiting RCNN: On Awakening the Classification Power of Faster RCNNYunchao Wei*, UIUC; Bowen Cheng, UIUC; Honghui Shi, UIUC; Rogerio Feris, IBM Research; Jinjun Xiong, IBM Thomas J. Watson Research Center; Thomas Huang, UIUC
2996DeepTAM: Deep Tracking and MappingHuizhong Zhou*, University of Freiburg; Benjamin Ummenhofer, University of Freiburg; Thomas Brox, University of Freiburg
2281On Regularized Losses for Weakly-supervised CNN SegmentationMeng Tang*, University of Waterloo; Ismail Ben Ayed, ETS; Federico Perazzi, Disney Research; Abdelaziz Djelouah, Disney Research; Christopher Schroers, Disney Research; Yuri Boykov, University of Waterloo
1531ShapeCodes: Self-Supervised Feature Learning by Lifting Views to ViewgridsDinesh Jayaraman*, UC Berkeley; Ruohan Gao, University of Texas at Austin; Kristen Grauman, University of Texas
2213A Minimal Closed-Form Solution for Multi-Perspective Pose Estimation using Points and LinesPedro Miraldo*, Instituto Superior Técnico, Lisboa; Tiago Dias, Institute for systems and robotics; Srikumar Ramalingam, University of Utah
2045Interaction-aware Spatio-temporal Pyramid Attention Networks for Action ClassificationYang Du, NLPR; Chunfeng Yuan*, NLPR; Weiming Hu, Institute of Automation,Chinese Academy of Sciences
2639Towards Privacy-Preserving Visual Recognition via Adversarial Training: A Pilot StudyZhenyu Wu, Texas A&M University; Zhangyang Wang*, Texas A&M University; Zhaowen Wang, Adobe Research; Hailin Jin, Adobe Research
1449Polarimetric Three-View GeometryLixiong Chen, National Institute of Informatics; Yinqiang Zheng*, National Institute of Informatics; Art Subpa-asa, Tokyo Institute of Technology; Imari Sato, National Institute of Informatics
727SketchyScene: Richly-Annotated Scene SketchesChangqing Zou*, University of Maryland (UMD); Qian Yu, Queen Mary University of London; Ruofei Du, UMD; Haoran Mo, sun yat sen university; Yi-Zhe Song, Queen Mary University of London; Tao Xiang, Queen Mary, University of London, UK; Chengying Gao, sun yat sen university; Baoquan Chen, Shandong University; Hao Zhang, SFU
1214Bi-Real Net: Enhancing the Performance of 1-bit CNNs with Improved Representational Capability and Advanced Training Algorithmzechun liu*, HKUST; Baoyuan Wu, Tencent AI Lab; Wenhan Luo, Tencent AI Lab; Xin Yang, Huazhong University of Science and Technology; Wei Liu, Tencent AI Lab; Kwang-Ting Cheng, Hong Kong University of Science and Technology
2683Deep Continuous Fusion for Multi-Sensor 3D Object DetectionMing Liang*, Uber; Shenlong Wang, Uber ATG, University of Toronto; Bin Yang, Uber ATG, University of Toronto; Raquel Urtasun, Uber ATG
1854Focus on the Hard Things: Dynamic Task Prioritization for Multitask LearningMichelle Guo*, Stanford University; Albert Haque, Stanford University; De-An Huang, Stanford University; Serena Yeung, Stanford University; Li Fei-Fei, Stanford University
2632Domain transfer through deep activation matchingHaoshuo Huang*, Tsinghua University; Qixing Huang, The University of Texas at Austin; Philipp Kraehenbuehl, UT Austin
1948Joint Blind Motion Deblurring and Depth Estimation of Light FieldDongwoo Lee, Seoul Ntional University; Haesol Park, Seoul National University; In Kyu Park, Inha University; Kyoung Mu Lee*, Seoul National University
1376Learning to Look around Objects for Top-View Representations of Outdoor ScenesSamuel Schulter*, NEC Labs; Menghua Zhai, University of Kentucky; Nathan Jacobs, University of Kentucky; Manmohan Chandraker, NEC Labs America
1972Data-Driven Sparse Structure Selection for Deep Neural NetworksZehao Huang*, TuSimple; Naiyan Wang, TuSimple
2120Reconstruction-based Pairwise Depth Dataset for Depth Image Enhancement Using CNNJunho Jeon, POSTECH; Seungyong Lee*, POSTECH
1515A Geometric Perspective on Structured Light CodingMohit Gupta*, University of Wisconsin-Madison, USA ; Nikhil Nakhate, University of Wisconsin-Madison
30173D Ego-Pose Estimation via Imitation LearningYe Yuan*, Carnegie Mellon University; Kris Kitani, CMU
2759Unsupervised Learning of Multi-Frame Optical Flow with OcclusionsJoel Janai*, Max Planck Institute for Intelligent Systems; Fatma Güney, University of Oxford; Anurag Ranjan, MPI for Intelligent Systems; Michael Black, Max Planck Institute for Intelligent Systems; Andreas Geiger, MPI-IS and University of Tuebingen
31Dynamic Conditional Networks for Few-Shot LearningFang Zhao, National University of Singapore; Jian Zhao*, National University of Singapore; Yan Shuicheng, National University of Singapore; Jiashi Feng, NUS
10173DFeat-Net: Weakly Supervised Local 3D Features for Rigid Point Cloud RegistrationZi Jian Yew*, National University of Singapore; Gim Hee Lee, National University of SIngapore
672Learning to Forecast and Refine Residual Motion for Image-to-Video GenerationLong Zhao*, Rutgers University; Xi Peng, Rutgers University; Yu Tian, Rutgers; Mubbasir Kapadia, Rutgers; Dimitris Metaxas, Rutgers
775Learn-to-Score: Efficient 3D Scene Exploration by Predicting View UtilityBenjamin Hepp*, ETH Zurich; Debadeepta Dey, Microsoft; Sudipta Sinha, Microsoft Research; Ashish Kapoor, Microsoft; Neel Joshi, -; Otmar Hilliges, ETH Zurich
126Deep Co-Training for Semi-Supervised Image RecognitionSiyuan Qiao*, Johns Hopkins University; Wei Shen, Shanghai University; Zhishuai Zhang, Johns Hopkins University; Bo Wang, Hikvision Research Institue; Alan Yuille, Johns Hopkins University
1013Attention-aware Deep Adversarial Hashing for Cross Modal RetrievalXi Zhang, Sun Yat-Sen University; Hanjiang Lai*, Sun Yat-Sen university; Jiashi Feng, NUS
2452Remote Photoplethysmography Correspondence Feature for 3D Mask Face Presentation Attack DetectionSiqi Liu*, Department of Computer Science, Hong Kong Baptist University; Xiangyuan Lan, Department of Computer Science, Hong Kong Baptist University; PongChi Yuen, Department of Computer Science, Hong Kong Baptist University
822Semi-Supervised Generative Adversarial Hashing for Image RetrievalGuan'an Wang*, Chinese Academy of Sciences; Qinghao Hu, Chinese Academy of Sciences; Jian Cheng, Chinese Academy of Sciences, China; Zengguang Hou, Chinese Academy of Sciences
1331Improving Spatiotemporal Self-Supervision by Deep Reinforcement LearningUta Büchler*, Heidelberg University; Biagio Brattoli, Heidelberg University; Bjorn Ommer, Heidelberg University
1625AutoLoc: Weakly-supervised Temporal Action Localization in Untrimmed VideosZheng Shou*, Columbia University; Hang Gao, Columbia University; Lei Zhang, Microsoft Research; Kazuyuki Miyazawa, Mitsubishi Electric; Shih-Fu Chang, Columbia University
937Revisiting Autofocus for Smartphone CamerasAbdullah Abuolaim*, York University; Abhijith Punnappurath, York University; Michael Brown, York University
598Contour Knowledge Transfer for Salient Object DetectionXin Li, UESTC; Fan Yang*, UESTC; Hong Cheng, UESTC; Wei Liu, Digital Media Technology Key Laboratory of Sichuan Province, UESTC; Dinggang Shen, UNC
1990Deep Volumetric Video From Very Sparse Multi-View Performance CaptureZeng Huang*, University of Southern California; Tianye Li, University of Southern California; Weikai Chen, USC Institute for Creative Technology; Yajie Zhao, USC Institute for Creative Technology ; Jun Xing, Institute for Creative Technologies, USC; Chloe LeGendre, USC Institute for Creative Technology ; Linjie Luo, Snap Inc; Chongyang Ma, Snap Inc.; Hao Li, Pinscreen/University of Southern California/USC ICT
892Person Re-identification with Deep Similarity-Guided Graph Neural NetworkYantao Shen*, The Chinese University of Hong Kong; Hongsheng Li, Chinese University of Hong Kong; Shuai Yi, The Chinese University of Hong Kong; Xiaogang Wang, Chinese University of Hong Kong, Hong Kong
1415Deep Component Analysis via Alternating Direction Neural NetworksCalvin Murdock*, Carnegie Mellon University; MingFang Chang, Carnegie Mellon University; Simon Lucey, CMU
2741Understanding Perceptual and Conceptual Fluency at a Large ScaleMeredith Hu*, Cornell University; Ali Borji, University of Central Florida
71Look Deeper into Depth: Monocular Depth Estimation with Semantic Booster and Attention-Driven LossJianbo Jiao*, City University of Hong Kong; Ying Cao, City University of Hong Kong; Yibing Song, Tencent AI Lab; Rynson Lau, City University of Hong Kong