Sessions

Sessions

Oral Sessions

Oral session 1A

1ALearning for Vision 1
Monday, September 10
Oral session 8:30 AM - 9:45 AMAndrea Vedaldi, Oxford
Timothy Hospedales, University of Edinburgh
O-1A-01Convolutional Networks with Adaptive Computation GraphsAndreas Veit*, Cornell University; Serge Belongie, Cornell University
O-1A-02Progressive Neural Architecture SearchChenxi Liu*, Johns Hopkins University; Maxim Neumann, Google; Barret Zoph, Google; Jon Shlens, Google; Wei Hua, Google; Li-Jia Li, Google; Li Fei-Fei, Stanford University; Alan Yuille, Johns Hopkins University; Jonathan Huang, Google; Kevin Murphy, Google
O-1A-03Diverse Image-to-Image Translation via Disentangled RepresentationsHsin-Ying Lee*, University of California, Merced; Hung-Yu Tseng, University of California, Merced; Maneesh Singh, Verisk Analytics; Jia-Bin Huang, Virginia Tech; Ming-Hsuan Yang, University of California at Merced
O-1A-04Lifting Layers: Analysis and ApplicationsMichael Moeller*, University of Siegen; Peter Ochs, Saarland University; Tim Meinhardt, Technical University of Munich; Laura Leal-Taixé, TUM
O-1A-05Learning with Biased Complementary LabelsXiyu Yu*, The University of Sydney; Tongliang Liu, The University of Sydney; Mingming Gong, University of Pittsburgh; Dacheng Tao, University of Sydney

Oral session 1B
1BComputational Photography 1
Monday, September 10
Oral session 1:00 PM - 2:15 PMJan-Michael Frahm, University of North Carolina at Chapel Hill
Gabriel Brostow, University College London
O-1B-01Light Structure from Pin Motion: Simple and Accurate Point Light Calibration for Physics-based ModelingHiroaki Santo*, Osaka University; Michael Waechter, Osaka University; Masaki Samejima, Osaka University; Yusuke Sugano, Osaka University; Yasuyuki Matsushita, Osaka University
O-1B-02Programmable Light CurtainsJian Wang*, Carnegie Mellon University; Joe Bartels, Carnegie Mellon University; William Whittaker, Carnegie Mellon University; Aswin Sankaranarayanan, Carnegie Mellon University; Srinivasa Narasimhan, Carnegie Mellon University
O-1B-03Learning to Separate Object Sounds by Watching Unlabeled VideoRuohan Gao*, University of Texas at Austin; Rogerio Feris, IBM Research; Kristen Grauman, University of Texas
O-1B-04Coded Two-Bucket Cameras for Computer VisionMian Wei, University of Toronto; Navid Navid Sarhangnejad, University of Toronto; Zhengfan Xia, University of Toronto; Nikola Katic, University of Toronto; Roman Genov, University of Toronto; Kyros Kutulakos*, University of Toronto
O-1B-05Materials for Masses: SVBRDF Acquisition with a Single Mobile Phone ImageZhengqin Li*, UC San Diego; Manmohan Chandraker, UC San Diego; Sunkavalli Kalyan, Adobe Research

Oral session 1C
1CVideo
Monday, September 10
Oral session 2:45 PM - 4:00 PMIvan Laptev, INRIA
Thomas Brox, University of Freiburg
O-1C-01End-to-End Joint Semantic Segmentation of Actors and Actions in VideoJingwei Ji*, Stanford University; Shyamal Buch, Stanford University; Alvaro Soto, Universidad Catolica de Chile; Juan Carlos Niebles, Stanford University
O-1C-02Learning-based Video Motion MagnificationTae-Hyun Oh, MIT CSAIL; Ronnachai Jaroensri*, MIT CSAIL; Changil Kim, MIT CSAIL; Mohamed A. Elghareb, Qatar Computing Research Institute; Fredo Durand, MIT; Bill Freeman, MIT; Wojciech Matusik, MIT CSAIL
O-1C-03Massively Parallel Video NetworksViorica Patraucean*, DeepMind; Joao Carreira, DeepMind; Laurent Mazare, DeepMind; Simon Osindero, DeepMind; Andrew Zisserman, University of Oxford
O-1C-04DeepWrinkles: Accurate and Realistic Clothing ModelingZorah Laehner, TU Munich; Tony Tung*, Facebook / Oculus Research; Daniel Cremers, TUM
O-1C-05Learning Discriminative Video Representations Using Adversarial PerturbationsJue Wang*, ANU; Anoop Cherian, MERL

Oral session 2A
2AHumans analysis 1
Tuesday, September 11
Oral session 8:30 AM - 9:45 AMKris Kitani, Carnegie Mellon University
Tinne Tuytelaars, KU Leuven
O-2A-01Scaling Egocentric Vision: The E-Kitchens DatasetDima Damen*, University of Bristol; Hazel Doughty, University of Bristol; Sanja Fidler, University of Toronto; Antonino Furnari, University of Catania; Evangelos Kazakos, University of Bristol; Giovanni Farinella, University of Catania, Italy; Davide Moltisanti, University of Bristol; Jonathan Munro, University of Bristol; Toby Perrett, University of Bristol; Will Price, University of Bristol; Michael Wray, University of Bristol
O-2A-02Unsupervised Person Re-identification by Deep Learning Tracklet AssociationMinxian Li*, Nanjing University and Science and Technology; Xiatian Zhu, Queen Mary University, London, UK; Shaogang Gong, Queen Mary University of London
O-2A-03Predicting Gaze in Egocentric Video by Learning Task-dependent Attention TransitionYifei Huang*, The University of Tokyo; Minjie Cai, Hunan University, The University of Tokyo; Zhenqiang Li, The University of Tokyo; Yoichi Sato,The University of Tokyo
O-2A-04Instance-level Human Parsing via Part Grouping NetworkKe Gong*, SYSU; Xiaodan Liang, Carnegie Mellon University; Yicheng Li, Sun Yat-sen University; Yimin Chen, sensetime; Liang Lin, Sun Yat-sen University
O-2A-05Adversarial Geometry-Aware Human Motion PredictionLiangyan Gui*, Carnegie Mellon University; Yu-Xiong Wang, Carnegie Mellon University; Xiaodan Liang, Carnegie Mellon University; José M. F. Moura, Carnegie Mellon University

Oral session 2B
2BHuman Sensing I
Tuesday, September 11
Oral session 1:00 PM - 2:15 PMMykhaylo Andriluka, Max Planck Insititute
Pascal Fua, EPFL
O-2B-01Weakly-supervised 3D Hand Pose Estimation from Monocular RGB ImagesYujun Cai*, Nanyang Technological University; Liuhao Ge, NTU; Jianfei Cai, Nanyang Technological University; Junsong Yuan, State University of New York at Buffalo, USA
O-2B-02Audio-Visual Scene Analysis with Self-Supervised Multisensory FeaturesAndrew Owens*, UC Berkeley; Alexei Efros, UC Berkeley
O-2B-03Jointly Discovering Visual Objects and Spoken Words from Raw Sensory InputDavid Harwath*, MIT CSAIL; Adria Recasens, Massachusetts Institute of Technology; Dídac Surís, Universitat Politecnica de Catalunya; Galen Chuang, MIT; Antonio Torralba, MIT; James Glass, MIT
O-2B-04DeepIM: Deep Iterative Matching for 6D Pose EstimationYi Li*, Tsinghua University; Gu Wang, Tsinghua University; Xiangyang Ji, Tsinghua University; Yu Xiang, University of Michigan; Dieter Fox, University of Washington
O-2B-05Implicit 3D Orientation Learning for 6D Object Detection from RGB ImagesMartin Sundermeyer*, German Aerospace Center (DLR); Zoltan Marton, DLR; Maximilian Durner, DLR; Rudolph Triebel, German Aerospace Center (DLR)

Oral session 2C
2CComputational Photograpy 2
Tuesday, September 11
Oral session 2:45 PM - 4:00 PMKyros Kutulakos, University of Toronto
Kalyan Sunkavalli, Adobe Research
O-2C-01Direct Sparse Odometry With Rolling ShutterDavid Schubert*, Technical University of Munich; Vladyslav Usenko, TU Munich; Nikolaus Demmel, TUM; Joerg Stueckler, Technical University of Munich; Daniel Cremers, TUM
O-2C-023D Motion Sensing from 4D Light Field GradientsSizhuo Ma*, University of Wisconsin-Madison; Brandon Smith, University of Wisconsin-Madison; Mohit Gupta, University of Wisconsin-Madison, USA
O-2C-03A Style-aware Content Loss for Real-time HD Style TransferArtsiom Sanakoyeu*, Heidelberg University; Dmytro Kotovenko, Heidelberg University; Bjorn Ommer, Heidelberg University
O-2C-04Scale-Awareness of Light Field Camera based Visual OdometryNiclas Zeller*, Karlsruhe University of Applied Sciences; Franz Quint, Karlsruhe University of Applied Sciences; Uwe Stilla, Technische Universitaet Muenchen
O-2C-05Burst Image Deblurring Using Permutation Invariant Convolutional Neural NetworksMiika Aittala*, MIT; Fredo Durand, MIT

Oral session 3A
3AStereo and reconstruction
Wednesday, September 12
Oral session 8:30 AM - 9:45 AMNoah Snavely, Cornell University
Andreas Geiger, University of Tübingen
O-3A-01MVSNet: Depth Inference for Unstructured Multi-view StereoYao Yao*, The Hong Kong University of Science and Technology; Zixin Luo, HKUST; Shiwei Li, HKUST; Tian Fang, HKUST; Long Quan, Hong Kong University of Science and Technology
O-3A-02PlaneMatch: Patch Coplanarity Prediction for Robust RGB-D RegistrationYifei Shi, Princeton University; Kai Xu, Princeton University and National University of Defense Technology; Matthias Niessner, Technical University of Munich; Szymon Rusinkiewicz, Princeton University; Thomas Funkhouser*, Princeton, USA
O-3A-03Active Stereo Net: End-to-End Self-Supervised Learning for Active Stereo SystemsYinda Zhang*, Princeton University; Sean Fanello, Google; Sameh Khamis, Google; Christoph Rhemann, Google; Julien Valentin, Google; Adarsh Kowdle, Google; Vladimir Tankovich, Google; Shahram Izadi, Google; Thomas Funkhouser, Princeton, USA
O-3A-04GAL: Geometric Adversarial Loss for Single-View 3D-Object ReconstructionLi Jiang*, The Chinese University of Hong Kong; Xiaojuan Qi, CUHK; Shaoshuai SHI, The Chinese University of Hong Kong; Jia Jiaya, Chinese University of Hong Kong
O-3A-05Deep Virtual Stereo Odometry: Leveraging Deep Depth Prediction for Monocular Direct Sparse OdometryNan Yang*, Technical University of Munich; Rui Wang, Technical University of Munich; Joerg Stueckler, Technical University of Munich; Daniel Cremers, TUM

Oral session 3B
3BHuman Sensing II
Wednesday, September 12
Oral session 1:00 PM - 2:15 PMGerard-Pons Moll, Max Planck Institute
Juergen Gall, University of Bonn
O-3B-01Unsupervised Geometry-Aware Representation for 3D Human Pose EstimationHelge Rhodin*, EPFL; Mathieu Salzmann, EPFL; Pascal Fua, EPFL, Switzerland
O-3B-02Dual-Agent Deep Reinforcement Learning for Deformable Face TrackingMinghao Guo, Tsinghua University; Jiwen Lu*, Tsinghua University; Jie Zhou, Tsinghua University, China
O-3B-03Deep Autoencoder for Combined Human Pose Estimation and Body Model UpscalingMatthew Trumble*, University of Surrey; Andrew Gilbert, University of Surrey; John Collomosse, Adobe Research; Adrian Hilton, University of Surrey
O-3B-04Occlusion-aware Hand Pose Estimation Using Hierarchical Mixture Density NetworkQi Ye*, Imperial College London; Tae-Kyun Kim, Imperial College London
O-3B-05GANimation: Anatomically-aware Facial Animation from a Single ImageAlbert Pumarola*, Institut de Robotica i Informatica Industrial; Antonio Agudo, Institut de Robotica i Informatica Industrial, CSIC-UPC; Aleix Martinez, The Ohio State University; Alberto Sanfeliu, Industrial Robotics Institute; Francesc Moreno, IRI

Oral session 3C
3COptimization
Wednesday, September 12
Oral session 4:00 PM - 5:15 PMVincent Lepetit, University of Bordeaux
Vladlen Koltun, Intel
O-3C-01Deterministic Consensus Maximization with Biconvex ProgrammingZhipeng Cai*, The University of Adelaide; Tat-Jun Chin, University of Adelaide; Huu Le, University of Adelaide; David Suter, University of Adelaide
O-3C-02Robust fitting in computer vision: easy or hard?Tat-Jun Chin*, University of Adelaide; Zhipeng Cai, The University of Adelaide; Frank Neumann, The University of Adelaide, School of Computer Science, Faculty of Engineering, Computer and Mathematical Science
O-3C-03Highly-Economized Multi-View Binary Compression for Scalable Image ClusteringZheng Zhang*, Harbin Institute of Technology Shenzhen Graduate School; Li Liu, the inception institute of artificial intelligence; Jie Qin, ETH Zurich; Fan Zhu, the inception institute of artificial intelligence ; Fumin Shen, UESTC; Yong Xu, Harbin Institute of Technology Shenzhen Graduate School; Ling Shao, Inception Institute of Artificial Intelligence; Heng Tao Shen, University of Electronic Science and Technology of China (UESTC)
O-3C-04Efficient Semantic Scene Completion Network with Spatial Group ConvolutionJiahui Zhang*, Tsinghua University; Hao Zhao, Intel Labs China; Anbang Yao, Intel Labs China; Yurong Chen, Intel Labs China; Hongen Liao, Tsinghua University
O-3C-05Asynchronous, Photometric Feature Tracking using Events and FramesDaniel Gehrig, University of Zurich; Henri Rebecq*, University of Zurich; Guillermo Gallego, University of Zurich; Davide Scaramuzza, University of Zurich& ETH Zurich, Switzerland

Oral session 4A
4ALearning for Vision 2
Thursday, September 13
Oral session 8:30 AM - 9:30 AMKyoung Mu Lee, Seoul National University
Michael Felsberg, Linköping University
O-4A-01Group NormalizationYuxin Wu, Facebook; Kaiming He*, Facebook Inc., USA
O-4A-02Deep Expander Networks: Efficient Deep Networks from Graph TheoryAmeya Prabhu*, IIIT Hyderabad; Girish Varma, IIIT Hyderabad; Anoop Namboodiri, IIIT Hyderbad
O-4A-03Towards Realistic PredictorsPei Wang*, UC San Diego; Nuno Vasconcelos, UC San Diego
O-4A-04Learning SO(3) Equivariant Representations with Spherical CNNsCarlos Esteves*, University of Pennsylvania; Kostas Daniilidis, University of Pennsylvania; Ameesh Makadia, Google Research; Christine Allec-Blanchette, University of Pennsylvania

Oral session 4B
4BMatching and Recognition
Thursday, September 13
Oral session 1:00 PM - 2:15 PMRoss Girshick, Facebook
Philipp Kraehenbuehl, University of Texas at Austin
O-4B-01CornerNet: Detecting Objects as Paired KeypointsHei Law*, University of Michigan; Jia Deng, University of Michigan
O-4B-02RelocNet: Continous Metric Learning Relocalisation using Neural NetsVassileios Balntas*, University of Oxford; Victor Prisacariu, University of Oxford; Shuda Li, University of Oxford
O-4B-03The Contextual Loss for Image Transformation with Non-Aligned DataRoey Mechrez*, Technion; Itamar Talmi, Technion; Lihi Zelnik-Manor, Technion
O-4B-04Acquisition of Localization Confidence for Accurate Object DetectionBorui Jiang*, Peking University; Ruixuan Luo, Peking University; Jiayuan Mao, Tsinghua University; Tete Xiao, Peking University; Yuning Jiang, Megvii(Face++) Inc
O-4B-05Deep Model-Based 6D Pose Refinement in RGBFabian Manhardt*, TU Munich; Wadim Kehl, Toyota Research Institute; Nassir Navab, Technische Universität München, Germany; Federico Tombari, Technical University of Munich, Germany

Oral session 4C
4CVideo and attention
Thursday, September 13
Oral session 2:45 PM - 4:00 PMHedvig Kjellström, KTH
Lihi Zelnik Manor, Technion
O-4C-01DeepTAM: Deep Tracking and MappingHuizhong Zhou*, University of Freiburg; Benjamin Ummenhofer, University of Freiburg; Thomas Brox, University of Freiburg
O-4C-02ContextVP: Fully Context-Aware Video PredictionWonmin Byeon*, NVIDIA; Qin Wang, ETH Zurich; Rupesh Kumar Srivastava, NNAISENSE; Petros Koumoutsakos, ETH Zurich
O-4C-03Saliency Benchmarking Made Easy: Separating Models, Maps and MetricsMatthias Kümmerer*, University of Tübingen; Thomas Wallis, University of Tübingen; Matthias Bethge, University of Tübingen
O-4C-04Museum Exhibit Identification Challenge for the Supervised Domain Adaptation.Piotr Koniusz*, Data61/CSIRO, ANU; Yusuf Tas, Data61; Hongguang Zhang, Australian National University; Mehrtash Harandi, Monash University; Fatih Porikli, ANU; Rui Zhang, University of Canberra
O-4C-05Multi-Attention Multi-Class Constraint for Fine-grained Image RecognitionMing Sun, baidu; Yuchen Yuan, Baidu Inc.; Feng Zhou*, Baidu Research; Errui Ding, Baidu Inc.

Poster Sessions

Poster session 1A

1AMonday, September 10Poster Session 10:00 AM - 12:00 PM
P-1A-01ECO: Efficient Convolutional Network for Online Video UnderstandingMohammadreza Zolfaghari*, University of Freiburg; kamaljeet singh, University of Freiburg; Thomas Brox, University of Freiburg
P-1A-02Learning to Anonymize Faces for Privacy Preserving Action DetectionZhongzheng Ren*, University of California, Davis; Yong Jae Lee, University of California, Davis; Michael Ryoo, Indiana University
P-1A-03Adversarial Open-World Person Re-IdentificationXiang Li, Sun Yat-sen University; Ancong Wu, Sun Yat-sen University; Jason Wei Shi Zheng*, Sun Yat Sen University
P-1A-04Graph R-CNN for Scene Graph GenerationJianwei Yang*, Georgia Institute of Technology; Jiasen Lu, Georgia Institute of Technology; Stefan Lee, Georgia Institute of Technology; Dhruv Batra, Georgia Tech & Facebook AI Research; Devi Parikh, Georgia Tech & Facebook AI Research
P-1A-05Contemplating Visual Emotions: Understanding and Overcoming Dataset BiasRameswar Panda*, UC Riverside; Jianming Zhang, Adobe Research; Haoxiang Li, Adobe; Joon-Young Lee, Adobe Research; Xin Lu, Adobe; Amit Roy-Chowdhury , University of California, Riverside, USA
P-1A-06Graph Adaptive Knowledge Transfer for Unsupervised Domain AdaptationZhengming Ding*, Northeastern University; Sheng Li, Adobe Research; Ming Shao, University of Massachusetts Dartmouth; YUN FU, Northeastern University
P-1A-07Deep Recursive HDRI: Inverse Tone Mapping using Generative Adversarial NetworksSiyeong Lee, Sogang University; Gwon Hwan An, Sogang University; Suk-Ju Kang*, Nil
P-1A-08Deep Cross-Modal Projection Learning for Image-Text MatchingYing Zhang*, Dalian University of Technology; Huchuan Lu, Dalian University of Technology
P-1A-09Composition Loss for Counting, Density Map Estimation and Localization in Dense CrowdsHaroon Idrees*, Carnegie Mellon University; Muhammad Tayyab, UCF; Kishan Athrey, UCF; Mubarak Shah, University of Central Florida; Dong Zhang, University of Central Florida, USA
P-1A-10Person Search by Multi-Scale MatchingXu Lan*, Queen Mary University of London; Xiatian Zhu, Queen Mary University, London, UK; Shaogang Gong, Queen Mary University of London
P-1A-11Efficient 6-DoF Tracking of Handheld Objects from an Egocentric ViewpointRohit Pandey, Google; Pavel Pidlypenskyi, Google; Shuoran Yang, Google; Christine Kaeser-Chen*, Google
P-1A-12Deep Video Generation, Prediction and Completion of Human Action SequencesChunyan Bai, Hong Kong University of Science and Technology; Haoye Cai*, Hong Kong University of Science and Technology; Yu-Wing Tai, Tencent YouTu; Chi-Keung Tang, Hong Kong University of Science and Technology
P-1A-13Efficient Uncertainty Estimation for Semantic Segmentation in VideosPo-Yu Huang*, National Tsing Hua University; Wan-Ting Hsu, National Tsing Hua University; Chun-Yueh Chiu, National Tsing Hua University; Tingfan Wu, Umbo Computer Vision; Min Sun, NTHU
P-1A-14DeepKSPD: Learning Kernel-matrix-based SPD Representation for Fine-grained Image RecognitionMelih Engin, university of wollongong; Lei Wang*, University of Wollongong, Australia; Luping Zhou, University of Wollongong, Australia; Xinwang Liu, National University of Defense Technology
P-1A-15From Face Recognition to Models of Identity: A Bayesian Approach to Learning about Unknown Identities from Unsupervised DataDaniel Castro*, Imperial College London; Sebastian Nowozin, Microsoft Research Cambridge
P-1A-16ShapeStacks: Learning Vision-Based Physical Intuition for Generalised Object StackingOliver Groth*, Oxford Robotics Insitute; Fabian Fuchs, Oxford Robotics Insitute; Andrea Vedaldi, Oxford University; Ingmar Posner, Oxford
P-1A-17Fast and Precise Camera Covariance Computation for Large 3D ReconstructionMichal Polic*, Czech Technical University in Prague; Wolfgang Foerstner, University Bonn; Tomas Pajdla, Czech Technical University in Prague
P-1A-18Inner Space Preserving Generative Pose MachineShuangjun Liu, Northeastern University; Sarah Ostadabbas*, Northeastern University
P-1A-19CTAP: Complementary Temporal Action Proposal GenerationJiyang Gao*, USC; Kan Chen, University of Southern California, USA; Ram Nevatia, U of Southern California
P-1A-20Learning to Reenact Faces via Boundary TransferWayne Wu, SenseTime Research; Yunxuan Zhang, sensetime research; Cheng Li*, SenseTime Research; Chen Qian, SenseTime; Chen Change Loy, Chinese University of Hong Kong
P-1A-21Fast and Accurate Intrinsic Symmetry DetectionRajendra Nagar*, Indian Institute of Technology Gandhinagar; Shanmuganathan Raman, IIT Gandhinagar
P-1A-22Fictitious GAN: Training GANs with Historical ModelsYin Xia*, Northwestern University; Xu Chen, Northwestern University; Hao Ge, Northwestern University; Ying Wu, Northwestern University; Randall Berry, Northwestern University
P-1A-23Audio-Visual Event Localization in Unconstrained VideosYapeng Tian*, University of Rochester; Jing Shi, University of Rochester; Bochen Li, University of Rochester; Zhiyao Duan, Unversity of Rochester; Chenliang Xu, University of Rochester
P-1A-24Tackling 3D ToF Artifacts Through Learning and the FLAT DatasetQi Guo, Harvard University; Iuri Frosio*, NVIDIA; Orazio Gallo, NVIDIA Research; Todd Zickler, Harvard University; Kautz Jan, NVIDIA
P-1A-25Self-Calibrating Isometric Non-Rigid Structure-from-Motionshaifali parashar*, CNRS; Adrien Bartoli, Université Clermont Auvergne; Daniel Pizarro, Universidad de Alcala
P-1A-26Semi-Supervised Deep Learning with MemoryYanbei Chen*, Queen Mary University of London; Xiatian Zhu, Queen Mary University, London, UK; Shaogang Gong, Queen Mary University of London
P-1A-27Question-Guided Hybrid Convolution for Visual Question Answeringgao peng*, Chinese university of hong kong; Hongsheng Li, Chinese University of Hong Kong; Shuang Li, The Chinese University of Hong Kong; Pan Lu, Tsinghua University; Yikang LI, The Chinese University of Hong Kong; Steven Hoi, SMU; Xiaogang Wang, Chinese University of Hong Kong, Hong Kong
P-1A-28Rolling Shutter Pose and Ego-motion Estimation using Shape-from-TemplateYizhen Lao*, Université Clermont Auvergne; Omar Ait-Aider, Université Clermont Auvergne; Adrien Bartoli, Université Clermont Auvergne
P-1A-29Semi-Dense 3D Reconstruction with a Stereo Event CameraYi Zhou*, The Australian National University; Guillermo Gallego, University of Zurich; Henri Rebecq, University of Zurich; Laurent Kneip, ShanghaiTech University; HONGDONG LI, Australian National University, Australia; Davide Scaramuzza, University of Zurich& ETH Zurich, Switzerland
P-1A-30Local Orthogonal-Group TestingAhmet Iscen*, Czech Technical University; Ondrej Chum, Vision Recognition Group, Czech Technical University in Prague
P-1A-31Temporal Relational Reasoning in VideosBolei Zhou*, MIT; Alex Andonian, Massachusetts Institute of Technology; Aude Oliva, MIT; Antonio Torralba, MIT
P-1A-32Deep High Dynamic Range Imaging with Large Foreground MotionsShangzhe Wu*, HKUST; Jiarui Xu, Hong Kong University of Science and Technology (HKUST); Yu-Wing Tai, Tencent YouTu; Chi-Keung Tang, Hong Kong University of Science and Technology
P-1A-33Geometric Constrained Joint Lane Segmentation and Lane Boundary DetectionJie Zhang*, Shanghai Jiao Tong University; Yi Xu, Shanghai Jiao Tong University; Bingbing Ni, Shanghai Jiao Tong University; Zhenyu Duan, Shanghai Jiao Tong University
P-1A-34Attributes as OperatorsTushar Nagarajan*, UT Austin; Kristen Grauman, University of Texas
P-1A-35Textual Explanations for Self-Driving VehiclesJinkyu Kim*, UC Berkeley; Anna Rohrbach, UC Berkeley; Trevor Darrell, UC Berkeley; John Canny, UC Berkeley; Zeynep Akata, University of Amsterdam
P-1A-36Generative Domain-Migration Hashing for Sketch-to-Image RetrievalJingyi Zhang*, University of Electronic Science and Technology of China; Fumin Shen, UESTC; Li Liu, the inception institute of artificial intelligence; Fan Zhu, the inception institute of artificial intelligence ; Mengyang Yu, ETH Zurich; Ling Shao, Inception Institute of Artificial Intelligence; Heng Tao Shen, University of Electronic Science and Technology of China (UESTC); Luc Van Gool, ETH Zurich
P-1A-37Recurrent Fusion Network for Image captioningWenhao Jiang*, Tencent AI Lab; Lin Ma, Tencent AI Lab; Yu-Gang Jiang, Fudan University; Wei Liu, Tencent AI Lab; Tong Zhang, Tecent AI Lab
P-1A-38Attention-based Ensemble for Deep Metric LearningWonsik Kim*, Samsung Electronics; Bhavya Goyal, Samsung Electronics; Kunal Chawla, Samsung Electronics; Jungmin Lee, Samsung Electronics; Keunjoo Kwon, Samsung Electronics
P-1A-39Egocentric Activity Prediction via Event Modulated AttentionYang Shen*, Shanghai Jiao Tong University; Bingbing Ni, Shanghai Jiao Tong University; Zefan Li, Shanghai Jiao Tong University; Ning Zhuang, Shanghai Jiao Tong University
P-1A-40A+D Net: Training a Shadow Detector with Adversarial Shadow AttenuationHieu Le*, Stony Brook University; Tomas F Yago Vicente, Stony Brook University; Vu Nguyen, Stony Brook University; Minh Hoai Nguyen, Stony Brook University; Dimitris Samaras, Stony Brook University
P-1A-41Stereo Vision-based Semantic 3D Object and Ego-motion Tracking for Autonomous DrivingPeiliang LI*, HKUST Robotics Institute; Tong QIN, HKUST Robotics Institute; Shaojie Shen, HKUST
P-1A-42End-to-end View Synthesis for Light Field Imaging with Pseudo 4DCNNYunlong Wang*, Center for Research on Intelligent Perception and Computing (CRIPAC) National Laboratory of Pattern Recognition (NLPR) Institute of Automation, Chinese Academy of Sciences (CASIA) ; Fei Liu, Center for Research on Intelligent Perception and Computing (CRIPAC) National Laboratory of Pattern Recognition (NLPR) Institute of Automation, Chinese Academy of Sciences (CASIA); Zilei Wang, University of Science and Technology of China; Guangqi Hou, Center for Research on Intelligent Perception and Computing (CRIPAC) National Laboratory of Pattern Recognition (NLPR) Institute of Automation, Chinese Academy of Sciences (CASIA); Zhenan Sun, Chinese of Academy of Sciences; Tieniu Tan, NLPR, China
P-1A-43Robust image stitching using multiple registrationsCharles Herrmann, Cornell; Chen Wang, Google Research; Richard Bowen, Cornell; Mike Krainin, Google; Ce Liu, Google; Bill Freeman, MIT; Ramin Zabih*, Cornell Tech/Google Research
P-1A-44Fast Multi-fiber Network for Video RecognitionYunpeng Chen*, National University of Singapore; Yannis Kalantidis, Facebook Research, USA; Jianshu Li, NUS; Yan Shuicheng, National University of Singapore; Jiashi Feng, NUS
P-1A-45TBN: Convolutional Neural Network with Ternary Inputs and Binary WeightsDiwen Wan*, University of Electronic Science and Technology of China; Fumin Shen, UESTC; Li Liu, the inception institute of artificial intelligence; Fan Zhu, the inception institute of artificial intelligence ; Jie Qin, ETH Zurich; Ling Shao, Inception Institute of Artificial Intelligence; Heng Tao Shen, University of Electronic Science and Technology of China (UESTC)
P-1A-46Contextual Based Image Inpainting: Infer, Match and TranslateYuhang Song*, USC; Chao Yang, University of Southern California; Zhe Lin, Adobe Research; Xiaofeng Liu, Carnegie Mellon University; Hao Li, Pinscreen/University of Southern California/USC ICT; Qin Huang, University of Southern California; C.-C. Jay Kuo, USC
P-1A-47Deep Fundamental Matrix EstimationRene Ranftl*, Intel Labs; Vladlen Koltun, Intel Labs
P-1A-48Joint Person Segmentation and Identification in Synchronized First- and Third-person VideosMingze Xu*, Indiana University; Chenyou Fan, JD.com; Yuchen Wang, Indiana University; Michael Ryoo, Indiana University; David Crandall, Indiana University
P-1A-49Linear Span Network for Object Skeleton DetectionChang Liu*, University of Chinese Academy of Sciences; Wei Ke, University of Chinese Academy of Sciences; Fei Qin, University of Chinese Academy of Sciences; Qixiang Ye, University of Chinese Academy of Sciences, China
P-1A-50Category-Agnostic Semantic Keypoint Representations in Canonical Object ViewsXingyi Zhou*, The University of Texas at Austin; Arjun Karpur, The University of Texas at Austin; Linjie Luo, Snap Inc; Qixing Huang, The University of Texas at Austin
P-1A-51Where are the blobs: Counting by Localization with Point SupervisionIssam Hadj Laradji*, University of British Columbia (UBC); Negar Rostamzadeh, Element AI; Pedro Pinheiro, EPFL; David Vazquez, Element AI; Mark Schmidt, University of British Columbia
P-1A-52A Hybrid Model for Identity Obfuscation by Face ReplacementQianru Sun*, National University of Singapore; Ayush Tewari, Max Planck Institute for Informatics; Weipeng Xu, MPII; Mario Fritz, Max-Planck-Institut für Informatik; Christian Theobalt, MPI Informatik; Bernt Schiele, MPI
P-1A-53Exploring the Limits of Supervised PretrainingDhruv Mahajan, Facebook; Ross Girshick*, Facebook AI Research (FAIR); Vignesh Ramanathan, Facebook; Kaiming He, Facebook Inc., USA; Manohar Paluri, Facebook; Yixuan Li, Facebook Research; Ashwin Bharambe, Facebook; Laurens van der Maaten, Facebook AI Research
P-1A-54TrackingNet: A Large-Scale Dataset and Benchmark for Object Tracking in the WildMatthias Müller*, King Abdullah University of Science and Technology (KAUST); Adel Bibi, KAUST; Silvio Giancola, KAUST; Salman Al-Subaihi, KAUST; Bernard Ghanem, KAUST
P-1A-55Unpaired Image Captioning by Language PivotingJiuxiang Gu*, Nanyang Technological University; Shafiq Joty, Nanyang Technological University; Jianfei Cai, Nanyang Technological University; Gang Wang, Alibaba Group
P-1A-56Pairwise Relational Networks for Face RecognitionBong-Nam Kang*, POSTECH
P-1A-57DeepPhys: Video-Based Physiological Measurement Using Convolutional Attention NetworksWeixuan Chen*, MIT Media Lab; Daniel McDuff, Microsoft Research
P-1A-58Semantic Match Consistency for Long-Term Visual LocalizationCarl Toft*, Chalmers; Erik Stenborg, Chalmers University; Lars Hammarstrand, Chalmers university of technology; Lucas Brynte, Chalmers University of Technology; Marc Pollefeys, ETH Zurich; Torsten Sattler, ETH Zurich; Fredrik Kahl, Chalmers
P-1A-59Grounding Visual ExplanationsLisa Anne Hendricks*, Uc berkeley; Ronghang Hu, University of California, Berkeley; Trevor Darrell, UC Berkeley; Zeynep Akata, University of Amsterdam
P-1A-60Cross-Modal Hamming HashingYue Cao, Tsinghua University; Mingsheng Long*, Tsinghua University; Bin Liu, Tsinghua University; Jianmin Wang, Tsinghua University, China
P-1A-61A Modulation Module for Multi-task Learning with Applications in Image RetrievalXiangyun Zhao*, Northwestern University; Haoxiang Li, Adobe; Xiaohui Shen, Adobe Research; Xiaodan Liang, Carnegie Mellon University; Ying Wu, Northwestern University
P-1A-62Open-World Stereo Video Matching with Deep RNNYiran Zhong*, Australian National University; HONGDONG LI, Australian National University, Australia; Yuchao Dai, Northwestern Polytechnical University
P-1A-63Deblurring Natural Image Using Super-Gaussian FieldsYuhang Liu, Wuhan University; Wenyong Dong*, Wuhan University; Dong Gong, Northwestern Polytechnical University & The University of Adelaide; Lei Zhang, The unversity of Adelaide; Qinfeng Shi, University of Adelaide
P-1A-64Diverse and Coherent Paragraph Generation from ImagesMoitreya Chatterjee*, University of Illinois at Urbana Champaign; Alexander Schwing, UIUC
P-1A-65Learning Compression from limited unlabeled DataXiangyu He*, Chinese Academy of Sciences; Jian Cheng, Chinese Academy of Sciences, China
P-1A-66Deep Video Quality Assessor: From Spatio-temporal Visual Sensitivity to A Convolutional Neural Aggregation NetworkWoojae Kim*, Yonsei University; Jongyoo Kim, Yonsei University; Sewoong Ahn, Yonsei University; Jinwoo Kim, Yonsei University; Sanghoon Lee, Yonsei University, Korea
P-1A-67Product Quantization Network for Fast Image RetrievalTan Yu*, Nanyang Technological University; Junsong Yuan, State University of New York at Buffalo, USA; CHEN FANG, Adobe Research, San Jose, CA; Hailin Jin, Adobe Research
P-1A-68Factorizable Net: An Efficient Subgraph-based Framework for Scene Graph GenerationYikang LI*, The Chinese University of Hong Kong; Bolei Zhou, MIT; Yawen Cui, National University of Defense Technology ; Jianping Shi, Sensetime Group Limited; Xiaogang Wang, Chinese University of Hong Kong, Hong Kong; Wanli Ouyang, CUHK
P-1A-69C-WSL: Count-guided Weakly Supervised LocalizationMingfei Gao*, University of Maryland; Ang Li, Google DeepMind; Ruichi Yu, University of Maryland, College Park; Vlad Morariu, Adobe Research; Larry Davis, University of Maryland
P-1A-70The Sound of PixelsHang Zhao*, Massachusetts Institute of Technology; Chuang Gan, MIT; Andrew Rouditchenko, MIT; Carl Vondrick, MIT; Josh McDermott, Massachusetts Institute of Technology; Antonio Torralba, MIT
P-1A-71Unsupervised Video Object Segmentation using Motion Saliency-Guided Spatio-Temporal PropagationYuan-Ting Hu*, University of Illinois at Urbana-Champaign; Jia-Bin Huang, Virginia Tech; Alexander Schwing, UIUC
P-1A-72Good Line Cutting: towards Accurate Pose Tracking of Line-assisted VO/VSLAMYipu Zhao*, Georgia Institute of Technology; Patricio Vela, Georgia Institute of Technology
P-1A-73Bi-box Regression for Pedestrian Detection and Occlusion EstimationCHUNLUAN ZHOU*, Nanyang Technological University; Junsong Yuan, State University of New York at Buffalo, USA
P-1A-74Unveiling the Power of Deep TrackingGoutam Bhat*, Linkoping University; Joakim Johnander, Linköping University; Martin Danelljan, Linkoping University; Fahad Shahbaz Khan, Linköping University; Michael Felsberg, Linköping University
P-1A-75Multi-Scale Structure-Aware Network for Human Pose Estimation Lipeng Ke*, University of Chinese Academy of Sciences; Ming-Ching Chang, Albany University; Honggang Qi, University of Chinese Academy of Sciences; Siwei Lyu, University at Albany
P-1A-76Neural Graph Matching Networks for Fewshot 3D Action RecognitionMichelle Guo*, Stanford University; Edward Chou, Stanford University; De-An Huang, Stanford University; Shuran Song, Princeton; Serena Yeung, Stanford University; Li Fei-Fei, Stanford University
P-1A-77Objects that SoundRelja Arandjelovi?*, DeepMind; Andrew Zisserman, University of Oxford
P-1A-78Discriminative Region Proposal Adversarial Networks for High-Quality Image-to-Image TranslationChao Wang, Ocean University of China; Haiyong Zheng*, Ocean University of China; Zhibin Yu, Ocean University of China; Ziqiang Zheng, Ocean University of China; Zhaorui Gu, Ocean University of China; Bing Zheng, Ocean University of China
P-1A-79SaaS: Speed as a Supervisor for Semi-supervised LearningSafa Cicek*, UCLA; Alhussein Fawzi, UCLA; Stefano Soatto, UCLA
P-1A-80Adaptive Affinity Field for Semantic SegmentationTsung-Wei Ke, UC Berkeley / ICSI; Jyh-Jing Hwang*, UC Berkeley / ICSI;
Ziwei Liu, UC Berkeley / ICSI; Stella Yu, UC Berkeley / ICSI
P-1A-81Semi-convolutional Operators for Instance SegmentationSamuel Albanie*, University of Oxford; Andrea Vedaldi, Oxford University; David Novotny, Oxford University; Diane Larlus, Naver Labs Europe
P-1A-82Effective Use of Synthetic Data for Urban Scene Semantic SegmentationFatemeh Sadat Saleh*, Australian National University (ANU); Mohammad Sadegh Aliakbarian, Data61; Mathieu Salzmann, EPFL; Lars Petersson, Data61/CSIRO; Jose Manuel Alvarez, Toyota Research Institute
P-1A-83Shape correspondences from learnt template-based parametrizationThibault Groueix*, École des ponts ParisTech; Bryan Russell, Adobe Research; Mathew Fisher, Adobe Research; Vladimir Kim, Adobe Research; Mathieu Aubry, École des ponts ParisTech
P-1A-84TextSnake: A Flexible Representation for Detecting Text of Arbitrary ShapesShangbang Long, Peking University; Jiaqiang Ruan, Peking University; Wenjie Zhang, Peking University; Xin He*, Megvii; Wenhao Wu, Megvii; Cong Yao, Megvii
P-1A-85How good is my GAN?Konstantin Shmelkov*, Inria; Cordelia Schmid, INRIA; Karteek Alahari, Inria
P-1A-86Deep Generative Models for Weakly-Supervised Multi-Label ClassificationHong-Min Chu*, National Taiwan University; Chih-Kuan Yeh, Carnegie Mellon University; Yu-Chiang Frank Wang, National Taiwan University
P-1A-87Attention-GAN for Object Transfiguration in Wild ImagesXinyuan Chen*, Shanghai Jiao Tong University; Chang Xu, University of Sydney; Xiaokang Yang, Shanghai Jiao Tong University of China; Dacheng Tao, University of Sydney
P-1A-88Skeleton-Based Action Recognition with Spatial Reasoning and Temporal Stack LearningChenyang Si*, Institute of Automation, Chinese Academy of Sciences; Ya Jing, Institute of Automation, Chinese Academy of Sciences; wei wang, Institute of Automation Chinese Academy of Sciences; Liang Wang, NLPR, China; Tieniu Tan, NLPR, China
P-1A-89Diverse Image-to-Image Translation via Disentangled RepresentationsHsin-Ying Lee*, University of California, Merced; Hung-Yu Tseng, University of California, Merced; Maneesh Singh, Verisk Analytics; Jia-Bin Huang, Virginia Tech; Ming-Hsuan Yang, University of California at Merced
P-1A-90Convolutional Networks with Adaptive Computation GraphsAndreas Veit*, Cornell University; Serge Belongie, Cornell University

Poster session 1B
1BMonday, September 10Poster Session 04:00 PM - 06:00 PM
P-1B-01Learning to Separate Object Sounds by Watching Unlabeled VideoRuohan Gao*, University of Texas at Austin; Rogerio Feris, IBM Research; Kristen Grauman, University of Texas
P-1B-02Learning-based Video Motion MagnificationTae-Hyun Oh, MIT CSAIL; Ronnachai Jaroensri*, MIT CSAIL; Changil Kim, MIT CSAIL; Mohamed A. Elghareb, Qatar Computing Research Institute; Fredo Durand, MIT; Bill Freeman, MIT; Wojciech Matusik, MIT CSAIL
P-1B-03Light Structure from Pin Motion: Simple and Accurate Point Light Calibration for Physics-based ModelingHiroaki Santo*, Osaka University; Michael Waechter, Osaka University; Masaki Samejima, Osaka University; Yusuke Sugano, Osaka University; Yasuyuki Matsushita, Osaka University
P-1B-04Video Object Segmentation with Joint Re-identification and Attention-Aware Mask PropagationXiaoxiao Li*, The Chinese University of Hong Kong; Chen Change Loy, Chinese University of Hong Kong
P-1B-05Coded Two-Bucket Cameras for Computer VisionMian Wei, University of Toronto; Navid Navid Sarhangnejad, University of Toronto; Zhengfan Xia, University of Toronto; Nikola Katic, University of Toronto; Roman Genov, University of Toronto; Kyros Kutulakos*, University of Toronto
P-1B-06Multimodal Unsupervised Image-to-image TranslationXun Huang*, Cornell University; Ming-Yu Liu, NVIDIA; Serge Belongie, Cornell University; Kautz Jan, NVIDIA
P-1B-07Learning to Detect and Track Visible and Occluded Body Joints in a Virtual WorldMatteo Fabbri, University of Modena and Reggio Emilia; Fabio Lanzi*, University of Modena and Reggio Emilia; SIMONE CALDERARA, University of Modena and Reggio Emilia, Italy; Andrea Palazzi, University of Modena and Reggio Emilia; ROBERTO VEZZANI, University of Modena and Reggio Emilia, Italy; Rita Cucchiara, Universita Di Modena E Reggio Emilia
P-1B-08Local Spectral Graph Convolution for Point Set Feature LearningChu Wang*, McGill University; Babak Samari, McGill University; Kaleem Siddiqi, McGill University
P-1B-09Meta-Tracker: Fast and Robust Online Adaptation for Visual Object TrackersEunbyung Park*, UNC-CHAPEL HILL; Alex Berg, University of North Carolina, USA
P-1B-10VSO: Visual Semantic OdometryKonstantinos-Nektarios Lianos, Geomagical Labs, Inc; Johannes Schoenberger, ETH Zurich; Marc Pollefeys, ETH Zurich; Torsten Sattler*, ETH Zurich
P-1B-11Progressive Lifelong Learning by Distillation and RetrospectionSaihui Hou*, University of Science and Technology of China; Xinyu Pan, MMLAB, CUHK; Chen Change Loy, Chinese University of Hong Kong; Dahua Lin, The Chinese University of Hong Kong
P-1B-12Spatio-Temporal Channel Correlation Networks for Action ClassificationAli Diba*, KU Leuven; Mohsen Fayyaz, University of Bonn; Vivek Sharma, Karlsruhe Institute of Technology; Mohammad Arzani, Sensifai; Rahman Yousefzadeh, sensifai; Jürgen Gall, University of Bonn; Luc Van Gool, ETH Zurich
P-1B-13Long-term Tracking in the Wild: a BenchmarkEfstratios Gavves, University of Amsterdam ; Luca Bertinetto*, University of Oxford; Joao Henriques, University of Oxford; Andrea Vedaldi, Oxford University; Philip Torr, University of Oxford; Ran Tao, University of Amsterdam; Jack Valmadre, Oxford
P-1B-14Online Detection of Action Start in Untrimmed, Streaming VideosZheng Shou*, Columbia University; Junting Pan, Columbia University ; Jonathan Chan, Columbia University; Kazuyuki Miyazawa, Mitsubishi Electric; Hassan Mansour, Mitsubishi Electric Research Laboratories (MERL); Anthony Vetro, Mitsubishi Electric Research Lab; Xavier Giro-i-Nieto, Universitat Politecnica de Catalunya; Shih-Fu Chang, Columbia University
P-1B-15Dense Pose TransferNatalia Neverova*, Facebook AI Research; Alp Guler, INRIA; Iasonas Kokkinos, Facebook, France
P-1B-16Simultaneous 3D Reconstruction for Water Surface and Underwater SceneYiming Qian*, University of Alberta; Yinqiang Zheng, National Institute of Informatics; Minglun Gong, Memorial University; Herb Yang, University of Alberta
P-1B-17Multiple-gaze geometry: Inferring novel 3D locations from gazes observed in monocular videoErnesto Brau, CiBO Technologies; Jinyan Guan, UC San Diego; Tanya Jeffries, U. Arizona; Kobus Barnard*, University of Arizona
P-1B-18Multi-Scale Context Intertwining for Semantic SegmentationDi Lin*, Shenzhen University; Yuanfeng Ji, Shenzhen University; Dani Lischinski, The Hebrew University of Jerusalem; Danny Cohen-Or, Tel Aviv University; Hui Huang, Shenzhen University
P-1B-19Object-centered image stitchingCharles Herrmann, Cornell; Chen Wang, Google Research; Richard Bowen, Cornell; Ramin Zabih*, Cornell Tech/Google Research
P-1B-20Grassmann Pooling for Fine-Grained Visual ClassificationXing Wei*, Xi'an Jiaotong University; Yihong Gong, Xi'an Jiaotong University; Yue Zhang, Xi'an Jiaotong University; Nanning Zheng, Xi'an Jiaotong University; Jiawei Zhang, City University of Hong Kong
P-1B-21Diagnosing Error in Temporal Action DetectorsHumam Alwassel*, KAUST; Fabian Caba, KAUST; Victor Escorcia, KAUST; Bernard Ghanem, KAUST
P-1B-22CGIntrinsics: Better Intrinsic Image Decomposition through Physically-Based RenderingZhengqi Li*, Cornell University; Noah Snavely, -
P-1B-23A Closed-form Solution to Photorealistic Image StylizationYijun Li*, University of California, Merced; Ming-Yu Liu, NVIDIA; Xueting Li, University of California, Merced; Ming-Hsuan Yang, University of California at Merced; Kautz Jan, NVIDIA
P-1B-24Two at Once: Enhancing Learning and Generalization Capacities via IBN-NetXingang Pan*, The Chinese University of Hong Kong; Ping Luo, The Chinese University of Hong Kong; Jianping Shi, Sensetime Group Limited; Xiaoou Tang, The Chinese University of Hong Kong
P-1B-25Collaborative Deep Reinforcement Learning for Multi-Object TrackingLiangliang Ren, Tsinghua University; Zifeng Wang, Tsinghua University; Jiwen Lu*, Tsinghua University; Qi Tian , The University of Texas at San Antonio; Jie Zhou, Tsinghua University, China
P-1B-26Single Image Highlight Removal with a Sparse and Low-Rank Reflection ModelJie Guo*, Nanjing University; Zuojian Zhou, Nanjing University Of Chinese Medicine; Limin Wang, Nanjing University
P-1B-27Hierarchical Relational Networks for Group Activity Recognition and RetrievalMostafa Ibrahim*, Simon Fraser University; Greg Mori, Simon Fraser University
P-1B-28Towards Human-Level License Plate RecognitionJiafan Zhuang, University of Science and Technology of China; Zilei Wang*, University of Science and Technology of China
P-1B-29Stacked Cross Attention for Image-Text MatchingKuang-Huei Lee*, Microsoft AI and Research; Xi Chen, Microsoft AI and Research; Gang Hua, Microsoft Cloud and AI; Houdong Hu, Microsoft AI and Research; Xiaodong He, JD AI Research
P-1B-30Deep Discriminative Model for Video ClassificationMohammad Tavakolian*, University of Oulu; Abdenour Hadid, Finland
P-1B-31The Mutex Watershed: Efficient, Parameter-Free Image PartitioningSteffen Wolf*, Univertity of Heidelberg; Constantin Pape, University of Heidelberg; Nasim Rahaman, University of Heidelberg; Anna Kreshuk, University of Heidelberg; Ullrich Köthe, University of Heidelberg; Fred Hamprecht, Heidelberg Collaboratory for Image Processing
P-1B-32Monocular Depth Estimation with Affinity, Vertical Pooling, and Label EnhancementYuKang Gan*, SUN YAT-SEN University; Xiangyu Xu, Tsinghua University; Wenxiu Sun, SenseTime Research; Liang Lin, SenseTime
P-1B-33Improved Structure from Motion Using Fiducial Marker MatchingJoseph DeGol*, UIUC; Timothy Bretl, University of Illinois at Urbana-Champaign; Derek Hoiem, University of Illinois at Urbana-Champaign
P-1B-34Temporal Modular Networks for Retrieving Complex Compositional Activities in VideoBingbin Liu*, Stanford University; Serena Yeung, Stanford University; Edward Chou, Stanford University; De-An Huang, Stanford University; Li Fei-Fei, Stanford University; Juan Carlos Niebles, Stanford University
P-1B-35Quantized Densely Connected U-Nets for Efficient Landmark LocalizationZhiqiang Tang*, Rutgers; Xi Peng, Rutgers University; Shijie Geng, Rutgers; Shaoting Zhang, University of North Carolina at Charlotte; Lingfei Wu, IBM T. J. Watson Research Center; Dimitris Metaxas, Rutgers
P-1B-36Real-to-Virtual Domain Uni_x000c_cation for End-to-End Autonomous DrivingLuona Yang*, Carnegie Mellon University; Xiaodan Liang, Carnegie Mellon University; Eric Xing, Petuum Inc.
P-1B-37Beyond Part Models: Person Retrieval with Refined Part Pooling (and A Strong Convolutional Baseline)Yifan Sun*, Tsinghua University; Liang Zheng, Singapore University of Technology and Design; Yi Yang, University of Technology, Sydney; Qi Tian , The University of Texas at San Antonio; Shengjin Wang, Tsinghua University
P-1B-38Fully-Convolutional Point Networks for Large-Scale Point CloudsDario Rethage*, Technical University of Munich, Germany; Johanna Wald, Technical University of Munich; Nassir Navab, TU Munich, Germany; Federico Tombari, Technical University of Munich, Germany
P-1B-39Real-Time Hair Rendering using Sequential Adversarial NetworksLingyu Wei*, University of Southern California; Liwen Hu, University of Southern California; Vladimir Kim, Adobe Research; Ersin Yumer, Argo AI; Hao Li, Pinscreen/University of Southern California/USC ICT
P-1B-40Visual Tracking via Spatially Aligned Correlation Filters Networkmengdan zhang*, Institute of Automation, Chinese Academy of Sciences; qiang wang, Institute of Automation, Chinese Academy of Sciences; Junliang Xing, National Laboratory of Pattern Recognition, Institute of Automation, Chinese Academy of Sciences; Jin Gao, Institute of Automation, Chinese Academy of Sciences; peixi peng, Institute of Automation, Chinese Academy of Sciences; Weiming Hu, Institute of Automation,Chinese Academy of Sciences; Steve Maybank, University of London
P-1B-41Spatio-temporal Transformer Network for Video RestorationTae Hyun Kim*, Max Planck Institute for Intelligent Systems; Mehdi S. M. Sajjadi, Max Planck Institute for Intelligent Systems; Michael Hirsch, Max Planck Institut for Intelligent Systems ; Bernhard Schölkopf, Max Planck Institute for Intelligent Systems
P-1B-42Value-aware Quantization for Training and Inference of Neural NetworksEunhyeok Park, Seoul National University; Sungjoo Yoo*, Seoul National University; Peter Vajda, Facebook
P-1B-43Lambda Twist: An Accurate Fast Robust Perspective Three Point (P3P) SolverMikael Persson*, Linköping University;
Klas Nordberg, Linköping University
P-1B-44Programmable Light CurtainsJian Wang*, Carnegie Mellon University; Joe Bartels, Carnegie Mellon University; William Whittaker, Carnegie Mellon University; Aswin Sankaranarayanan, Carnegie Mellon University; Srinivasa Narasimhan, Carnegie Mellon University
P-1B-45Monocular Depth Estimation Using Whole Strip Masking and Reliability-Based RefinementMinhyeok Heo*, Korea University; Jaehan Lee, Korea University; Kyung-Rae Kim, Korea University; Han-Ul Kim, Korea University; Chang-Su Kim, Korea university
P-1B-46Task-Aware Image DownscalingHeewon Kim, Seoul National University; Myungsub Choi, Seoul National University; Bee Lim, Seoul National University; Kyoung Mu Lee*, Seoul National University
P-1B-47Single Image Scene Refocusing using Conditional Adversarial NetworksParikshit Sakurikar*, IIIT-Hyderabad; Ishit Mehta, IIIT Hyderabad; Vineeth N Balasubramanian, IIT Hyderabad; P. J. Narayanan, IIIT-Hyderabad
P-1B-48Model-free Consensus Maximization for Non-Rigid ShapesThomas Probst*, ETH Zurich; Ajad Chhatkuli , ETHZ; Danda Pani Paudel, ETH Zürich; Luc Van Gool, ETH Zurich
P-1B-49BSN: Boundary Sensitive Network for Temporal Action Proposal GenerationTianwei Lin, Shanghai Jiao Tong University; Xu Zhao*, Shanghai Jiao Tong University; Haisheng Su, Shanghai Jiao Tong University; Chongjing Wang, China Academy of Information and Communications Technology; Ming Yang, Shanghai Jiao Tong University
P-1B-50Materials for Masses: SVBRDF Acquisition with a Single Mobile Phone ImageZhengqin Li*, UC San Diego; Manmohan Chandraker, UC San Diego; Sunkavalli Kalyan, Adobe Research
P-1B-51Attentive Semantic Alignment with Offset-Aware Correlation KernelsPaul Hongsuck Seo*, POSTECH; Jongmin Lee, POSTECH; Deunsol Jung, POSTECH; Bohyung Han, Seoul National University; Minsu Cho, POSTECH
P-1B-52Deeply Learned Compositional Models for Human Pose EstimationWei Tang*, Northwestern University; Pei Yu, Northwestern University; Ying Wu, Northwestern University
P-1B-53Real-Time MDNetIlchae Jung*, POSTECH; Jeany Son, POSTECH; Mooyeol Baek, POSTECH; Bohyung Han, Seoul National University
P-1B-54Women also Snowboard: Overcoming Bias in Captioning ModelsLisa Anne Hendricks*, UC Berkeley; Kaylee Burns, UC Berkeley; Kate Saenko, Boston University; Trevor Darrell, UC Berkeley; Anna Rohrbach, UC Berkeley
P-1B-55Progressive Structure from MotionAlex Locher*, ETH Zürich; Michal Havlena, Vuforia, PTC, Vienna; Luc Van Gool, ETH Zurich
P-1B-56Occlusion-aware R-CNN: Detecting Pedestrians in a CrowdShifeng Zhang*, CBSR, NLPR, CASIA; Longyin Wen, GE Global Research; Xiao Bian, GE Global Research; Zhen Lei, NLPR, CASIA, China; Stan Li, National Lab. of Pattern Recognition, China
P-1B-57Affinity Derivation and Graph Merge for Instance SegmentationYiding Liu*, University of Science and Technology of China; Siyu Yang, Beihang University; Bin Li, Microsoft Research Asia; Wengang Zhou, University of Science and Technology of China; Ji-Zeng Xu, Microsoft Research Asia; Houqiang Li, University of Science and Technology of China; Yan Lu, Microsoft Research Asia
P-1B-58Second-order Democratic AggregationTsung-Yu Lin*, University of Massachusetts Amherst; Subhransu Maji, University of Massachusetts, Amherst; Piotr Koniusz, Data61/CSIRO, ANU
P-1B-59Improving Sequential Determinantal Point Processes for Supervised Video SummarizationAidean Sharghi*, University of Central Florida; Boqing Gong, Tencent AI Lab; Ali Borji, University of Central Florida; Chengtao Li, MIT; Tianbao Yang, University of Iowa
P-1B-60Seeing Deeply and Bidirectionally: A Deep Learning Approach for Single Image Reflection RemovalJie Yang*, University of Adelaide; Dong Gong, Northwestern Polytechnical University & The University of Adelaide; Lingqiao Liu, University of Adelaide; Qinfeng Shi, University of Adelaide
P-1B-61Specular-to-Diffuse Translation for Multi-View ReconstructionShihao Wu*, University of Bern; Hui Huang, Shenzhen University; Tiziano Portenier, University of Bern; Matan Sela, Technion - Israel Institute of Technology; Danny Cohen-Or, Tel Aviv University; Ron Kimmel, Technion; Matthias Zwicker, University of Maryland
P-1B-62SEAL: A Framework Towards Simultaneous Edge Alignment and LearningZhiding Yu*, NVIDIA; Weiyang Liu, Georgia Tech; Yang Zou, Carnegie Mellon University; Chen Feng, Mitsubishi Electric Research Laboratories (MERL); Srikumar Ramalingam, University of Utah; B. V. K. Vijaya Kumar, CMU, USA; Kautz Jan, NVIDIA
P-1B-63Question Type Guided Attention in Visual Question AnsweringYang Shi*, University of California, Irvine; Tommaso Furlanello, University of Southern California; Sheng Zha, Amazon Web Services; Anima Anandkumar, Amazon
P-1B-64Neural Procedural Reconstruction for Residential BuildingsHuayi Zeng*, Washington University in St.Louis; Jiaye Wu, Washington University in St.Louis; Yasutaka Furukawa, Simon Fraser University
P-1B-65Self-Calibration of Cameras with Euclidean Image Plane in Case of Two Views and Known Relative Rotation AngleEvgeniy Martyushev*, South Ural State University
P-1B-66Towards Optimal Deep Hashing via Policy GradientXin Yuan, Tsinghua University; Liangliang Ren, Tsinghua University; Jiwen Lu*, Tsinghua University; Jie Zhou, Tsinghua University, China
P-1B-67Piggyback: Adapting a Single Network to Multiple Tasks by Learning to Mask WeightsArun Mallya*, UIUC; Svetlana Lazebnik, UIUC; Dillon Davis, UIUC
P-1B-68Generating 3D Faces using Convolutional Mesh AutoencodersAnurag Ranjan*, MPI for Intelligent Systems; Timo Bolkart, Max Planck for Intelligent Systems; Soubhik Sanyal, Max Planck Institute for Intelligent Systems; Michael Black, Max Planck Institute for Intelligent Systems
P-1B-69ICNet for Real-Time Semantic Segmentation on High-Resolution ImagesHengshuang Zhao, The Chinese University of Hong Kong; Xiaojuan Qi, CUHK; Xiaoyong Shen*, CUHK; Jianping Shi, Sensetime Group Limited; Jia Jiaya, Chinese University of Hong Kong
P-1B-70Memory Aware Synapses: Learning what (not) to forget Rahaf Aljundi*, KU Leuven; Francesca babiloni, KU Leuven; Mohamed Elhoseiny, Facebook; Marcus Rohrbach, Facebook AI Research; Tinne Tuytelaars, K.U. Leuven
P-1B-71Deep Texture and Structure Aware Filtering Network for Image SmoothingKaiyue Lu*, Australian National University & Data61-CSIRO; Shaodi You, Data61-CSIRO, Australia; Nick Barnes, CSIRO(Data61)
P-1B-72Linear RGB-D SLAM for Planar EnvironmentsPyojin Kim*, Seoul National University; Brian Coltin, NASA Ames Research Center; Hyoun Jin Kim, Seoul National University
P-1B-73DeepJDOT: Deep Joint distribution optimal transport for unsupervised domain adaptationBharath Bhushan Damodaran*, IRISA,Universite de Bretagne-Sud; Benjamin Kellenberger, Wageningen University and Research; Rémi Flamary, Université Côte d’Azur; Devis Tuia, Wageningen University and Research; Nicolas Courty, IRISA, Universite Bretagne-Sud
P-1B-74W-TALC: Weakly-supervised Temporal Activity Localization and ClassificationSujoy Paul*, University of California-Riverside; Sourya Roy, University of California, Riverside; Amit Roy-Chowdhury , University of California, Riverside, USA
P-1B-75Unsupervised Video Object Segmentation with Motion-based Bilateral NetworksSiyang Li*, University of Southern California; Bryan Seybold, Google Inc.; Alexey Vorobyov, Google Inc.; Xuejing Lei, University of Southern California ; C.-C. Jay Kuo, USC
P-1B-76Disentangling Factors of Variation with Cycle-Consistent Variational Auto-EncodersAnanya Harsh Jha*, Indraprastha Institute of Information Technology Delhi; Saket Anand, Indraprastha Institute of Information Technology Delhi; Maneesh Singh, Verisk Analytics; VSR Veeravasarapu, Verisk Analytics
P-1B-77Mancs: A Multi-task Attentional Network with Curriculum Sampling for Person Re-identificationCheng Wang, Huazhong Univ. of Science and Technology; Qian Zhang, Horizon Robotics; Chang Huang, Horizon Robotics, Inc.; Wenyu Liu, Huazhong University of Science and Technology; Xinggang Wang*, Huazhong Univ. of Science and Technology
P-1B-78Multi-view to Novel view: Synthesizing Views via Self-Learned ConfidenceShao-Hua Sun*, University of Southern California; Jacob Huh, Carnegie Mellon University; Yuan-Hong Liao, National Tsing Hua University; Ning Zhang, SnapChat; Joseph Lim, USC
P-1B-79Part-Activated Deep Reinforcement Learning for Action PredictionLei Chen, Tianjin University; Jiwen Lu*, Tsinghua University; Zhanjie Song, Tianjin University; Jie Zhou, Tsinghua University, China
P-1B-80Online Dictionary Learning for Approximate Archetypal AnalysisJieru Mei, Microsoft Research Asia; Chunyu Wang*, Microsoft Research asia; Wenjun Zeng, Microsoft Research
P-1B-81Estimating Depth from RGB and Sparse SensingZhao Chen*, Magic Leap, Inc.; Vijay Badrinarayanan, Magic Leap, Inc.; Gilad Drozdov, Magic Leap, Inc.; Andrew Rabinovich, Magic Leap, Inc.
P-1B-82Unsupervised Domain Adaptation for Semantic Segmentation via Class-Balanced Self-TrainingYang Zou*, Carnegie Mellon University; Zhiding Yu, NVIDIA; B. V. K. Vijaya Kumar, CMU, USA; Jinsong Wang, General Motors
P-1B-83Zoom-Net: Mining Deep Feature Interactions for Visual Relationship RecognitionGuojun Yin, University of Science and Technology of China; Lu Sheng, The Chinese University of Hong Kong; Bin Liu, University of Science and Technology of China; Nenghai Yu, University of Science and Technology of China; Xiaogang Wang, Chinese University of Hong Kong, Hong Kong; Chen Change Loy, Chinese University of Hong Kong; Jing Shao*, The Chinese University of Hong Kong
P-1B-84Joint Camera Spectral Sensitivity Selection and Hyperspectral Image RecoveryYing Fu*, Beijing Institute of Technology; Tao Zhang, Beijing Institute of Technology; Yinqiang Zheng, National Institute of Informatics; debing zhang, DeepGlint; Hua Huang, Beijing Institute of Technology
P-1B-85Compositing-aware Image SearchHengshuang Zhao*, The Chinese University of Hong Kong; Xiaohui Shen, Adobe Research; Zhe Lin, Adobe Research; Sunkavalli Kalyan, Adobe Research; Brian Price, Adobe; Jia Jiaya, Chinese University of Hong Kong
P-1B-86Zero-shot keyword search for visual speech recognition in-the-wildThemos Stafylakis*, University of Nottingham; Georgios Tzimiropoulos, University of Nottingham
P-1B-87End-to-End Joint Semantic Segmentation of Actors and Actions in VideoJingwei Ji*, Stanford University; Shyamal Buch, Stanford University; Alvaro Soto, Universidad Catolica de Chile; Juan Carlos Niebles, Stanford University
P-1B-88Learning Discriminative Video Representations Using Adversarial PerturbationsJue Wang*, ANU; Anoop Cherian, MERL
P-1B-89DeepWrinkles: Accurate and Realistic Clothing ModelingZorah Laehner, TU Munich; Tony Tung*, Facebook / Oculus Research; Daniel Cremers, TUM
P-1B-90Massively Parallel Video NetworksViorica Patraucean*, DeepMind; Joao Carreira, DeepMind; Laurent Mazare, DeepMind; Simon Osindero, DeepMind; Andrew Zisserman, University of Oxford

Poster session 2A
2ATuesday, September 11Poster session 10:00 AM - 12:00 PM
P-2A-01Unsupervised Person Re-identification by Deep Learning Tracklet AssociationMinxian Li*, Nanjing University and Science and Technology; Xiatian Zhu, Queen Mary University, London, UK; Shaogang Gong, Queen Mary University of London
P-2A-02Instance-level Human Parsing via Part Grouping NetworkKe Gong*, SYSU; Xiaodan Liang, Carnegie Mellon University; Yicheng Li, Sun Yat-sen University; Yimin Chen, sensetime; Liang Lin, Sun Yat-sen University
P-2A-03Scaling Egocentric Vision: The E-Kitchens DatasetDima Damen*, University of Bristol; Hazel Doughty, University of Bristol; Sanja Fidler, University of Toronto; Antonino Furnari, University of Catania; Evangelos Kazakos, University of Bristol; Giovanni Farinella, University of Catania, Italy; Davide Moltisanti, University of Bristol; Jonathan Munro, University of Bristol; Toby Perrett, University of Bristol; Will Price, University of Bristol; Michael Wray, University of Bristol
P-2A-04Predicting Gaze in Egocentric Video by Learning Task-dependent Attention TransitionYifei Huang*, The University of Tokyo; Minjie Cai, Hunan University, The University of Tokyo; Zhenqiang Li, The University of Tokyo; Yoichi Sato,The University of Tokyo
P-2A-05Beyond local reasoning for stereo confidence estimation with deep learningFabio Tosi, University of Bologna; Matteo Poggi*, University of Bologna; Antonio Benincasa, University of Bologna; Stefano Mattoccia, University of Bologna
P-2A-06DeepGUM: Learning Deep Robust Regression with a Gaussian-Uniform Mixture ModelStéphane Lathuiliere, INRIA; Pablo Mesejo-Santiago, University of Granada; Xavier Alameda-Pineda*, INRIA; Radu Horaud, INRIA
P-2A-07Into the Twilight Zone: Depth Estimation using Joint Structure-Stereo OptimizationAashish Sharma*, National University of Singapore; Loong Fah Cheong, NUS
P-2A-08Generalized Loss-Sensitive Adversarial Learning with Manifold MarginsMarzieh Edraki*, University of Central Florida; Guo-Jun Qi, University of Central Florida
P-2A-09Adversarial Open Set Domain AdaptationKuniaki Saito*, The University of Tokyo; Shohei Yamamoto, The University of Tokyo; Yoshitaka Ushiku, The University of Tokyo; Tatsuya Harada, The University of Tokyo
P-2A-10Connecting Gaze, Scene and AttentionEunji Chong*, Georgia Institute of Technology; Nataniel Ruiz, Georgia Institute of Technology; Richard Wang, Georgia Institute of Technology; Yun Zhang, Georgia Institute of Technology
P-2A-11Multi-modal Cycle-consistent Generalized Zero-Shot LearningRAFAEL FELIX*, The University of Adelaide; Vijay Kumar B G, University of Adelaide; Ian Reid, University of Adelaide, Australia; Gustavo Carneiro, University of Adelaide
P-2A-12Understanding Degeneracies and Ambiguities in Attribute TransferAttila Szabo*, University of Bern; Qiyang Hu, University of Bern; Tiziano Portenier, University of Bern; Matthias Zwicker, University of Maryland; Paolo Favaro, Bern University, Switzerland
P-2A-13Start, Follow, Read: End-to-End Full Page Handwriting RecognitionCurtis Wigington*, Brigham Young University; Chris Tensmeyer, Brigham Young University; Brian Davis, Brigham Young University; Bill Barrett, Brigham Young University; Brian Price, Adobe; Scott Cohen, Adobe Research
P-2A-14Rethinking the Form of Latent States in Image CaptioningBo Dai*, the Chinese University of Hong Kong; Deming Ye, Tsinghua University; Dahua Lin, The Chinese University of Hong Kong
P-2A-15ConvNets and ImageNet Beyond Accuracy: Understanding Mistakes and Uncovering BiasesPierre Stock*, Facebook AI Research; Moustapha Cisse, Facebook AI Research
P-2A-16Deep Shape MatchingFilip Radenovic*, Visual Recognition Group, CTU Prague; Giorgos Tolias, Vision Recognition Group, Czech Technical University in Prague; Ondrej Chum, Vision Recognition Group, Czech Technical University in Prague
P-2A-17Neural Stereoscopic Image Style TransferXinyu Gong*, University of Electronic Science and Technology of China; Haozhi Huang, Tencent AI Lab; Lin Ma, Tencent AI Lab; Fumin Shen, UESTC; Wei Liu, Tencent AI Lab; Tong Zhang, Tecent AI Lab
P-2A-18Semi-supervised FusedGAN for Conditional Image GenerationNavaneeth Bodla*, University of Maryland; Gang Hua, Microsoft Cloud and AI; Rama Chellappa, University of Maryland
P-2A-19Affine Correspondences between Central Cameras for Rapid Relative Pose EstimationIván Eichhardt*, MTA SZTAKI; Mitya Csetverikov, MTA SZTAKI & ELTE
P-2A-20Bi-directional Feature Pyramid Network with Recursive Attention Residual Modules For Shadow DetectionLei Zhu*, The Chinese University of Hong Kong; Zijun Deng, South China University of Technology; Xiaowei Hu, The Chinese University of Hong Kong; Chi-Wing Fu, The Chinese University of Hong Kong; Xuemiao Xu, South China University of Technology; Jing Qin, The Hong Kong Polytechnic University; Pheng-Ann Heng, The Chinese Univsersity of Hong Kong
P-2A-21Joint Learning of Intrinsic Images and Semantic SegmentationAnil Baslamisli*, University of Amsterdam; Thomas Tiel Groenestege, University of Amsterdam; Partha Das, University of Amsterdam; Hoang-An Le, University of Amsterdam; Sezer Karaoglu, University of Amsterdam; Theo Gevers, University of Amsterdam
P-2A-22Visual Reasoning with a Multi-hop FiLM GeneratorFlorian Strub*, University of Lille; Mathieu Seurin, University of Lille; Ethan Perez, Rice University; Harm De Vries, Montreal Institute for Learning Algorithms; Jeremie Mary, Criteo; Philippe Preux, INRIA; Aaron Courville, MILA, Université de Montréal; Olivier Pietquin, GoogleBrain
P-2A-23View-graph Selection Framework for SfMRajvi Shah*, IIIT Hyderabad; Visesh Chari, INRIA; P. J. Narayanan, IIIT-Hyderabad
P-2A-24Fine-grained Video Categorization with Redundancy Reduction AttentionChen Zhu, University of Maryland; Xiao Tan, Baidu Inc.; Feng Zhou, Baidu Inc.; Xiao Liu, Baidu Research; Kaiyu Yue*, Baidu Inc.; Errui Ding, Baidu Inc.; Yi Ma, UC Berkeley
P-2A-25Space-time Knowledge for Unpaired Image-to-Image TranslationAayush Bansal*, Carnegie Mellon University; Shugao Ma, Facebook / Occulus; Deva Ramanan, Carnegie Mellon University; Yaser Sheikh, CMU
P-2A-26Integral Human Pose RegressionXiao Sun*, Microsoft Research Asia; Bin Xiao, MSR Asia; Fangyin Wei, Peking University; Shuang Liang, Tongji University; Yichen Wei, MSR Asia
P-2A-27Recurrent Tubelet Proposal and Recognition Networks for Action DetectionDong Li, University of Science and Technology of China; Zhaofan Qiu, University of Science and Technology of China; Qi Dai, Microsoft Research; Ting Yao*, Microsoft Research; Tao Mei, JD.com
P-2A-28Learning to Predict Crisp EdgeRuoxi Deng*, Central South University; Chunhua Shen, University of Adelaide; Shengjun Liu, Central South University; Huibing Wang, Dalian University of Technology; Xinru Liu, Central South University
P-2A-29Open Set Learning with Counterfactual ImagesLawrence Neal*, Oregon State University; Matthew Olson, Oregon State University; Xiaoli Fern, Oregon State University; Weng-Keen Wong, Oregon State University; Fuxin Li, Oregon State University
P-2A-30Estimating the Success of Unsupervised Image to Image TranslationLior Wolf, Tel Aviv University, Israel; Sagie Benaim*, Tel Aviv University; Tomer Galanti, Tel Aviv University
P-2A-31Joint Map and Symmetry SynchronizationQixing Huang*, The University of Texas at Austin; Xiangru Huang, University of Texas at Austin; Zhenxiao Liang, Tsinghua University; Yifan Sun, The University of Texas at Austin
P-2A-32Single Image Water Hazard Detection using FCN with Reflection Attention UnitsXiaofeng Han, Nanjing University of Science and Technology; Chuong Nguyen*, CSIRO Data61; Shaodi You, Data61-CSIRO, Australia; Jianfeng Lu, Nanjing University of Science and Technology
P-2A-33Realtime Time Synchronized Event-based StereoAlex Zhu*, University of Pennsylvania; Yibo Chen, University of Pennsylvania; Kostas Daniilidis, University of Pennsylvania
P-2A-34Transferring GANs: generating images from limited datayaxing wang*, Computer Vision Center; Chenshen Wu, Computer Vision Center; Luis Herranz, Computer Vision Center (Ph.D.); Joost van de Weijer, Computer Vision Center; Abel Gonzalez-Garcia, Computer Vision Center; BOGDAN RADUCANU, Computer Version Center, Edifici
P-2A-35To learn image super-resolution, use a GAN to learn how to do image degradation firstAdrian Bulat*, University of Nottingham; Jing Yang, University of Nottingham; Georgios Tzimiropoulos, University of Nottingham
P-2A-36Unsupervised CNN-based co-saliency detection with graphical optimizationKuang-Jui Hsu*, Academia Sinica; Chung-Chi Tsai, Texas A&M University; Yen-Yu Lin, Academia Sinica; Xiaoning Qian, Texas A&M University; Yung-Yu Chuang, National Taiwan University
P-2A-37Fast Light Field Reconstruction With Deep Coarse-To-Fine Modeling of Spatial-Angular CluesHenry W. F. Yeung, the University of Sydney; Junhui Hou*, City University of Hong Kong, Hong Kong; Jie Chen, Nanyang Technological University; Yuk Ying Chung, the University of Sydney; Xiaoming Chen, University of Science and Technology of China
P-2A-38Unified Perceptual Parsing for Scene UnderstandingTete Xiao*, Peking University; Yingcheng Liu, Peking University; Yuning Jiang, Megvii(Face++) Inc; Bolei Zhou, MIT; Jian Sun, Megvii, Face++
P-2A-39PARN: Pyramidal Affine Regression Networks for Dense Semantic Correspondence EstimationSangryul Jeon*, Yonsei university; Seungryung Kim, Yonsei University; Dongbo Min, Ewha Womans University; Kwanghoon Sohn , Yonsei Univ.
P-2A-40Structural Consistency and Controllability for Diverse ColorizationSafa Messaoud*, University of Illinois at Urbana Champaign; Alexander Schwing, UIUC; David Forsyth, Univeristy of Illinois at Urbana-Champaign
P-2A-41Online Multi-Object Tracking with Dual Matching Attention NetworksJi Zhu, Shanghai Jiao Tong University; Hua Yang*, Shanghai Jiao Tong University; Nian Liu, Northwestern Polytechnical University; Minyoung Kim, Perceptive Automata; Wenjun Zhang, Shanghai Jiao Tong University; Ming-Hsuan Yang, University of California at Merced
P-2A-42MaskConnect: Connectivity Learning by Gradient DescentKarim Ahmed*, Dartmouth College; Lorenzo Torresani, Dartmouth College
P-2A-43FloorNet: A Unified Framework for Floorplan Reconstruction from 3D ScansChen Liu*, Washington University in St. Louis; Jiaye Wu, Washington University in St.Louis; Yasutaka Furukawa, Simon Fraser University
P-2A-44Image Manipulation with Perceptual DiscriminatorsDiana Sungatullina*, Skolkovo Institute of Science and Technology; Egor Zakharov, Skolkovo Institute of Science and Technology; Dmitry Ulyanov, Skolkovo Institute of Science and Technology; Victor Lempitsky, Skoltech
P-2A-45Transductive Centroid Projection for Semi-supervised Large-scale RecognitionYu Liu*, The Chinese University of Hong Kong; Xiaogang Wang, Chinese University of Hong Kong, Hong Kong; Guanglu Song, Sensetime; Jing Shao, Sensetime
P-2A-46Eigendecomposition-free Training of Deep Networks with Zero Eigenvalue-based LossesZheng Dang*, Xi'an Jiaotong University; Kwang Moo Yi, University of Victoria; Yinlin Hu, EPFL; Fei Wang, Xi'an Jiaotong University; Pascal Fua, EPFL, Switzerland; Mathieu Salzmann, EPFL
P-2A-47Self-supervised Knowledge Distillation Using Singular Value DecompositionSEUNG HYUN LEE, Inha University; Daeha Kim, Inha University ; Byung Cheol Song*, Inha University
P-2A-48Snap Angle Prediction for 360$^{\circ}$ PanoramasBo Xiong*, University of Texas at Austin; Kristen Grauman, University of Texas
P-2A-49Saliency Preservation in Low-Resolution Grayscale ImagesShivanthan Yohanandan*, RMIT University; Adrian Dyer, RMIT University; Dacheng Tao, University of Sydney; Andy Song, RMIT University
P-2A-50PPF-FoldNet: Unsupervised Learning of Rotation Invariant 3D Local DescriptorsTolga Birdal*, TU Munich; Haowen Deng, Technical University of Munich; Slobodan Ilic, Siemens AG
P-2A-51BusterNet: Detecting Copy-Move Image Forgery with Source/Target LocalizationRex Yue Wu*, USC ISI; Wael Abd-Almageed, Information Sciences Institute; Prem Natarajan, USC ISI
P-2A-52Double JPEG Detection in Mixed JPEG Quality Factors using Deep Convolutional Neural NetworkJin-Seok Park*, Korea Advanced Institute of Science and Technology (KAIST); Donghyeon Cho, KAIST; Wonhyuk Ahn, KAIST; Heung-Kyu Lee, Korea Advanced Institute of Science and Technology (KAIST)
P-2A-53Unsupervised holistic image generation from key local patchesDonghoon Lee*, Seoul National University; Sangdoo Yun, Clova AI Research, NAVER Corp.; Sungjoon Choi, Seoul National University; Hwiyeon Yoo, Seoul National University; Ming-Hsuan Yang, University of California at Merced; Songhwai Oh, Seoul National University
P-2A-54CrossNet: An End-to-end Reference-based Super Resolution Network using Cross-scale WarpingHaitian Zheng, HKUST; Mengqi Ji, HKUST; Haoqian Wang, Tsinghua University; Yebin Liu*, Tsinghua University; Lu Fang, Tsinghua University
P-2A-55DCAN: Dual Channel-wise Alignment Networks for Unsupervised Scene AdaptationZuxuan Wu*, UMD; Xintong Han, University of Maryland, USA; Yen-Liang Lin, GE Global Research ; Gokhan Uzunbas, Avitas Systems-GE Venture; Tom Goldstein, University of Maryland, College Park; Ser-Nam Lim, GE Global Research; Larry Davis, University of Maryland
P-2A-56YouTube-VOS: Sequence-to-Sequence Video Object SegmentationNing Xu*, Adobe Research; Linjie Yang, Snap Research; Dingcheng Yue, UIUC; Jianchao Yang, Snap; Brian Price, Adobe; Jimei Yang, Adobe; Scott Cohen, Adobe Research; Yuchen Fan, Image Formation and Processing (IFP) Group, University of Illinois at Urbana-Champaign; Yuchen Liang, UIUC; Thomas Huang, University of Illinois at Urbana Champaign
P-2A-57Selfie Video StabilizationJiyang Yu*, University of California San Diego; Ravi Ramamoorthi, University of California San Diego
P-2A-58Videos as Space-Time Region GraphsXiaolong Wang*, CMU; Abhinav Gupta, CMU
P-2A-59Parallel Feature Pyramid Network for Object DetectionSeung-Wook Kim*, Korea University; Hyong-Keun Kook, Korea University; Jee-Young Sun, Korea University; Mun-Cheon Kang, Korea University; Sung-Jea Ko, Korea University
P-2A-60Goal-Oriented Visual Question Generation via Intermediate RewardsJunjie Zhang, University of Technology, Sydney; Qi Wu*, University of Adelaide; Chunhua Shen, University of Adelaide; Jian Zhang, UTS; Jianfeng Lu, Nanjing University of Science and Technology; Anton Van Den Hengel, University of Adelaide
P-2A-61WildDash - Creating Hazard-Aware BenchmarksOliver Zendel*, AIT Austrian Institute of Technology; Katrin Honauer, Heidelberg University; Markus Murschitz, AIT Austrian Institute of Technology; Daniel Steininger, AIT Austrian Institute of Technology; Gustavo Fernandez, n/a
P-2A-62Reinforced Temporal Attention and Split-Rate Transfer for Depth-Based Person Re-identificationNikolaos Karianakis*, Microsoft; Zicheng Liu, Microsoft; Yinpeng Chen, Microsoft; Stefano Soatto, UCLA
P-2A-63DF-Net: Unsupervised Joint Learning of Depth and Flow using Cross-Network ConsistencyYuliang Zou*, Virginia Tech; Zelun Luo, Stanford University; Jia-Bin Huang, Virginia Tech
P-2A-64Generating Multimodal Human Dynamics with a Transformation based RepresentationXinchen Yan*, University of Michigan; Akash Rastogi, UM; Ruben Villegas, University of Michigan; Eli Shechtman, Adobe Research, US; Sunkavalli Kalyan, Adobe Research; Sunil Hadap, Adobe; Ersin Yumer, Argo AI; Honglak Lee, UM
P-2A-65Learning Rigidity in Dynamic Scenes with a Moving Camera for 3D Motion Field EstimationZhaoyang Lv*, GEORGIA TECH; Kihwan Kim, NVIDIA; Alejandro Troccoli, NVIDIA; Deqing Sun, NVIDIA; Kautz Jan, NVIDIA; James Rehg, Georgia Institute of Technology
P-2A-66Learning Visual Question Answering by Bootstrapping Hard AttentionMateusz Malinowski*, DeepMind; Carl Doersch, DeepMind; Adam Santoro, DeepMind; Peter Battaglia, DeepMind
P-2A-67Image Reassembly Combining Deep Learning and Shortest Path ProblemMarie-Morgane Paumard*, ETIS; David Picard, ETIS/LIP6; Hedi Tabia, France
P-2A-68RESOUND: Towards Action Recognition without Representation BiasYingwei Li*, UCSD; Nuno Vasconcelos, UC San Diego; Yi Li, University of California San Diego
P-2A-69Key-Word-Aware Network for Referring Expression Image SegmentationHengcan Shi*, University of Electronic Science and Technology of China; Hongliang Li, University of Electronic Science and Technology of China; Fanman Meng, University of Electronic Science and Technology of China; Qingbo Wu, University of Electronic Science and Technology of China
P-2A-70Mutual Learning to Adapt for Joint Human Parsing and Pose EstimationXuecheng Nie*, NUS; Jiashi Feng, NUS; Shuicheng Yan, Qihoo/360
P-2A-71Simple Baselines for Human Pose Estimation and TrackingBin Xiao*, MSR Asia; Haiping Wu, MSR Asia; Yichen Wei, MSR Asia
P-2A-72Pose Partition Networks for Multi-Person Pose EstimationXuecheng Nie*, NUS; Jiashi Feng, NUS; Junliang Xing, National Laboratory of Pattern Recognition, Institute of Automation, Chinese Academy of Sciences; Shuicheng Yan, Qihoo/360
P-2A-73Wasserstein Divergence For GANsJiqing Wu*, ETH Zurich; Zhiwu Huang, ETH Zurich; Janine Thoma, ETH Zurich; Dinesh Acharya, ETH Zurich; Luc Van Gool, ETH Zurich
P-2A-74A Segmentation-aware Deep Fusion Network for Compressed Sensing MRIZhiwen Fan, Xiamen University; Liyan Sun, Xiamen University; Xinghao Ding*, Xiamen University; Yue Huang, Xiamen University; Congbo Cai, Xiamen University; John Paisley, Columbia University
P-2A-75Deep Metric Learning with Hierarchical Triplet LossWeifeng Ge*, The University of Hong Kong
P-2A-76Generative Adversarial Network with Spatial Attention for Face Attribute EditingGang Zhang*, Institute of Computing Technology, CAS; Meina Kan, Institute of Computing Technology, Chinese Academy of Sciences; Shiguang Shan, Chinese Academy of Sciences; Xilin Chen, China
P-2A-77Proxy Clouds for Live RGB-D Stream Processing and ConsolidationAdrien Kaiser*, Telecom ParisTech; Jose Alonso Ybanez Zepeda, Ayotle SAS; Tamy Boubekeur, Paris Telecom
P-2A-78Synthetically Supervised Feature Learning for Scene Text RecognitionYang Liu*, University of Cambridge; Zhaowen Wang, Adobe Research; Hailin Jin, Adobe Research; Ian Wassell, University of Cambridge
P-2A-79Scale Aggregation Network for Accurate and Efficient Crowd CountingXinkun Cao*, Beijing University of Posts and Telecommunications; Zhipeng Wang, School of Communication and Information Engineering, Beijing University of Posts and Telecommunications; Yanyun Zhao, Beijing Univiersity of Posts and Telecommunications; Fei Su, Beijing University of Posts and Telecommunications
P-2A-80PM-GANs: Discriminative Representation Learning for Action Recognition Using Partial-modalitiesLan Wang, Chongqing Key Laboratory of Signal and Information Processing, Chongqing University of Posts and Telecommunications; Chenqiang Gao*, Chongqing Key Laboratory of Signal and Information Processing, Chongqing University of Posts and Telecommunications; Luyu Yang, Chongqing Key Laboratory of Signal and Information Processing, Chongqing University of Posts and Telecommunications; Yue Zhao, Chongqing Key Laboratory of Signal and Information Processing, Chongqing University of Posts and Telecommunications; Wangmeng Zuo, Harbin Institute of Technology, China; Deyu Meng, Xi'an Jiaotong University
P-2A-81OmniDepth: Dense Depth Estimation for Indoors Spherical Panoramas.NIKOLAOS ZIOULIS*, CERTH / CENTRE FOR RESEARCH AND TECHNOLOGY HELLAS; Antonis Karakottas, CERTH / CENTRE FOR RESEARCH AND TECHNOLOGY HELLAS; Dimitrios Zarpalas, CERTH / CENTRE FOR RESEARCH AND TECHNOLOGY HELLAS; Petros Daras, ITI-CERTH, Greece
P-2A-82Hashing with Binary Matrix PursuitFatih Cakir*, Boston University; Kun He, Boston University; Stan Sclaroff, Boston University
P-2A-83Probabilistic Video Generation using Holistic Attribute ControlJiawei He*, Simon Fraser University; Andreas Lehrmann, Facebook; Joe Marino, California Institute of Technology; Greg Mori, Simon Fraser University; Leonid Sigal, University of British Columbia
P-2A-84Transductive Semi-Supervised Deep Learning using Min-Max FeaturesWeiwei Shi*, Xi'an Jiaotong University; Yihong Gong, Xi'an Jiaotong University; Chris Ding, UNIVERSITY OF TEXAS AT ARLINGTON; Zhiheng Ma, Xi'an Jiaotong University; Xiaoyu Tao, Institute of Artificial Intelligence and Robotics, Xi'an Jiaotong University.; Nanning Zheng, Xi'an Jiaotong University
P-2A-85Deep Feature Pyramid Reconfiguration for Object DetectionTao Kong*, Tsinghua; Fuchun Sun, Tsinghua; Wenbing Huang, Tencent AI Lab; ? ??, ????
P-2A-86Quadtree Convolutional Neural NetworksPradeep Kumar Jayaraman*, Nanyang Technological University; Jianhan Mei, Nanyang Technological University; Jianfei Cai, Nanyang Technological University; Jianmin Zheng, Nanyang Technological University
P-2A-87Correcting the Triplet Selection Bias for Triplet LossBaosheng Yu*, The University of Sydney; Tongliang Liu, The University of Sydney; Mingming Gong, CMU & U Pitt; Changxing Ding, South China University of Technology; Dacheng Tao, University of Sydney
P-2A-88Adversarial Geometry-Aware Human Motion PredictionLiangyan Gui*, Carnegie Mellon University; Yu-Xiong Wang, Carnegie Mellon University; Xiaodan Liang, Carnegie Mellon University; José M. F. Moura, Carnegie Mellon University

Poster session 2B
2BTuesday, September 11Poster session 04:00 PM - 06:00 PM
P-2B-013D Motion Sensing from 4D Light Field GradientsSizhuo Ma*, University of Wisconsin-Madison; Brandon Smith, University of Wisconsin-Madison; Mohit Gupta, University of Wisconsin-Madison, USA
P-2B-02A Trilateral Weighted Sparse Coding Scheme for Real-World Image DenoisingXU JUN, The Hong Kong Polytechnic University; Lei Zhang*, Hong Kong Polytechnic University, Hong Kong, China; D. Zhang, The Hong Kong Polytechnic University
P-2B-03Saliency Detection in 360$^\circ$ VideosZiheng Zhang, Shanghaitech University; Yanyu Xu*, Shanghaitech University; Shenghua Gao, Shanghaitech University; Jingyi Yu, Shanghai Tech University
P-2B-04Learning to Blend PhotosWei-Chih Hung*, University of California, Merced; Jianming Zhang, Adobe Research; Xiaohui Shen, Adobe Research; Zhe Lin, Adobe Research; Joon-Young Lee, Adobe Research; Ming-Hsuan Yang, University of California at Merced
P-2B-05Escaping from Collapsing Modes in a Constrained SpaceChieh Lin, National Tsing Hua University; Chia-Che Chang, National Tsing Hua University; Che-Rung Lee, National Tsing Hua University; Hwann-Tzong Chen*, National Tsing Hua University
P-2B-06Verisimilar Image Synthesis for Accurate Detection and Recognition of Texts in ScenesFangneng Zhan, Nanyang Technological University; Shijian Lu*, Nanyang Technological University; Chuhui Xue, Nanyang Technological University
P-2B-07Layer-structured 3D Scene Inference via View SynthesisShubham Tulsiani*, UC Berkeley; Richard Tucker, Google; Noah Snavely, -
P-2B-08Perturbation Robust Representations of Topological Persistence DiagramsAnirudh Som*, Arizona State University; Kowshik Thopalli, Arizona State University; Karthikeyan Natesan Ramamurthy, IBM Research; Vinay Venkataraman, Arizona State University; Ankita Shukla, Indraprastha Institute of Information Technology - Delhi; Pavan Turaga, Arizona State University
P-2B-09Analyzing Clothing Layer Deformation Statistics of 3D Human MotionsJinlong YANG*, Inria; Jean-Sebastien Franco, INRIA; Franck Hétroy-Wheeler, University of Strasbourg; Stefanie Wuhrer, Inria
P-2B-10Neural Nonlinear least Squares with Application to Dense Tracking and MappingRonald Clark*, Imperial College London; Michael Bloesch, Imperial; Jan Czarnowski, Imperial College London; Andrew Davison, Imperial College London; Stefan Leutenegger, Imperial College London
P-2B-11Propagating LSTM: 3D Pose Estimation based on Joint InterdependencyKyoungoh Lee*, Yonsei University; Inwoong Lee, Yonsei University; Sanghoon Lee, Yonsei University, Korea
P-2B-12Proximal Dehaze-Net: A Prior Learning-Based Deep Network for Single Image DehazingDong Yang, Xi'an Jiaotong University; JIAN SUN*, Xi'an Jiaotong University
P-2B-13Attend and Rectify: a gated attention mechanism for fine-grained recoveryPau Rodriguez Lopez*, Computer Vision Center, Universitat Autonoma de Barcelona; Guillem Cucurull, Computer Vision Center, Universitat Autonoma de Barcelona; Josep Gonfaus, Computer Vision Center; Jordi Gonzalez, UA Barcelona; Xavier Roca, Computer Vision Center, Universitat Autonoma de Barcelona
P-2B-14Learning to Capture Light Fields through A Coded Aperture CameraYasutaka Inagaki*, Nagoya University; Yuto Kobayashi, Nagoya University; Keita Takahashi, Nagoya University; Toshiaki Fujii, Nagoya University; Hajime Nagahara, Osaka University
P-2B-15AMC: AutoML for Model Compression and Acceleration on Mobile DevicesYihui He, Xian Jiaotong University; Ji Lin, MIT; Zhijian Liu, MIT; Hanrui Wang, MIT; Li-Jia Li, Google; Song Han, MIT
P-2B-16Extreme Network Compression via Filter Group ApproximationBo Peng*, Hikvision Research Institute; Wenming Tan, Hikvision Research Institute; Zheyang Li, Hikvision Research Institute; Shun Zhang, Hikvision Research Institute; Di Xie, Hikvision Research Institute; Shiliang Pu, Hikvision Research Institute
P-2B-17Retrospective Encoders for Video SummarizationKe Zhang*, USC; Kristen Grauman, University of Texas; Fei Sha, USC
P-2B-18Optimized Quantization for Highly Accurate and Compact DNNsDongqing Zhang, Microsoft Research; Jiaolong Yang*, Microsoft Research Asia (MSRA); Dongqiangzi Ye, Microsoft Research; Gang Hua, Microsoft Cloud and AI
P-2B-19Universal Sketch Perceptual GroupingKe LI*, Queen Mary University of London; Kaiyue Pang, Queen Mary University of London; Jifei Song, Queen Mary, University of London; Yi-Zhe Song, Queen Mary University of London; Tao Xiang, Queen Mary, University of London, UK; Timothy Hospedales, Edinburgh University; Honggang Zhang, Beijing University of Posts and Telecommunications
P-2B-20Uncertainty Estimates and Multi-Hypotheses Networks for Optical FlowEddy Ilg*, University of Freiburg; Özgün Çiçek, University of Freiburg; Silvio Galesso, University of Freiburg; Aaron Klein, Universität Freiburg; Osama Makansi, University of Freiburg; Frank Hutter, University of Freiburg; Thomas Brox, University of Freiburg
P-2B-21Learning 3D Keypoint Descriptors for Non-Rigid Shape MatchingHanyu Wang, NLPR, Institute of Automation, Chinese Academy of Sciences; Jianwei Guo*, NLPR, Institute of Automation, Chinese Academy of Sciences; Yan Dong-Ming, NLPR, CASIA; Weize Quan, NLPR, Institute of Automation, Chinese Academy of Sciences; Xiaopeng Zhang, Institute of Automation, Chinese Academy of Sciences
P-2B-22A Joint Sequence Fusion Model for Video Question Answering and RetrievalYoungjae Yu, Seoul National University Vision and Learning Lab; Jongseok Kim, Seoul National University Vision and Learning Lab; Gunhee Kim*, Seoul National University
P-2B-23Deformable Pose Traversal Convolution for 3D Action and Gesture RecognitionJunwu Weng*, Nanyang Technological University; Mengyuan Liu, Nanyang Technological University; Xudong Jiang, Nanyang Technological University; Junsong Yuan, State University of New York at Buffalo, USA
P-2B-24Fine-Grained Visual Categorization using Meta-Learning Optimization with Sample Selection of Auxiliary DataYabin Zhang, South China University of Technology; Tang Hui, South China University of Technology; Kui Jia*, South China University of Technology
P-2B-25Stereo relative pose from line and point feature tripletsAlexander Vakhitov*, Skoltech; Victor Lempitsky, Skoltech; Yinqiang Zheng, National Institute of Informatics
P-2B-26Convolutional Block Attention ModuleSanghyun Woo*, KAIST; Jongchan Park, KAIST; Joon-Young Lee, Adobe Research; In So Kweon, KAIST
P-2B-27EC-Net: an Edge-aware Point set Consolidation NetworkLequan Yu*, The Chinese University of Hong Kong; Xianzhi Li, The Chinese University of Hong Kong; Chi-Wing Fu, The Chinese University of Hong Kong; Danny Cohen-Or, Tel Aviv University; Pheng-Ann Heng, The Chinese Univsersity of Hong Kong
P-2B-28Video Compression through Image InterpolationChao-Yuan Wu*, UT Austin; Nayan Singhal, UT Austin; Philipp Kraehenbuehl, UT Austin
P-2B-29Burst Image Deblurring Using Permutation Invariant Convolutional Neural NetworksMiika Aittala*, MIT; Fredo Durand, MIT
P-2B-30HybridNet: Classification and Reconstruction Cooperation for Semi-Supervised LearningThomas Robert*, LIP6 / Sorbonne Universite; Nicolas Thome, CNAM, Paris; Matthieu Cord, Sorbonne University
P-2B-31Structure-from-Motion-Aware PatchMatch for Adaptive Optical Flow EstimationDaniel Maurer*, University of Stuttgart; Nico Marniok, Universität Konstanz; Bastian Goldluecke, University of Konstanz; Andrés Bruhn, University of Stuttgart
P-2B-32Joint & Progressive Learning from High-Dimensional Data for Multi-Label ClassificationDanfeng Hong*, Technical University of Munich (TUM); German Aerospace Center (DLR); Naoto Yokoya, RIKEN Center for Advanced Intelligence Project (AIP); Jian Xu, German Aerospace Center (DLR); Xiaoxiang Zhu, DLR&TUM
P-2B-33SDC-Net: Video prediction using spatially-displaced convolutionFitsum Reda*, NVIDIA; Guilin Liu, NVIDIA; Kevin Shih, NVIDIA; Robert Kirby, Nvidia; Jon Barker, Nvidia; David Tarjan, Nvidia; Andrew Tao, NVIDIA; Bryan Catanzaro, NVIDIA
P-2B-34Encoder-Decoder with Atrous Separable Convolution for Semantic Image SegmentationLiang-Chieh Chen*, Google Inc.; Yukun Zhu, Google Inc.; George Papandreou, Google; Florian Schroff, Google Inc.; Hartwig Adam, Google
P-2B-35VQA-E: Explaining, Elaborating, and Enhancing Your Answers for Visual QuestionsQing Li*, University of Science and Technology of China; Qingyi Tao, Nanyang Techonological University; Shafiq Joty, Nanyang Technological University; Jianfei Cai, Nanyang Technological University; Jiebo Luo, U. Rochester
P-2B-36Image Super-Resolution Using Very Deep Residual Channel Attention NetworksYulun Zhang*, Northeastern University; Kunpeng Li, Northeastern University; kai li, northeastern university; Lichen Wang, Northeastern University; Bineng Zhong, Huaqiao University; YUN FU, Northeastern University
P-2B-37Urban Zoning Using Higher-Order Markov Random Fields on Multi-View Imagery DataTian Feng, University of New South Wales; Quang-Trung Truong, SUTD; Thanh Nguyen*, Deakin University, Australia; Jing Yu Koh, SUTD; Lap-Fai Yu, UMass Boston; Sai-Kit Yeung, Singapore University of Technology and Design; Alexander Binder, Singapore University of Technology and Design
P-2B-38Clustering Convolutional Kernels to Compress Deep Neural NetworksSanghyun Son, Seoul National University; Seungjun Nah, Seoul National University; Kyoung Mu Lee*, Seoul National University
P-2B-39Explainable Neural Computation via Stack Neural Module NetworksRonghang Hu*, University of California, Berkeley; Jacob Andreas, UC Berkeley; Trevor Darrell, UC Berkeley; Kate Saenko, Boston University
P-2B-40Quaternion Convolutional Neural NetworksXuanyu Zhu*, Shanghai Jiao Tong University; Yi Xu, Shanghai Jiao Tong University; Hongteng Xu, Duke University; Changjian Chen, Shanghai Jiao Tong University
P-2B-41Lip Movements Generation at a GlanceLele Chen*, University of Rochester; Zhiheng Li, WuHan University; Ross Maddox, University of Rochester; Zhiyao Duan, Unversity of Rochester; Chenliang Xu, University of Rochester
P-2B-42Toward Scale-Invariance and Position-Sensitive Object Proposal NetworksHsueh-Fu Lu, Umbo Computer Vision; Ping-Lin Chang*, Umbo Computer Vision; Xiaofei Du, Umbo Computer Vision
P-2B-43Constraints Matter in Deep Neural Network CompressionChangan Chen, Simon Fraser University; Fred Tung*, Simon Fraser University; Naveen Vedula, Simon Fraser University; Greg Mori, Simon Fraser University
P-2B-44MRF Optimization with Separable Convex Prior on Partially Ordered LabelsCsaba Domokos*, Technical University of Munich; Frank Schmidt, BCAI; Daniel Cremers, TUM
P-2B-45Switchable Temporal Propagation NetworkSifei Liu*, NVIDIA; Ming-Hsuan Yang, University of California at Merced; Guangyu Zhong, Dalian University of Technology; Jinwei Gu, Nvidia; Shalini De Mello, NVIDIA Research; Kautz Jan, NVIDIA; Varun Jampani, Nvidia Research
P-2B-46T2Net: Synthetic-to-Realistic Translation for Solving Single-Image Depth Estimation TasksChuanxia Zheng*, Nanyang Technological University; Tat-Jen Cham, Nanyang Technological University; Jianfei Cai, Nanyang Technological University
P-2B-47ArticulatedFusion: Real-time Reconstruction of Motion, Geometry and Segmentation Using a Single Depth CameraChao Li*, The University of Texas at Dallas; Zheheng Zhao, The University of Texas at Dallas; Xiaohu Guo, The University of Texas at Dallas
P-2B-48NNEval: Neural Network based Evaluation Metric for Image CaptioningNaeha Sharif*, University of Western Australia; Lyndon White, University of Western Australia; Mohammed Bennamoun, University of Western Australia; Syed Afaq Ali Shah, Department of Computer Science and Software Engineering, The University of Western Australia
P-2B-49Coreset-Based Convolutional Neural Network CompressionAbhimanyu Dubey*, Massachusetts Institute of Technology; Moitreya Chatterjee, University of Illinois at Urbana Champaign; Ramesh Raskar, Massachusetts Institute of Technology; Narendra Ahuja, University of Illinois at Urbana-Champaign, USA
P-2B-50Context Refinement for Object DetectionZhe Chen*, University of Sydney; Shaoli Huang, University of Sydney; Dacheng Tao, University of Sydney
P-2B-51Real-time ‘Actor-Critic’ TrackingBoyu Chen*, Dalian University of Technology; Dong Wang, Dalian University of Technology; Peixia Li, Dalian University of Technology; Huchuan Lu, Dalian University of Technology
P-2B-52Partial Adversarial Domain AdaptationZhangjie Cao, Tsinghua University; Lijia Ma, Tsinghua University; Mingsheng Long*, Tsinghua University; Jianmin Wang, Tsinghua University, China
P-2B-53Localization Recall Precision (LRP): A New Performance Metric for Object DetectionKemal Oksuz*, Middle East Technical University; Bar?? Can Çam, Roketsan; Emre Akbas, Middle East Technical University; Sinan Kalkan, Middle East Technical University
P-2B-54Improving Embedding Generalization via Scalable Neighborhood Component AnalysisZhirong Wu*, UC Berkeley; Alexei Efros, UC Berkeley; Stella Yu, UC Berkeley / ICSI
P-2B-55Leveraging Motion Priors in Videos for Improving Human SegmentationYu-Ting Chen*, NTHU; Wen-Yen Chang, NTHU; Hai-Lun Lu, NTHU; Tingfan Wu, Umbo Computer Vision; Min Sun, NTHU
P-2B-56Recurrent Squeeze-and-Excitation Context Aggregation Net for Single Image DerainingXia Li*, Peking University Shenzhen Graduate School; Jianlong Wu, Peking University; Zhouchen Lin, Peking University; Hong Liu, Peking University Shenzhen Graduate School; Hongbin Zha, Peking University, China
P-2B-57Statistically-motivated Second-order PoolingKaicheng Yu*, EPFL; Mathieu Salzmann, EPFL
P-2B-58SegStereo: Exploiting Semantic Information for Disparity EstimationGuorun Yang*, Tsinghua University; Hengshuang Zhao, The Chinese University of Hong Kong; Jianping Shi, Sensetime Group Limited; Jia Jiaya, Chinese University of Hong Kong
P-2B-59Small-scale Pedestrian Detection Based on Somatic Topology Localization and Temporal Feature AggregationTao Song, Hikvision Research Institute; Leiyu Sun, Hikvision Research Institute; Di Xie*, Hikvision Research Institute; Haiming Sun, Hikvision Research Institute; Shiliang Pu, Hikvision Research Institute
P-2B-60Object Detection with an Aligned Spatial-Temporal MemoryFanyi Xiao*, University of California Davis; Yong Jae Lee, University of California, Davis
P-2B-61Learning to Drive with 360° Surround-View Cameras and a MapSimon Hecker*, ETH Zurich; Dengxin Dai, ETH Zurich; Luc Van Gool, ETH Zurich
P-2B-62Monocular Scene Parsing and Reconstruction using 3D Holistic Scene GrammarSiyuan Huang*, UCLA; Siyuan Qi, UCLA; Yixin Zhu, UCLA; Yinxue Xiao, University of California, Los Angeles; Yuanlu Xu, University of California, Los Angeles; Song-Chun Zhu, UCLA
P-2B-63Coded Illumination and Imaging for Fluorescence Based ClassificationYuta Asano, Tokyo Institute of Technology; Misaki Meguro, Tokyo Institute of Technology; Chao Wang, Kyushu Institute of Technology; Antony Lam*, Saitama University; Yinqiang Zheng, National Institute of Informatics; Takahiro Okabe, Kyushu Institute of Technology; Imari Sato, National Institute of Informatics
P-2B-64Modality Distillation with Multiple Stream Networks for Action RecognitionNuno Garcia, IIT; Pietro Morerio*, IIT; Vittorio Murino, Istituto Italiano di Tecnologia
P-2B-65VideoMatch: Matching based Video Object SegmentationYuan-Ting Hu*, University of Illinois at Urbana-Champaign; Jia-Bin Huang, Virginia Tech; Alexander Schwing, UIUC
P-2B-66Superpixel Sampling NetworksVarun Jampani*, Nvidia Research; Deqing Sun, NVIDIA; Ming-Yu Liu, NVIDIA; Ming-Hsuan Yang, University of California at Merced; Kautz Jan, NVIDIA
P-2B-67Deep Bilinear Learning for RGB-D Action RecognitionHU Jian-Fang, Sun Yat-sen University; Jason Wei Shi Zheng*, Sun Yat Sen University; Pan Jiahui, Sun Yat-sen University; Jian-Huang Lai, Sun Yat-sen University; Jianguo Zhang, University of Dundee
P-2B-68Multi-object Tracking with Neural Gating using bilinear LSTMsChanho Kim*, Georgia Tech; Fuxin Li, Oregon State University; James Rehg, Georgia Institute of Technology
P-2B-69Direct Sparse Odometry With Rolling ShutterDavid Schubert*, Technical University of Munich; Vladyslav Usenko, TU Munich; Nikolaus Demmel, TUM; Joerg Stueckler, Technical University of Munich; Daniel Cremers, TUM
P-2B-70Person Search via A Mask-guided Two-stream CNN ModelDi Chen*, Nanjing University of Science and Techonology; Shanshan Zhang, Max Planck Institute for Informatics; Wanli Ouyang, CUHK; Jian Yang, Nanjing University of Science and Technology; Ying Tai, Tencent
P-2B-71Imagine This! Scripts to Compositions to VideosTanmay Gupta*, UIUC; Dustin Schwenk, Allen Institute for Artificial Intelligence; Ali Farhadi, University of Washington; Derek Hoiem, University of Illinois at Urbana-Champaign; Aniruddha Kembhavi, Allen Institute for Artificial Intelligence
P-2B-72Multiresolution Tree Networks for Point Cloud ProcesingMatheus Gadelha*, University of Massachusetts Amherst; Subhransu Maji, University of Massachusetts, Amherst; Rui Wang, U Massachusetts
P-2B-73Quantization Mimic: Towards Very Tiny CNN for Object DetectionYi Wei*, Tsinghua University; Xinyu Pan, MMLAB, CUHK; Hongwei Qin, SenseTime; Junjie Yan, Sensetime; Wanli Ouyang, CUHK
P-2B-74Multi-scale Residual Network for Image Super-ResolutionJuncheng Li, East China Normal University; Faming Fang*, East China Normal University; Kangfu Mei, Jiangxi Normal University; Guixu Zhang, East China Normal University
P-2B-75BodyNet: Volumetric Inference of 3D Human Body ShapesGul Varol*, INRIA; Duygu Ceylan, Adobe Research; Bryan Russell, Adobe Research; Jimei Yang, Adobe; Ersin Yumer, Argo AI; Ivan Laptev, INRIA Paris; Cordelia Schmid, INRIA
P-2B-763D Recurrent Neural Networks with Context Fusion for Point Cloud Semantic SegmentationXiaoqing Ye*, SIMIT; Jiamao Li, SIMIT; Hexiao Huang, Shanghai Opening University; Xiaolin Zhang, SIMIT
P-2B-77Robust Anchor Embedding for Unsupervised Video Re-Identification in the WildMang YE*, Hong Kong Baptist University; Xiangyuan Lan, Department of Computer Science, Hong Kong Baptist University; PongChi Yuen, Department of Computer Science, Hong Kong Baptist University
P-2B-78Towards Robust Neural Networks via Random Self-ensembleXuanqing Liu, UC Davis Department of Computer Science; Minhao Cheng, University of California, Davis; Huan Zhang, UC Davis; Cho-Jui Hsieh*, UC Davis Department of Computer Science and Statistics
P-2B-79SpiderCNN: Deep Learning on Point Sets with Parameterized Convolutional FiltersYifan Xu, Tsinghua University; Tianqi Fan, Multimedia Laboratory, Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences; Mingye Xu, Multimedia Laboratory, Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences; Long Zeng, Tsinghua University; Yu Qiao*, Multimedia Laboratory, Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences
P-2B-80CIRL: Controllable Imitative Reinforcement Learning for Vision-based Self-drivingXiaodan Liang*, Carnegie Mellon University; Tairui Wang, Petuum Inc; Luona Yang, Carnegie Mellon University; Eric Xing, Petuum Inc.
P-2B-81Normalized Blind DeconvolutionMeiguang Jin*, University of Bern; Stefan Roth, TU Darmstadt; Paolo Favaro, Bern University, Switzerland
P-2B-82Few-Shot Human Motion Prediction via Meta-LearningLiangyan Gui*, Carnegie Mellon University; Yu-Xiong Wang, Carnegie Mellon University; Deva Ramanan, Carnegie Mellon University; José M. F. Moura, Carnegie Mellon University
P-2B-83Learning to Segment via Cut-and-PasteTal Remez*, Tel-Aviv University; Matthew Brown, Google; Jonathan Huang, Google
P-2B-84Weakly-supervised 3D Hand Pose Estimation from Monocular RGB ImagesYujun Cai*, Nanyang Technological University; Liuhao Ge, NTU; Jianfei Cai, Nanyang Technological University; Junsong Yuan, State University of New York at Buffalo, USA
P-2B-85DeepIM: Deep Iterative Matching for 6D Pose EstimationYi Li*, Tsinghua University; Gu Wang, Tsinghua University; Xiangyang Ji, Tsinghua University; Yu Xiang, University of Michigan; Dieter Fox, University of Washington
P-2B-86Jointly Discovering Visual Objects and Spoken Words from Raw Sensory InputDavid Harwath*, MIT CSAIL; Adria Recasens, Massachusetts Institute of Technology; Dídac Surís, Universitat Politecnica de Catalunya; Galen Chuang, MIT; Antonio Torralba, MIT; James Glass, MIT
P-2B-87A Style-aware Content Loss for Real-time HD Style TransferArtsiom Sanakoyeu*, Heidelberg University; Dmytro Kotovenko, Heidelberg University; Bjorn Ommer, Heidelberg University
P-2B-88Implicit 3D Orientation Learning for 6D Object Detection from RGB ImagesMartin Sundermeyer*, German Aerospace Center (DLR); Zoltan Marton, DLR; Maximilian Durner, DLR; Rudolph Triebel, German Aerospace Center (DLR)
P-2B-89Scale-Awareness of Light Field Camera based Visual OdometryNiclas Zeller*, Karlsruhe University of Applied Sciences; Franz Quint, Karlsruhe University of Applied Sciences; Uwe Stilla, Technische Universitaet Muenchen
P-2B-90Audio-Visual Scene Analysis with Self-Supervised Multisensory FeaturesAndrew Owens*, UC Berkeley; Alexei Efros, UC Berkeley

Poster session 3A
3AWednesday, September 12Poster session 10:00 AM - 12:00 PM
P-3A-01Efficient Sliding Window Computation for NN-Based Template MatchingLior Talker*, Haifa University; Yael Moses, IDC, Israel; Ilan Shimshoni, University of Haifa
P-3A-02Active Stereo Net: End-to-End Self-Supervised Learning for Active Stereo SystemsYinda Zhang*, Princeton University; Sean Fanello, Google; Sameh Khamis, Google; Christoph Rhemann, Google; Julien Valentin, Google; Adarsh Kowdle, Google; Vladimir Tankovich, Google; Shahram Izadi, Google; Thomas Funkhouser, Princeton, USA
P-3A-03GAL: Geometric Adversarial Loss for Single-View 3D-Object ReconstructionLi Jiang*, The Chinese University of Hong Kong; Xiaojuan Qi, CUHK; Shaoshuai SHI, The Chinese University of Hong Kong; Jia Jiaya, Chinese University of Hong Kong
P-3A-04Learning to Reconstruct High-quality 3D Shapes with Cascaded Fully Convolutional NetworksYan-Pei Cao*, Tsinghua University; Zheng-Ning Liu, Tsinghua University; Zheng-Fei Kuang, Tsinghua University; Shi-Min Hu, Tsinghua University
P-3A-05Deep Reinforcement Learning with Iterative Shift for Visual TrackingLiangliang Ren, Tsinghua University; Xin Yuan, Tsinghua University; Jiwen Lu*, Tsinghua University; Ming Yang, Horizon Robotics; Jie Zhou, Tsinghua University, China
P-3A-06CPlaNet: Enhancing Image Geolocalization by Combinatorial Partitioning of MapsPaul Hongsuck Seo*, POSTECH; Tobias Weyand, Google Inc.; Jack Sim, Google LLC; Bohyung Han, Seoul National University
P-3A-07Bayesian Instance Segmentation in Open Set WorldTrung Pham*, NVIDIA; Vijay Kumar B G, University of Adelaide; Thanh-Toan Do, The University of Adelaide; Gustavo Carneiro, University of Adelaide; Ian Reid, University of Adelaide, Australia
P-3A-08Characterizing Adversarial Examples Based on Spatial Consistency Information for Semantic SegmentationChaowei Xiao, University of Michigan, Ann Arbor; Ruizhi Deng, Simon Fraser University; Bo Li*, University of Illinois at Urbana–Champaign and UC Berkeley; Fisher Yu, UC Berkeley;
Mingyan Liu, University of Michigan, Ann Arbor; Dawn Song, UC Berkeley
P-3A-09CubeNet: Equivariance to 3D Rotation and TranslationDaniel Worrall*, UCL; Gabriel Brostow, University College London
P-3A-103D Face Reconstruction from Light Field Images: A Model-free ApproachMingtao Feng, Hunan Unversity; Syed Zulqarnain Gilani*, The University of Western Australia; Yaonan Wang, Hunan University; Ajmal Mian, University of Western Australia
P-3A-11stagNet: An Attentive Semantic RNN for Group Activity RecognitionMengshi Qi*, Beihang University; Jie Qin, ETH Zurich; Annan Li, Beijing University of Aeronautics and Astronautics; Yunhong Wang, State Key Laboratory of Virtual Reality Technology and System, Beihang University, Beijing 100191, China; Jiebo Luo, U. Rochester; Luc Van Gool, ETH Zurich
P-3A-12Supervising the new with the old: learning SFM from SFMMaria Klodt*, University of Oxford; Andrea Vedaldi, Oxford University
P-3A-13PSANet: Point-wise Spatial Attention Network for Scene ParsingHengshuang Zhao*, The Chinese University of Hong Kong; Yi ZHANG, The Chinese University of Hong Kong; Shu Liu, CUHK; Jianping Shi, Sensetime Group Limited; Chen Change Loy, Chinese University of Hong Kong; Dahua Lin, The Chinese University of Hong Kong; Jia Jiaya, Chinese University of Hong Kong
P-3A-14FishEyeRecNet: A Multi-Context Collaborative Deep Network for Fisheye Image Recti_x000c_cationXiaoqing Yin*, University of Sydney; Xinchao Wang, Stevens Institute of Technology; Jun Yu, HDU; Maojun Zhang, National University of Defense Technology, China; Pascal Fua, EPFL, Switzerland; Dacheng Tao, University of Sydney
P-3A-15ELEGANT: Exchanging Latent Encodings with GAN for Transferring Multiple Face AttributesTaihong Xiao*, Peking University; Jiapeng Hong, Peking University; Jinwen Ma, Peking University
P-3A-16Deep Bilevel LearningSimon Jenni*, Universität Bern; Paolo Favaro, Bern University, Switzerland
P-3A-17ADVIO: An Authentic Dataset for Visual-Inertial OdometrySantiago Cortes, Aalto University; Arno Solin*, Aalto University; Esa Rahtu, Tampere University of Technology; Juho Kannala, Aalto University, Finland
P-3A-18D2S: Densely Segmented Supermarket DatasetPatrick Follmann*, MVTec Software GmbH; Tobias Böttger, MVTec Software GmbH; Philipp Härtinger, MVTec Software GmbH; Rebecca König, MVTec Software GmbH; Markus Ulrich, MVTec Software GmbH
P-3A-19PyramidBox: A Context-assisted Single Shot Face DetectorXu Tang, Baidu; Daniel Du*, Baidu; Zeqiang He, Baidu; jingtuo liu, baidu
P-3A-20Structured Siamese Network for Real-Time Visual TrackingYunhua Zhang, Dalian University of Technology; Lijun Wang, Dalian University of Technology; Dong Wang, Dalian University of Technology; Mengyang Feng, Dalian University of Technology; Huchuan Lu*, Dalian University of Technology; Jinqing Qi, Dalian University of Technology
P-3A-21Probabilistic Signed Distance Function for On-the-fly Scene ReconstructionWei Dong*, Peking University; Qiuyuan Wang, Peking University; Xin Wang, Peking University; Hongbin Zha, Peking University, China
P-3A-223D Vehicle Trajectory Reconstruction in Monocular Video Data Using Environment Structure ConstraintsSebastian Bullinger*, Fraunhofer IOSB; Christoph Bodensteiner, Fraunhofer IOSB; Michael Arens, Fraunhofer IOSB; Rainer Stiefelhagen, Karlsruhe Institute of Technology
P-3A-23Unsupervised Image-to-Image Translation with Stacked Cycle-Consistent Adversarial NetworksMinjun Li*, Fudan University; Haozhi Huang, Tencent AI Lab; Lin Ma, Tencent AI Lab; Wei Liu, Tencent AI Lab; Tong Zhang, Tecent AI Lab; Yu-Gang Jiang, Fudan University
P-3A-24Pose-Normalized Image Generation for Person Re-identificationXuelin Qian, Fudan University; Yanwei Fu*, Fudan Univ.; Tao Xiang, Queen Mary, University of London, UK; Wenxuan Wang, Fudan University; Jie Qiu, Nara Institute of Science and Technology; Yang Wu, Nara Institute of Science and Technology; Yu-Gang Jiang, Fudan University; Xiangyang Xue, Fudan University
P-3A-25Action Anticipation with RBF Kernelized Feature Mapping RNNYuge Shi*, Australian National University; Basura Fernando, Australian National University; RICHARD HARTLEY, Australian National University, Australia
P-3A-26Rendering Portraitures from Monocular Camera and BeyondXiangyu Xu*, Tsinghua University; Deqing Sun, NVIDIA; Sifei Liu, NVIDIA; Wenqi Ren, Institute of Information Engineering, Chinese Academy of Sciences; Yu-Jin Zhang, Tsinghua University; Ming-Hsuan Yang, University of California at Merced; Jian Sun, Megvii, Face++
P-3A-27Recovering 3D Planes from a Single Image via Convolutional Neural NetworksFengting Yang*, Pennsylvania State University ; Zihan Zhou, Penn State University
P-3A-28The Devil of Face Recognition is in the NoiseLiren Chen*, Sensetime Group Limited; Fei Wang, SenseTime; Cheng Li, SenseTime Research; Shiyao Huang, SenseTime Co Ltd; Yanjie Chen, sensetime; Chen Qian, SenseTime; Chen Change Loy, Chinese University of Hong Kong
P-3A-293DMV: Joint 3D-Multi-View Prediction for 3D Semantic Scene SegmentationAngela Dai*, Stanford University; Matthias Niessner, Technical University of Munich
P-3A-30Joint optimization for compressive video sensing and reconstruction under hardware constraintsMichitaka Yoshida*, Kyushu University; Akihiko Torii, Tokyo Institute of Technology, Japan; Masatoshi Okutomi, Tokyo Institute of Technology; Kenta Endo, Hamamatsu Photonics K. K.; Yukinobu Sugiyama, Hamamatsu Photonics K. K.; Hajime Nagahara, Osaka University
P-3A-31Consensus-Driven Propagation in Massive Unlabeled Data for Face RecognitionXiaohang Zhan*, The Chinese University of Hong Kong; Ziwei Liu, The Chinese University of Hong Kong; Junjie Yan, Sensetime Group Limited; Dahua Lin, The Chinese University of Hong Kong; Chen Change Loy, Chinese University of Hong Kong
P-3A-32Recovering Accurate 3D Human Pose in The Wild Using IMUs and a Moving CameraTimo von Marcard*, University of Hannover; Roberto Henschel, Leibniz University of Hannover; Michael Black, Max Planck Institute for Intelligent Systems; Bodo Rosenhahn, Leibniz University Hannover; Gerard Pons-Moll, MPII, Germany
P-3A-33Predicting Future Instance Segmentation by Forecasting Convolutional FeaturesPauline Luc*, Facebook AI Research; Camille Couprie, Facebook; yann lecun, Facebook; Jakob Verbeek, INRIA
P-3A-34PS-FCN: A Flexible Learning Framework for Photometric StereoGuanying Chen*, The University of Hong Kong; Kai Han, University of Oxford; Kwan-Yee Wong, The University of Hong Kong
P-3A-35Unsupervised Class-Specific DeblurringNimisha T M*, Indian Institute of Technology Madras; Sunil Kumar, Indian Institute of Technology Madras; Rajagopalan Ambasamudram, Indian Institute of Technology Madras
P-3A-36Face Super-resolution Guided by Facial Component HeatmapsXin Yu*, Australian National University; Basura Fernando, Australian National University; Bernard Ghanem, KAUST; Fatih Porikli, ANU; RICHARD HARTLEY, Australian National University, Australia
P-3A-37A Contrario Horizon-First Vanishing Point Detection Using Second-Order Grouping LawsGilles Simon*, Université de Lorraine; Antoine Fond, Université de Lorraine; Marie-Odile Berger, INRIA
P-3A-38Fast, Accurate, and, Lightweight Super-Resolution with Cascading Residual NetworkNamhyuk Ahn, Ajou University; Byungkon Kang, Ajou University; Kyung-Ah Sohn*, Ajou University
P-3A-39Face Recognition with Contrastive ConvolutionChunrui Han*, ICT, Chinese Academy of Sciences, China; Shiguang Shan, Chinese Academy of Sciences; Meina Kan, ICT, CAS; Shuzhe Wu, Chinese Academy of Sciences; xilin chen, ICT, Chinese Academy of Sciences, China
P-3A-40Deforming Autoencoders: Unsupervised Disentangling of Shape and AppearanceZhixin Shu*, Stony Brook University; Mihir Sahasrabudhe, CentraleSupelec; Alp Guler, INRIA; Dimitris Samaras, Stony Brook University; Nikos Paragios, Therapanacea; Iasonas Kokkinos , UCL
P-3A-41NetAdapt: Platform-Aware Neural Network Adaptation for Mobile ApplicationsTien-Ju Yang*, Massachusetts Institute of Technology; Andrew Howard, Google; Bo Chen, Google; Xiao Zhang, Google; Alec Go, Google; Vivienne Sze, Massachusetts Institute of Technology; Hartwig Adam, Google
P-3A-42ExFuse: Enhancing Feature Fusion for Semantic SegmentationZhenli Zhang*, Fudan University; Xiangyu Zhang, Megvii Inc; Chao Peng, Megvii(Face++) Inc; Jian Sun, Megvii, Face++
P-3A-43AugGAN: Cross Domain Adaptation with GAN-based Data AugmentationSheng-Wei Huang, National Tsing Hua University; Che-Tsung Lin*, National Tsing Hua University; Shu-Ping Chen, National Tsing Hua University; Yen-Yi Wu, NTHU CS; Po-Hao Hsu, National Tsing Hua University; Shang-Hong Lai , National Tsing Hua University
P-3A-44LAPCSR:A Deep Laplacian Pyramid Generative Adversarial Network for Scalable Compressive Sensing ReconstructionKai Xu*, Arizona State University; Zhikang Zhang, Arizona State University; Fengbo Ren, Arizona State University
P-3A-45U-PC: Unsupervised Planogram ComplianceArchan Ray, University of Massachusetts Amherst; Nishant Kumar, SMART-FM; Avishek Shaw*, Tata Consultancy Services Limited; Dipti Prasad Mukherjee, ISI, Kolkata
P-3A-46Seeing Tree Structure from VibrationTianfan Xue, MIT; Jiajun Wu*, MIT; Zhoutong Zhang, MIT; Chengkai Zhang, MIT; Joshua Tenenbaum, MIT; Bill Freeman, MIT
P-3A-47A Dataset of Flash and Ambient Illumination Pairs from the CrowdYagiz Aksoy*, ETH Zurich; Changil Kim, MIT CSAIL; Petr Kellnhofer, MIT; Sylvain Paris, Adobe Research; Mohamed A. Elghareb, Qatar Computing Research Institute; Marc Pollefeys, ETH Zurich; Wojciech Matusik, MIT
P-3A-48Compressing the Input for CNNs with the First-Order Scattering TransformEdouard Oyallon*, CentraleSupélec; Eugene Belilovsky, Inria Galen / KU Leuven; Sergey Zagoruyko, Inria; Michal Valko, Inria
P-3A-49Distractor-aware Siamese Networks for Visual Object TrackingZheng Zhu*, CASIA; Qiang Wang, University of Chinese Academy of Sciences; Bo Li, sensetime; Wu Wei, Sensetime; Junjie Yan, Sensetime Group Limited
P-3A-50"Factual" or "Emotional": Stylized Image Captioning with Adaptive Learning and Attention"Tianlang Chen*, University of Rochester; Zhongping Zhang, University of Rochester; Quanzeng You, Microsoft; CHEN FANG, Adobe Research, San Jose, CA; Zhaowen Wang, Adobe Research; Hailin Jin, Adobe Research; Jiebo Luo, U. Rochester
P-3A-51Constrained Optimization Based Low-Rank Approximation of Deep Neural NetworksChong Li*, University of Washington; C.J. Richard Shi, University of Washington
P-3A-52Extending Layered Models to 3D MotionDong Lao, KAUST; Ganesh Sundaramoorthi*, Kaust
P-3A-53ExplainGAN: Model Explanation via Decision Boundary Crossing TransformationsNathan Silberman*, Butterfly Network; Pouya Samangouei, Butterfly Network; Liam Nakagawa, Butterfly Network; Ardavan Saeedi, Butterfly Network Inc
P-3A-54Adding Attentiveness to the Neurons in Recurrent Neural NetworksPengfei Zhang, Xi'an Jiaotong University; Jianru Xue, Xi'an Jiaotong University; Cuiling Lan*, Microsoft Research; Wenjun Zeng, Microsoft Research; Zhanning Gao, Xi'an Jiaotong University; Nanning Zheng, Xi'an Jiaotong University
P-3A-55ESPNet: Efficient Spatial Pyramid of Dilated Convolutions for Semantic SegmentationSachin Mehta*, University of Washington; Mohammad Rastegari, Allen Institute for Artificial Intelligence; Anat Caspi, University of Washington; Linda Shapiro, University of Washington; Hannaneh Hajishirzi, University of Washington
P-3A-56Learning Human-Object Interactions by Graph Parsing Neural NetworksSiyuan Qi*, UCLA; Wenguan Wang, Beijing Institute of Technology; Baoxiong Jia, UCLA; Jianbing Shen, Beijing Institute of Technology; Song-Chun Zhu, UCLA
P-3A-57BOP: Benchmark for 6D Object Pose Estimation Tomas Hodan*, Czech Technical University in Prague; Frank Michel, Technical University Dresden; Eric Brachmann, TU Dresden; Wadim Kehl, Toyota Research Institute; Anders Buch, University of Southern Denmark; Dirk Kraft, Syddansk Universitet; Bertram Drost, MVTec Software GmbH; Joel Vidal, National Taiwan University of Science and Technology; Stephan Ihrke , Fraunhofer ivi ; Xenophon Zabulis, FORTH; Caner Sahin, Imperial College London; Fabian Manhardt, TU Munich; Federico Tombari, Technical University of Munich, Germany; Tae-Kyun Kim, Imperial College London; Jiri Matas, CMP CTU FEE; Carsten Rother, University of Heidelberg
P-3A-58RCAA: Relational Context-Aware Agents for Person SearchXiaojun Chang*, Carnegie Mellon University; Po-Yao Huang, Carnegie Mellon University; Xiaodan Liang, Carnegie Mellon University; Yi Yang, UTS; Alexander Hauptmann, Carnegie Mellon University
P-3A-59DetNet: Design Backbone for Object DetectionZeming Li*, Tsinghua University;Megvii Inc; Chao Peng, Megvii(Face++) Inc; Gang Yu, Face++; Yangdong Deng, Tsinghua University; Xiangyu Zhang, Megvii Inc; Jian Sun, Megvii, Face++
P-3A-60Modeling Varying Camera-IMU Time Offset in Optimization-Based Visual-Inertial OdometryYonggen Ling*, Tencent AI Lab; Linchao Bao, Tencent AI Lab; Zequn Jie, Tencent AI Lab; Fengming Zhu, Tencent AI Lab; Ziyang Li, Tencent AI Lab; Shanmin Tang, Tencent AI Lab; YongSheng Liu, Tencent AI Lab; Wei Liu, Tencent AI Lab; Tong Zhang, Tecent AI Lab
P-3A-61Exploiting temporal information for 3D human pose estimationMir Rayat Imtiaz Hossain*, University of British Columbia; Jim Little, University of British Columbia, Canada
P-3A-62Joint Representation and Truncated Inference Learning for Correlation Filter based TrackingYingjie Yao, Harbin Institute of technology; Xiaohe Wu, Harbin Institute of technology; Lei Zhang, University of Pittsburgh; Shiguang Shan, Chinese Academy of Sciences; Wangmeng Zuo*, Harbin Institute of Technology, China
P-3A-63Learning to Zoom: a Saliency-Based Sampling Layer for Neural NetworksAdria Recasens*, Massachusetts Institute of Technology; Petr Kellnhofer, MIT; Simon Stent, Toyota Research Institute; Wojciech Matusik, MIT; Antonio Torralba, MIT
P-3A-64Does Haze Removal Help Image Classification?Yanting Pei*, Beijing Jiaotong University; Yaping Huang, Beijing Jiaotong University; Qi Zou, Beijing Jiaotong University; Yuhang Lu, University of South Carolina; Song Wang, University of South Carolina
P-3A-65Learning Local Descriptors by Integrating Geometry ConstraintsZixin Luo*, HKUST; Tianwei Shen, HKUST; Lei Zhou, HKUST; Siyu Zhu, HKUST; Runze Zhang, HKUST; Tian Fang, HKUST; Long Quan, Hong Kong University of Science and Technology
P-3A-66Repeatability Is Not Enough: Learning Affine Regions via DiscriminabilityDmytro Mishkin*, Czech Technical University in Prague; Filip Radenovic, Visual Recognition Group, CTU Prague; Jiri Matas, CMP CTU FEE
P-3A-67Macro-Micro Adversarial Network for Human ParsingYawei Luo*, University of Technology Sydney; Zhedong Zheng, University of Technology Sydney; Liang Zheng, University of Technology Sydney; Yi Yang, UTS
P-3A-68Learning Class Prototypes via Structure Alignment for Zero-Shot RecognitionHuajie Jiang, ICT, CAS; Ruiping Wang*, ICT, CAS; Shiguang Shan, Chinese Academy of Sciences; Xilin Chen, China
P-3A-69SphereNet: Learning Spherical Representations for Detection and Classification in Omnidirectional ImagesBenjamin Coors*, MPI Intelligent Systems, Bosch; Alexandru Condurache, Bosch; Andreas Geiger, MPI-IS and University of Tuebingen
P-3A-70A dataset and architecture for visual reasoning with a working memoryGuangyu Robert Yang*, Columbia University; Igor Ganichev, Google Brain; Xiao-Jing Wang, New York University; Jon Shlens, Google; David Sussillo, Google Brain
P-3A-71Flow-Grounded Spatial-Temporal Video Prediction from Still ImagesYijun Li*, University of California, Merced; CHEN FANG, Adobe Research, San Jose, CA; Jimei Yang, Adobe; Zhaowen Wang, Adobe Research; Xin Lu, Adobe; Ming-Hsuan Yang, University of California at Merced
P-3A-72The Unmanned Aerial Vehicle Benchmark: Object Detection and TrackingDawei Du*, University of Chinese Academy of Sciences; Yuankai Qi, Harbin Institute of Technology; Hongyang Yu, Harbin Institute of Technology; Yifang Yang, University of Chinese Academy of Sciences; Kaiwen Duan, University of Chinese Academy of Sciences; guorong Li, CAS; Weigang Zhang, Harbin Institute of Technology, Weihai; Qingming Huang, University of Chinese Academy of Sciences; Qi Tian , The University of Texas at San Antonio
P-3A-73Selective Zero-Shot Classification with Augmented AttributesJie Song, College of Computer Science and Technology, Zhejiang University; Chengchao Shen, Zhejiang University; Jie Lei, Zhejiang University; An-Xiang Zeng, Alibaba; Kairi Ou, Alibaba; Dacheng Tao, University of Sydney; Mingli Song*, Zhejiang University
P-3A-74Action Search: Spotting Actions in Videos and Its Application to Temporal Action LocalizationHumam Alwassel*, KAUST; Fabian Caba, KAUST; Bernard Ghanem, KAUST
P-3A-75A Principled Approach to Hard Triplet Generation via Adversarial NetsYiru Zhao*, Shanghai Jiao Tong University; Zhongming Jin, Alibaba Group; Guo-Jun Qi, University of Central Florida; Hongtao Lu, Shanghai Jiao Tong University; Xian-Sheng Hua, Alibaba Group
P-3A-76Pose Guided Human Video GenerationCeyuan Yang*, SenseTime Group Limited; Zhe Wang, Sensetime Group Limited; Xinge Zhu, Sensetime Group Limited; Chen Huang, Carnegie Mellon University; Jianping Shi, Sensetime Group Limited; Dahua Lin, The Chinese University of Hong Kong
P-3A-77Deep Directional Statistics: Pose Estimation with Uncertainty QuantificationSergey Prokudin*, Max Planck Institute for Intelligent Systems; Peter Gehler, Amazon; Sebastian Nowozin, Microsoft Research Cambridge
P-3A-78Learning 3D Human Pose from Structure and MotionRishabh Dabral*, IIT Bombay; Anurag Mundhada, IIT Bombay; Abhishek Sharma, Gobasco AI Labs
P-3A-79Learning Dynamic Memory Networks for Object TrackingTianyu Yang*, City University of Hong Kong; Antoni Chan, City University of Hong Kong, Hong, Kong
P-3A-80Faces as Lighting Probes via Unsupervised Deep Highlight ExtractionRenjiao Yi*, Simon Fraser University; Chenyang Zhu, Simon Fraser University; Ping Tan, Simon Fraser University; Stephen Lin, Microsoft Research
P-3A-81CurriculumNet: Learning from Large-Scale Web Images without Human AnnotationSheng Guo*, Malong Technologies; Weilin Huang, Malong Technologies; Haozhi Zhang, Malong Technologies
P-3A-82Joint Task-Recursive Learning for Semantic Segmentation and Depth EstimationZhenyu Zhang*, Nanjing University of Sci & Tech; Zhen Cui, Nanjing University of Science and Technology; Zequn Jie, Tencent AI Lab; Xiang Li, NJUST; Chunyan Xu, Nanjing University of Science and Technology; Jian Yang, Nanjing University of Science and Technology
P-3A-83HybridFusion: Real-Time Performance Capture Using a Single Depth Sensor and Sparse IMUsZerong Zheng*, Tsinghua University; Tao Yu, Beihang University; Hao Li, Pinscreen/University of Southern California/USC ICT; Kaiwen Guo, Google Inc.; Qionghai Dai, Tsinghua University; Lu Fang, Tsinghua University; Yebin Liu, Tsinghua University
P-3A-84Associating Inter-Image Salient Instances for Weakly Supervised Semantic SegmentationRuochen Fan*, Tsinghua University; Qibin Hou, Nankai University; Ming-Ming Cheng, Nankai University; Gang Yu, Face++; Ralph Martin, Cardiff University; Shimin Hu, Tsinghua University
P-3A-85Ask, Acquire and Attack: Data-free UAP generation using Class impressionsKonda Reddy Mopuri*, Indian Institute of Science, Bangalore; Phani Krishna Uppala, Indian Institute of Science; Venkatesh Babu RADHAKRISHNAN, Indian Institute of Science
P-3A-86A Scalable Exemplar-based Subspace Clustering Algorithm for Class-Imbalanced DataChong You*, Johns Hopkins University; Chi Li, Johns Hopkins University; Daniel Robinson, Johns Hopkins University; Rene Vidal, Johns Hopkins University
P-3A-87Find and Focus: Retrieve and Localize Video Events with Natural Language QueriesDian SHAO*, The Chinese University of Hong Kong; Yu Xiong, The Chinese University of HK; Yue Zhao, The Chinese University of Hong Kong; Qingqiu Huang, CUHK; Yu Qiao, Multimedia Laboratory, Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences; Dahua Lin, The Chinese University of Hong Kong
P-3A-88Graininess-Aware Deep Feature Learning for Pedestrian DetectionChunze Lin, Tsinghua University; Jiwen Lu*, Tsinghua University; Jie Zhou, Tsinghua University, China
P-3A-89MVSNet: Depth Inference for Unstructured Multi-view StereoYao Yao*, The Hong Kong University of Science and Technology; Zixin Luo, HKUST; Shiwei Li, HKUST; Tian Fang, HKUST; Long Quan, Hong Kong University of Science and Technology
P-3A-90PlaneMatch: Patch Coplanarity Prediction for Robust RGB-D RegistrationYifei Shi, Princeton University; Kai Xu, Princeton University and National University of Defense Technology; Matthias Niessner, Technical University of Munich; Szymon Rusinkiewicz, Princeton University; Thomas Funkhouser*, Princeton, USA
P-3A-91Deep Virtual Stereo Odometry: Leveraging Deep Depth Prediction for Monocular Direct Sparse OdometryNan Yang*, Technical University of Munich; Rui Wang, Technical University of Munich; Joerg Stueckler, Technical University of Munich; Daniel Cremers, TUM

Poster session 3B
3BWednesday, September 12Poster session 02:30 PM - 04:00 PM
P-3B-01GANimation: Anatomically-aware Facial Animation from a Single ImageAlbert Pumarola*, Institut de Robotica i Informatica Industrial; Antonio Agudo, Institut de Robotica i Informatica Industrial, CSIC-UPC; Aleix Martinez, The Ohio State University; Alberto Sanfeliu, Industrial Robotics Institute; Francesc Moreno, IRI
P-3B-02Unsupervised Geometry-Aware Representation for 3D Human Pose EstimationHelge Rhodin*, EPFL; Mathieu Salzmann, EPFL; Pascal Fua, EPFL, Switzerland
P-3B-03Efficient Semantic Scene Completion Network with Spatial Group ConvolutionJiahui Zhang*, Tsinghua University; Hao Zhao, Intel Labs China; Anbang Yao, Intel Labs China; Yurong Chen, Intel Labs China; Hongen Liao, Tsinghua University
P-3B-04Deep Autoencoder for Combined Human Pose Estimation and Body Model UpscalingMatthew Trumble*, University of Surrey; Andrew Gilbert, University of Surrey; John Collomosse, Adobe Research; Adrian Hilton, University of Surrey
P-3B-05Highly-Economized Multi-View Binary Compression for Scalable Image ClusteringZheng Zhang*, Harbin Institute of Technology Shenzhen Graduate School; Li Liu, the inception institute of artificial intelligence; Jie Qin, ETH Zurich; Fan Zhu, the inception institute of artificial intelligence ; Fumin Shen, UESTC; Yong Xu, Harbin Institute of Technology Shenzhen Graduate School; Ling Shao, Inception Institute of Artificial Intelligence; Heng Tao Shen, University of Electronic Science and Technology of China (UESTC)
P-3B-06Asynchronous, Photometric Feature Tracking using Events and FramesDaniel Gehrig, University of Zurich; Henri Rebecq*, University of Zurich; Guillermo Gallego, University of Zurich; Davide Scaramuzza, University of Zurich& ETH Zurich, Switzerland
P-3B-07Deterministic Consensus Maximization with Biconvex ProgrammingZhipeng Cai*, The University of Adelaide; Tat-Jun Chin, University of Adelaide; Huu Le, University of Adelaide; David Suter, University of Adelaide
P-3B-08Depth-aware CNN for RGB-D SegmentationWeiyue Wang*, USC; Ulrich Neumann, USC
P-3B-09Object Detection in Video with Spatiotemporal Sampling NetworksGedas Bertasius*, University of Pennsylvania; Lorenzo Torresani, Dartmouth College; Jianbo Shi, University of Pennsylvania
P-3B-10Dependency-aware Attention Control for Unconstrained Face Recognition with Image SetsXiaofeng Liu*, Carnegie Mellon University; B. V. K. Vijaya Kumar, CMU, USA; Chao Yang, University of Southern California; Qingming Tang, TTIC; Jane You, The Hong Kong Polytechnic University
P-3B-11License Plate Detection and Recognition in Unconstrained ScenariosSérgio Silva*, UFRGS; Claudio Jung, UFRGS
P-3B-12Revisiting the Inverted Indices for Billion-Scale Approximate Nearest NeighborsDmitry Baranchuk*, MSU / Yandex; Artem Babenko, MIPT/Yandex; Yury Malkov, NTechLab
P-3B-13Zero-Annotation Object Detection with Web Knowledge TransferQingyi Tao*, Nanyang Techonological University; Hao Yang, NTU; Jianfei Cai, Nanyang Technological University
P-3B-14Semi-supervised Adversarial Learning to Generate Photorealistic Face Images of New Identities from 3D Morphable ModelBaris Gecer*, Imperial College London; Binod Bhattarai, Imperial College London; Josef Kittler, University of Surrey, UK; Tae-Kyun Kim, Imperial College London
P-3B-15Improving Shape Deformation in Unsupervised Image-to-Image TranslationAaron Gokaslan*, Brown University; Vivek Ramanujan, Brown University; Daniel Ritchie, Brown University; Kwang In Kim, University of Bath; James Tompkin, Brown University
P-3B-16K-convexity shape priors for segmentationHossam Isack*, UWO; Lena Gorelick, University of Western Ontario; Karin nG, University of Western Ontario; Olga Veksler, University of Western Ontario; Yuri Boykov, University of Waterloo
P-3B-17Visual Question Generation for Class Acquisition of Unknown ObjectsKohei Uehara*, The University of Tokyo; Antonio Tejero-de-Pablos, The University of Tokyo; Yoshitaka Ushiku, The University of Tokyo; Tatsuya Harada, The University of Tokyo
P-3B-18Sampling Algebraic Varieties for Robust Camera AutocalibrationDanda Pani Paudel*, ETH Zürich; Luc Van Gool, ETH Zurich
P-3B-19Hand Pose Estimation via Latent 2.5D Heatmap RegressionUmar Iqbal*, University of Bonn; Pavlo Molchanov, NVIDIA; Thomas Breuel, NVIDIA; Jürgen Gall, University of Bonn; Kautz Jan, NVIDIA
P-3B-20HairNet: Single-View Hair Reconstruction using Convolutional Neural NetworksYi Zhou*, University of Southern California; Liwen Hu, University of Southern California; Jun Xing, Institute for Creative Technologies, USC; Weikai Chen, USC Institute for Creative Technology; Han-Wei Kung, University of California, Santa Barbara; Xin Tong, Microsoft Research Asia; Hao Li, Pinscreen/University of Southern California/USC ICT
P-3B-21Super-Identity Convolutional Neural Network for Face HallucinationKaipeng Zhang*, National Taiwan University; ZHANPENG ZHANG, SenseTime Group Limited; Chia-Wen Cheng, UT Austin; Winston Hsu, National Taiwan University; Yu Qiao, Multimedia Laboratory, Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences; Wei Liu, Tencent AI Lab; Tong Zhang, Tecent AI Lab
P-3B-22Receptive Field Block Net for Accurate and Fast Object DetectionSongtao Liu, BUAA; Di Huang*, Beihang University, China; Yunhong Wang, State Key Laboratory of Virtual Reality Technology and System, Beihang University, Beijing 100191, China
P-3B-23Interpretable Intuitive Physics ModelTian Ye*, Carnegie Mellon University; Xiaolong Wang, CMU; James Davidson, Google; Abhinav Gupta, CMU
P-3B-24Variable Ring Light Imaging: Capturing Transient Subsurface Scattering with An Ordinary CameraKo Nishino*, Kyoto University; Art Subpa-asa, Tokyo Institute of Technology; Yuta Asano, Tokyo Institute of Technology; Mihoko Shimano, National Institute of Informatics; Imari Sato, National Institute of Informatics
P-3B-25Facial Dynamics Interpreter Network: What are the Important Relations between Local Dynamics for Facial Trait Estimation?Seong Tae Kim*, KAIST; Yong Man Ro, KAIST
P-3B-26Coloring with Words: Guiding Image Colorization Through Text-based Palette GenerationHyojin Bahng, Korea University; Seungjoo Yoo, Korea University; Wonwoong Cho, Korea University; David Park, Korea University; Ziming Wu, Hong Kong University of Science and Technology; Xiaojuan MA, Hong Kong University of Science and Technology; Jaegul Choo*, Korea University
P-3B-27Sparsely Aggregated Convolutional NetworksLigeng Zhu*, Simon Fraser University; Ruizhi Deng, Simon Fraser University; Michael Maire, Toyota Technological Institute at Chicago; Zhiwei Deng, Simon Fraser University; Greg Mori, Simon Fraser University; Ping Tan, Simon Fraser University
P-3B-28Deep Attention Neural Tensor Network for Visual Question AnsweringYalong Bai*, Harbin Institute of Technology; Jianlong Fu, Microsoft Research; Tao Mei, JD.com
P-3B-29Diverse feature visualizations reveal invariances in early layers of deep neural networksSantiago Cadena*, University of Tübingen; Marissa Weis, University of Tübingen; Leon A. Gatys, University of Tuebingen; Matthias Bethge, University of Tübingen; Alexander Ecker, University of Tübingen
P-3B-30Sidekick Policy Learning for Active Visual ExplorationSanthosh Kumar Ramakrishnan*, University of Texas at Austin; Kristen Grauman, University of Texas
P-3B-31DPP-Net: Device-aware Progressive Search for Pareto-optimal Neural ArchitecturesJin-Dong Dong*, National Tsing-Hua University; An-Chieh Cheng, National Tsing-Hua University; Da-Cheng Juan, Google; Wei Wei, Google; Min Sun, NTHU
P-3B-32Pixel2Mesh: Generating 3D Mesh Models from Single RGB ImagesNanyang Wang, Fudan University; Yinda Zhang*, Princeton University; Zhuwen Li, Intel Labs; Yanwei Fu, Fudan Univ.; Wei Liu, Tencent AI Lab; Yu-Gang Jiang, Fudan University
P-3B-33End-to-End Incremental LearningFrancisco M. Castro*, University of Málaga; Manuel J. Marín-Jiménez, University of Córdoba; Nicolás Guil, University of Málaga; Cordelia Schmid, INRIA; Karteek Alahari, Inria
P-3B-34CAR-Net: Clairvoyant Attentive Recurrent NetworkAmir Sadeghian*, Stanford; Maxime Voisin, Stanford University; Ferdinand Legros, Stanford University; Ricky Vesel, Race Optimal; Alexandre Alahi, EPFL; Silvio Savarese, Stanford University
P-3B-35Learning Data Terms for Image DeblurringJiangxin Dong*, Dalian University of Technology; Jinshan Pan, Dalian University of Technology; Deqing Sun, NVIDIA; Zhixun Su, Dalian University of Technology; Ming-Hsuan Yang, University of California at Merced
P-3B-36Image Inpainting for Irregular Holes Using Partial ConvolutionsGuilin Liu*, NVIDIA; Fitsum Reda, NVIDIA; Kevin Shih, NVIDIA; Ting-Chun Wang, NVIDIA; Andrew Tao, NVIDIA; Bryan Catanzaro, NVIDIA
P-3B-37SRDA: Generating Instance Segmentation Annotation Via Scanning, Reasoning And Domain AdaptionWenqiang Xu, Shanghai Jiaotong University; Yonglu Li, Shanghai Jiao Tong University; Jun Lv, SJTU; Cewu Lu*, Shanghai Jiao Tong Univercity
P-3B-38Learning Priors for Semantic 3D ReconstructionIan Cherabier*, ETH Zurich; Johannes Schoenberger, ETH Zurich; Martin R. Oswald, ETH Zurich; Marc Pollefeys, ETH Zurich; Andreas Geiger, MPI-IS and University of Tuebingen
P-3B-39Integrating Egocentric Videos in Top-view Surveillance Videos: Joint Identification and Temporal AlignmentShervin Ardeshir*, University of Central Florida; Ali Borji, University of Central Florida
P-3B-40Deep Boosting for Image DenoisingChang Chen, University of Science and Technology of China; Zhiwei Xiong*, University of Science and Technology of China; Xinmei Tian, USTC; Feng Wu, University of Science and Technology of China
P-3B-41Descending, lifting or smoothing: Secrets of robust cost optimizationChristopher Zach*, Toshiba Research; Guillaume Bourmaud, University of Bordeaux
P-3B-42MultiPoseNet: Fast Multi-Person Pose Estimation using Pose Residual NetworkMuhammed Kocabas*, Middle East Technical University; Salih Karagoz, Middle East Technical University; Emre Akbas, Middle East Technical University
P-3B-43TS2C: Tight Box Mining with Surrounding Segmentation Context for Weakly Supervised Object DetectionYunchao Wei*, UIUC; Zhiqiang Shen, UIUC; Honghui Shi, UIUC; Bowen Cheng, UIUC; Jinjun Xiong, IBM Thomas J. Watson Research Center; Jiashi Feng, NUS; Thomas Huang, UIUC
P-3B-44End-to-End Deep Structured Models for Drawing CrosswalksJustin Liang*, Uber ATG; Raquel Urtasun, Uber ATG
P-3B-45Efficient Global Point Cloud Registration by Matching Rotation Invariant Features Through Translation SearchYinlong Liu, Fudan University; Wang Chen*, Shanghai Key Laboratory of Medical Imaging Computing and Computer Assisted Intervention, Digital Medical Research Center, Fudan University; Zhijian Song, Fudan University; Manning Wang, Fudan University
P-3B-46Large Scale Urban Scene Modeling from MVS MeshesLingjie Zhu, University of Chinese Academy of Sciences; National Laboratory of Pattern Recognition, Institute of Automation, Chinese Academy of Sciences; Shuhan Shen*, National Laboratory of Pattern Recognition, Institute of Automation, Chinese Academy of Sciences; Zhanyi Hu, National Laboratory of Pattern Recognition, Institute of Automation, Chinese Academy of Sciences
P-3B-47Sub-GAN: An Unsupervised Generative Model via SubspacesJie Liang, Nankai University; Jufeng Yang*, Nankai University ; Hsin-Ying Lee, University of California, Merced; Kai Wang, Nankai University; Ming-Hsuan Yang, University of California at Merced
P-3B-48Pseudo Pyramid Deeper Bidirectional ConvLSTM for Video Saliency DetectionHongmei Song, Beijing Institute of Technology; Sanyuan Zhao*, Beijing Institute of Technology ; Jianbing Shen, Beijing Institute of Technology; Kin-Man Lam, The Hong Kong Polytechnic University
P-3B-49Practical Black-box Attacks on Deep Neural Networks using Efficient Query MechanismsArjun Nitin Bhagoji*, Princeton University; Warren he, University of California, Berkeley; Bo Li, University of Illinois at Urbana–Champaign; Dawn Song, UC Berkeley
P-3B-50Learning 3D Shape Priors for Shape Completion and ReconstructionJiajun Wu*, MIT; Chengkai Zhang, MIT; Xiuming Zhang, MIT; Zhoutong Zhang, MIT; Joshua Tenenbaum, MIT; Bill Freeman, MIT
P-3B-51Comparator NetworksWeidi Xie*, University of Oxford; Li Shen, University of Oxford; Andrew Zisserman, University of Oxford
P-3B-52Improving Fine-Grained Visual Classification using Pairwise ConfusionAbhimanyu Dubey*, Massachusetts Institute of Technology; Otkrist Gupta, MIT; Pei Guo, Brigham Young University; Ryan Farrell, Brigham Young University; Ramesh Raskar, Massachusetts Institute of Technology; Nikhil Naik, MIT
P-3B-53Visual-Inertial Object Detection and MappingXiaohan Fei*, UCLA; Stefano Soatto, UCLA
P-3B-54Learning Region Features for Object DetectionJiayuan Gu, Peking University; Han Hu, Microsoft Research Asia; Liwei Wang, Peking University; Yichen Wei, MSR Asia; Jifeng Dai*, Microsoft Research Asia
P-3B-55Efficient Dense Point Cloud Object Reconstruction using Deformation Vector FieldsKejie Li*, University of Adelaide; Trung Pham, NVIDIA; Huangying Zhan, The University of Adelaide; Ian Reid, University of Adelaide, Australia
P-3B-56Evaluating Capability of Deep Neural Networks for Image Classification via Information PlaneHao Cheng*, Shanghaitech University; Dongze Lian, Shanghaitech University; Shenghua Gao, Shanghaitech University; Yanlin Geng, Shanghaitech University
P-3B-57Shuffle-Then-Assemble: Learning Object-Agnostic Visual Relationship FeaturesXU YANG*, NTU; Hanwang Zhang, Nanyang Technological University; Jianfei Cai, Nanyang Technological University
P-3B-58Zero-Shot Deep Domain AdaptationKuan-Chuan Peng*, siemens corporation; Ziyan Wu, Siemens Corporation; Jan Ernst, Siemens Corporation
P-3B-59Deep Imbalanced Attribute Classification using Visual Attention AggregationNikolaos Sarafianos*, University of Houston; Xiang Xu, University of Houston; Ioannis Kakadiaris, University of Houston
P-3B-60Video Object Segmentation by Learning Location-Sensitive EmbeddingsHai Ci, Peking University; Chunyu Wang*, Microsoft Research asia; Yizhou Wang, PKU
P-3B-61Deep Multi-Task Learning to Recognise Subtle Facial Expressions of Mental StatesGuosheng Hu*, AnyVision; Li Liu, the inception institute of artificial intelligence; Yang Yuan, AnyVision; Zehao Yu, Xiamen University; Yang Hua, Queen's University Belfast; Zhihong Zhang, Xiamen University; Fumin Shen, UESTC; Ling Shao, Inception Institute of Artificial Intelligence; Timothy Hospedales, Edinburgh University; Neil Robertson, Queen's University Belfast; Yongxin Yang, University of Edinburgh
P-3B-62Where Will They Go? Predicting Fine-Grained Adversarial Multi-Agent Motion using Conditional Variational AutoencodersPanna Felsen*, University of California Berkeley; Patrick Lucey, STATS; Sujoy Ganguly, STATS
P-3B-63Video Summarization Using Fully Convolutional Sequence NetworksMrigank Rochan*, University of Manitoba; Linwei Ye, University of Manitoba; Yang Wang, University of Manitoba
P-3B-64Occlusion-aware Hand Pose Estimation Using Hierarchical Mixture Density NetworkQi Ye*, Imperial College London; Tae-Kyun Kim, Imperial College London
P-3B-65Learning with Biased Complementary LabelsXiyu Yu*, The University of Sydney; Tongliang Liu, The University of Sydney; Mingming Gong, University of Pittsburgh; Dacheng Tao, University of Sydney
P-3B-66ConceptMask: Large-Scale Segmentation from Semantic ConceptsYufei Wang*, Facebook; Zhe Lin, Adobe Research; Xiaohui Shen, Adobe Research; Scott Cohen, Adobe Research; Jianming Zhang, Adobe Research
P-3B-67Conditional Image-Text Embedding NetworksBryan Plummer*, Boston University; Paige Kordas, University of Illinois at Urbana Champaign; Hadi Kiapour, eBay; Shuai Zheng, eBay; Robinson Piramuthu, eBay Inc.; Svetlana Lazebnik, UIUC
P-3B-68Geolocation Estimation of Photos using a Hierarchical Model and Scene ClassificationEric Müller-Budack*, Leibniz Information Centre of Science and Technology (TIB); Kader Pustu-Iren, Leibniz Information Center of Science and Technology (TIB); Ralph Ewerth, Leibniz Information Center of Science and Technology (TIB)
P-3B-69Lifting Layers: Analysis and ApplicationsMichael Moeller*, University of Siegen; Peter Ochs, Saarland University; Tim Meinhardt, Technical University of Munich; Laura Leal-Taixé, TUM
P-3B-70Progressive Neural Architecture SearchChenxi Liu*, Johns Hopkins University; Maxim Neumann, Google; Barret Zoph, Google; Jon Shlens, Google; Wei Hua, Google; Li-Jia Li, Google; Li Fei-Fei, Stanford University; Alan Yuille, Johns Hopkins University; Jonathan Huang, Google; Kevin Murphy, Google
P-3B-71Learning Deep Representations with Probabilistic Knowledge TransferNikolaos Passalis*, Aristotle University of Thessaloniki; Anastasios Tefas, Aristotle University of Thessaloniki
P-3B-72Robust fitting in computer vision: easy or hard?Tat-Jun Chin*, University of Adelaide; Zhipeng Cai, The University of Adelaide; Frank Neumann, The University of Adelaide, School of Computer Science, Faculty of Engineering, Computer and Mathematical Science
P-3B-73Dual-Agent Deep Reinforcement Learning for Deformable Face TrackingMinghao Guo, Tsinghua University; Jiwen Lu*, Tsinghua University; Jie Zhou, Tsinghua University, China

Poster session 3C
3CWednesday, September 12Poster session 05:15 PM - 06:45 PM
P-3C-01Zero-Shot Object DetectionAnkan Bansal*, University of Maryland; Karan Sikka, SRI International; Gaurav Sharma, NEC Labs America; Rama Chellappa, University of Maryland; Ajay Divakaran, SRI, USA
P-3C-02ForestHash: Semantic Hashing With Shallow Random Forests and Tiny Convolutional NetworksQiang Qiu*, Duke University; Jose Lezama, Universidad de la Republica, Uruguay; Alex Bronstein, Tel Aviv University, Israel; Guillermo Sapiro, Duke University
P-3C-03ML-LocNet: Improving Object Localization with Multi-view Learning NetworkXiaopeng Zhang*, National University of Singapore; Jiashi Feng, NUS
P-3C-04MPLP++: Fast, Parallel Dual Block-Coordinate Ascent for Dense Graphical ModelsSiddharth Tourani*, Visual Learning Lab, HCI, Uni-Heidelberg; Alexander Shekhovtsov, Czech Technical University in Prague, Czech Republic; Carsten Rother, University of Heidelberg; Bogdan Savchynskyy, Heidelberg University
P-3C-05A Zero-Shot Framework for Sketch based Image RetrievalSasikiran Yelamarthi , IIT Madras; Shiva Krishna Reddy M, Indian Institute of Technology Madras; Ashish Mishra*, IIT Madras; Anurag Mittal, Indian Institute of Technology Madras
P-3C-06In the Eye of Beholder: Joint Learning of Gaze and Actions in First Person VisionYin Li*, CMU; Miao Liu, Georgia Tech; James Rehg, Georgia Institute of Technology
P-3C-07SAN: Learning Relationship between Convolutional Features for Multi-Scale Object DetectionYongHyun Kim*, POSTECH
P-3C-08A Systematic DNN Weight Pruning Framework using Alternating Direction Method of MultipliersTianyun Zhang*, Syracuse University; Shaokai Ye, Syracuse University; Kaiqi Zhang, Syracuse University; Yanzhi Wang, Syracuse University; Makan Fardad, Syracuse Universtiy; Wujie Wen, Florida International University
P-3C-09Iterative Crowd CountingViresh Ranjan*, Stony Brook University; Hieu Le, Stony Brook University; Minh Hoai Nguyen, Stony Brook University
P-3C-10A Dataset for Lane Instance Segmentation in Urban EnvironmentsBrook Roberts, Five AI Ltd.; Sebastian Kaltwang*, Five AI Ltd.; Sina Samangooei, Five AI Ltd.; Mark Pender-Bare, Five AI Ltd.; Konstantinos Tertikas, Five AI Ltd.; John Redford, Five AI Ltd.
P-3C-11Out-of-Distribution Detection Using an Ensemble of Self Supervised Leave-out ClassifiersNataraj Jammalamadaka*, Intel Labs;
Xia Zhu, Intel Labs;
Dipankar Das, Intel Labs;
Bharat Kaul, Intel Labs;
Theodore Willke, Intel Labs
P-3C-12Penalizing Top Performers: Conservative Loss for Semantic Segmentation AdaptationXinge Zhu*, Sensetime Group Limited; Hui Zhou, Sensetime Group Limited.; Ceyuan Yang, SenseTime Group Limited; Jianping Shi, Sensetime Group Limited; Dahua Lin, The Chinese University of Hong Kong
P-3C-13Compound Memory Networks for Few-shot Video ClassificationLinchao Zhu*, University of Technology, Sydney; Yi Yang, UTS
P-3C-14Straight to the Facts: Learning Knowledge Base Retrieval for Factual Visual Question AnsweringMedhini Narasimhan*, University of Illinois at Urbana-Champaign ; Alexander Schwing, UIUC
P-3C-15Interpretable Basis Decomposition for Visual ExplanationAntonio Torralba, MIT; Bolei Zhou*, MIT; David Bau, MIT; Yiyou Sun, Harvard
P-3C-16How Local is the Local Diversity? Reinforcing Sequential Determinantal Point Processes with Dynamic Ground Sets for Supervised Video SummarizationYandong Li*, University of Central Florida; Boqing Gong, Tencent AI Lab; Tianbao Yang, University of Iowa; Liqiang Wang, University of Central Florida
P-3C-17Dividing and Aggregating Network for Multi-view Action RecognitionDongang Wang*, The University of Sydney; Wanli Ouyang, CUHK; Wen Li, ETHZ; Dong Xu, University of Sydney
P-3C-18Shape Reconstruction Using Volume Sweeping and Learned PhotoconsistencyVincent Leroy*, INRIA Grenoble Rhône-Alpes; Edmond Boyer, Inria; Jean-Sebastien Franco, INRIA
P-3C-19RT-GENE: Real-Time Eye Gaze Estimation in Natural EnvironmentsTobias Fischer*, Imperial College London; Hyung Jin Chang, University of Birmingham; Yiannis Demiris, Imperial College London
P-3C-20Pairwise Body-Part Attention for Recognizing Human-Object InteractionsHaoshu Fang, SJTU; Jinkun Cao, Shanghai Jiao Tong University; Yu-Wing Tai, Tencent YouTu; Cewu Lu*, Shanghai Jiao Tong Univercity
P-3C-21Motion Feature Network: Fixed Motion Filter for Action RecognitionMyunggi Lee, Seoul National University; Seung Eui Lee, Seoul National University; Sung Joon Son, Seoul National University; Gyutae Park, Seoul National University; Nojun Kwak*, Seoul National University
P-3C-22Reverse Attention for Salient Object DetectionShuhan Chen*, Yangzhou University; Xiuli Tan, Yangzhou University; Ben Wang, Yangzhou University; Xuelong Hu, Yangzhou University
P-3C-23Dynamic Sampling Convolutional Neural NetworksJialin Wu*, UT Austin; Dai Li, Tsinghua University; Yu Yang, Tsinghua University; Chandrajit Bajaj, University of Texas, Austin; Xiangyang Ji, Tsinghua University
P-3C-24DDRNet: Depth Map Denoising and Refinement for Consumer Depth Cameras Using Cascaded CNNsShi Yan, Tsinghua University; Chenglei Wu, Oculus Research; Lizheng Wang, Tsinghua University; Liang An, Tsinghua University; Feng Xu, Tsinghua University; Kaiwen Guo, Google Inc.; Yebin Liu*, Tsinghua University
P-3C-25Stereo Computation for a Single Mixture ImageYiran Zhong, Australian National University; Yuchao Dai*, Northwestern Polytechnical University; HONGDONG LI, Australian National University, Australia
P-3C-26Volumetric performance capture from minimal camera viewpointsAndrew Gilbert*, University of Surrey; Marco Volino, University of Surrey; John Collomosse, Adobe Research; Adrian Hilton, University of Surrey
P-3C-27Liquid Pouring Monitoring via Rich Sensory InputsTz-Ying Wu*, National Tsing Hua University; Juan-Ting Lin, National Tsing Hua University; Tsun-Hsuang Wang, National Tsing Hua University; Chan-Wei Hu, National Tsing Hua University; Juan Carlos Niebles, Stanford University; Min Sun, NTHU
P-3C-28Move Forward and Tell: A Progressive Generator of Video DescriptionsYilei Xiong*, The Chinese University of Hong Kong; Bo Dai, the Chinese University of Hong Kong; Dahua Lin, The Chinese University of Hong Kong
P-3C-29DYAN: A Dynamical Atoms-Based Network for Video PredictionWenqian Liu*, Northeastern University; Abhishek Sharma, Northeastern University ; Octavia Camps, Northeastern University; Mario Sznaier, Northeastern University
P-3C-30Deep Structure Inference Network for Facial Action Unit RecognitionCiprian Corneanu*, Universitat de Barcelona; Meysam Madadi, CVC; Sergio Escalera, Computer Vision Center (UAB) & University of Barcelona,
P-3C-31Physical Primitive DecompositionZhijian Liu, Shanghai Jiao Tong University; Jiajun Wu*, MIT; Bill Freeman, MIT; Joshua Tenenbaum, MIT
P-3C-32Boosted Attention: Leveraging Human Attention for Image CaptioningShi Chen*, University of Minnesota; Qi Zhao, University of Minnesota
P-3C-33Is Robustness the Cost of Accuracy? -- Lessons Learned from 18 Deep Image ClassifiersDong Su*, IBM Research T.J. Watson Center; Huan Zhang, UC Davis; Hongge Chen, MIT; Jinfeng Yi, JD AI Research; Pin-Yu Chen, IBM Research; Yupeng Gao, IBM Research AI
P-3C-34Dynamic Multimodal Instance Segmentation guided by natural language queriesEdgar Margffoy-Tuay*, Universidad de los Andes; Emilio Botero, Universidad de los Andes; Juan Pérez, Universidad de los Andes; PABLO ARBELÁEZ, Universidad de los Andes
P-3C-35Hierarchy of Alternating Specialists for Scene RecognitionHyo Jin Kim*, University of North Carolina at Chapel Hill; Jan-Michael Frahm, UNC-Chapel Hill
P-3C-36SwapNet: Garment Transfer in Single View ImagesAmit Raj*, Georgia Institute of Technology; Patsorn Sangkloy, Georgia Institute of Technology; Huiwen Chang, Princeton University; Jingwan Lu, Adobe Research ; Duygu Ceylan, Adobe Research; James Hays, Georgia Institute of Technology, USA
P-3C-37What do I Annotate Next? An Empirical Study of Active Learning for Action LocalizationFabian Caba*, KAUST; Joon-Young Lee, Adobe Research; Hailin Jin, Adobe Research; Bernard Ghanem, KAUST
P-3C-38Combining 3D Model Contour Energy and Keypoints for Object TrackingBogdan Bugaev*, Saint Petersburg Academic University; Anton Kryshchenko, Saint Petersburg Academic University; Roman Belov, KeenTools
P-3C-39AGIL: Learning Attention from Human for Visuomotor TasksRuohan Zhang*, University of Texas at Austin; Zhuode Liu, Google Inc.; Luxin Zhang, Peking University; Jake Whritner, University of Texas at Austin; Karl Muller, University of Texas at Austin; Mary Hayhoe, University of Texas at Austin; Dana Ballard, University of Texas at Austin
P-3C-40PersonLab: Person Pose Estimation and Instance Segmentation with a Bottom-Up, Part-Based, Geometric Embedding ModelGeorge Papandreou*, Google; Tyler Zhu, Google; Liang-Chieh Chen, Google Inc.; Spyros Gidaris, Ecole des Ponts ParisTech; Jonathan Tompson, Google; Kevin Murphy, Google
P-3C-41Accelerating Dynamic Programs via Nested Benders Decomposition with Application to Multi-Person Pose EstimationShaofei Wang*, Baidu Inc.; Alexander Ihler, UC Irvine; Konrad Kording, Northwestern; Julian Yarkony, Experian Data Lab
P-3C-42Separating Reflection and Transmission Images in the WildPatrick Wieschollek*, University of Tuebingen; Orazio Gallo, NVIDIA Research; Jinwei Gu, Nvidia; Kautz Jan, NVIDIA
P-3C-43Point-to-Point Regression PointNet for 3D Hand Pose EstimationLiuhao Ge*, NTU; Zhou Ren, Snap Research, USA, ; Junsong Yuan, State University of New York at Buffalo, USA
P-3C-44Summarizing First-Person Videos from Third Persons' Points of ViewHSUAN-I HO*, National Taiwan University; Wei-Chen Chiu, National Chiao Tung University; Yu-Chiang Frank Wang, National Taiwan University
P-3C-45Learning Category-Specific Mesh Reconstruction from Image CollectionsAngjoo Kanazawa*, UC Berkeley; Shubham Tulsiani, UC Berkeley; Alexei Efros, UC Berkeley; Jitendra Malik, University of California at Berkley
P-3C-46StereoNet: Guided Hierarchical Refinement for Real-Time Edge-Aware Depth PredictionSameh Khamis*, Google; Sean Fanello, Google; Christoph Rhemann, Google; Julien Valentin, Google; Adarsh Kowdle, Google; Shahram Izadi, Google
P-3C-47Visual Question Answering as a Meta Learning TaskDamien Teney*, The Unversity of Adelaide; Anton van den Hengel, The University of Adelaide
P-3C-48SRFeat: Single Image Super Resolution with Feature DiscriminationSeong-Jin Park*, POSTECH; Hyeongseok Son, POSTECH; Sunghyun Cho, DGIST; Ki-Sang Hong, POSTECH; Seungyong Lee, POSTECH
P-3C-49Deep Factorised Inverse-SketchingKaiyue Pang*, Queen Mary University of London; Da Li, QMUL; Jifei Song, Queen Mary, University of London; Yi-Zhe Song, Queen Mary University of London; Tao Xiang, Queen Mary, University of London, UK; Timothy Hospedales, Edinburgh University
P-3C-50Multimodal image alignment through a multiscale chain of neural networks with application to remote sensingArmand Zampieri, Inria Sophia-Antipolis; Guillaume Charpiat, INRIA; Nicolas Girard, Inria Sophia-Antipolis; Yuliya Tarabalka*, Inria Sophia-Antipolis
P-3C-51Improving Deep Visual Representation for Person Re-identification by Global and Local Image-language AssociationDapeng Chen*, The Chinese University of HongKong; Hongsheng Li, Chinese University of Hong Kong; Xihui Liu, The Chinese University of Hong Kong; Jing Shao, The Chinese University of Hong Kong; Xiaogang Wang, Chinese University of Hong Kong, Hong Kong
P-3C-52Robust Optical Flow Estimation in Rainy ScenesRuoteng Li*, National University of Singapore; Robby Tan, Yale-NUS College, Singapore; Loong Fah Cheong, NUS
P-3C-53Image Generation from Sketch Constraint Using Contextual GANYongyi Lu*, HKUST; Shangzhe Wu, HKUST; Yu-Wing Tai, Tencent YouTu; Chi-Keung Tang, Hong Kong University of Science and Technology
P-3C-54Accurate Scene Text Detection through Border Semantics Awareness and BootstrappingChuhui Xue, Nanyang Technological University; Shijian Lu*, Nanyang Technological University; Fangneng Zhan, Nanyang Technological University
P-3C-55CNN-PS: CNN-based Photometric Stereo for General Non-Convex SurfacesSatoshi Ikehata*, National Institute of Informatics
P-3C-56Making Deep Heatmaps Robust to Partial Occlusions for 3D Object Pose EstimationMarkus Oberweger*, TU Graz; Mahdi Rad, TU Graz; Vincent Lepetit, TU Graz
P-3C-57Recognition in Terra IncognitaSara Beery*, Caltech; Grant van Horn, Caltech; Pietro Perona, Caltech
P-3C-58Super-Resolution and Sparse View CT ReconstructionGuangming Zang, KAUST; Ramzi Idoughi, KAUST; Mohamed Aly, KAUST; Peter Wonka, KAUST; Wolfgang Heidrich*, KAUST
P-3C-59Modeling Visual Context is Key to Augmenting Object Detection DatasetsNIKITA DVORNIK*, INRIA; Julien Mairal, INRIA; Cordelia Schmid, INRIA
P-3C-60Occlusions, Motion and Depth Boundaries with a Generic Network for Optical Flow, Disparity, or Scene Flow Estimation Eddy Ilg*, University of Freiburg; Tonmoy Saikia, University of Freiburg; Margret Keuper, University of Mannheim; Thomas Brox, University of Freiburg
P-3C-61Unsupervised Domain Adaptation for 3D Keypoint Estimation via View ConsistencyXingyi Zhou, The University of Texas at Austin; Arjun Karpur, The University of Texas at Austin; Chuang Gan, MIT; Linjie Luo, Snap Inc; Qixing Huang*, The University of Texas at Austin
P-3C-62Improving DNN Robustness to Adversarial Attacks using Jacobian RegularizationDaniel Jakubovitz*, Tel Aviv University; Raja Giryes, Tel Aviv University
P-3C-63A Framework for Evaluating 6-DOF Object TrackersMathieu Garon, Université Laval; Denis Laurendeau, Laval University; Jean-Francois Lalonde*, Université Laval
P-3C-64Self-Supervised Relative Depth Learning for Urban Scene UnderstandingHuaizu Jiang*, UMass Amherst; Erik Learned-Miller, University of Massachusetts, Amherst; Gustav Larsson, University of Chicago; Michael Maire, Toyota Technological Institute at Chicago; Greg Shakhnarovich, Toyota Technological Institute at Chicago
P-3C-65Actor-centric Relation Network Chen Sun*, Google; Abhinav Shrivastava, UMD / Google; Carl Vondrick, MIT; Kevin Murphy, Google; Rahul Sukthankar, Google; Cordelia Schmid, Google
P-3C-66Self-produced Guidance for Weakly-supervised Object LocalizationXiaolin Zhang*, University of Technology Sydney; Yunchao Wei, UIUC; Guoliang Kang, UTS; Yi Yang, UTS; Thomas Huang, UIUC
P-3C-67Attribute-Guided Face Generation Using Conditional CycleGANYongyi Lu*, HKUST; Yu-Wing Tai, Tencent YouTu; Chi-Keung Tang, Hong Kong University of Science and Technology
P-3C-68Neural Network EncapsulationHongyang Li*, Chinese University of Hong Kong; Bo Dai, the Chinese University of Hong Kong; Wanli Ouyang, CUHK; Xiaoyang Guo, The Chinese University of Hong Kong; Xiaogang Wang, Chinese University of Hong Kong, Hong Kong
P-3C-69Deep Regionlets for Object DetectionHongyu Xu*, University of Maryland; Xutao Lv, Intellifusion; Xiaoyu Wang, -; Zhou Ren, Snap Inc.; Navaneeth Bodla, University of Maryland; Rama Chellappa, University of Maryland
P-3C-70Deep Adversarial Attention Alignment for Unsupervised Domain Adaptation: the Benefit of Target Expectation MaximizationGuoliang Kang*, UTS; Liang Zheng, Singapore University of Technology and Design; Yan Yan, UTS; Yi Yang, UTS
P-3C-71Fighting Fake News: Image Splice Detection via Learned Self-ConsistencyJacob Huh*, Carnegie Mellon University; Andrew Liu, University of California, Berkeley; Andrew Owens, UC Berkeley; Alexei Efros, UC Berkeley
P-3C-72Learning Monocular Depth by Distilling Cross-domain Stereo NetworksXiaoyang Guo*, The Chinese University of Hong Kong; Hongsheng Li, Chinese University of Hong Kong; Shuai Yi, The Chinese University of Hong Kong; Jimmy Ren, Sensetime Research; Xiaogang Wang, Chinese University of Hong Kong, Hong Kong
P-3C-73 Riemannian Walk for Incremental Learning: Understanding Forgetting and IntransigenceArslan Chaudhry*, University of Oxford; Puneet Dokania, University of Oxford; Thalaiyasingam Ajanthan, University of Oxford; Philip Torr, University of Oxford
P-3C-74Weakly Supervised Region Proposal Network and Object DetectionPeng Tang*, Huazhong University of Science and Technology; Xinggang Wang, Huazhong Univ. of Science and Technology; Angtian Wang, Huazhong University of Science and Technology ; Yongluan Yan, Huazhong University of Science and Technology ; Wenyu Liu, Huazhong University of Science and Technology; Junzhou Huang, Tencent AI Lab; Alan Yuille, Johns Hopkins University

Poster session 4A
4AThursday, September 13Poster session 10:00 AM - 12:00 PM
P-4A-01Viewpoint Estimation - Insights & ModelGilad Divon, Technion; Ayellet Tal*, Technion
P-4A-02Towards Realistic PredictorsPei Wang*, UC San Diego; Nuno Vasconcelos, UC San Diego
P-4A-03Group NormalizationYuxin Wu, Facebook; Kaiming He*, Facebook Inc., USA
P-4A-04Deep Expander Networks: Efficient Deep Networks from Graph TheoryAmeya Prabhu*, IIIT Hyderabad; Girish Varma, IIIT Hyderabad; Anoop Namboodiri, IIIT Hyderbad
P-4A-05Learning SO(3) Equivariant Representations with Spherical CNNsCarlos Esteves*, University of Pennsylvania; Kostas Daniilidis, University of Pennsylvania; Ameesh Makadia, Google Research; Christine Allec-Blanchette, University of Pennsylvania
P-4A-06Video Re-localization via Cross Gated Bilinear MatchingYang Feng*, University of Rochester; Lin Ma, Tencent AI Lab; Wei Liu, Tencent AI Lab; Tong Zhang, Tecent AI Lab; Jiebo Luo, U. Rochester
P-4A-07A Deeply-initialized Coarse-to-fine Ensemble of Regression Trees for Face AlignmentRoberto Valle*, Universidad Politécnica de Madrid; José Buenaposada, Universidad Rey Juan Carlos; Antonio Valdés, Universidad Complutense de Madrid; Luis Baumela, Universidad Politecnica de Madrid
P-4A-08Deep Kalman Filtering Network for Video Compression Artifact ReductionGuo Lu*, Shanghai Jiao Tong University; Wanli Ouyang, CUHK; Dong Xu, University of Sydney; Xiaoyun Zhang, Shanghai Jiao Tong University; Zhiyong Gao, Shanghai Jiao Tong University; Ming Ting Sun, -
P-4A-09Exploring Visual Relationship for Image CaptioningTing Yao*, Microsoft Research; Yingwei Pan, University of Science and Technology of China; Yehao Li, Sun Yat-Sen University; Tao Mei, JD.com
P-4A-10Sequential Clique Optimization for Video Object SegmentationYeong Jun Koh*, Korea University; Young-Yoon Lee, Samsung; Chang-Su Kim, Korea university
P-4A-11Spatial Pyramid Calibration for Image ClassificationYan Wang, Shanghai Jiao Tong University; Lingxi Xie*, JHU; Siyuan Qiao, Johns Hopkins University; Ya Zhang, Cooperative Medianet Innovation Center, Shang hai Jiao Tong University; Wenjun Zhang, Shanghai Jiao Tong University; Alan Yuille, Johns Hopkins University
P-4A-12Visual Text CorrectionAmir Mazaheri*, University of Central Florida; Mubarak Shah, University of Central Florida
P-4A-13X-ray Computed Tomography Through ScatterAdam Geva*, Technion; Yoav Y. Schechner, Technion; Jonathan Chernyak, Technion; Rajiv Gupta, MGH Harvard
P-4A-14Graph Distillation for Action Detection with Privileged Information in RGB-D VideosZelun Luo*, Stanford University; Lu Jiang, Google; Jun-Ting Hsieh, Stanford University; Juan Carlos Niebles, Stanford University; Li Fei-Fei, Stanford University
P-4A-15Modular Generative Adversarial NetworksBo Zhao*, University of British Columbia; Bo Chang, University of British Columbia; Zequn Jie, Tencent AI Lab; Leonid Sigal, University of British Columbia
P-4A-16R2P2: A ReparameteRized Pushforward Policy for Diverse, Precise Generative Path ForecastingNicholas Rhinehart*, CMU; Kris Kitani, CMU; Paul Vernaza, NEC Labs America
P-4A-17DFT-based Transformation Invariant Pooling Layer for Visual ClassificationJongbin Ryu*, Hanyang University; Ming-Hsuan Yang, University of California at Merced; Jongwoo Lim, Hanyang University
P-4A-18X2Face: A network for controlling face generation by using images, audio, and pose codesOlivia Wiles*, University of Oxford; A Koepke, University of Oxford; Andrew Zisserman, University of Oxford
P-4A-19Compositional Learning of Human Object InteractionsKeizo Kato, CMU; Yin Li*, CMU; Abhinav Gupta, CMU
P-4A-20Learning to Navigate for Fine-grained ClassificationZe Yang*, Peking University; Tiange Luo, Peking University; Dong Wang, Peking University; Zhiqiang Hu, Peking University; Jun Gao, Peking University; Liwei Wang, Peking University
P-4A-21Cross-Modal Ranking with Soft Consistency and Noisy Labels for Robust RGB-T TrackingChenglong Li, Anhui University; Chengli Zhu, Anhui University; Yan Huang, Institute of Automation, Chinese Academy of Sciences; Jin Tang, Anhui University; Liang Wang*, NLPR, China
P-4A-22Light-weight CNN Architecture Design for Fast InferenceNingning Ma*, Tsinghua; Xiangyu Zhang, Megvii Inc; Hai-Tao Zheng, Tsinghua University; Jian Sun, Megvii, Face++
P-4A-23Fully Motion-Aware Network for Video Object DetectionShiyao Wang*, Tsinghua University; Yucong Zhou, Beihang University; Junjie Yan, Sensetime Group Limited
P-4A-24Shift-Net: Image Inpainting via Deep Feature RearrangementZhaoyi Yan, Harbin Institute of Technology; Xiaoming Li, Harbin Institute of Technology; Mu LI, The Hong Kong Polytechnic University; Wangmeng Zuo*, Harbin Institute of Technology, China; Shiguang Shan, Chinese Academy of Sciences
P-4A-25Choose Your Neuron: Incorporating Domain Knowledge through Neuron ImportanceRamprasaath Ramasamy Selvaraju*, Virginia Tech; Prithvijit Chattopadhyay, Georgia Institute of Technology; Mohamed Elhoseiny, Facebook; Tilak Sharma, Facebook; Dhruv Batra, Georgia Tech & Facebook AI Research; Devi Parikh, Georgia Tech & Facebook AI Research; Stefan Lee, Georgia Institute of Technology
P-4A-26Joint 3D tracking of a deformable object in interaction with a handAggeliki Tsoli*, FORTH; Antonis Argyros, CSD-UOC and ICS-FORTH
P-4A-27Interpolating Convolutional Neural Networks Using Batch NormalizationGratianus Wesley Putra Data*, University of Oxford; Kirjon Ngu, University of Oxford; David Murray, University of Oxford; Victor Prisacariu, University of Oxford
P-4A-28Learning Warped Guidance for Blind Face RestorationXiaoming Li, Harbin Institute of Technology; Ming Liu, Harbin Institute of Technology; Yuting Ye, Harbin Institute of Technology; Wangmeng Zuo*, Harbin Institute of Technology, China; Liang Lin, Sun Yat-sen University; Ruigang Yang, University of Kentucky, USA
P-4A-29Separable Cross-Domain TranslationYedid Hoshen*, Facebook AI Research (FAIR); Lior Wolf, Tel Aviv University, Israel
P-4A-30Task-driven Webpage SaliencyQuanlong Zheng*, City University of HongKong; Jianbo Jiao, City University of Hong Kong; Ying Cao, City University of Hong Kong; Rynson Lau, City University of Hong Kong
P-4A-31Appearance-Based Gaze Estimation via Evaluation-Guided Asymmetric RegressionYihua Cheng, Beihang University; Feng Lu*, U. Tokyo; Xucong Zhang, Max Planck Institute for Informatics and Saarland University
P-4A-32Pivot Correlational Neural Network for Multimodal Video CategorizationSunghun Kang*, KAIST; Junyeong Kim, KAIST; Hyunsoo Choi, SAMSUNG ELECTRONICS CO.,LTD; Sungjin Kim, SAMSUNG ELECTRONICS CO.,LTD; Chang D. Yoo, KAIST
P-4A-33Interactive Boundary Prediction for Object SelectionHoang Le, Portland State University; Long Mai*, Adobe Research; Brian Price, Adobe; Scott Cohen, Adobe Research; Hailin Jin, Adobe Research; Feng Liu, Portland State University
P-4A-34Scenes-Objects-Actions: A Multi-Task, Multi-Label Video DatasetHeng Wang*, Facebook Inc; Lorenzo Torresani, Dartmouth College; Matt Feiszli, Facebook Research; Manohar Paluri, Facebook; Du Tran, Facebook; Jamie Ray, Facebook Research; Yufei Wang, Facebook
P-4A-35Transferable Adversarial PerturbationsBruce Hou*, Tencent; Wen Zhou, Tencent
P-4A-36Incremental Non-Rigid Structure-from-Motion with Unknown Focal LengthThomas Probst, ETH Zurich; Danda Pani Paudel*, ETH Zürich; Ajad Chhatkuli , ETHZ; Luc Van Gool, ETH Zurich
P-4A-37Semantically Aware Urban 3D Reconstruction with Plane-Based RegularizationThomas Holzmann*, Graz University of Technology; Michael Maurer, Graz University of Technology; Friedrich Fraundorfer, Graz University of Technology; Horst Bischof, Graz University of Technology
P-4A-38Learning to Dodge A Bulletshi jin*, ShanghaiTech University; Jinwei Ye, Louisiana State University; Yu Ji, Plex-VR; RUIYANG LIU, ShanghaiTech University; Jingyi Yu, Shanghai Tech University
P-4A-39Training Binary Weight Networks via Semi-Binary DecompositionQinghao Hu*, Institute of Automation, Chinese Academy of Sciences; Gang Li, Institute of Automation, Chinese Academy of Sciences; Peisong Wang, Institute of Automation, Chinese Academy of Sciences; yifan zhang, Institute of Automation,Chinese Academy of Sciences; Jian Cheng, Chinese Academy of Sciences, China
P-4A-40Learnable PINs: Cross-Modal Embeddings for Person IdentitySamuel Albanie*, University of Oxford; Arsha Nagrani, Oxford University ; Andrew Zisserman, University of Oxford
P-4A-41Toward Characteristic-Preserving Image-based Virtual Try-On NetworkBochao Wang, Sun Yet-sen University; Huabin Zheng, Sun Yat-Sen University; Xiaodan Liang*, Carnegie Mellon University; Yimin Chen, sensetime; Liang Lin, Sun Yat-sen University
P-4A-42Deep Feature Factorization For Unsupervised Concept DiscoveryEdo Collins*, EPFL; Radhakrishna Achanta, EPFL; Sabine Süsstrunk, EPFL
P-4A-43SOD-MTGAN: Small Object Detection via Multi-Task Generative Adversarial NetworkYongqiang Zhang*, Harbin institute of Technology/KAUST; Yancheng Bai, KAUST/ISCAS; Mingli Ding, Harbin institute of Technology; Bernard Ghanem, KAUST
P-4A-44Human Motion Analysis with Deep Metric LearningHUSEYIN COSKUN*, Technical University of Munich; David Joseph Tan, CAMP, TU Munich; Sailesh Conjeti, Technical University of Munich; Nassir Navab, TU Munich, Germany; Federico Tombari, Technical University of Munich, Germany
P-4A-45Dist-GAN: An Improved GAN using Distance ConstraintsNgoc-Trung Tran*, Singapore University of Technology and Design; Tuan Anh Bui, Singapore University of Technology and Design; Ngai-Man Cheung, Singapore University of Technology and Design
P-4A-46Cross-Modal and Hierarchical Modeling of Video and TextBowen Zhang*, University of Southern California; Hexiang Hu, University of Southern California; Fei Sha, USC
P-4A-47Deep Image Demosaicking using a Cascade of Convolutional Residual Denoising NetworksFilippos Kokkinos*, Skolkovo Institute of Science and Technology; Stamatis Lefkimmiatis, Skolkovo Institute of Science and Technology
P-4A-48Deep Clustering for Unsupervised Learning of Visual FeaturesMathilde Caron*, Facebook Artificial Intelligence Research; Piotr Bojanowski, Facebook; Armand Joulin, Facebook AI Research; Matthijs Douze, Facebook AI Research
P-4A-49Domain Adaptation through Synthesis for Unsupervised Person Re-identificationSlawomir Bak*, Argo AI; Jean-Francois Lalonde, Université Laval; Pete Carr, Argo AI
P-4A-50Facial Expression Recognition with Inconsistently Annotated DatasetsJiabei Zeng*, Institute of Computing Technology, Chinese Academy on Sciences; Shiguang Shan, Chinese Academy of Sciences; Chen Xilin, Institute of Computing Technology, Chinese Academy of Sciences
P-4A-51Single Shot Scene Text RetrievalLluis Gomez*, Universitat Autónoma de Barcelona; Andres Mafla, Computer Vision Center; Marçal Rossinyol, Universitat Autónoma de Barcelona; Dimosthenis Karatzas, Computer Vision Centre
P-4A-52DeepVS: A Deep Learning Based Video Saliency Prediction ApproachLai Jiang, BUAA; Mai Xu*, BUAA; Minglang Qiao, BUAA; Zulin Wang, BUAA
P-4A-53Generalizing A Person Retrieval Model Hetero- and HomogeneouslyZhun Zhong*, Xiamen University; Liang Zheng, Singapore University of Technology and Design; Shaozi Li, Xiamen University, China; Yi Yang, University of Technology, Sydney
P-4A-54A New Large Scale Dynamic Texture Dataset with Application to ConvNet UnderstandingIsma Hadji*, York University; Rick Wildes, York University
P-4A-55Deep Cross-modality Adaptation via Semantics Preserving Adversarial Learning for Sketch-based 3D Shape RetrievalJiaxin Chen, New York University Abu Dhabi; Yi Fang*, New York University
P-4A-56BiSeNet: Bilateral Segmentation Network for Real-time Semantic SegmentationChangqian Yu*, Huazhong University of Science and Technology; Jingbo Wang, Peking University; Chao Peng, Megvii(Face++) Inc; Changxin Gao, Huazhong University of Science and Technology; Gang Yu, Face++; Nong Sang, School of Automation, Huazhong University of Science and Technology
P-4A-57Face De-spoofingYaojie Liu*, Michigan State University; Amin Jourabloo, Michigan State University; Xiaoming Liu, Michigan State University
P-4A-58Towards End-to-End License Plate Detection and Recognition: A Large Dataset and BaselineZhenbo Xu*, University of Science and Technology in China; Wei Yang, University of Science and Technology in China; Ajin Meng, University of Science and Technology in China; Nanxue Lu, University of Science and Technology in China; Huan Huang, Xingtai Financial Holdings Group Co., Ltd.
P-4A-59Self-supervised Tracking by ColorizationCarl Vondrick*, MIT; Abhinav Shrivastava, UMD / Google; Alireza Fathi, Google; Sergio Guadarrama, Google; Kevin Murphy, Google
P-4A-60Pose Proposal NetworksTaiki Sekii*, Konica Minolta, inc.
P-4A-61Incremental Multi-graph Matching via Diversity and Randomness based Graph ClusteringTianshu Yu*, Arizona State University; Junchi Yan, Shanghai Jiao Tong University; baoxin Li, Arizona State University; Wei Liu, Tencent AI Lab
P-4A-62Single Image Intrinsic Decomposition Without a Single Intrinsic ImageWei-Chiu Ma*, MIT; Hang Chu, University of Toronto; Bolei Zhou, MIT; Raquel Urtasun, University of Toronto; Antonio Torralba, MIT
P-4A-63Triplet Loss with Theoretical Analysis in Siamese Network for Real-Time Object TrackingXingping Dong, Beijing Institute of Technology; Jianbing Shen*, Beijing Institute of Technology
P-4A-64Learning to Learn Parameterized Image OperatorsQingnan Fan, Shandong University; Dongdong Chen*, university of science and technology of china; Lu Yuan, Microsoft Research Asia; Gang Hua, Microsoft Cloud and AI; Nenghai Yu, University of Science and Technology of China; Baoquan Chen, Shandong University
P-4A-65HBE: Hand Branch Ensemble network for real time 3D hand pose estimationYidan Zhou, Dalian University of Technology; Jian Lu, Laboratory of Advanced Design and Intelligent Computing, Dalian University; Kuo Du, Dalian University of Technology; Xiangbo Lin*, Dalian University of Technology; Yi Sun, Dalian University of Technology; Xiaohong Ma, Dalian University of Technology
P-4A-66Generative Semantic Manipulation with Mask-Contrasting GANXiaodan Liang*, Carnegie Mellon University
P-4A-67Learning to Fuse Proposals from Multiple Scanline Optimizations in Semi-Global MatchingJohannes Schoenberger*, ETH Zurich; Sudipta Sinha, Microsoft Research; Marc Pollefeys, ETH Zurich
P-4A-68Less is More: Picking Informative Frames for Video CaptioningYangyu Chen*, University of Chinese Academy of Sciences; Shuhui Wang, vipl,ict,Chinese academic of science; Weigang Zhang, Harbin Institute of Technology, Weihai; Qingming Huang, University of Chinese Academy of Sciences, China
P-4A-69Deep Pictorial Gaze EstimationSeonwook Park*, ETH Zurich; Adrian Spurr, ETH Zurich; Otmar Hilliges, ETH Zurich
P-4A-70SkipNet: Learning Dynamic Execution in Residual NetworksXin Wang*, UC Berkeley; Fisher Yu, UC Berkeley; Zi-Yi Dou, Nanjing University; Trevor Darrell, UC Berkeley; Joseph Gonzalez, UC Berkeley
P-4A-71Mask TextSpotter: An End-to-End Trainable Neural Network for Spotting Text with Arbitrary ShapesPengyuan Lyu*, Huazhong University of Science and Technology; Minghui Liao, Huazhong University of Science and Technology; Cong Yao, Megvii; Wenhao Wu, Megvii; Xiang Bai, Huazhong University of Science and Technology
P-4A-72Deep Adaptive Attention for Joint Facial Action Unit Detection and Face AlignmentZhiwen Shao*, Shanghai Jiao Tong University; Zhilei Liu, Tianjin University; Jianfei Cai, Nanyang Technological University; Lizhuang Ma, Shanghai Jiao Tong University
P-4A-73Semantic Scene Understanding under Dense Fog with Synthetic and Real DataChristos Sakaridis*, ETH Zurich; Dengxin Dai, ETH Zurich; Simon Hecker, ETH Zurich; Luc Van Gool, ETH Zurich
P-4A-74RIDI: Robust IMU Double IntegrationHang Yan*, Washington University in St. Louis; Qi Shan, Zillow Group; Yasutaka Furukawa, Simon Fraser University
P-4A-75Weakly-supervised Video Summarization using Variational Encoder-Decoder and Web PriorSijia Cai*, The Hong Kong Polytechnic University; Wangmeng Zuo, Harbin Institute of Technology; Larry Davis, University of Maryland; Lei Zhang, Hong Kong Polytechnic University, Hong Kong, China
P-4A-76Transferring Common-Sense Knowledge for Object DetectionKrishna Kumar Singh*, University of California Davis; Santosh Divvala, Allen AI; Ali Farhadi, University of Washington; Yong Jae Lee, University of California, Davis
P-4A-77Person Search in Videos with One Portrait Through Visual and Temporal LinksQingqiu Huang*, CUHK; Wentao Liu, Sensetime; Dahua Lin, The Chinese University of Hong Kong
P-4A-78Eliminating the Dreaded Blind Spot: Adapting 3D Object Detection and Monocular Depth Estimation to 360° Panoramic ImageryGregoire Payen de La Garanderie*, Durham University; Toby Breckon, Durham University; Amir Atapour-Abarghouei, Durham University
P-4A-79Folded Recurrent Neural Networks for Future Video PredictionMarc Oliu*, Universitat Oberta de Catalunya; Javier Selva, Universitat de Barcelona; Sergio Escalera, Computer Vision Center (UAB) & University of Barcelona,
P-4A-80Deep Regression Tracking with Shrinkage LossXiankai Lu, Shanghai Jiao Tong University; Chao Ma*, University of Adelaide; Bingbing Ni, Shanghai Jiao Tong University; Xiaokang Yang, Shanghai Jiao Tong University of China; Ian Reid, University of Adelaide, Australia; Ming-Hsuan Yang, University of California at Merced
P-4A-81Stroke Controllable Fast Style Transfer with Adaptive Receptive FieldsYongcheng Jing, Zhejiang University; Yang Liu, Zhejiang University; Yezhou Yang, Arizona State University; Zunlei Feng, Zhejiang University; Yizhou Yu, The University of Hong Kong; Dacheng Tao, University of Sydney; Mingli Song*, Zhejiang University
P-4A-82Part-Aligned Bilinear Representations for Person Re-IdentificationYumin Suh, Seoul National University; Jingdong Wang, Microsoft Research; Kyoung Mu Lee*, Seoul National University
P-4A-83Joint 3D Face Reconstruction and Dense Alignment with Position Map Regression NetworkYao Feng*, Shanghai Jiao Tong University; Fan Wu, CloudWalk Technology; Xiao-Hu Shao, Chongqing Institute of Green and Intelligent Technology,Chinese Academy of Sciences; Yan-Feng Wang, Shanghai Jiao Tong University; Xi Zhou, CloudWalk Technology
P-4A-84Learning Efficient Single-stage Pedestrian Detection by Asymptotic Localization FittingWei Liu*, National University of Defense Technology; Shengcai Liao, NLPR, Chinese Academy of Sciences, China; Weidong Hu, National University of Defence Technology; Xuezhi Liang, Center for Biometrics and Security Research & National Laboratory of Pattern Recognition Institute of Automation, Chinese Academy of Sciences; Xiao Chen, National University of Defense Technology
P-4A-85Unsupervised Hard-Negative Mining from Videos for Object DetectionSouYoung Jin*, UMASS Amherst; Huaizu Jiang, UMass Amherst; Aruni RoyChowdhury, University of Massachusetts, Amherst; Ashish Singh, UMASS Amherst; Aditya Prasad, UMASS Amherst; Deep Chakraborty, UMASS Amherst; Erik Learned-Miller, University of Massachusetts, Amherst
P-4A-86Focus, Segment and Erase: An Efficient Network for Multi-Label Brain Tumor SegmentationXuan Chen*, NUS; Jun Hao Liew, NUS; Wei Xiong, A*STAR Institute for Infocomm Research, Singapore; Chee-Kong Chui, NUS; Sim-Heng Ong, NUS
P-4A-87Maximum Margin Metric Learning Over Discriminative Nullspace for Person Re-identificationT M Feroz Ali*, Indian Institute of Technology Bombay, Mumbai; Subhasis Chaudhuri, Indian Institute of Technology Bombay
P-4A-88Efficient Relative Attribute Learning using Graph Neural NetworksZihang Meng*, University of Wisconsin Madison; Nagesh Adluru , WISC; Vikas Singh, University of Wisconsin-Madison USA
P-4A-89Object Level Visual Reasoning in VideosFabien Baradel, LIRIS; Natalia Neverova*, Facebook AI Research; Christian Wolf, INSA Lyon, France; Julien Mille, INSA Centre Val de Loire; Greg Mori, Simon Fraser University

Poster session 4B
4BThursday, September 13Poster session 04:00 PM - 06:00 PM
P-4B-01Deep Model-Based 6D Pose Refinement in RGBFabian Manhardt*, TU Munich; Wadim Kehl, Toyota Research Institute; Nassir Navab, Technische Universität München, Germany; Federico Tombari, Technical University of Munich, Germany
P-4B-02ContextVP: Fully Context-Aware Video PredictionWonmin Byeon*, NVIDIA; Qin Wang, ETH Zurich; Rupesh Kumar Srivastava, NNAISENSE; Petros Koumoutsakos, ETH Zurich
P-4B-03CornerNet: Detecting Objects as Paired KeypointsHei Law*, University of Michigan; Jia Deng, University of Michigan
P-4B-04RelocNet: Continous Metric Learning Relocalisation using Neural NetsVassileios Balntas*, University of Oxford; Victor Prisacariu, University of Oxford; Shuda Li, University of Oxford
P-4B-05Museum Exhibit Identification Challenge for the Supervised Domain Adaptation.Piotr Koniusz*, Data61/CSIRO, ANU; Yusuf Tas, Data61; Hongguang Zhang, Australian National University; Mehrtash Harandi, Monash University; Fatih Porikli, ANU; Rui Zhang, University of Canberra
P-4B-06Acquisition of Localization Confidence for Accurate Object DetectionBorui Jiang*, Peking University; Ruixuan Luo, Peking University; Jiayuan Mao, Tsinghua University; Tete Xiao, Peking University; Yuning Jiang, Megvii(Face++) Inc
P-4B-07The Contextual Loss for Image Transformation with Non-Aligned DataRoey Mechrez*, Technion; Itamar Talmi, Technion; Lihi Zelnik-Manor, Technion
P-4B-08Saliency Benchmarking Made Easy: Separating Models, Maps and MetricsMatthias Kümmerer*, University of Tübingen; Thomas Wallis, University of Tübingen; Matthias Bethge, University of Tübingen
P-4B-09Multi-Attention Multi-Class Constraint for Fine-grained Image RecognitionMing Sun, baidu; Yuchen Yuan, Baidu Inc.; Feng Zhou*, Baidu Research; Errui Ding, Baidu Inc.
P-4B-10Look Before You Leap: Bridging Model-Free and Model-Based Reinforcement Learning for Planned-Ahead Vision-and-Language NavigationXin Wang*, University of California, Santa Barbara; Wenhan Xiong, University of California, Santa Barbara; Hongmin Wang, University of California, Santa Barbara; William Wang, UC Santa Barbara
P-4B-11HandMap: Robust Hand Pose Estimation via Intermediate Dense Guidance Map SupervisionXiaokun Wu*, University of Bath; Daniel Finnegan, University of Bath; Eamonn O'Neill, University of Bath; Yongliang Yang, University of Bath
P-4B-12LSQ++: lower runtime and higher recall in multi-codebook quantizationJulieta Martinez*, University of British Columbia; Shobhit Zakhmi, University of British Columbia; Holger Hoos, University of British Columbia; Jim Little, University of British Columbia, Canada
P-4B-13Multimodal Dual Attention Memory for Video Story Question AnsweringKyungmin Kim*, Seoul National University; Seong-Ho Choi, Seoul National University; Jin-Hwa Kim, Seoul National University; Byoung-Tak Zhang, Seoul National University
P-4B-14Hierarchical Bilinear Pooling for Fine-Grained Visual RecognitionChaojian Yu*, Huazhong University of Science and Technology; Qi Zheng, Huazhong University of Science and Technology; Xinyi Zhao, Huazhong University of Science and Technology; Peng Zhang, Huazhong University of Science and Technology; Xinge YOU, School of Electronic Information and Communications,Huazhong University of Science and Technology
P-4B-15Dense Semantic and Topological Correspondence of 3D Faces without LandmarksZhenfeng Fan*, Chinese Academy of Sciences; hu xiyuan, The Chinese academy of science; chen chen, The Chinese academy of science; peng silong, The Chinese academy of science
P-4B-16Real-Time Blind Video Temporal ConsistencyWei-Sheng Lai*, University of California, Merced; Jia-Bin Huang, Virginia Tech; Oliver Wang, Adobe Systems Inc; Eli Shechtman, Adobe Research, US; Ersin Yumer, Argo AI; Ming-Hsuan Yang, University of California at Merced
P-4B-17Depth Estimation via Affinity Learned with Convolutional Spatial Propagation NetworkXinjing Cheng, Baidu; Peng Wang*, Baidu USA LLC; Ruigang Yang, University of Kentucky, USA
P-4B-18Hierarchical Metric Learning and Matching for 2D and 3D Geometric CorrespondencesMohammed Fathy, University of Maryland College Park; Quoc-Huy Tran*, NEC Labs; Zeeshan Zia, Microsoft; Paul Vernaza, NEC Labs America; Manmohan Chandraker, NEC Labs America
P-4B-19GridFace: Face Rectification via Learning Local Homography TransformationsErjin Zhou*, Megvii Research
P-4B-20Rethinking Spatiotemporal Feature Learning: Speed-Accuracy Trade-offs in Video ClassificationSaining Xie*, UCSD; Chen Sun, Google; Jonathan Huang, Google; Zhuowen Tu, UC San Diego; Kevin Murphy, Google
P-4B-21Deep Variational Metric LearningXudong Lin, Tsinghua University; Yueqi Duan, Tsinghua University; Qiyuan Dong, Tsinghua University; Jiwen Lu*, Tsinghua University; Jie Zhou, Tsinghua University, China
P-4B-22Multi-Class Model Fitting by Energy Minimization and Mode-SeekingDániel Baráth*, MTA SZTAKI, CMP Prague; Jiri Matas, CMP CTU FEE
P-4B-23A Unified Framework for Single-View 3D Reconstruction with Limited Pose SupervisionGuandao Yang*, Cornell University; Yin Cui, Cornell University; Bharath Hariharan, Cornell University
P-4B-24Diverse Conditional Image Generation by Stochastic Regression with Latent Drop-Out CodesYang He*, MPI Informatics; Bernt Schiele, MPI; Mario Fritz, Max-Planck-Institut für Informatik
P-4B-25Orthogonal Deep Features Decomposition for Age-Invariant Face Recognitionyitong wang, Tencent AI Lab; dihong gong, Tencent AI Lab; zheng zhou, Tencent AI Lab; xing ji, Tencent AI Lab; Hao Wang, Tencent AI Lab; Zhifeng Li*, Tencent AI Lab; Wei Liu, Tencent AI Lab; Tong Zhang, Tecent AI Lab
P-4B-26HiDDeN: Hiding Data with Deep NetworksJiren Zhu*, Stanford University; Russell Kaplan, Stanford University; Justin Johnson, Stanford University; Li Fei-Fei, Stanford University
P-4B-27Learning and Matching Multi-View Descriptors for Registration of Point CloudsLei Zhou*, HKUST; Siyu Zhu, HKUST; Zixin Luo, HKUST; Tianwei Shen, HKUST; Runze Zhang, HKUST; Tian Fang, HKUST; Long Quan, Hong Kong University of Science and Technology
P-4B-28Deep Burst DenoisingClement Godard*, University College London; Kevin Matzen, Facebook; Matt Uyttendaele, Facebook
P-4B-29On Offline Evaluation of Vision-based Driving ModelsFelipe Codevilla, UAB; Antonio Lopez, CVC & UAB; Vladlen Koltun, Intel Labs; Alexey Dosovitskiy*, Intel Labs
P-4B-30Distortion-Aware Convolutional Filters for Dense Prediction in Panoramic ImagesKeisuke Tateno*, Technical University Munich; Nassir Navab, TU Munich, Germany; Federico Tombari, Technical University of Munich, Germany
P-4B-31Salient Objects in Clutter: Bringing Salient Object Detection to the ForegroundDeng-Ping Fan, Nankai University; Jiang-Jiang Liu, Nankai University; Shanghua Gao, Nankai University; Qibin Hou, Nankai University; Ming-Ming Cheng*, Nankai University; Ali Borji, University of Central Florida
P-4B-32Randomized Ensemble EmbeddingsHong Xuan*, The George Washington University; Robert Pless, George Washington University
P-4B-33Conditional Prior Networks for Optical FlowYanchao Yang*, UCLA; Stefano Soatto, UCLA
P-4B-34Adaptively Transforming Graph MatchingFudong Wang, Wuhan University; Nan Xue, Wuhan University; yi-peng Zhang, Syracuse University; Xiang Bai, Huazhong University of Science and Technology; Gui-Song Xia*, Wuhan University
P-4B-35Learning 3D shapes as multi-layered height maps using 2D convolutional neural networksKripasindhu Sarkar*, University of Kaiserslautern; Basavaraj Hampiholi, University of Kaiserslautern; Kiran Varanasi, German Research Center for Artificial Intelligence; Didier Stricker, DFKI
P-4B-36ISNN - Impact Sound Neural Network for Material and Geometry ClassificationAuston Sterling*, UNC Chapel Hill; Justin Wilson, UNC Chapel Hill; Sam Lowe, UNC Chapel Hill; Ming Lin, UNC Chapel Hill
P-4B-37Visual Psychophysics for Making Face Recognition Algorithms More ExplainableBrandon RichardWebster*, University of Notre Dame; So Yon Kwon, Perceptive Automata; Samuel Anthony, Perceptive Automata; Christopher Clarizio, University of Notre Dame; Walter Scheirer, University of Notre Dame
P-4B-38Show, Tell and Discriminate: Image Captioning by Self-retrieval with Partially Labeled DataXihui Liu*, The Chinese University of Hong Kong; Hongsheng Li, Chinese University of Hong Kong; Jing Shao, The Chinese University of Hong Kong; Dapeng Chen, The Chinese University of HongKong; Xiaogang Wang, Chinese University of Hong Kong, Hong Kong
P-4B-39Using LIP to Gloss Over Faces in Single-Stage Face Detection NetworksSiqi Yang*, UQ ITEE; Arnold Wiliem, University of Queensland; Shaokang Chen, University of Queensland; Brian Lovell, University of Queensland
P-4B-40Variational Wasserstein ClusteringLiang Mi*, Arizona State University; wen zhang, ASU; Xianfeng GU, Stony Brook University; Yalin Wang, Arizona State University
P-4B-41ADVISE: Symbolism and External Knowledge for Decoding AdvertisementsKeren Ye*, University of Pittsburgh; Adriana Kovashka, University of Pittsburgh
P-4B-42Weakly- and Semi-Supervised Panoptic SegmentationAnurag Arnab*, University of Oxford; Philip Torr, University of Oxford; Qizhu Li, University of Oxford
P-4B-43Broadcasting Convolutional Network for Visual Relational ReasoningSimyung Chang, Seoul National University; John Yang, Seoul National University; Seonguk Park, Seoul National University; Nojun Kwak*, Seoul National University
P-4B-44A Unified Framework for Multi-View Multi-Class Object Pose EstimationChi Li*, Johns Hopkins University; Jin Bai, Johns Hopkins University; Gregory D. Hager, The Johns Hopkins University
P-4B-45Fast and Accurate Point Cloud Registration using Trees of Gaussian MixturesBenjamin Eckart*, NVIDIA; Kihwan Kim, NVIDIA; Kautz Jan, NVIDIA
P-4B-46Teaching Machines to Understand Baseball Games: Large Scale Baseball Video Database for Multiple Video Understanding TasksMinho Shim, Yonsei University; KYUNGMIN KIM, Yonsei University; Young Hwi Kim, Yonsei University; Seon Joo Kim*, Yonsei Univ.
P-4B-47Using Object Information for Spotting TextShitala Prasad*, NTU Singapore; Wai-Kin Adams Kong, Nanyang Technological University
P-4B-48Deep Domain Generalization via Conditional Invariant Adversarial NetworksYa Li, USTC; Xinmei Tian, USTC; Mingming Gong, CMU & U Pitt; Yajing Liu*, USTC; Tongliang Liu, The University of Sydney; Kun Zhang, Carnegie Mellon University; Dacheng Tao, University of Sydney
P-4B-49On the Solvability of Viewing GraphsMatthew Trager*, INRIA; Brian Osserman, UC Davis; Jean Ponce, Inria
P-4B-50Learning Type-Aware Embeddings for Fashion CompatibilityMariya Vasileva*, University of Illinois at Urbana-Champaign; Bryan Plummer, Boston University; Krishna Dusad, University of Illinois at Urbana-Champaign; Shreya Rajpal, University of Illinois at Urbana-Champaign; David Forsyth, Univeristy of Illinois at Urbana-Champaign; Ranjitha Kumar, UIUC: CS
P-4B-51Visual Coreference Resolution in Visual Dialog using Neural Module NetworksSatwik Kottur*, Carnegie Mellon University; José M. F. Moura, Carnegie Mellon University; Devi Parikh, Georgia Tech & Facebook AI Research; Dhruv Batra, Georgia Tech & Facebook AI Research; Marcus Rohrbach, Facebook AI Research
P-4B-52Hard-Aware Point-to-Set Deep Metric for Person Re-identificationRui Yu*, Huazhong University of Science and Technology; Zhiyong Dou, Huazhong University of Science and Technology; Song Bai, HUST; ZHAO-XIANG ZHANG, Chinese Academy of Sciences, China; Yongchao Xu, HUST; Xiang Bai, Huazhong University of Science and Technology
P-4B-53Gray box adversarial trainingVivek B S*, Indian Institute of Science; Konda Reddy Mopuri, Indian Institute of Science, Bangalore; Venkatesh Babu RADHAKRISHNAN, Indian Institute of Science
P-4B-54Exploiting Vector Fields for Geometric Rectification of Distorted Document ImagesGaofeng Meng*, Chinese Academy of Sciences; Yuanqi Su, Xi'an Jiaotong University; Ying Wu, Northwestern University; SHIMING XIANG, Chinese Academy of Sciences, China; Chunhong Pan, Institute of Automation, Chinese Academy of Sciences
P-4B-55Revisiting RCNN: On Awakening the Classification Power of Faster RCNNYunchao Wei*, UIUC; Bowen Cheng, UIUC; Honghui Shi, UIUC; Rogerio Feris, IBM Research; Jinjun Xiong, IBM Thomas J. Watson Research Center; Thomas Huang, UIUC
P-4B-56DeepTAM: Deep Tracking and MappingHuizhong Zhou*, University of Freiburg; Benjamin Ummenhofer, University of Freiburg; Thomas Brox, University of Freiburg
P-4B-57On Regularized Losses for Weakly-supervised CNN SegmentationMeng Tang*, University of Waterloo; Ismail Ben Ayed, ETS; Federico Perazzi, Disney Research; Abdelaziz Djelouah, Disney Research; Christopher Schroers, Disney Research; Yuri Boykov, University of Waterloo
P-4B-58ShapeCodes: Self-Supervised Feature Learning by Lifting Views to ViewgridsDinesh Jayaraman*, UC Berkeley; Ruohan Gao, University of Texas at Austin; Kristen Grauman, University of Texas
P-4B-59A Minimal Closed-Form Solution for Multi-Perspective Pose Estimation using Points and LinesPedro Miraldo*, Instituto Superior Técnico, Lisboa; Tiago Dias, Institute for systems and robotics; Srikumar Ramalingam, University of Utah
P-4B-60Interaction-aware Spatio-temporal Pyramid Attention Networks for Action ClassificationYang Du, NLPR; Chunfeng Yuan*, NLPR; Weiming Hu, Institute of Automation,Chinese Academy of Sciences
P-4B-61Towards Privacy-Preserving Visual Recognition via Adversarial Training: A Pilot StudyZhenyu Wu, Texas A&M University; Zhangyang Wang*, Texas A&M University; Zhaowen Wang, Adobe Research; Hailin Jin, Adobe Research
P-4B-62Polarimetric Three-View GeometryLixiong Chen, National Institute of Informatics; Yinqiang Zheng*, National Institute of Informatics; Art Subpa-asa, Tokyo Institute of Technology; Imari Sato, National Institute of Informatics
P-4B-63SketchyScene: Richly-Annotated Scene SketchesChangqing Zou*, University of Maryland (UMD); Qian Yu, Queen Mary University of London; Ruofei Du, UMD; Haoran Mo, sun yat sen university; Yi-Zhe Song, Queen Mary University of London; Tao Xiang, Queen Mary, University of London, UK; Chengying Gao, sun yat sen university; Baoquan Chen, Shandong University; Hao Zhang, SFU
P-4B-64Bi-Real Net: Enhancing the Performance of 1-bit CNNs with Improved Representational Capability and Advanced Training Algorithmzechun liu*, HKUST; Baoyuan Wu, Tencent AI Lab; Wenhan Luo, Tencent AI Lab; Xin Yang, Huazhong University of Science and Technology; Wei Liu, Tencent AI Lab; Kwang-Ting Cheng, Hong Kong University of Science and Technology
P-4B-65Deep Continuous Fusion for Multi-Sensor 3D Object DetectionMing Liang*, Uber; Shenlong Wang, Uber ATG, University of Toronto; Bin Yang, Uber ATG, University of Toronto; Raquel Urtasun, Uber ATG
P-4B-66Focus on the Hard Things: Dynamic Task Prioritization for Multitask LearningMichelle Guo*, Stanford University; Albert Haque, Stanford University; De-An Huang, Stanford University; Serena Yeung, Stanford University; Li Fei-Fei, Stanford University
P-4B-67Domain transfer through deep activation matchingHaoshuo Huang*, Tsinghua University; Qixing Huang, The University of Texas at Austin; Philipp Kraehenbuehl, UT Austin
P-4B-68Joint Blind Motion Deblurring and Depth Estimation of Light FieldDongwoo Lee, Seoul Ntional University; Haesol Park, Seoul National University; In Kyu Park, Inha University; Kyoung Mu Lee*, Seoul National University
P-4B-69Learning to Look around Objects for Top-View Representations of Outdoor ScenesSamuel Schulter*, NEC Labs; Menghua Zhai, University of Kentucky; Nathan Jacobs, University of Kentucky; Manmohan Chandraker, NEC Labs America
P-4B-70Data-Driven Sparse Structure Selection for Deep Neural NetworksZehao Huang*, TuSimple; Naiyan Wang, TuSimple
P-4B-71Reconstruction-based Pairwise Depth Dataset for Depth Image Enhancement Using CNNJunho Jeon, POSTECH; Seungyong Lee*, POSTECH
P-4B-72A Geometric Perspective on Structured Light CodingMohit Gupta*, University of Wisconsin-Madison, USA ; Nikhil Nakhate, University of Wisconsin-Madison
P-4B-733D Ego-Pose Estimation via Imitation LearningYe Yuan*, Carnegie Mellon University; Kris Kitani, CMU
P-4B-74Unsupervised Learning of Multi-Frame Optical Flow with OcclusionsJoel Janai*, Max Planck Institute for Intelligent Systems; Fatma Güney, University of Oxford; Anurag Ranjan, MPI for Intelligent Systems; Michael Black, Max Planck Institute for Intelligent Systems; Andreas Geiger, MPI-IS and University of Tuebingen
P-4B-75Dynamic Conditional Networks for Few-Shot LearningFang Zhao, National University of Singapore; Jian Zhao*, National University of Singapore; Yan Shuicheng, National University of Singapore; Jiashi Feng, NUS
P-4B-763DFeat-Net: Weakly Supervised Local 3D Features for Rigid Point Cloud RegistrationZi Jian Yew*, National University of Singapore; Gim Hee Lee, National University of SIngapore
P-4B-77Learning to Forecast and Refine Residual Motion for Image-to-Video GenerationLong Zhao*, Rutgers University; Xi Peng, Rutgers University; Yu Tian, Rutgers; Mubbasir Kapadia, Rutgers; Dimitris Metaxas, Rutgers
P-4B-78Learn-to-Score: Efficient 3D Scene Exploration by Predicting View UtilityBenjamin Hepp*, ETH Zurich; Debadeepta Dey, Microsoft; Sudipta Sinha, Microsoft Research; Ashish Kapoor, Microsoft; Neel Joshi, -; Otmar Hilliges, ETH Zurich
P-4B-79Deep Co-Training for Semi-Supervised Image RecognitionSiyuan Qiao*, Johns Hopkins University; Wei Shen, Shanghai University; Zhishuai Zhang, Johns Hopkins University; Bo Wang, Hikvision Research Institue; Alan Yuille, Johns Hopkins University
P-4B-80Attention-aware Deep Adversarial Hashing for Cross Modal RetrievalXi Zhang, Sun Yat-Sen University; Hanjiang Lai*, Sun Yat-Sen university; Jiashi Feng, NUS
P-4B-81Remote Photoplethysmography Correspondence Feature for 3D Mask Face Presentation Attack DetectionSiqi Liu*, Department of Computer Science, Hong Kong Baptist University; Xiangyuan Lan, Department of Computer Science, Hong Kong Baptist University; PongChi Yuen, Department of Computer Science, Hong Kong Baptist University
P-4B-82Semi-Supervised Generative Adversarial Hashing for Image RetrievalGuan'an Wang*, Chinese Academy of Sciences; Qinghao Hu, Chinese Academy of Sciences; Jian Cheng, Chinese Academy of Sciences, China; Zengguang Hou, Chinese Academy of Sciences
P-4B-83Improving Spatiotemporal Self-Supervision by Deep Reinforcement LearningUta Büchler*, Heidelberg University; Biagio Brattoli, Heidelberg University; Bjorn Ommer, Heidelberg University
P-4B-84AutoLoc: Weakly-supervised Temporal Action Localization in Untrimmed VideosZheng Shou*, Columbia University; Hang Gao, Columbia University; Lei Zhang, Microsoft Research; Kazuyuki Miyazawa, Mitsubishi Electric; Shih-Fu Chang, Columbia University
P-4B-85Revisiting Autofocus for Smartphone CamerasAbdullah Abuolaim*, York University; Abhijith Punnappurath, York University; Michael Brown, York University
P-4B-86Contour Knowledge Transfer for Salient Object DetectionXin Li, UESTC; Fan Yang*, UESTC; Hong Cheng, UESTC; Wei Liu, Digital Media Technology Key Laboratory of Sichuan Province, UESTC; Dinggang Shen, UNC
P-4B-87Deep Volumetric Video From Very Sparse Multi-View Performance CaptureZeng Huang*, University of Southern California; Tianye Li, University of Southern California; Weikai Chen, USC Institute for Creative Technology; Yajie Zhao, USC Institute for Creative Technology ; Jun Xing, Institute for Creative Technologies, USC; Chloe LeGendre, USC Institute for Creative Technology ; Linjie Luo, Snap Inc; Chongyang Ma, Snap Inc.; Hao Li, Pinscreen/University of Southern California/USC ICT
P-4B-88Person Re-identification with Deep Similarity-Guided Graph Neural NetworkYantao Shen*, The Chinese University of Hong Kong; Hongsheng Li, Chinese University of Hong Kong; Shuai Yi, The Chinese University of Hong Kong; Xiaogang Wang, Chinese University of Hong Kong, Hong Kong
P-4B-89Deep Component Analysis via Alternating Direction Neural NetworksCalvin Murdock*, Carnegie Mellon University; MingFang Chang, Carnegie Mellon University; Simon Lucey, CMU
P-4B-90Understanding Perceptual and Conceptual Fluency at a Large ScaleMeredith Hu*, Cornell University; Ali Borji, University of Central Florida
P-4B-91Look Deeper into Depth: Monocular Depth Estimation with Semantic Booster and Attention-Driven LossJianbo Jiao*, City University of Hong Kong; Ying Cao, City University of Hong Kong; Yibing Song, Tencent AI Lab; Rynson Lau, City University of Hong Kong