2020 ICML Oral 論文

Oral Papers

38 - ShapeCaptioner: Generative Caption Network for 3D Shapes by Learning a Mapping from Parts Detected in Multiple Views to Sentences

Zhizhong Han (University of Maryland, College Park); Chao Chen (Tsinghua University); Yu-Shen Liu (Tsinghua University)*; Matthias Zwicker (University of Maryland)

 

53 - Image Inpainting Based on Multi-frequency Probabilistic Inference Model

Jin Wang (Beijing University of Technology)*; Chen Wang (Beijing University of Technology); Qingming Huang (University of Chinese Academy of Sciences); Yunhui Shi (Beijing University of Technology); Jian-Feng Cai (The Hong Kong University of Science and Technology); Qing Zhu (Beijing University of Technology); Baocai Yin (Beijing University of Technology)

 

60 - Learning from the Past: Meta-Continual Learning with Knowledge Embedding for Jointly Sketch, Cartoon, and Caricature Face Recognition

Wenbo Zheng (School of Software Engineering, Xi'an Jiaotong University); Lan Yan (The State Key Laboratory for Management and Control of Complex Systems, Institute of Automation, Chinese Academy of Sciences); Chao Gou (School of Intelligent Systems Engineering, Sun Yat-sen University)*; Fei-Yue Wang (The State Key Laboratory for Management and Control of Complex Systems, Institute of Automation, Chinese Academy of Sciences)

 

63 - Dual Adversarial Network for Unsupervised Ground/Satellite-to-Aerial Scene Adaptation

jianzhe peter lin (University of British Columbia)*; Lichao Mou (DLR&TUM); tianze yu (University of British Columbia); Xiaoxiang Zhu (Technical University of Munich (TUM); German Aerospace Center (DLR)); Z. Jane Wang (University of British Columbia)

 

68 - Scene-Aware Background Music Synthesis

Yujia Wang (Beijing Institute of Technology)*; Wei Liang (Beijing Institute of Technology); Wanwan Li (George Mason University); Dingzeyu Li (Adobe Research); Lap-Fai Yu (George Mason University)

 

75 - Adversarial Bipartite Graph Learning for Video Domain Adaptation

Yadan Luo (University of Queensland)*; Zi Huang (University of Queensland); Zijian Wang (University of Queensland); Zheng Zhang (Harbin Institute of Technology, Shenzhen); Mahsa Baktashmotlagh (University of Queensland)

 

111 - Domain Adaptive Person Re-Identification via Coupling Optimization

Xiaobin Liu (Peking University); Shiliang Zhang (Peking University)*

 

118 - Give Me Something to Eat: Referring Expression Comprehension with Commonsense Knowledge

Peng Wang (Northwestern Polytechnical University); Dongyang Liu (Northwestern Polytechnical University); Hui Li (the University of Adelaide)*; Qi Wu (University of Adelaide)

 

142 - Controllable Video Captioning with an Exemplar Sentence

Yitian Yuan (Tsinghua University)*; Lin Ma (Tencent AI Lab); Jingwen Wang (Tencent AI Lab); Wenwu Zhu (Tsinghua University)

 

143 - MEmoR: A Dataset for Multimodal Emotion Reasoning in Videos

Guangyao Shen (Tsinghua University)*; Xin Wang (Tsinghua University); Xuguang Duan (Tsinghua University); Hongzhi Li (Microsoft Research); Wenwu Zhu (Tsinghua University)

 

162 - Single Image De-noising via Staged Memory Network

Weijiang Yu (SUN YAT-SEN UNIVERSITY)*; Jian Liang (Nanchang University); Lu Li (Zhejiang University); Nong Xiao (Sun Yat-sen University)

 

193 - Dual-Structure Disentangling Variational Generation for Data-Limited Face Parsing

Peipei Li ( Institute of Automation Chinese Academy of Sciences)*; Yinglu Liu (JD AI); Hailin Shi (JD AI); Xiang Wu (Reconova); Yibo Hu (Institute of Automation, Chinese Academy of Sciences); Ran He (Institute of Automation, Chinese Academy of Sciences); Zhenan Sun (Chinese of Academy of Sciences)

 

197 - A Human-Computer Duet System for Music Performance

Yuen-Jen Lin (Academia Sinica)*; Hsuan-Kai Kao (Academia Sinica); Yih-Chih Tseng (Academia Sinica); Ming Tsai (KoKo Lab); Li Su (Academia Sinica)

 

202 - Invisible: Federated Learning over Non-Informative Intermediate Updates against Multimedia Privacy Leakages

Qiushi Li (Tsinghua University)*; Wenwu Zhu (Tsinghua University); Chao Wu (Tsinghua University); xinglin pan (University of Electronic Science and Technology of China); Fan Yang (Tsinghua University); Yuezhi Zhou (Tsinghua University); Yaoxue Zhang (Tsinghua University)

 

221 - Every Moment Matters: Detail-Aware Networks to Bring a Blurry Image Alive

Kaihao Zhang (Australian National University)*; Wenhan Luo (Tencent AI Lab); Bjorn Stenger (Rakuten Institute of Technology); Wenqi Ren (Institute of Information Engineering, Chinese Academy of Sciences); Lin Ma (Tencent AI Lab); HONGDONG LI (Australian National University, Australia)

 

230 - Self-supervised Dance Video Synthesis Conditioned on Music

Xuanchi Ren (HKUST); Haoran Li (The Hong Kong University of Science and Technology); Zijian HUANG (the Hong Kong University of Science and Technology); Qifeng Chen (HKUST)*

 

232 - Co-Attentive Lifting for Infrared-Visible Person Re-Identification

Xing Wei (Xi'an Jiaotong University)*; Diangang Li (Xi'an Jiaotong University); Xiaopeng Hong (Xi'an Jiaotong University); Wei Ke (Xi'an Jiaotong University); Yihong Gong (Xi'an Jiaotong University)

 

291 - Dynamic GCN: Context-enriched Topology Learning for Skeleton-based Action Recognition

Fanfan Ye ( Hikvision Research Institute); Shiliang Pu (Hikvision Research Institute); Qiaoyong Zhong (Hikvision Research Institute)*; Chao Li (Hikvision Research Institute); Di Xie (Hikvision Research Institute); Huiming Tang (Zhejiang University)

 

304 - Boosting Visual Question Answering with Context-aware Knowledge Aggregation

Guohao Li (Tsinghua University)*; Xin Wang (Tsinghua University); Wenwu Zhu (Tsinghua University)

 

306 - Meta Parsing Networks: Towards Generalized Few-shot Scene Parsing with Adaptive Metric Learning

Peike Li (UTS)*; Yunchao Wei (University of Technology Sydney); Yi Yang (UTS)

 

312 - CODAN: Counting-driven Attention Network for Vehicle Detection in Congested Scenes

Wei Li (Southwest Jiaotong University); Zhenting Wang (Southwest Jiaotong University); Xiao Wu (Southwest Jiaotong University)*; Ji Zhang (Southwest Jiaotong University); Qiang Peng (Southwest Jiaotong University); Hongliang Li (University of Electronic Science and Technology of China)

 

352 - Modeling both Intra- and Inter-modal Influence for Real-Time Emotion Detection in Conversations

Dong Zhang (Soochow University)*; Weisheng Zhang (Soochow University); Shoushan Li (Soochow University); Zhu Qiaoming (Soochow University); Zhou Guodong (Soochow University)

 

355 - WIKI Food-500: A dataset for Large-Scale Food Recognition via Stacked Global-Local Attention Network

Weiqing Min (Institute of Computing Technology, Chinese Academy of Sciences)*; Linhu Liu (ICT); Zhiling Wang (Institute of Computing Technology, Chinese Academy of Sciences); Zhengdong Luo (University of Chinese Academy of Sciences); Xiaoming Wei (MeituanDianping group ); Xiaolin Wei (MeituanDianping group ); Shuqiang Jiang (ICT, China Academy of Science)

 

358 - Learning Image Classifier from Only Web Labels and Metadata: Automatic Label Correction through Graph

Jingkang Yang (Sensetime Research)*; Weirong Chen (SenseTime Research); Litong Feng (Sensetime Research); Xiaopeng Yan (SenseTime Research); Huabin Zheng (SenseTime Research); Wayne Zhang (SenseTime Research)

 

373 - Photo Stand-Out: Photography with Virtual Character

Yujia Wang (Beijing Institute of Technology)*; Sifan Hou (Beijing Institute of Technology); Wei Liang (Beijing Institute of Technology); Bing Ning (Beijing Institute of Fashion Technology)

 

378 - Accurate UAV Tracking with Distance-Injected Overlap Maximization

Chunhui Zhang (Chinese Academy of Sciences); Shiming Ge (Chinese Academy of Sciences)*; Kangkai Zhang (Chinese Academy of Sciences); Dan Zeng (Shanghai University)

 

383 - Context-Aware Multi-View Summarization Network for Image-Text Matching

Leigang Qu (Shandong University); Meng Liu (Shandong Jianzhu University); Da Cao (Hunan University); Liqiang Nie (Shandong University )*; Qi Tian (Huawei Cloud & AI)

 

391 - PiRhDy: Learning Pitch-, Rhythm-, and Dynamics-aware Embeddings for Symbolic Music

Hongru Liang (Nankai University); Wenqiang Lei (National University of Singapore)*; Paul Yaozhu Chan (A∗STAR); Zhenglu Yang (Nankai University); Maosong Sun (Tsinghua University); Tat-Seng Chua (National Univ. of Singapore)

 

395 - An Egocentric Action Anticipation Framework via Fusing Intuition and Analysis

Tianyu Zhang (ICT)*; Weiqing Min (Institute of Computing Technology, Chinese Academy of Sciences); Ying Zhu (University of Chinese Academy of Sciences); Yong Rui (Lenovo); Shuqiang Jiang (ICT, China Academy of Science)

 

436 - Label Embedding Online Hashing for Cross-Modal Retrieval

Yongxin Wang (Shandong University); Xin Luo (Shandong University); Xin-Shun Xu (Shandong University)*

 

444 - Cloze Test Helps: Effective Video Anomaly Detection via Learning to Complete Video Events

Guang Yu (National University of Defense Technology)*; Siqi Wang (National University of Defense Technology); Zhiping Cai (NUDT); En Zhu (National University of Defense Technology); Chuanfu Xu (National University of Defense Technology); Jianping Yin (National University of Defense Technology); Marius Kloft (TU Kaiserslautern)

 

484 - CRSSC: Salvage Reusable Samples from Noisy Data for Robust Learning

Zeren Sun (Nanjing University of Science and Technology ); Xian-Sheng Hua (Alibaba Group); Yazhou Yao (Nanjing University of Science and Technology)*; Xiu-Shen Wei (Nanjing University of Science and Technology); Guosheng Hu (AnyVision); Jian Zhang (UTS)

 

519 - MMFL: Multimodal Fusion Learning for Text-Guided Image Inpainting

Qing Lin (Fudan University); Bo Yan (Fudan University)*; Jichun Li (Fudan University); Weimin Tan (Fudan University)

 

531 - Vision Meets Wireless Positioning: Effective Person Re-identification with Recurrent Context Propagation

Yiheng Liu (University of Science and Technology of China)*; Wengang Zhou (University of Science and Technology of China); Mao Xi (University of Science and Technology of China); Sanjing Shen (University of Science and Technology of China); Houqiang Li (University of Science and Technology of China)

 

541 - Learning From Music to Visual Storytelling of Shots: A Deep Interactive Learning Mechanism

Jen-Chun Lin (Academia Sinica)*; Wen-Li Wei (Academia Sinica); Yen-Yu Lin (National Chiao Tung University); Tyng-Luh Liu (Academia Sinica); Hong-Yuan Mark Liao (Institute of Information Science, Academia Sinica, Taiwan)

 

553 - Asymmetric Deep Hashing for Efficient Hash Code Compression

Shu Zhao (Institute of Information Engineering, Chinese Academy of Sciences); Dayan Wu (Institute of Information Engineering, Chinese Academy of Sciences)*; Wanqian Zhang (Institute of Information Engineering, Chinese Academy of Sciences); Yu Zhou (Institute of Information Engineering, CAS); Bo Li ( Institute of Information Engineering, Chinese Academy of Sciences); Weiping Wang (Institute of Information Engineering, CAS, China)

 

588 - Quaternion-Based Knowledge Graph Network for Recommendation

Zhaopeng Li (State Key Laboratory of Information Security, Institute of Information Engineering, Chinese Academy of Sciences; University of Chinese Academy of Sciences)*; Qianqian Xu (Key Laboratory of Intelligent Information Processing, Institute of Computing Technology, Chinese Academy of Sciences); Yangbangyan Jiang (Institute of Information Engineering, Chinese Academy of Sciences; University of Chinese Academy of Sciences); Xiaochun Cao (Chinese Academy of Sciences); Qingming Huang (University of Chinese Academy of Sciences)

 

601 - Multi-Person Action Recognition in Microwave Sensors

Diangang Li (Xi'an Jiaotong University); Jianquan Liu (NEC Corporation)*; Shoji Nishimura (NEC Corporation); Yuka Hayashi (NEC Corporation); Jun Suzuki (NEC Corporation); Yihong Gong (Xi'an Jiaotong University)

 

612 - Norm-in-Norm Loss with Faster Convergence and Better Performance for Image Quality Assessment

Dingquan Li (Peking University); Tingting Jiang (Peking University)*; Ming Jiang (Peking University)

 

639 - Coupling deep textural and shape features for sketch retrieval

Qi Jia (Dalian University of Technology); Xin Fan (Dalian University of Technology)*; Meiyu Yu (Didi Chuxing); Yuqing Liu (Dalian University of Technology); Dingrong Wang (Dalian University of Technology); Longin Jan Latecki (Temple University)

 

647 - Memory-Augmented Relation Network for Few-Shot Learning

He Jun (Hefei University of Technology)*; Richang Hong (Hefei University of Technology); Xueliang Liu (Hefei University of Technology); Mingliang Xu (Zhengzhou University); Zheng-Jun Zha (University of Science and Technology of China); Meng Wang (Hefei University of Technology)

 

668 - Performance Optimization of Federated Person Re-identification via Benchmark Analysis

Weiming Zhuang (Nanyang Technological University)*; Yonggang Wen (Nanyang Technological University); Xuesen Zhang (SenseTime); Xin Gan (SenseTime); Daiying Yin (SenseTime); Dongzhan Zhou (The University of Sydney); shuai zhang (Sensetime Ltd); Shuai Yi (SenseTime Group Limited)

 

691 - Guided Attention Network for Object Detection and Counting on Drones

CAI YuanQiang (UCAS); Dawei Du (University of Chinese Academy of Sciences); Libo Zhang (Institute of Software Chinese Academy of Sciences)*; Longyin Wen (JD Digit); Weiqiang Wang (University of Chinese Academy of Sciences); Yanjun Wu (Institute of Software Chinese Academy of Sciences ); Siwei Lyu (University at Albany)

 

696 - K-armed Bandit based Multi-Modal Network Architecture Search for Visual Question Answering

Yiyi Zhou (Xiamen University); Rongrong Ji (Xiamen University, China)*; Xiaoshuai Sun ( Xiamen University); Gen Luo (Xiamen University); Xiaopeng Hong (Xi'an Jiaotong University); Jinsong Su (Xiamen University); Xinghao Ding (Xiamen University); Ling Shao (Inception Institute of Artificial Intelligence)

 

701 - TextRay: Contour-based Geometric Modeling for Arbitrary-shaped Scene Text Detection

Fangfang Wang (Zhejiang University)*; Yifeng Chen (Zhejiang University); Fei Wu (Zhejiang University, China); Xi Li (Zhejiang University)

 

704 - Class-Aware Modality Mix and Center-Guided Metric Learning for Visible-Thermal Person Re-Identification

Yongguo Ling (Xiamen University)*; Zhun Zhong (University of Trento); Zhiming Luo (Xiamen University); Paolo Rota (University of Trento); Shaozi Li (Xiamen University, China); Nicu Sebe (University of Trento)

 

707 - Adversarial Graph Representation Adaptation for Cross-Domain Facial Expression Recognition

Yuan Xie (DarkMatter AI); Tianshui Chen (DarkMatter AI)*; Tao Pu (Sun Yat-sen University); Hefeng Wu (Sun Yat-sen University); Liang Lin (DarkMatter AI)

 

710 - Weakly Supervised Real-time Image Cropping based on Aesthetic Distributions

Peng Lu (Beijing University of Posts and Telecommunications)*; Jiahui Liu (Beijing University of Posts and Telecommunications); Xujun Peng (Information Sciences Institute, University of Southern California); Xiaojie Wang (Beijing University of Posts and Telecommunications)

 

732 - Towards Unsupervised Crowd Counting via Regression-Detection Bi-knowledge Transfer

Yuting Liu (Sichuan University)*; Zheng Wang (National Institute of Informatics); Miaojing Shi (King's College London); Shin'ichi Satoh (National Institute of Informatics); Qijun Zhao (Sichuan University); hongyu yang (sichuan university)

 

734 - KBGN: Knowledge-Bridge Graph Network for Adaptive Vision-Text Reasoning in Visual Dialogue

Xiaoze Jiang (Intelligent Computing & Machine Learning Lab, School of ASEE, Beihang University); Siyi Du (Intelligent Computing & Machine Learning Lab, School of ASEE, Beihang University); Zengchang Qin (Intelligent Computing & Machine Learning Lab, School of ASEE, Beihang University)*; Yajing Sun (Institute of Information Engineering,Chinese Academy of Sciences); Jing Yu ( Institute of Information Engineering,Chinese Academy of Sciences)

 

737 - Occluded Prohibited Items Detection: An X-ray Security Inspection Benchmark and De-occlusion Attention Module

Yanlu Wei (Beihang University); Renshuai Tao (Beihang University)*; Zhangjie Wu (Beihang University); Yuqing Ma (Beihang University); Libo Zhang (Institute of Software Chinese Academy of Sciences); Xianglong Liu (BUAA)

 

765 - Context-aware Attention Network for Predicting Image Aesthetic Subjectivity

Munan Xu (Shenzhen Graduate School, Peking University); Jia-Xing Zhong (School of Electronic and Computer Engineering, Peking University); Yurui Ren (Shenzhen Graduate School, Peking University); Shan Liu (Tencent America); Ge Li (SECE, Shenzhen Graduate School, Peking University)*

 

783 - PIDNet: An Efficient Network for Dynamic Pedestrian Intrusion Detection

Jingchen Sun (Zhejiang University); Jiming Chen (Zhejiang University); Tao Chen (Fudan University); jiayuan fan (Fudan University); Shibo He (Zhejiang University)*

 

787 - ChoreoNet: Torwards Music to Dance Synthesis with Choreographic Action Unit

Zijie Ye (Tsinghua University)*; Haozhe Wu (Tsinghua University); Jia Jia (Tsinghua University); Yaohua Bu (Tsinghua University); Wei Chen (Beijing Sougou Science and Technology Development Co., Ltd); Fanbo Meng (Sogou Corporation, Beijing, China); Yanfeng Wang ( Beijing Sougou Science and Technology Development Co., Ltd)

 

794 - Adversarial Video Moment Retrieval by Jointly Modeling Ranking and Localization

Da Cao (Hunan University)*; Yawen Zeng (Hunan University); Xiaochi Wei (Baidu Inc.); Liqiang Nie (Shandong University ); Richang Hong (Hefei University of Technology); Zheng Qin (Hunan University)

 

795 - Pose-native Network Architecture Search for Multi-person Human Pose Estimation

Qian Bao (AI Research of JD.com); Wu Liu (AI Research of JD.com)*; Jun Hong (AI Research of JD.com); Lingyu Duan (Peking University); Tao Mei (AI Research of JD.com)

 

814 - Cascade Grouped Attention Network for Referring Expression Segmentation

Gen Luo (Xiamen University); Rongrong Ji (Xiamen University, China)*; Yiyi Zhou (Xiamen University); Xiaoshuai Sun ( Xiamen University); Jinsong Su (Xiamen University); Chia-Wen Lin (National Tsing Hua University); Qi Tian (Huawei Cloud & AI)

 

816 - Temporally Guided Music-to-Body-Movement Generation

Hsuan-Kai Kao (Academia Sinica); Li Su (Academia Sinica)*

 

818 - Compositional Few-Shot Recognition with Primitive Discovery and Enhancing

Yixiong Zou (Peking University)*; Shanghang Zhang (UC Berkeley); Ke Chen (South China University of Technology); José M. F. Moura (Carnegie Mellon University); Yaowei Wang (PengCheng Laboratory); Yonghong Tian (Peking University)

 

830 - InteractGAN: Learning to Generate Human-Object Interaction

Chen Gao (Institute of Information Engineering, CAS)*; si liu (Beihang University); Defa Zhu (Institute of Information Engineering, CAS); Quan Liu (Beihang University); Jie Cao (Institute of Automation, Chinese Academy of Sciences); Haoqian He (Beihang University); Ran He (Institute of Automation, Chinese Academy of Sciences); Shuicheng Yan (YITU Tech)

 

867 - Reinforcement Learning for Weakly Supervised Temporal Grounding of Natural Language in Untrimmed Videos

Jie Wu (Sun Yat-sen University)*; Guanbin Li (Sun Yat-sen University); Xiaoguang Han (Shenzhen Research Institute of Big Data, the Chinese University of Hong Kong (Shenzhen)); Liang Lin (DarkMatter AI)

 

876 - Traffic-Aware Multi-Camera Tracking of Vehicles Based on ReID and Camera Link Model

Hung-Min Hsu (UW)*; Yizhou Wang (University of Washington); Jenq-Neng Hwang (University of WA�)

 

893 - VONAS: Network Design in Visual Odometry using Neural Architecture Search

Xing Cai (Peking University); Lanqing Zhang (Peking University); Chengyuan Li (Peking University); Ge Li (SECE, Shenzhen Graduate School, Peking University); Thomas H Li (Advanced Institute of Information Technology, Peking University)*

 

921 - Category-specific Semantic Coherency Learning for Fine-grained Image Recognition

Shijie Wang (Dalian University of Technology); zhihui wang (Dalian University of Technology); Haojie Li (Dalian University of Technology)*; Wanli Ouyang (The University of Sydney)

 

961 - Poet: Product-oriented Video Captioner for E-commerce

Shengyu Zhang (Zhejiang University)*; Ziqi Tan (Zhejiang University); Jin Yu (Alibaba Group); Zhou Zhao (Zhejiang University); Kun Kuang (Zhejiang University); jie liu (Alibaba); Jingren Zhou (Alibaba Group); Hongxia Yang (Alibaba Group); Fei Wu (Zhejiang University, China)

 

976 - Beyond the Attention: Distinguish the Discriminative and Confusable Features For Fine-grained Image Classification

Xiruo Shi (Beijing University of Posts and Telecommunications ); Liutong Xu (Beijing University of Posts and Telecommunications); Pengfei Wang (School of Computer Science, Beijing University of Posts and Telecommunications); Yuanyuan Gao (Beihang Univeristy); Haifang Jian (Institute of Semiconductors, Chinese Academy of Sciences); Wu Liu (AI Research of JD.com)*

 

977 - BlockMix: Meta Regularization and Self-Calibrated Inference for Metric-Based Meta-Learning

Hao Tang (Nanjing University of Science and Technology); Zechao Li (Nanjing University of Science and Technology)*; Zhimao Peng (Nanjing University of Science and Technology); Jinhui Tang (Nanjing University of Science and Technology)

 

980 - Structural Semantic Adversarial Active Learning for Image Captioning

Beichen Zhang (University of Chinese Academy of Sciences)*; liang li (Institute of Computing Technology, Chinese Academy of Sciences); Li Su (University of Chinese Academy of Sciences); Shuhui Wang (VIPL,ICT,Chinese academic of science); Jincan Deng (Institute of Computing Technology, Chinese Academy of Sciences); Zheng-Jun Zha (University of Science and Technology of China); Qingming Huang (University of Chinese Academy of Sciences)

 

988 - Scene-Aware Context Reasoning for Unsupervised Abnormal Event Detection in Videos

Che Sun (Beijing Institute of Technology); Yunde Jia (Beijing Institute of Technology); Yao Hu (Alibaba Youku Cognitive and Intelligent Lab); Yuwei WU (Beijing Institute of Technology (BIT), China)*

 

1002 - Active Object Search

Jie Wu (Sun Yat-sen University)*; Tianshui Chen (DarkMatter AI); Lishan Huang (Sun Yat-Sen University); Hefeng Wu (Sun Yat-sen University); Guanbin Li (Sun Yat-sen University); Ling Tian (University of Electronic Science and Technology of China); Liang Lin (DarkMatter AI)

 

1009 - Deep-Modal: Real-Time Impact Sound Synthesis for Arbitrary Shapes

Xutong Jin (Peking University); Sheng Li (Peking University)*; Tianshu Qu (Peking University); Dinesh Manocha (UMD); Guoping Wang (Peking University)

 

1011 - Fine-grained Feature Alignment with Part Perspective Transformation for Vehicle ReID

Dechao Meng (vipl,ict,Chinese academic of science)*; Liang Li (Chinese Academy of Sciences); Shuhui Wang (VIPL,ICT,Chinese academic of science); Xingyu Gao (Chinese Academy of Sciences); Zheng-Jun Zha (University of Science and Technology of China); Qingming Huang (University of Chinese Academy of Sciences)

 

1035 - Transformer-based Label Set Generation for Multi-modal Multi-label Emotion Detection

xincheng Ju (Soochow University)*; Dong Zhang (Soochow University); Junhui Li (Soochow University); Zhou Guodong (Soochow University)

 

1038 - Beyond the Parts: Learning Multi-view Cross-part Correlation for Vehicle Re-identification

Xinchen Liu (AI Research of JD.com); Wu Liu (AI Research of JD.com)*; Jinkai Zheng (Hangzhou Dianzi University); Chenggang Yan (Hangzhou Dianzi University); Tao Mei (AI Research of JD.com)

 

1064 - Look, Read and Feel: Benchmarking Ads Understanding with Multimodal Multitask Learning

Huaizheng Zhang (Nanyang Technological University)*; YONG LUO (Nanyang Technological University); Qiming Ai (Nanyang Technological University); Han Hu (Beijing Institute of Technology, China); Yonggang Wen (Nanyang Technological University)

 

1075 - Light Field Super-resolution via Attention-Guided Fusion of Hybrid Lenses

Jing Jin (City University of Hong Kong); Junhui Hou (City University of Hong Kong, Hong Kong)*; Jie Chen (Hong Kong Baptist University); Sam Kwong (City Univeristy of Hong Kong); Jingyi Yu (Shanghai Tech University)

 

1147 - Compact Bilinear Augmented Query Structured Attention for Sport Highlights Classification

Yanbin Hao (City University of Hong Kong); Hao Zhang (City University of Hong Kong)*; Chong-Wah Ngo (City University of Hong Kong); Qiang Liu (DeepAIT (Hong Kong) Limited); Xiaojun Hu (DeepAIT (Hong Kong) Limited)

 

1195 - Semantic Image Analogy with a Conditional Single-Image GAN

Jiacheng Li (University of Science and Technology of China); Zhiwei Xiong (University of Science and Technology of China)*; Dong Liu (University of Science and Technology of China); Xuejin Chen (University of Science and Technology of China); Zheng-Jun Zha (University of Science and Technology of China)

 

1196 - Trajectory Prediction in Heterogeneous Environment via Attended Ecology Embedding

Wei-Cheng Lai (National Chiao Tung University); Zi-Xiang Xia (National Chiao Tung University); Hao-Siang Lin (National Chiao Tung University); Lien-Feng Hsu (National Chiao Tung University); Hong-Han Shuai (National Chiao Tung University); I-Hong Jhuo (IBM); Wen-Huang Cheng (EE, NCTU)*

 

1214 - A Structured Graph Attention Network for Vehicle Re-Identification

Yangchun Zhu (University of Science and Technology of China)*; Zheng-Jun Zha (University of Science and Technology of China); Tianzhu Zhang (University of Science and Technology of China); Jiawei Liu (University of Science and Technology of China); Jiebo Luo (U. Rochester)

 

1224 - Scoring High: Analysis and Prediction of Viewer Behavior and Engagement in the Context of 2018 FIFA WC Live Streaming

Nikolas Wehner (University of Würzburg)*; Michael Seufert (University of Würzburg); Sebastian Egger-Lampl (AIT Austrian Institute of Technology GmbH); Bruno Gardlo (AIT Austrian Institute of Technology GmbH); Pedro Casas (AIT Austrian Institute of Technology GmbH); Raimund Schatz (AIT)

 

1275 - Text-Guided Neural Image Inpainting

Lisai Zhang (Harbin Institute of Technology, Shenzhen)*; Qingcai Chen ( Harbin Institute of Technology, Shenzhen); Baotian Hu (Harbin Institute of Technology, Shenzhen); Shuoran Jiang (Harbin Institute of Technology, Shenzhen)

 

1319 - Weakly-supervised Image Hashing through Masked Visual Semantic Graph Reasoning

Lu Jin (Nanjing University of Science and Technology ); Zechao Li (Nanjing University of Science and Technology)*; Yonghua Pan (Nanjing University of Science and Technology); Jinhui Tang (Nanjing University of Science and Technology)

 

1344 - Semantic Consistency Guided Instance Feature Alignment for 2D Image-Based 3D Shape Retrieval

Heyu Zhou (Tianjin University, China); Weizhi Nie (Tianjin University)*; Dan Song (Tianjin University); Nian Hu (Tianjin University); Xuanya Li (Baidu); An-An Liu (Tianjin University)

 

1347 - Performance over Random: A robust evaluation protocol for video summarization methods

Evlampios Apostolidis (QMUL & CERTH-ITI)*; Eleni Adamantidou (CERTH); Alexandros I Metsai (CERTH-ITI); Vasileios Mezaris (Information Technologies Institute, Centre for Research and Technology Hellas, Greece); Ioannis Patras (Queen Mary University of London)

 

1355 - ARSketch: Sketch-Based User Interface for Augmented Reality Glasses

Zhaohui Zhang (Rokid); Haichao Zhu (The Chinese University of Hong Kong)*; Qian Zhang (California University, Los Angeles)

 

1367 - Text-Embedded Bilinear Model for Fine-Grained Visual Recognition

Liang Sun (University of Electronic Science and Technology of China); Xiang Guan (University of Electronic Science and Technology of China); Yang Yang (University of Electronic Science and Technology of China)*; Lei Zhang (Chongqing University)

 

1384 - Learning Scales from Points: A Scale-aware Probabilistic Model for Crowd Counting

Zhiheng Ma (Xi'an Jiaotong University)*; Xing Wei (Xi'an Jiaotong University); Xiaopeng Hong (Xi'an Jiaotong University); Yihong Gong (Xi'an Jiaotong University)

 

1394 - Learning Global Structure Consistency for Robust Object Tracking

Bi Li (Huazhong University of Science and Technology); Chengquan Zhang (Baidu Inc); Zhibin Hong (Baidu Inc.); Xu Tang (Baidu); jingtuo liu (baidu); Junyu Han (Baidu Inc.); Errui Ding (Baidu Inc.); Wenyu Liu (Huazhong University of Science and Technology)*

 

1399 - RGB2LIDAR: Towards Solving Large-Scale Cross-Modal Visual Localization

Niluthpol c Mithun (SRI International)*; Karan Sikka (SRI International); Han-Pang Chiu (SRI International); Supun Samarasekera (SRI International); Rakesh Kumar (SRI International)

 

1418 - Multimodal Representation with Embedded Visual Guiding Objects for Named Entity Recognition in Social Media Posts

Zhiwei Wu (School of Software Engineering, South China University of Technology); Changmeng Zheng (South China University of Technology); Yi Cai (School of Software Engineering, South China University of Technology)*; Junying Chen (South China University of Technology); Ho-fung Leung (The Chinese University of Hong Kong); Qing Li (The Hong Kong Polytechnic University)

 

1453 - Contextual Multi-Scale Feature Learning for Person Re-Identification

Baoyu Fan (Inspur Electronic Information Industry Co.,Ltd.); Li Wang (inspur)*; Runze Zhang (Inspur Electronic Information Industry Co.,Ltd.); Zhenhua Guo (Inspur Electronic Information Industry Co.,Ltd.); Yaqian Zhao (Inspur); Rengang Li (Inspur); Weifeng Gong ( Inspur Electronic Information Industry Co.,Ltd.)

 

1456 - Campus3D: A Photogrammetry Point Cloud Benchmark for Hierarchical Understanding of Outdoor Scene

Xinke Li (National University of Singapore); Chongshou Li (National University of Singapore)*; Zekun Tong (National University of Singapore); Andrew Lim (National University of Singapore); Junsong Yuan ("State University of New York at Buffalo, USA"); Yuwei Wu (National University of Singapore); Jing Tang (National University of Singapore); Raymond Huang (National University of Singapore)

 

1473 - Space-Time Video Super-Resolution using Temporal Profiles

Zeyu Xiao (University of Science and Technology of China); Zhiwei Xiong (University of Science and Technology of China)*; Xueyang Fu (University of Science and Technology of China); Dong Liu (University of Science and Technology of China); Zheng-Jun Zha (University of Science and Technology of China)

 

1493 - Pop Music Transformer: Beat-based Modeling and Generation of Expressive Pop Piano Compositions

Yu-Siang Huang (Academia Sinica)*; Yi-Hsuan Yang (Academia Sinica)

 

1541 - MISA: Modality-Invariant and -Specific Representations for Multimodal Sentiment Analysis

Devamanyu Hazarika (NUS, Singapore)*; Roger Zimmermann (NUS); Soujanya Poria (Singapore University of Technology and Design)

 

1549 - Instability of Successive Deep Image Compression

Jun-Hyuk Kim (Yonsei University); Soobeom Jang (Yonsei University); Jun-Ho Choi (Yonsei University); Jong-Seok Lee ("Yonsei University, Korea")*

 

1570 - DeepFacePencil: Creating Face Images from Freehand Sketches

Yuhang Li (University of Science and Technology of China); Xuejin Chen (University of Science and Technology of China)*; Binxin Yang (University of Science and Technology of China); Zihan Chen (University of Science and Technology of China); Zhihua Cheng (University of Science and Technology of China); Zheng-Jun Zha (University of Science and Technology of China)

 

1576 - ALANET: Adaptive Latent Attention Network for Joint Video Deblurring and Interpolation

Akash Gupta (University of California, Riverside)*; Abhishek Aich (University of California, Riverside); Amit K. Roy-Chowdhury (University of California, Riverside)

 

1595 - CM-BERT: Cross-Modal BERT for Text-Audio Sentiment Analysis

Kaicheng Yang (Hebei University Of Science and Technology); Hua Xu (State Key Laboratory of Intelligent Technology and Systems, Department of Computer Science and Technology, Tsinghua University, Beijing 100084, China)*; kai gao (Hebei University Of Science and Technology)

 

1598 - Single-Shot Two-Pronged Detector with Rectified IoU Loss

Keyang Wang (chongqing university); Lei Zhang (Chongqing University)*

 

1612 - Object-level Attention for Aesthetic Rating Distribution Prediction

Jingwen Hou (Nanyang Technological University)*; Sheng Yang (Nanyang Technological University); Weisi Lin (Nanyang Technological University, Singapore)

 

1633 - Not made for each other - Audio-Visual Dissonance-based Deepfake Detection and Localization

Komal Chugh (Indian Institute of Technology Ropar); Parul Gupta (Indian Institute of Technology Ropar); Abhinav Dhall (Monash University)*; Ramanathan Subramanian (Indian Institute of Technology Ropar)

 

1656 - Make your favorite music curative: music style transfer for anxiety reduction

Zhejing Hu (The Hong Kong Polytechnic University); Yan Liu (The Hong Kong Polytechnic University)*; Gong Chen (The Hong Kong Polytechnic University); Sheng-hua Zhong (Shenzhen University); Aiwei Zhang (St. Paul’s Co-educational College)

 

1685 - Hearing like Seeing: Improving Voice-Face Interactions and Associations via Adversarial Deep Semantic Matching Network

Kai Cheng (Huaqiao University); Xin Liu (Huaqiao University)*; Yiu-ming CHEUNG (Hong Kong Baptist University); Rui Wang (Huaqiao University); Xing Xu (University of Electronic Science and Technology of China); Bineng Zhong (Huaqiao University)

 

1702 - Concept Drift Detection for Multivariate Data Streams and Temporal Segmentation of Daylong Egocentric Videos

Pravin Nagar (IIIT Delhi)*; Mansi Khemka (Columbia University); Chetan Arora (Indian Institute of Technology Delhi)

 

1708 - Dynamic Context-guided Capsule Network for Multimodal Machine Translation

Huan Lin (Xiamen University)*; Fandong Meng (Tencent WeChat AI - Pattern Recognition Center Tencent Inc.); Jinsong Su (Xiamen University); Yongjing Yin (Xiamen University); Zhengyuan Yang (University of Rochester); Yubin Ge (University of Illinois at Urbana-Champaign); Jie Zhou (Tencent); Jiebo Luo (U. Rochester)

 

1710 - DeepSonar: Towards Effective and Robust Detection of AI-Synthesized Fake Voices

Run Wang (Nanyang Technological University)*; Felix Juefei-Xu (Alibaba Group); Yihao Huang (East China Normal University); Qing Guo (Nanyang Technological University); Xiaofei Xie (Nanyang Technological University); Lei Ma (Kyushu University); Yang Liu (Nanyang Technology University, Singapore)

 

1717 - RIRNet: Recurrent-In-Recurrent Network for Video Quality Assessment

Pengfei Chen (Xidian University / China University of Mining and Technology); Leida Li (Xidian University)*; Lei Ma (Hangzhou Multi-Color Optoelctronics Co., Ltd.); Jinjian Wu (Xidian University); Guangming Shi (Xidian University)

 

1719 - Black Re-ID: A Head-shoulder Descriptor for the Challenging Problem of Person Re-Identification

BOQIANG XU (University of Chinese Academy of Sciences;Institute of Automation,Chinese Academy of Sciences)*; Lingxiao He (AI Research of JD.com); Xingyu Liao (AI Research of JD.com); Wu Liu (AI Research of JD.com); Zhenan Sun (Chinese of Academy of Sciences); Tao Mei (AI Research of JD.com)

 

1722 - PopMAG: Pop Music Accompaniment Generation

Yi Ren (Zhejiang University)*; Jinzheng He (Zhejiang University); Xu Tan (Microsoft Research Asia); Tao Qin (Microsoft Research Asia); Zhou Zhao (Zhejiang University); Tie-Yan Liu (Microsoft)

 

1729 - PCPL: Predicate-Correlation Perception Learning for Unbiased Scene Graph Generation

Shaotian Yan (Zhejiang University)*; Chen Shen (Alibaba Group); Zhongming Jin (Alibaba Group); Jianqiang Huang (Alibaba Group); Rongxin Jiang (Zhejiang University); Yaowu Chen (Zhejiang University); Xian-Sheng Hua (Alibaba Group)

 

1761 - Differentiable Manifold Reconstruction for Point Cloud Denoising

Shitong Luo (Peking University)*; Wei Hu (Peking University)

 

1775 - Discriminative Spatial Feature Learning for Person Re-Identification

Peixi Peng (Peking University)*; Yonghong Tian (Peking University); Yangru Huang (Beijing University); Xiangqian Wang (Huawei); Huilong An (AI Application Research Center)

 

1781 - FakePolisher: Making DeepFakes More Detection-Evasive by Shallow Reconstruction

Yihao Huang (East China Normal University)*; Felix Juefei-Xu (Alibaba Group); Run Wang (Nanyang Technological University); Qing Guo (Nanyang Technological University); Lei Ma (Kyushu University); Xiaofei Xie (Nanyang Technological University); Jianwen Li (East China Normal University); Weikai Miao (East China Normal University); Yang Liu (Nanyang Technology University, Singapore); Geguang Pu (East China Normal University)

 

1784 - SalGCN: Saliency Prediction for 360-Degree Images Based on Spherical Graph Convolutional Networks

Haoran Lv (Shanghai Jiao Tong University)*; Qin Yang (Shanghai Jiao Tong University); Chenglin Li (Shanghai Jiao Tong University); Wenrui Dai (Shanghai Jiao Tong University); Junni Zou (Shanghai Jiao Tong University); Hongkai Xiong (Shanghai Jiao Tong University)

 

1800 - AdaHGNN: Adaptive Hypergraph Neural Networks for Multi-Label Image Classification

Xiangping Wu (Harbin Institute of Technology, Shenzhen); Qingcai Chen ( Harbin Institute of Technology, Shenzhen)*; Wei Li (Harbin Institute of Technology, Shenzhen); Yulun Xiao (Harbin Institute of Technology, Shenzhen); Baotian Hu (University of Massachusetts)

 

1828 - Reinforced Similarity Learning: Siamese Relation Networks for Robust Object Tracking

Dawei Zhang (Zhejiang Normal University)*; Zhonglong Zheng (Zhejiang Normal University); Minglu Li (Zhejiang Normal University); Xiaowei He (Zhejiang Normal University); Tianxiang Wang (Zhejiang Normal University); Liyuan Chen (Zhejiang Normal University); Riheng Jia (Zhejiang Normal University); Feilong Lin (Zhejiang Normal University)

 

1832 - AffectI: A Game for Diverse, Reliable, and Efficient Affective Image Annotation

xingkun zuo (University of Yamanashi); Jiyi Li (University of Yamanashi / RIKEN AIP); qili zhou (hangzhou dianzi university); jianjun li (HangZhou Dianzi University); Xiaoyang mao (University of Yamanashi)*

 

1837 - Cognitive Representation Learning of Self-Media Online Article Quality

Yiru Wang (Tencent Inc.; Tsinghua University)*; Shen Huang (Tencent Inc.); Gongfu Li (Tencent Inc.); Qiang Deng (Tencent Inc.); Dongliang Liao (Data Quality Team, WeChat, Tencent Inc., China); Pengda Si (Tsinghua University); Yujiu Yang (Tsinghua University); Jin Xu (Tencent Inc.)

 

1852 - Describing Subjective Experiment Consistency by p-value qq-plot

Jakub Nawała (AGH University of Science and Technology)*; Lucjan Janowski (AGH University of Science and Technology); Bogdan Ćmiel (); Krzysztof Rusek (AGH University of Science and Technology)

 

1859 - Deep Structural Contour Detection

Ruoxi Deng (Central South University)*; Shengjun Liu (Central South University)

 

1874 - Multimodal Multi-Task Financial Risk Forecasting

Ramit Sawhney (Netaji Subhas Institute of Technology)*; Puneet Mathur (University of Maryland, College Park); Ayush Mangal (IIT Roorkee); Piyush Khanna (Delhi Technological University); Rajiv Ratn Shah ("Indraprastha Institute of Information Technology, Delhi"); Roger Zimmermann (NUS)

 

1893 - Cross-modal Non-linear Guided Attention and TemporalCoherence in Multi-modal Deep Video Models

Saurabh Sahu (); Palash Goyal (Samsung Research); Shalini Ghosh (Samsung Research)*; Chul Lee (Samsung Research America)

 

1946 - Multi-modal Cooking Workflow Construction for Food Recipes

Liang-Ming Pan (National University of Singapore)*; Jingjing Chen (Fudan University); Jianlong Wu (Fudan University); Shaoteng Liu (Xi'an Jiaotong University); Chong-Wah Ngo (City University of Hong Kong); Min-Yen Kan (National University of Singapore); Yu-Gang Jiang (Fudan University); Tat-Seng Chua (National university of Singapore)

 

1950 - Distributed Multi-agent Video Fast-forwarding

Shuyue Lan (Northwestern University)*; Zhilu Wang (Northwestern University); Amit K. Roy-Chowdhury (University of California, Riverside); Ermin Wei (); Zhu Qi (Northwestern University)

 

1988 - IR-GAN: Image Manipulation with Linguistic Instruction by Increment Reasoning

Zhenhuan Liu (Institute of Computing Technology, Chinese Academy of Sciences); liang li (Institute of Computing Technology, Chinese Academy of Sciences)*; Shaofei Cai (Institute of Computing Technology, Chinese Academy of Sciences); Jincan Deng (Institute of Computing Technology, Chinese Academy of Sciences); Qianqian Xu (Key Laboratory of Intelligent Information Processing, Institute of Computing Technology, Chinese Academy of Sciences); Shuhui Wang (VIPL,ICT,Chinese academic of science); Qingming Huang (University of Chinese Academy of Sciences)

 

1994 - LIGHTEN: Learning Interactions with Graph and Heirarchical TEmporal Networks for HOI in videos

Sai Praneeth Reddy Sunkesula (Indian Institute of Technology, Bombay)*; Rishabh Dabral (IIT Bombay); Ganesh Ramakrishnan (IIT Bombay)

 

2014 - BS-MCVR: Binary-sensing based Mobile-cloud Visual Recognition

Hongyi Zheng (The Hong Kong Polytechnic University); Lei Zhang ("Hong Kong Polytechnic University, Hong Kong, China")*

 

2017 - Depth Guided Adaptive Meta-Fusion Network for Few-shot Video Recognition

Yuqian Fu (Fudan University)*; Yanwei Fu (Fudan University); junke wang (Fudan University); Li Zhang (University of Oxford); Xing Zhang (Fudan University); Yu-Gang Jiang (Fudan University)

 

2030 - Learning Modality-Invariant Latent Representations for Generalized Zero-shot Learning

Jingjing Li (University of Electronic Science and Technology of China)*; Mengmeng Jing (University of Electronic Science and Technology of China); Lei Zhu (Shandong Normal Unversity); Zhengming Ding (Indiana University-Purdue University Indianapolis); Ke Lu (University of Electronic Science and Technology of China); Yang Yang (University of Electronic Science and Technology of China)

 

2032 - When Bitstream Prior Meets Deep Prior: Compressed Video Super-resolution with Learning from Decoding

Peilin Chen (City University of Hong Kong)*; Wenhan Yang (City University of Hong Kong); Long Sun (Huawei); Shiqi Wang (CityU)

 

2035 - Describe What to Change: A Text-guided Unsupervised Image-to-image Translation Approach

Yahui Liu (University of Trento); Marco De Nadai (Fondazione Bruno Kessler)*; Deng Cai (The Chinese University of Hong Kong); Huayang Li (Tencent AI Lab); Xavier Alameda-Pineda (INRIA); Nicu Sebe (University of Trento); Bruno Lepri (FBK, Trento, Italy)

 

2052 - Increasing Video Perceptual Quality with GANs and Semantic Coding

Leonardo Galteri (University of Florence); Marco Bertini (University of Florence)*; Lorenzo Seidenari (University of Florence); Tiberio Uricchio (University of Florence); Alberto Del Bimbo (University of Florence)

 

2053 - Attentive One-Dimensional Heatmap Regression for Facial Landmark Detection and Tracking

Shi Yin (University of Science and Technology of China); Shangfei Wang (University of Science and Technology of China)*; Xiaoping Chen (University of Science and Technology of China); Enhong Chen (University of Science and Technology of China); Cong Liang (University of Science and Technology of China)

 

2071 - Fine-Grained Similarity Measurement between Educational Videos and Exercises

Xin Wang (University of Science and Technology of China); Wei Huang (University of Science and Technology of China); Qi Liu (" University of Science and Technology of China, China")*; Yu Yin (University of Science and Technology of China); Zhenya Huang (University of Science and Technology of China ); Le Wu (Hefei University of Technology); Jianhui Ma (University of Science and Technology of China); Xue Wang (Nankai University)

 

2073 - One-shot Text Field labeling using Attention and Belief Propagation for Structure information extraction

Mengli Cheng (Alibaba Group)*; Minghui Qiu (Alibaba)

 

2081 - GRAD: Learning for Overhead-aware Adaptive Video Streaming with Scalable Video Coding

Yunzhuo Liu (Shanghai Jiao Tong University); Bo Jiang (Shanghai Jiao Tong University)*; Tian Guo (Worcester Polytechnic Institute); Ramesh K. Sitaraman (UMass Amherst & Akamai Technologies); Don Towsley (University of Massachusetts Amherst); Xinbing Wang (Shanghai Jiao Tong University)

 

2088 - Down to the Last Detail: Virtual Try-on with Fine-grained Details

Jiahang Wang (Huazhong University of Science and Technology)*; Tong Sha (Beihang University); Wei Zhang (JD AI Research); Zhoujun Li (Beihang University); Tao Mei (AI Research of JD.com)

 

2151 - Reduce the Influence of Stability in Content Delivery Network via Learning-Based Caching Algorithm

Gang Yan (Binghamton University-SUNY); Jian Li (Binghamton University-SUNY )*

 

2158 - Temporal Denoising Mask Synthesis Network for Learning Blind Video Temporal Consistency

Yifeng Zhou (University of Electronic Science and Technology of China); Xing Xu (University of Electronic Science and Technology of China)*; Fumin Shen (UESTC); Lianli Gao (The University of Electronic Science and Technology of China); Huimin Lu (Kyushu Institute of Technology); Heng Tao Shen (University of Electronic Science and Technology of China (UESTC))

 

2174 - INCLUDE: A Large Scale Dataset for Indian Sign Language Recognition

Advaith Sridhar (IIT Madras)*; Rohith Gandhi G (IIT Madras); Pratyush Kumar (IIT Madras); Mitesh Khapra (IIT Madras)

 

2205 - A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild

Prajwal K R (International Institute of Information Technology, Hyderabad)*; Rudrabha Mukhopadhyay (IIIT Hyderabad); Vinay Namboodiri (University of Bath); C.V. Jawahar (IIIT-Hyderabad)

 

2237 - Efficient adaptation of neural network filter for video compression

Yat-Hong Lam (Nokia Technologies)*; Alireza Zare (Nokia Technologies); Francesco Cricri (Nokia Technologies); Jani Lainema (Nokia); Miska Hannuksela (Nokia Technologies)

 

2246 - An Analysis of Delay in Live 360° Video Streaming Systems

Jun Yi (Georgia State University)*; Md Reazul Islam (Georgia State University); Shivang Aggarwal (University at Buffalo, The State University of New York); Dimitrios Koutsonikolas (SUNY Buffalo); Y. Charlie Hu (Purdue University); Zhisheng Yan (Georgia State University)

 

2249 - Adaptive Temporal Triplet-loss for Cross-modal Embedding Learning

David Semedo (Universidade NOVA de Lisboa)*; Joao Magalhaes (Universidade NOVA Lisboa)

 

2257 - SonoSpace: Visual Feedback of Timbre with Unsupervised Learning

Naoki Kimura (The University of Tokyo)*; Keisuke Shiro (The University of Tokyo); Yota Takakura (Innoqua Inc.); Hiromi Nakamura (The University of Tokyo); Jun Rekimoto (The Univertsity of Tokyo)

 

2264 - Amora: Black-box Adversarial Morphing Attack

Run Wang (Nanyang Technological University)*; Felix Juefei-Xu (Alibaba Group); Qing Guo (Nanyang Technological University); Yihao Huang (East China Normal University); Xiaofei Xie (Nanyang Technological University); Lei Ma (Kyushu University); Yang Liu (Nanyang Technology University, Singapore)

 

2323 - Single Image Deraining via Scale-space Invariant Attention Neural Network

Bo Pang (Harbin Institute of Technology); Deming Zhai (Harbin Institute of Technolgy); Junjun Jiang (Harbin Institute of Technology); Xianming Liu (Harbin Institute of Technology)*

 

2342 - Concept-based Explanation for Fine-grained Images and Its Application in Infectious Keratitis Classification

Zhengqing Fang (Zhejiang University)*; Kun Kuang (Zhejiang University); Yuxiao Lin (Zhejiang University); Fei Wu (Zhejiang University); Yufeng Yao (Zhejiang University)

 

2448 - Visual Relation of Interest Detection

Fan Yu (Nanjing University); Haonan Wang (Nanjing University); Tongwei Ren (Nanjing University)*; Jinhui Tang (Nanjing University of Science and Technology); Gangshan Wu (Nanjing University)

發表評論
所有評論
還沒有人評論,想成為第一個評論的人麼? 請在上方評論欄輸入並且點擊發布.
相關文章