Skip to main content

Showing 251–300 of 346 results for author: Ji, R

  1. arXiv:2009.04247  [pdf, other

    cs.CV

    Binarized Neural Architecture Search for Efficient Object Recognition

    Authors: Hanlin Chen, Li'an Zhuo, Baochang Zhang, Xiawu Zheng, Jianzhuang Liu, Rongrong Ji, David Doermann, Guodong Guo

    Abstract: Traditional neural architecture search (NAS) has a significant impact in computer vision by automatically designing network architectures for various tasks. In this paper, binarized neural architecture search (BNAS), with a search space of binarized convolutions, is introduced to produce extremely compressed models to reduce huge computational cost on embedded devices for edge computing. The BNAS… ▽ More

    Submitted 8 September, 2020; originally announced September 2020.

    Comments: arXiv admin note: substantial text overlap with arXiv:1911.10862

  2. arXiv:2008.11434  [pdf, other

    eess.IV cs.CV

    Better Than Reference In Low Light Image Enhancement: Conditional Re-Enhancement Networks

    Authors: Yu Zhang, Xiaoguang Di, Bin Zhang, Ruihang Ji, Chunhui Wang

    Abstract: Low light images suffer from severe noise, low brightness, low contrast, etc. In previous researches, many image enhancement methods have been proposed, but few methods can deal with these problems simultaneously. In this paper, to solve these problems simultaneously, we propose a low light image enhancement method that can combined with supervised learning and previous HSV (Hue, Saturation, Value… ▽ More

    Submitted 26 August, 2020; originally announced August 2020.

    Comments: 10 pages, 8 figures

  3. arXiv:2008.00698  [pdf, other

    cs.CV cs.LG

    Anti-Bandit Neural Architecture Search for Model Defense

    Authors: Hanlin Chen, Baochang Zhang, Song Xue, Xuan Gong, Hong Liu, Rongrong Ji, David Doermann

    Abstract: Deep convolutional neural networks (DCNNs) have dominated as the best performers in machine learning, but can be challenged by adversarial attacks. In this paper, we defend against adversarial attacks using neural architecture search (NAS) which is based on a comprehensive search of denoising blocks, weight-free operations, Gabor filters and convolutions. The resulting anti-bandit NAS (ABanditNAS)… ▽ More

    Submitted 5 August, 2020; v1 submitted 3 August, 2020; originally announced August 2020.

  4. arXiv:2007.13264  [pdf, other

    cs.CV cs.IR cs.LG

    Learning Task-oriented Disentangled Representations for Unsupervised Domain Adaptation

    Authors: Pingyang Dai, Peixian Chen, Qiong Wu, Xiaopeng Hong, Qixiang Ye, Qi Tian, Rongrong Ji

    Abstract: Unsupervised domain adaptation (UDA) aims to address the domain-shift problem between a labeled source domain and an unlabeled target domain. Many efforts have been made to address the mismatch between the distributions of training and testing data, but unfortunately, they ignore the task-oriented information across domains and are inflexible to perform well in complicated open-set scenarios. Many… ▽ More

    Submitted 26 July, 2020; originally announced July 2020.

    Comments: 9 pages, 6 figures

  5. arXiv:2007.13249  [pdf, other

    cs.IR cs.CV cs.LG

    Dual Distribution Alignment Network for Generalizable Person Re-Identification

    Authors: Peixian Chen, Pingyang Dai, Jianzhuang Liu, Feng Zheng, Qi Tian, Rongrong Ji

    Abstract: Domain generalization (DG) serves as a promising solution to handle person Re-Identification (Re-ID), which trains the model using labels from the source domain alone, and then directly adopts the trained model to the target domain without model updating. However, existing DG approaches are usually disturbed by serious domain variations due to significant dataset variations. Subsequently, DG highl… ▽ More

    Submitted 26 July, 2020; originally announced July 2020.

    Comments: 8 pages, 3 figures

  6. arXiv:2007.01546  [pdf, other

    cs.CV

    Multiple Expert Brainstorming for Domain Adaptive Person Re-identification

    Authors: Yunpeng Zhai, Qixiang Ye, Shijian Lu, Mengxi Jia, Rongrong Ji, Yonghong Tian

    Abstract: Often the best performing deep neural models are ensembles of multiple base-level networks, nevertheless, ensemble learning with respect to domain adaptive person re-ID remains unexplored. In this paper, we propose a multiple expert brainstorming network (MEB-Net) for domain adaptive person re-ID, opening up a promising direction about model ensemble problem under unsupervised conditions. MEB-Net… ▽ More

    Submitted 13 July, 2020; v1 submitted 3 July, 2020; originally announced July 2020.

    Comments: Accepted by ECCV'20

  7. arXiv:2006.09142  [pdf, other

    cs.CV

    Cogradient Descent for Bilinear Optimization

    Authors: Li'an Zhuo, Baochang Zhang, Linlin Yang, Hanlin Chen, Qixiang Ye, David Doermann, Guodong Guo, Rongrong Ji

    Abstract: Conventional learning methods simplify the bilinear model by regarding two intrinsically coupled factors independently, which degrades the optimization procedure. One reason lies in the insufficient training due to the asynchronous gradient descent, which results in vanishing gradients for the coupled variables. In this paper, we introduce a Cogradient Descent algorithm (CoGD) to address the bilin… ▽ More

    Submitted 16 June, 2020; originally announced June 2020.

    Comments: 9 pages, 6 figures

  8. arXiv:2006.06419  [pdf, other

    cs.CR

    DEPOSafe: Demystifying the Fake Deposit Vulnerability in Ethereum Smart Contracts

    Authors: Ru Ji, Ningyu He, Lei Wu, Haoyu Wang, Guangdong Bai, Yao Guo

    Abstract: Cryptocurrency has seen an explosive growth in recent years, thanks to the evolvement of blockchain technology and its economic ecosystem. Besides Bitcoin, thousands of cryptocurrencies have been distributed on blockchains, while hundreds of cryptocurrency exchanges are emerging to facilitate the trading of digital assets. At the same time, it also attracts the attentions of attackers. Fake deposi… ▽ More

    Submitted 11 June, 2020; originally announced June 2020.

    Comments: 10 pages, 6 figures

  9. arXiv:2005.09917  [pdf, other

    cs.CV

    Rethinking Performance Estimation in Neural Architecture Search

    Authors: Xiawu Zheng, Rongrong Ji, Qiang Wang, Qixiang Ye, Zhenguo Li, Yonghong Tian, Qi Tian

    Abstract: Neural architecture search (NAS) remains a challenging problem, which is attributed to the indispensable and time-consuming component of performance estimation (PE). In this paper, we provide a novel yet systematic rethinking of PE in a resource constrained regime, termed budgeted PE (BPE), which precisely and effectively estimates the performance of an architecture sampled from an architecture sp… ▽ More

    Submitted 20 May, 2020; originally announced May 2020.

  10. arXiv:2005.03837  [pdf, other

    cs.CV

    Projection & Probability-Driven Black-Box Attack

    Authors: Jie Li, Rongrong Ji, Hong Liu, Jianzhuang Liu, Bineng Zhong, Cheng Deng, Qi Tian

    Abstract: Generating adversarial examples in a black-box setting retains a significant challenge with vast practical application prospects. In particular, existing black-box attacks suffer from the need for excessive queries, as it is non-trivial to find an appropriate direction to optimize in the high-dimensional space. In this paper, we propose Projection & Probability-driven Black-box Attack (PPBA) to ta… ▽ More

    Submitted 7 May, 2020; originally announced May 2020.

    Comments: CVPR2020

  11. arXiv:2004.12311  [pdf, ps, other

    cs.LG cs.AI

    Filter Grafting for Deep Neural Networks: Reason, Method, and Cultivation

    Authors: Hao Cheng, Fanxu Meng, Ke Li, Yuting Gao, Guangming Lu, Xing Sun, Rongrong Ji

    Abstract: Filter is the key component in modern convolutional neural networks (CNNs). However, since CNNs are usually over-parameterized, a pre-trained network always contain some invalid (unimportant) filters. These filters have relatively small $l_{1}$ norm and contribute little to the output (\textbf{Reason}). While filter pruning removes these invalid filters for efficiency consideration, we tend to rea… ▽ More

    Submitted 14 January, 2021; v1 submitted 26 April, 2020; originally announced April 2020.

    Comments: arXiv admin note: substantial text overlap with arXiv:2001.05868

  12. arXiv:2004.08787  [pdf, other

    cs.CV

    AD-Cluster: Augmented Discriminative Clustering for Domain Adaptive Person Re-identification

    Authors: Yunpeng Zhai, Shijian Lu, Qixiang Ye, Xuebo Shan, Jie Chen, Rongrong Ji, Yonghong Tian

    Abstract: Domain adaptive person re-identification (re-ID) is a challenging task, especially when person identities in target domains are unknown. Existing methods attempt to address this challenge by transferring image styles or aligning feature distributions across domains, whereas the rich unlabeled samples in target domains are not sufficiently exploited. This paper presents a novel augmented discrimina… ▽ More

    Submitted 23 April, 2020; v1 submitted 19 April, 2020; originally announced April 2020.

    Comments: Accepted by CVPR'20

  13. arXiv:2004.08746  [pdf, other

    cs.SE

    Interactive Patch Filtering as Debugging Aid

    Authors: Jingjing Liang, Ruyi Ji, Jiajun Jiang, Yiling Lou, Yingfei Xiong, Gang Huang

    Abstract: It is widely recognized that program repair tools need to have a high precision to be useful, i.e., the generated patches need to have a high probability to be correct. However, it is fundamentally difficult to ensure the correctness of the patches, and many tools compromise other aspects of repair performance such as recall for an acceptable precision. In this paper we ask a question: can a rep… ▽ More

    Submitted 27 April, 2020; v1 submitted 18 April, 2020; originally announced April 2020.

  14. arXiv:2003.13268  [pdf, other

    cs.CV

    Architecture Disentanglement for Deep Neural Networks

    Authors: Jie Hu, Liujuan Cao, Qixiang Ye, Tong Tong, ShengChuan Zhang, Ke Li, Feiyue Huang, Rongrong Ji, Ling Shao

    Abstract: Understanding the inner workings of deep neural networks (DNNs) is essential to provide trustworthy artificial intelligence techniques for practical applications. Existing studies typically involve linking semantic concepts to units or layers of DNNs, but fail to explain the inference process. In this paper, we introduce neural architecture disentanglement (NAD) to fill the gap. Specifically, NAD… ▽ More

    Submitted 23 March, 2021; v1 submitted 30 March, 2020; originally announced March 2020.

  15. arXiv:2003.11865  [pdf

    physics.app-ph

    A compact flat solar still with high performance

    Authors: Guilong Peng, Swellam W. Sharshir, Rencai Ji, Zhixiang Hu, Jianqiang Ma, A. E. Kabeel, Huan Liu, Jianfeng Zang, Nuo Yang

    Abstract: Solar still is a convenient off-grid device for desalination, which can provide fresh water for families, ships, islands and so on. The conventional inclined solar still (ISS) suffers from low efficiency and low productivity. To improve the performance of solar still, a flat solar still (FSS) is proposed, which has a working principle similar to the solar cell. The condensate water in FSS is colle… ▽ More

    Submitted 18 February, 2021; v1 submitted 26 March, 2020; originally announced March 2020.

    Comments: 34

  16. arXiv:2003.11228  [pdf, other

    cs.CV

    ASFD: Automatic and Scalable Face Detector

    Authors: Bin Zhang, Jian Li, Yabiao Wang, Ying Tai, Chengjie Wang, Jilin Li, Feiyue Huang, Yili Xia, Wenjiang Pei, Rongrong Ji

    Abstract: In this paper, we propose a novel Automatic and Scalable Face Detector (ASFD), which is based on a combination of neural architecture search techniques as well as a new loss design. First, we propose an automatic feature enhance module named Auto-FEM by improved differential architecture search, which allows efficient multi-scale feature fusion and context enhancement. Second, we use Distance-base… ▽ More

    Submitted 31 March, 2020; v1 submitted 25 March, 2020; originally announced March 2020.

    Comments: Ranked No.1 on WIDER Face (http://shuoyang1213.me/WIDERFACE/WiderFace_Results.html)

  17. arXiv:2003.08813  [pdf, other

    cs.CV

    Multi-task Collaborative Network for Joint Referring Expression Comprehension and Segmentation

    Authors: Gen Luo, Yiyi Zhou, Xiaoshuai Sun, Liujuan Cao, Chenglin Wu, Cheng Deng, Rongrong Ji

    Abstract: Referring expression comprehension (REC) and segmentation (RES) are two highly-related tasks, which both aim at identifying the referent according to a natural language expression. In this paper, we propose a novel Multi-task Collaborative Network (MCN) to achieve a joint learning of REC and RES for the first time. In MCN, RES can help REC to achieve better language-vision alignment, while REC can… ▽ More

    Submitted 19 March, 2020; originally announced March 2020.

    Comments: accpected by CVPR2020 (oral)

  18. arXiv:2003.07314  [pdf, other

    cs.CR cs.SE

    Characterizing Cryptocurrency Exchange Scams

    Authors: Pengcheng Xia, Bowen Zhang, Ru Ji, Bingyu Gao, Lei Wu, Xiapu Luo, Haoyu Wang, Guoai Xu

    Abstract: As the indispensable trading platforms of the ecosystem, hundreds of cryptocurrency exchanges are emerging to facilitate the trading of digital assets. While, it also attracts the attentions of attackers. A number of scam attacks were reported targeting cryptocurrency exchanges, leading to a huge mount of financial loss. However, no previous work in our research community has systematically studie… ▽ More

    Submitted 16 March, 2020; originally announced March 2020.

    Comments: 15 pages,18 figures

  19. arXiv:2003.06761  [pdf, ps, other

    cs.CV

    Siamese Box Adaptive Network for Visual Tracking

    Authors: Zedu Chen, Bineng Zhong, Guorong Li, Shengping Zhang, Rongrong Ji

    Abstract: Most of the existing trackers usually rely on either a multi-scale searching scheme or pre-defined anchor boxes to accurately estimate the scale and aspect ratio of a target. Unfortunately, they typically call for tedious and heuristic configurations. To address this issue, we propose a simple yet effective visual tracking framework (named Siamese Box Adaptive Network, SiamBAN) by exploiting the e… ▽ More

    Submitted 22 April, 2020; v1 submitted 15 March, 2020; originally announced March 2020.

    Comments: Accepted to CVPR 2020

  20. arXiv:2003.06070  [pdf

    cond-mat.mes-hall cond-mat.mtrl-sci

    Electric-field-induced modulation of thermal conductivity in poly(vinylidene fluoride)

    Authors: Shichen Deng, Jiale Yuan, Yuli Lin, Xiaoxiang Yu, Dengke Ma, Yuwen Huang, Rencai Ji, Guangzu Zhang, Nuo Yang

    Abstract: Phonon engineering focuses on heat transport modulation on atomic-scale. Different from reported methods, it is shown that electric field can also modulate heat transport in ferroelectric polymers, poly(vinylidene fluoride), by both simulation and measurement. Interestingly, thermal conductivities of poly(vinylidene fluoride) array can be enhanced by a factor of 3.25 along the polarization directi… ▽ More

    Submitted 24 November, 2020; v1 submitted 12 March, 2020; originally announced March 2020.

  21. arXiv:2002.10179  [pdf, other

    cs.CV

    HRank: Filter Pruning using High-Rank Feature Map

    Authors: Mingbao Lin, Rongrong Ji, Yan Wang, Yichen Zhang, Baochang Zhang, Yonghong Tian, Ling Shao

    Abstract: Neural network pruning offers a promising prospect to facilitate deploying deep neural networks on resource-limited devices. However, existing methods are still challenged by the training inefficiency and labor cost in pruning designs, due to missing theoretical guidance of non-salient network components. In this paper, we propose a novel filter pruning method by exploring the High Rank of feature… ▽ More

    Submitted 16 March, 2020; v1 submitted 24 February, 2020; originally announced February 2020.

  22. arXiv:2002.03662  [pdf, other

    cs.CV

    Improving Face Recognition from Hard Samples via Distribution Distillation Loss

    Authors: Yuge Huang, Pengcheng Shen, Ying Tai, Shaoxin Li, Xiaoming Liu, Jilin Li, Feiyue Huang, Rongrong Ji

    Abstract: Large facial variations are the main challenge in face recognition. To this end, previous variation-specific methods make full use of task-related prior to design special network losses, which are typically not general among different tasks and scenarios. In contrast, the existing generic methods focus on improving the feature discriminability to minimize the intra-class distance while maximizing… ▽ More

    Submitted 18 July, 2020; v1 submitted 10 February, 2020; originally announced February 2020.

    Comments: ECCV2020

  23. arXiv:2001.08565  [pdf, other

    cs.CV

    Channel Pruning via Automatic Structure Search

    Authors: Mingbao Lin, Rongrong Ji, Yuxin Zhang, Baochang Zhang, Yongjian Wu, Yonghong Tian

    Abstract: Channel pruning is among the predominant approaches to compress deep neural networks. To this end, most existing pruning methods focus on selecting channels (filters) by importance/optimization or regularization based on rule-of-thumb designs, which defects in sub-optimal pruning. In this paper, we propose a new channel pruning method based on artificial bee colony algorithm (ABC), dubbed as ABCPr… ▽ More

    Submitted 28 June, 2020; v1 submitted 23 January, 2020; originally announced January 2020.

    Comments: Accepted by IJCAI2020. SOLO copyright holder is IJCAI (International Joint Conferences on Artificial Intelligence)

  24. arXiv:2001.08514  [pdf, other

    cs.CV

    Filter Sketch for Network Pruning

    Authors: Mingbao Lin, Liujuan Cao, Shaojie Li, Qixiang Ye, Yonghong Tian, Jianzhuang Liu, Qi Tian, Rongrong Ji

    Abstract: We propose a novel network pruning approach by information preserving of pre-trained network weights (filters). Network pruning with the information preserving is formulated as a matrix sketch problem, which is efficiently solved by the off-the-shelf Frequent Direction method. Our approach, referred to as FilterSketch, encodes the second-order information of pre-trained weights, which enables the… ▽ More

    Submitted 24 May, 2021; v1 submitted 23 January, 2020; originally announced January 2020.

    Comments: Accepted by IEEE Transactions on Neural Networks and Learning Systems (IEEE TNNLS)

  25. arXiv:2001.05868  [pdf, ps, other

    cs.CV cs.LG

    Filter Grafting for Deep Neural Networks

    Authors: Fanxu Meng, Hao Cheng, Ke Li, Zhixin Xu, Rongrong Ji, Xing Sun, Gaungming Lu

    Abstract: This paper proposes a new learning paradigm called filter grafting, which aims to improve the representation capability of Deep Neural Networks (DNNs). The motivation is that DNNs have unimportant (invalid) filters (e.g., l1 norm close to 0). These filters limit the potential of DNNs since they are identified as having little effect on the network. While filter pruning removes these invalid filter… ▽ More

    Submitted 26 February, 2020; v1 submitted 14 January, 2020; originally announced January 2020.

    Comments: Accepted by CVPR2020

  26. arXiv:1912.09622  [pdf, other

    cs.CV

    Learning Semantic Neural Tree for Human Parsing

    Authors: Ruyi Ji, Dawei Du, Libo Zhang, Longyin Wen, Yanjun Wu, Chen Zhao, Feiyue Huang, Siwei Lyu

    Abstract: The majority of existing human parsing methods formulate the task as semantic segmentation, which regard each semantic category equally and fail to exploit the intrinsic physiological structure of human body, resulting in inaccurate results. In this paper, we design a novel semantic neural tree for human parsing, which uses a tree architecture to encode physiological structure of human body, and d… ▽ More

    Submitted 19 December, 2019; originally announced December 2019.

  27. arXiv:1912.03478  [pdf, other

    cs.CV

    A Real-time Global Inference Network for One-stage Referring Expression Comprehension

    Authors: Yiyi Zhou, Rongrong Ji, Gen Luo, Xiaoshuai Sun, Jinsong Su, Xinghao Ding, Chia-wen Lin, Qi Tian

    Abstract: Referring Expression Comprehension (REC) is an emerging research spot in computer vision, which refers to detecting the target region in an image given an text description. Most existing REC methods follow a multi-stage pipeline, which are computationally expensive and greatly limit the application of REC. In this paper, we propose a one-stage model towards real-time REC, termed Real-time Global I… ▽ More

    Submitted 7 December, 2019; originally announced December 2019.

  28. arXiv:1912.01349  [pdf, other

    cs.CV

    Asymmetric Co-Teaching for Unsupervised Cross Domain Person Re-Identification

    Authors: Fengxiang Yang, Ke Li, Zhun Zhong, Zhiming Luo, Xing Sun, Hao Cheng, Xiaowei Guo, Feiyue Huang, Rongrong Ji, Shaozi Li

    Abstract: Person re-identification (re-ID), is a challenging task due to the high variance within identity samples and imaging conditions. Although recent advances in deep learning have achieved remarkable accuracy in settled scenes, i.e., source domain, few works can generalize well on the unseen target domain. One popular solution is assigning unlabeled target images with pseudo labels by clustering, and… ▽ More

    Submitted 3 December, 2019; originally announced December 2019.

    Comments: Accepted by AAAi 2020

  29. arXiv:1911.11419  [pdf, other

    cs.CV

    Revisiting Image Aesthetic Assessment via Self-Supervised Feature Learning

    Authors: Kekai Sheng, Weiming Dong, Menglei Chai, Guohui Wang, Peng Zhou, Feiyue Huang, Bao-Gang Hu, Rongrong Ji, Chongyang Ma

    Abstract: Visual aesthetic assessment has been an active research field for decades. Although latest methods have achieved promising performance on benchmark datasets, they typically rely on a large number of manual annotations including both aesthetic labels and related image attributes. In this paper, we revisit the problem of image aesthetic assessment from the self-supervised feature learning perspectiv… ▽ More

    Submitted 26 November, 2019; originally announced November 2019.

    Comments: AAAI Conference on Artificial Intelligence, 2020, accepted

    Journal ref: Proceedings of AAAI Conference on Articial Intelligence 2020

  30. arXiv:1911.10862  [pdf, other

    cs.CV

    Binarized Neural Architecture Search

    Authors: Hanlin Chen, Li'an Zhuo, Baochang Zhang, Xiawu Zheng, Jianzhuang Liu, David Doermann, Rongrong Ji

    Abstract: Neural architecture search (NAS) can have a significant impact in computer vision by automatically designing optimal neural network architectures for various tasks. A variant, binarized neural architecture search (BNAS), with a search space of binarized convolutions, can produce extremely compressed models. Unfortunately, this area remains largely unexplored. BNAS is more challenging than NAS due… ▽ More

    Submitted 11 February, 2020; v1 submitted 25 November, 2019; originally announced November 2019.

  31. arXiv:1911.04127  [pdf, other

    cs.CV

    Fast Learning of Temporal Action Proposal via Dense Boundary Generator

    Authors: Chuming Lin, Jian Li, Yabiao Wang, Ying Tai, Donghao Luo, Zhipeng Cui, Chengjie Wang, Jilin Li, Feiyue Huang, Rongrong Ji

    Abstract: Generating temporal action proposals remains a very challenging problem, where the main issue lies in predicting precise temporal proposal boundaries and reliable action confidence in long and untrimmed real-world videos. In this paper, we propose an efficient and unified framework to generate temporal action proposals named Dense Boundary Generator (DBG), which draws inspiration from boundary-sen… ▽ More

    Submitted 11 November, 2019; originally announced November 2019.

    Comments: Accepted by AAAI 2020. Ranked No. 1 on ActivityNet Challenge 2019 on Temporal Action Proposals (http://activity-net.org/challenges/2019/evaluation.html)

  32. arXiv:1910.14184  [pdf, other

    cs.CV

    Beyond Universal Person Re-ID Attack

    Authors: Wenjie Ding, Xing Wei, Rongrong Ji, Xiaopeng Hong, Qi Tian, Yihong Gong

    Abstract: Deep learning-based person re-identification (Re-ID) has made great progress and achieved high performance recently. In this paper, we make the first attempt to examine the vulnerability of current person Re-ID models against a dangerous attack method, \ie, the universal adversarial perturbation (UAP) attack, which has been shown to fool classification models with a little overhead. We propose a \… ▽ More

    Submitted 13 December, 2020; v1 submitted 30 October, 2019; originally announced October 2019.

  33. arXiv:1910.10853  [pdf, other

    cs.CV

    Circulant Binary Convolutional Networks: Enhancing the Performance of 1-bit DCNNs with Circulant Back Propagation

    Authors: Chunlei Liu, Wenrui Ding, Xin Xia, Baochang Zhang, Jiaxin Gu, Jianzhuang Liu, Rongrong Ji, David Doermann

    Abstract: The rapidly decreasing computation and memory cost has recently driven the success of many applications in the field of deep learning. Practical applications of deep learning in resource-limited hardware, such as embedded devices and smart phones, however, remain challenging. For binary convolutional networks, the reason lies in the degraded representation caused by binarizing full-precision filte… ▽ More

    Submitted 23 October, 2019; originally announced October 2019.

    Comments: Published in CVPR2019

    Journal ref: ]Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 2019: 2691-2699

  34. arXiv:1910.09182  [pdf, other

    cs.CV cs.IR

    Hadamard Codebook Based Deep Hashing

    Authors: Shen Chen, Liujuan Cao, Mingbao Lin, Yan Wang, Xiaoshuai Sun, Chenglin Wu, Jingfei Qiu, Rongrong Ji

    Abstract: As an approximate nearest neighbor search technique, hashing has been widely applied in large-scale image retrieval due to its excellent efficiency. Most supervised deep hashing methods have similar loss designs with embedding learning, while quantizing the continuous high-dim feature into compact binary space. We argue that the existing deep hashing schemes are defective in two issues that seriou… ▽ More

    Submitted 21 October, 2019; originally announced October 2019.

    Comments: 8 pages, 7 figures, conference

  35. arXiv:1910.03853  [pdf, ps, other

    cs.CV

    Semantic-aware Image Deblurring

    Authors: Fuhai Chen, Rongrong Ji, Chengpeng Dai, Xiaoshuai Sun, Chia-Wen Lin, Jiayi Ji, Baochang Zhang, Feiyue Huang, Liujuan Cao

    Abstract: Image deblurring has achieved exciting progress in recent years. However, traditional methods fail to deblur severely blurred images, where semantic contents appears ambiguously. In this paper, we conduct image deblurring guided by the semantic contents inferred from image captioning. Specially, we propose a novel Structured-Spatial Semantic Embedding model for image deblurring (termed S3E-Deblur)… ▽ More

    Submitted 9 October, 2019; originally announced October 2019.

  36. arXiv:1909.11378  [pdf, other

    cs.CV

    Attention Convolutional Binary Neural Tree for Fine-Grained Visual Categorization

    Authors: Ruyi Ji, Longyin Wen, Libo Zhang, Dawei Du, Yanjun Wu, Chen Zhao, Xianglong Liu, Feiyue Huang

    Abstract: Fine-grained visual categorization (FGVC) is an important but challenging task due to high intra-class variances and low inter-class variances caused by deformation, occlusion, illumination, etc. An attention convolutional binary neural tree architecture is presented to address those problems for weakly supervised FGVC. Specifically, we incorporate convolutional operations along edges of the tree… ▽ More

    Submitted 14 March, 2020; v1 submitted 25 September, 2019; originally announced September 2019.

    Comments: accepted by CVPR 2020

  37. arXiv:1909.04366  [pdf, other

    cs.CV

    Structured Modeling of Joint Deep Feature and Prediction Refinement for Salient Object Detection

    Authors: Yingyue Xu, Dan Xu, Xiaopeng Hong, Wanli Ouyang, Rongrong Ji, Min Xu, Guoying Zhao

    Abstract: Recent saliency models extensively explore to incorporate multi-scale contextual information from Convolutional Neural Networks (CNNs). Besides direct fusion strategies, many approaches introduce message-passing to enhance CNN features or predictions. However, the messages are mainly transmitted in two ways, by feature-to-feature passing, and by prediction-to-prediction passing. In this paper, we… ▽ More

    Submitted 10 September, 2019; originally announced September 2019.

    Comments: Accepted to ICCV

  38. arXiv:1909.02466  [pdf, other

    cs.CV cs.LG

    FreeAnchor: Learning to Match Anchors for Visual Object Detection

    Authors: Xiaosong Zhang, Fang Wan, Chang Liu, Rongrong Ji, Qixiang Ye

    Abstract: Modern CNN-based object detectors assign anchors for ground-truth objects under the restriction of object-anchor Intersection-over-Unit (IoU). In this study, we propose a learning-to-match approach to break IoU restriction, allowing objects to match anchors in a flexible manner. Our approach, referred to as FreeAnchor, updates hand-crafted anchor assignment to "free" anchor matching by formulating… ▽ More

    Submitted 12 November, 2019; v1 submitted 5 September, 2019; originally announced September 2019.

    Comments: Accepted by NeurIPS 2019

  39. arXiv:1908.08433  [pdf, other

    cs.CV cs.DB

    Scoot: A Perceptual Metric for Facial Sketches

    Authors: Deng-Ping Fan, ShengChuan Zhang, Yu-Huan Wu, Yun Liu, Ming-Ming Cheng, Bo Ren, Paul L. Rosin, Rongrong Ji

    Abstract: Human visual system has the strong ability to quick assess the perceptual similarity between two facial sketches. However, existing two widely-used facial sketch metrics, e.g., FSIM and SSIM fail to address this perceptual similarity in this field. Recent study in facial modeling area has verified that the inclusion of both structure and texture has a significant positive benefit for face sketch s… ▽ More

    Submitted 4 September, 2019; v1 submitted 21 August, 2019; originally announced August 2019.

    Comments: Code & dataset:http://mmcheng.net/scoot/, 11 pages, ICCV 2019, First one good evaluation metric for facial sketh that consistent with human judgment. arXiv admin note: text overlap with arXiv:1804.02975

  40. arXiv:1908.06314  [pdf, other

    cs.CV

    Bayesian Optimized 1-Bit CNNs

    Authors: Jiaxin Gu, Junhe Zhao, Xiaolong Jiang, Baochang Zhang, Jianzhuang Liu, Guodong Guo, Rongrong Ji

    Abstract: Deep convolutional neural networks (DCNNs) have dominated the recent developments in computer vision through making various record-breaking models. However, it is still a great challenge to achieve powerful DCNNs in resource-limited environments, such as on embedded devices and smart phones. Researchers have realized that 1-bit CNNs can be one feasible solution to resolve the issue; however, they… ▽ More

    Submitted 17 August, 2019; originally announced August 2019.

  41. arXiv:1908.02632  [pdf, other

    cs.CV

    Scene-based Factored Attention for Image Captioning

    Authors: Chen Shen, Rongrong Ji, Fuhai Chen, Xiaoshuai Sun, Xiangming Li

    Abstract: Image captioning has attracted ever-increasing research attention in the multimedia community. To this end, most cutting-edge works rely on an encoder-decoder framework with attention mechanisms, which have achieved remarkable progress. However, such a framework does not consider scene concepts to attend visual information, which leads to sentence bias in caption generation and defects the perform… ▽ More

    Submitted 2 September, 2019; v1 submitted 7 August, 2019; originally announced August 2019.

    Comments: 10 pages

  42. arXiv:1908.02126  [pdf, other

    cs.CV

    Semi-Supervised Adversarial Monocular Depth Estimation

    Authors: Rongrong Ji, Ke Li, Yan Wang, Xiaoshuai Sun, Feng Guo, Xiaowei Guo, Yongjian Wu, Feiyue Huang, Jiebo Luo

    Abstract: In this paper, we address the problem of monocular depth estimation when only a limited number of training image-depth pairs are available. To achieve a high regression accuracy, the state-of-the-art estimation methods rely on CNNs trained with a large number of image-depth pairs, which are prohibitively costly or even infeasible to acquire. Aiming to break the curse of such expensive data collect… ▽ More

    Submitted 6 August, 2019; originally announced August 2019.

  43. arXiv:1906.01288  [pdf, other

    cs.LG cs.CV stat.ML

    Information Competing Process for Learning Diversified Representations

    Authors: Jie Hu, Rongrong Ji, ShengChuan Zhang, Xiaoshuai Sun, Qixiang Ye, Chia-Wen Lin, Qi Tian

    Abstract: Learning representations with diversified information remains as an open problem. Towards learning diversified representations, a new approach, termed Information Competing Process (ICP), is proposed in this paper. Aiming to enrich the information carried by feature representations, ICP separates a representation into two parts with different mutual information constraints. The separated parts are… ▽ More

    Submitted 27 November, 2019; v1 submitted 4 June, 2019; originally announced June 2019.

    Comments: Accept as a NeurIPS 2019 paper

  44. arXiv:1906.01166  [pdf, other

    cs.CV

    Interpretable Neural Network Decoupling

    Authors: Yuchao Li, Rongrong Ji, Shaohui Lin, Baochang Zhang, Chenqian Yan, Yongjian Wu, Feiyue Huang, Ling Shao

    Abstract: The remarkable performance of convolutional neural networks (CNNs) is entangled with their huge number of uninterpretable parameters, which has become the bottleneck limiting the exploitation of their full potential. Towards network interpretation, previous endeavors mainly resort to the single filter analysis, which however ignores the relationship between filters. In this paper, we propose a nov… ▽ More

    Submitted 25 August, 2020; v1 submitted 3 June, 2019; originally announced June 2019.

    Comments: 20 pages, 12 figures

  45. DDPNAS: Efficient Neural Architecture Search via Dynamic Distribution Pruning

    Authors: Xiawu Zheng, Chenyi Yang, Shaokun Zhang, Yan Wang, Baochang Zhang, Yongjian Wu, Yunsheng Wu, Ling Shao, Rongrong Ji

    Abstract: Neural Architecture Search (NAS) has demonstrated state-of-the-art performance on various computer vision tasks. Despite the superior performance achieved, the efficiency and generality of existing methods are highly valued due to their high computational complexity and low generality. In this paper, we propose an efficient and unified NAS framework termed DDPNAS via dynamic distribution pruning,… ▽ More

    Submitted 10 March, 2023; v1 submitted 28 May, 2019; originally announced May 2019.

    Comments: A update version of this work. 19 pages

    Journal ref: International Journal of Computer Vision, 1-16 (2023)

  46. arXiv:1905.13382  [pdf, other

    cs.CV cs.AI cs.MM

    Supervised Online Hashing via Similarity Distribution Learning

    Authors: Mingbao Lin, Rongrong Ji, Shen Chen, Feng Zheng, Xiaoshuai Sun, Baochang Zhang, Liujuan Cao, Guodong Guo, Feiyue Huang

    Abstract: Online hashing has attracted extensive research attention when facing streaming data. Most online hashing methods, learning binary codes based on pairwise similarities of training instances, fail to capture the semantic relationship, and suffer from a poor generalization in large-scale applications due to large variations. In this paper, we propose to model the similarity distributions between the… ▽ More

    Submitted 30 May, 2019; originally announced May 2019.

  47. arXiv:1905.07529  [pdf, other

    cs.LG cs.CV cs.NE

    Multinomial Distribution Learning for Effective Neural Architecture Search

    Authors: Xiawu Zheng, Rongrong Ji, Lang Tang, Baochang Zhang, Jianzhuang Liu, Qi Tian

    Abstract: Architectures obtained by Neural Architecture Search (NAS) have achieved highly competitive performance in various computer vision tasks. However, the prohibitive computation demand of forward-backward propagation in deep neural networks and searching algorithms makes it difficult to apply NAS in practice. In this paper, we propose a Multinomial Distribution Learning for extremely effective NAS,wh… ▽ More

    Submitted 13 August, 2019; v1 submitted 17 May, 2019; originally announced May 2019.

  48. arXiv:1905.04454  [pdf, other

    cs.IR cs.LG

    Hadamard Matrix Guided Online Hashing

    Authors: Mingbao Lin, Rongrong Ji, Hong Liu, Xiaoshuai Sun, Shen Chen, Qi Tian

    Abstract: Online image hashing has attracted increasing research attention recently, which receives large-scale data in a streaming manner to update the hash functions on-the-fly. Its key challenge lies in the difficulty of balancing the learning timeliness and model accuracy. To this end, most works follow a supervised setting, i.e., using class labels to boost the hashing performance, which defects in two… ▽ More

    Submitted 22 January, 2020; v1 submitted 11 May, 2019; originally announced May 2019.

  49. arXiv:1905.03694  [pdf, other

    cs.CV cs.MM

    Supervised Online Hashing via Hadamard Codebook Learning

    Authors: Mingbao Lin, Rongrong Ji, Hong Liu, Yongjian Liu

    Abstract: In recent years, binary code learning, a.k.a hashing, has received extensive attention in large-scale multimedia retrieval. It aims to encode high-dimensional data points to binary codes, hence the original high-dimensional metric space can be efficiently approximated via Hamming space. However, most existing hashing methods adopted offline batch learning, which is not suitable to handle increment… ▽ More

    Submitted 24 May, 2019; v1 submitted 27 April, 2019; originally announced May 2019.

  50. arXiv:1904.12428  [pdf, other

    cs.CV

    Attribute Guided Unpaired Image-to-Image Translation with Semi-supervised Learning

    Authors: Xinyang Li, Jie Hu, Shengchuan Zhang, Xiaopeng Hong, Qixiang Ye, Chenglin Wu, Rongrong Ji

    Abstract: Unpaired Image-to-Image Translation (UIT) focuses on translating images among different domains by using unpaired data, which has received increasing research focus due to its practical usage. However, existing UIT schemes defect in the need of supervised training, as well as the lack of encoding domain information. In this paper, we propose an Attribute Guided UIT model termed AGUIT to tackle the… ▽ More

    Submitted 28 April, 2019; originally announced April 2019.