REVERSE ATTENTION-BASED RESIDUAL NETWORK FOR SALIENT OBJECT DETECTION

Abstract

Benefiting from the quick development of deep convolutional neural networks, especially fully convolutional neural networks (FCNs), remarkable progresses have been achieved on salient object detection recently. Nevertheless, these FCNs based methods are still challenging to generate high resolution saliency maps, and also not applicable for subsequent applications due to their heavy model weights. In this project propose a compact and efficient deep network with high accuracy for salient object detection. Firstly, we propose two strategies for initial prediction, one is a new designed multi-scale context module, and the other is incorporating hand-crafted saliency priors. Secondly, we employ residual learning to refine it progressively by only learning the residual in each side-output, which can be achieved with few convolutional parameters, therefore leads to high compactness and high efficiency. Finally, we further design a novel top down reverse attention block to guide the above side-output residual learning. Specifically, the current predicted salient regions are used to erase its side output feature, thus the missing object parts and details can be efficiently learned from these un erased regions, which results in more complete detection and high accuracy. Extensive experimental results on seven benchmark datasets demonstrate that the proposed network performs favorably against the state of the art approaches, and shows advantages in simplicity, compactness and efficiency. This project is implemented with MATLAB software.

Let's Talk