Deep residual hashing network for image retrieval
Abstract:
Conventional methods in Content-Based Image Retrieval use hand-crafted visual features as input but sometimes such feature vectors do not preserve the similarity between images. Taking advantage of the improvements in the Convolutional Neural Networks (CNN) area we propose a Deep Residual Hashing Network (DRHN) based in the work of [1] that generates binary hash codes based on the features learned. The base of our model is a residual block which is formed by the following operations: Convolution, Rectified Linear Unit (ReLU), Batch normalization (BN) [2] and Element-wise addition. A Residual Group is the join of n Residual Blocks. The architecture of the proposed model is the next: Input → (Convolution → ReLU → BN) → six residual blocks → Average Pooling Layer (APL) → Hash Layer → Fully Connected Layer with Softmax, where the APL calculates the average of all values in a channel and Hash Layer length is denoted by h. We exploit a previous proposed idea in the field [3]: in a supervised manner the binary codes can be learned adding an extra hidden layer to represent the main features that identifies the classes in a database, hence to generate the binary code of h bits related to an image we binarize the activation of the Hash Layer according to some threshold t. The experimental results outperforms the state-of-the-art hashing algorithms on the CIFAR-10 dataset, using a DRHN with n = 15 and h = 48 we obtained a mean average precision (mAP) of 92.91 in the image retrieval task, previous best known result for the same number of bits was 89.73 [3].
Año de publicación:
2017
Keywords:
- Content-based Image Retrieval
- deep learning
- Computer Vision
- convolutional neural networks
Fuente:

Tipo de documento:
Conference Object
Estado:
Acceso restringido
Áreas de conocimiento:
- Visión por computadora
- Ciencias de la computación
Áreas temáticas:
- Métodos informáticos especiales