An analysis of rotation matrix and colour constancy data augmentation in classifying images of animals

Emmanuel Okafor*, Lambert Schomaker, Marco A. Wiering

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

24 Scopus citations

Abstract

In this paper, we examine a novel data augmentation (DA) method that transforms an image into a new image containing multiple rotated copies of the original image. The DA method creates a grid of n × n cells, in which each cell contains a different randomly rotated image and introduces a natural background in the newly created image. We investigate the use of deep learning to assess the classification performance on the rotation matrix or original dataset with colour constancy versions of the datasets. For the colour constancy methods, we use two well-known retinex techniques: the multi-scale retinex and the multi-scale retinex with colour restoration for enhancing both original (ORIG) and rotation matrix (ROT) images. We perform experiments on three datasets containing images of animals, from which the first dataset is collected by us and contains aerial images of cows or non-cow backgrounds. To classify the Aerial UAV images, we use a convolutional neural network (CNN) architecture and compare two loss functions (hinge loss and cross-entropy loss). Additionally, we compare the CNN to classical feature-based techniques combined with a k-nearest neighbour classifier or a support vector machine. The best approach is then used to examine the colour constancy DA variants, ORIG and ROT-DA alone for three datasets (Aerial UAV, Bird-600 and Croatia fish). The results show that the rotation matrix data augmentation is very helpful for the Aerial UAV dataset. Furthermore, the colour constancy data augmentation is helpful for the Bird-600 dataset. Finally, the results show that the fine-tuned CNNs significantly outperform the CNNs trained from scratch on the Croatia fish and the Bird-600 datasets, and obtain very high accuracies on the Aerial UAV and Bird-600 datasets.

Original languageEnglish
Pages (from-to)465-491
Number of pages27
JournalJournal of Information and Telecommunication
Volume2
Issue number4
DOIs
StatePublished - 2018
Externally publishedYes

Bibliographical note

Publisher Copyright:
© Taylor and Francis Ltd.. All rights reserved.

Keywords

  • Colour constancy
  • Convolutional neural networks
  • Data augmentation
  • Feature descriptors
  • Image recognition

ASJC Scopus subject areas

  • Computer Networks and Communications
  • Computer Science Applications
  • Computer Science (miscellaneous)
  • Electrical and Electronic Engineering

Fingerprint

Dive into the research topics of 'An analysis of rotation matrix and colour constancy data augmentation in classifying images of animals'. Together they form a unique fingerprint.

Cite this