Abstract
Most research in image classification has focused on applications such as face, object, scene and character recognition. This paper examines a comparative study between deep convolutional neural networks (CNNs) and bag of visual words (BOW) variants for recognizing animals. We developed two variants of the bag of visual words (BOW and HOG-BOW) and examine the use of gray and color information as well as different spatial pooling approaches. We combined the final feature vectors extracted from these BOW variants with a regularized L2 support vector machine (L2-SVM) to distinguish between classes within our datasets. We modified existing deep CNN architectures: AlexNet and GoogleNet, by reducing the number of neurons in each layer of the fully connected layers and last inception layer for both scratch and pre-trained versions. Finally, we compared the existing CNN methods, our modified CNN architectures and the proposed BOW variants on our novel wild-animal dataset (Wild-Anim). The results show that the CNN methods significantly outperform the BOW techniques.
Original language | English |
---|---|
Title of host publication | 2016 IEEE Symposium Series on Computational Intelligence, SSCI 2016 |
Publisher | Institute of Electrical and Electronics Engineers Inc. |
ISBN (Electronic) | 9781509042401 |
DOIs | |
State | Published - 9 Feb 2017 |
Externally published | Yes |
Event | 2016 IEEE Symposium Series on Computational Intelligence, SSCI 2016 - Athens, Greece Duration: 6 Dec 2016 → 9 Dec 2016 |
Publication series
Name | 2016 IEEE Symposium Series on Computational Intelligence, SSCI 2016 |
---|
Conference
Conference | 2016 IEEE Symposium Series on Computational Intelligence, SSCI 2016 |
---|---|
Country/Territory | Greece |
City | Athens |
Period | 6/12/16 → 9/12/16 |
Bibliographical note
Publisher Copyright:© 2016 IEEE.
ASJC Scopus subject areas
- Computer Networks and Communications
- Information Systems and Management
- Control and Optimization
- Artificial Intelligence