Close
Metadata

%0 Conference Proceedings
%4 sid.inpe.br/sibgrapi/2015/06.19.01.44
%2 sid.inpe.br/sibgrapi/2015/06.19.01.44.22
%T Improving Spatial Feature Representation from Aerial Scenes by Using Convolutional Networks
%D 2015
%A Nogueira, Keiller,
%A Miranda, Waner O.,
%A Santos, Jefersson A. dos,
%@affiliation Universidade Federal de Minas Gerais
%@affiliation Universidade Federal de Minas Gerais
%@affiliation Universidade Federal de Minas Gerais
%E Papa, Joćo Paulo,
%E Sander, Pedro Vieira,
%E Marroquim, Ricardo Guerra,
%E Farrell, Ryan,
%B Conference on Graphics, Patterns and Images, 28 (SIBGRAPI)
%C Salvador
%8 Aug. 26-29, 2015
%I IEEE Computer Society
%J Los Alamitos
%S Proceedings
%K Deep Learning, Remote Sensing, Feature Learning, Image Classification, Machine Learning, High-resolution Images.
%X The performance of image classification is highly dependent on the quality of extracted features. Concerning high resolution remote image images, encoding the spatial features in an efficient and robust fashion is the key to generating discriminatory models to classify them. Even though many visual descriptors have been proposed or successfully used to encode spatial features of remote sensing images, some applications, using this sort of images, demand more specific description techniques. Deep Learning, an emergent machine learning approach based on neural networks, is capable of learning specific features and classifiers at the same time and adjust at each step, in real time, to better fit the need of each problem. For several task, such image classification, it has achieved very good results, mainly boosted by the feature learning performed which allows the method to extract specific and adaptable visual features depending on the data. In this paper, we propose a novel network capable of learning specific spatial features from remote sensing images, with any pre-processing step or descriptor evaluation, and classify them. Specifically, automatic feature learning task aims at discovering hierarchical structures from the raw data, leading to a more representative information. This task not only poses interesting challenges for existing vision and recognition algorithms, but also brings huge opportunities for urban planning, crop and forest management and climate modelling. The propose convolutional neural network has six layers: three convolutional, two fully-connected and one classifier layer. So, the five first layers are responsible to extract visual features while the last one is responsible to classify the images. We conducted a systematic evaluation of the proposed method using two datasets: (i) the popular aerial image dataset UCMerced Land-use and, (ii) a multispectral high-resolution scenes of the Brazilian Coffee Scenes. The experiments show that the proposed method outperforms state-of-the-art algorithms in terms of overall accuracy.
%@language en
%3 sibgrapi2015.pdf


Close