Open Access Open Access  Restricted Access Subscription or Fee Access

Place Recognition with DAG-CNN

Javier Orlando Pinzon-Arenas(1), Robinson Jimenez-Moreno(2*), Cesar Giovany Pachon-Suescun(3)

(1) Faculty of Engineering, Universidad Militar Nueva Granada, Colombia
(2) Faculty of Engineering, Universidad Militar Nueva Granada, Colombia
(3) Faculty of Engineering, Universidad Militar Nueva Granada, Colombia
(*) Corresponding author


DOI: https://doi.org/10.15866/ireaco.v13i2.17053

Abstract


This paper presents the development of a convolutional neural network with a directed acyclic graph architecture (DAG-CNN) focused on the recognition of places. The network is focused on identifying six types of rooms in various houses. For this purpose, five houses have been built in a virtual environment from which the training and validation database has been obtained through an on-site panning camera. In order to select the number of filters required for the proposed architecture, the internal behavior of each training has been verified through neuron activation heat maps in order to reduce the learning repetitions of little relevant objects or the characteristics of the scene as much as possible, obtaining a network capable of recognizing 96.5% of the individual images from room sequence photographs and 100% individual recognition of each room (complete sequence). Thus, the capacity and the robustness of the selected architecture for recognizing indoor places are demonstrated.
Copyright © 2020 Praise Worthy Prize - All rights reserved.

Keywords


Convolutional Neural Networks; DAG-CNN; Place Recognition; Indoor Places; Feature Activations

Full Text:

PDF


References


S. Lowry, N. Sünderhauf, P. Newman, J. J. Leonard, D. Cox, P. Corke and M. J. Milford, Visual place recognition: A survey, IEEE Transactions on Robotics, vol. 32, no 1, pp.1-19, 2016.
https://doi.org/10.1109/tro.2015.2496823

A. Ranganathan, Honda Motor Co Ltd, Detecting and labeling places using runtime change-point detection, U.S. Patent 8,565,538, 2013.

P. Espinace, T. Kollar, A. Soto and N. Roy, Indoor scene recognition through object detection, In 2010 IEEE International Conference on Robotics and Automation, IEEE, 2010, pp. 1406-1413.
https://doi.org/10.1109/robot.2010.5509682

S. Gupta, P. Arbelaez and J. Malik, Perceptual organization and recognition of indoor scenes from RGB-D images, In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2013, pp. 564-571.
https://doi.org/10.1109/cvpr.2013.79

W. Liu, X. Ma, Y. Zhou, D. Tao and J. Cheng, p-Laplacian regularization for scene recognition, IEEE transactions on cybernetics, vol. 49, no 8, pp. 2927-2940, 2018.
https://doi.org/10.1109/tcyb.2018.2833843

C. Chen, Y. Ren and C.C.J. Kuo, Outdoor scene classification using labeled segments, In Big Visual Data Analysis, Springer, Singapore, 2016, pp. 65-92.
https://doi.org/10.1007/978-981-10-0631-9_4

M.D. Zeiler and R. Fergus, 2014, Visualizing and understanding convolutional networks, In European conference on computer vision, Springer, Cham, 2014, pp. 818-833.

A. Krizhevsky, I. Sutskever and G. E. Hinton, Imagenet classification with deep convolutional neural networks, In Advances in neural information processing systems, 2012, pp. 1097-1105.
https://doi.org/10.1145/3065386

C. M. Bautista, C. A. Dy, M. I. Mañalac, R. A. Orbe and M. Cordel, Convolutional neural network for vehicle detection in low resolution traffic videos, In Region 10 Symposium (TENSYMP), International Conference on, IEEE, 2016, pp. 277-281.
https://doi.org/10.1109/tenconspring.2016.7519418

J. Li, X. Liang, S. Shen, T. Xu, J. Feng and S. Yan, Scale-aware fast R-CNN for pedestrian detection, IEEE Transactions on Multimedia, vol. 20, no 4, pp. 985-96, 2018.
https://doi.org/10.1109/tmm.2017.2759508

J. O. P. Arenas, R. J. Moreno and P. C. U. Murillo, Faster R-CNN for object location in a Virtual Environment for sorting task, International Journal of Online Engineering (iJOE), vol. 14, no 07, pp.4-14, 2018.
https://doi.org/10.3991/ijoe.v14i07.8465

S. Yang and D. Ramanan, Multi-scale recognition with DAG-CNNs, In Proceedings of the IEEE International Conference on Computer Vision, 2015, pp. 1215-1223.
https://doi.org/10.1109/iccv.2015.144

K. Thulasiraman and M.N. Swamy, Graphs: theory and algorithms (John Wiley & Sons, 2011).

Z. Golrizkhatami, S. Taheri and A. Aca, Multi-scale features for heartbeat classification using directed acyclic graph CNN, Applied Artificial Intelligence, pp.1-16, 2018.
https://doi.org/10.1080/08839514.2018.1501910

Z. Chen, A. Jacobson, N. Sünderhauf, B. Upcroft, L. Liu, C. Shen, I. Reid and M. Milford, Deep learning features at scale for visual place recognition, In 2017 International Conference on Robotics and Automation (ICRA), IEEE, 2017, pp. 3223-3230.
https://doi.org/10.1109/icra.2017.7989366

Z. Chen, O. Lam, A. Jacobson and M. Milford, Convolutional neural network-based place recognition, arXiv preprint arXiv:1411.1509, 2014.

P. Tang, H. Wang and S. Kwong, G-MS2F: GoogLeNet based multi-stage feature fusion of deep CNN for scene recognition, Neurocomputing, vol. 225, pp. 188-197, 2017.
https://doi.org/10.1016/j.neucom.2016.11.023

J. Gao, J. Yang, G. Wang and M. Li, A novel feature extraction method for scene recognition based on centered convolutional restricted Boltzmann machines, Neurocomputing, vol. 214, pp. 708-717, 2016.
https://doi.org/10.1016/j.neucom.2016.06.055

A. Yashwanth, T. Nadu, S. Shammer, R. Sairam and G. Chamundeeswari, A novel approach for indoor-outdoor scene classification using transfer learning, International Journal of Advance Research, Ideas and Innovations in Technology, vol. 5, no 2, pp. 1756-1762, 2019.

H. Zhu, J.B. Weibel and S. Lu, Discriminative multi-modal feature fusion for rgbd indoor scene recognition, In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2016, pp. 2969-2976.
https://doi.org/10.1109/cvpr.2016.324

S.H. Khan, M. Hayat, M. Bennamoun, R. Togneri and F.A. Sohel, A discriminative representation of convolutional features for indoor scene recognition, IEEE Transactions on Image Processing, vol. 25, no 7, pp. 3372-3383, 2016.
https://doi.org/10.1109/tip.2016.2567076

X. Cheng, J. Lu, J. Feng, B. Yuan and J. Zhou, Scene recognition with objectness. Pattern Recognition, 2018, vol. 74, pp. 474-487.
https://doi.org/10.1016/j.patcog.2017.09.025

S. Yang and D. Ramanan, Multi-scale recognition with DAG-CNNs, In Proceedings of the IEEE International Conference on Computer Vision. 2015, pp. 1215-1223.
https://doi.org/10.1109/iccv.2015.144

I. Goodfellow, Y. Bengio and A. Courville, Deep learning (MIT press, 2016).

Tolebi, G., Dairbekov, N., Kurmankhojayev, D., Link Flow Estimation on an Isolated Intersection Based on Deep Learning Models, (2020) International Review of Automatic Control (IREACO), 13 (1), pp. 19-26.
https://doi.org/10.15866/ireaco.v13i1.18213

Jimenez-Moreno, R., Martinez, D., A Novel Parallel Convolutional Network Architecture for Depth-Dependent Object Recognition, (2019) International Review of Automatic Control (IREACO), 12 (2), pp. 76-81.
https://doi.org/10.15866/ireaco.v12i2.16467

Pinzón-Arenas, J., Jiménez-Moreno, R., Pachón-Suescún, C., Handwritten Word Searching by Means of Speech Commands Using Deep Learning Techniques, (2019) International Review on Modelling and Simulations (IREMOS), 12 (4), pp. 253-263.
https://doi.org/10.15866/iremos.v12i4.17166


Refbacks

  • There are currently no refbacks.



Please send any question about this web site to info@praiseworthyprize.com
Copyright © 2005-2021 Praise Worthy Prize