A Novel Parallel Convolutional Network Architecture for Depth-Dependent Object Recognition

Robinson Jimenez-Moreno; Diana Ovalle Martinez

doi:10.15866/ireaco.v12i2.16467

A Novel Parallel Convolutional Network Architecture for Depth-Dependent Object Recognition

Robinson Jimenez-Moreno^(1*), Diana Ovalle Martinez⁽²⁾

^(*) Corresponding author

Authors' affiliations

DOI: https://doi.org/10.15866/ireaco.v12i2.16467

Abstract

This article presents the evaluation of a novel parallel convolutional neural network, oriented to recognize objects at different distances, in order to find a solution to the problem of variability in the value of confidence with which an object is recognized, by varying the distance of capture of the image with respect to the object. In order to test its performance, two additional convolutional neural network architectures are implemented, a conventional one with multiple branches of identification in parallel and a Directed Acyclic Graph convolutional network with the same parameters as the proposed one, which differ in the training database used and the structure of the network's output. This problem is identified when trying to develop assistive robotic systems that should recognize a particular object in a group of objects in order to be taken by an end effector capable of changing trajectories, avoiding possible collisions in human-machine work environments. Here, four different tools must be recognized at four distances (20, 40, 60 and 80 cm), where the conventional CNN obtain the lowest accuracy (80.6%), while, in comparison between the DAG and the parallel CNN, although their performances have been close, the proposed architecture has obtained better results, with 93% average accuracy. It is concluded that this network is able to function in environments with dynamic reference positions of the objects, allowing being implemented in mobile agents that require relocating an object and establishing a new path given an obstacle to evade.
Copyright © 2019 Praise Worthy Prize - All rights reserved.

Keywords

Machine Vision; Convolutional Neural Network; Object Recognition; MATLAB; RGB-D Image

Full Text:

PDF

References

Z. Fadlullah; F. Tang; B. Mao; N. Kato; O. Akashi; T. Inoue; K. Mizutani, State-of-the-Art Deep Learning: Evolving Machine Intelligence Toward Tomorrow’s Intelligent Network Traffic Control Systems, in IEEE Communications Surveys & Tutorials, May-2017, no.99, pp.1-1.
https://doi.org/10.1109/comst.2017.2707140

J. Schmidhuber, Deep learning in neural networks: An overview, Neural Networks, Volume 61, January 2015, pp. 85–117.
https://doi.org/10.1016/j.neunet.2014.09.003

Ajeet Ram Pathak, Manjusha Pandey, Siddharth Rautaray. Application of Deep Learning for Object Detection. Procedia Computer Science, Volume 132, 2018, Pages 1706-1717, ISSN 1877-0509.
https://doi.org/10.1016/j.procs.2018.05.144

Krizhevsky A, Sutskever I, Hinton G. (2012) ImageNet classification with deep convolutional neural networks. University of Toronto. Conference on Advances in neural information processing systems, 2012, pp 1097-1105.
https://doi.org/10.1145/3065386

Antonio Brunetti, Domenico Buongiorno, Gianpaolo Francesco Trotta, Vitoantonio Bevilacqua. Computer vision and deep learning techniques for pedestrian detection and tracking: A survey. Neurocomputing, Volume 300, 2018, Pages 17-33, ISSN 0925-2312.
https://doi.org/10.1016/j.neucom.2018.01.092

Li Suhao, Lin Jinzhao, Li Guoquan, Bai Tong, Wang Huiqian, Pang Yu. Vehicle type detection based on deep learning in traffic scene. Procedia Computer Science, Volume 131, 2018, Pages 564-572, ISSN 1877-0509.
https://doi.org/10.1016/j.procs.2018.04.281

Abdelkader Dairi, Fouzi Harrou, Mohamed Senouci, Ying Sun, Unsupervised obstacle detection in driving environments using deep-learning-based stereovision, Robotics and Autonomous Systems, Volume 100, 2018.
https://doi.org/10.1016/j.robot.2017.11.014

Cheng Wang, Ming Cheng, Ferdous Sohel, Mohammed Bennamoun, Jonathan Li, NormalNet: A voxel-based CNN for 3D object classification and retrieval, Neurocomputing, Volume 323, 2019, Pages 139-147, ISSN 0925-2312.
https://doi.org/10.1016/j.neucom.2018.09.075

Keyu Lu, Xiangjing An, Jian Li, Hangen He. Efficient deep network for vision-based object detection in robotic applications. Neurocomputing, Volume 245, 2017, Pages 31-45, ISSN 0925-2312.
https://doi.org/10.1016/j.neucom.2017.03.050

L. Porzi, S. R. Buló, A. Penate-Sanchez, E. Ricci and F. Moreno-Noguer, Learning Depth-Aware Deep Representations for Robotic Perception, in IEEE Robotics and Automation Letters, vol. 2, no. 2, pp. 468-475, April 2017.
https://doi.org/10.1109/lra.2016.2637444

H. Schulz, N. Hoft, and S. Behnke, Depth and height aware semantic RGB-D perception with convolutional neural networks, in Proc. Eur. Symp. Artif. Neural Netw., 2015, pp. 463–468.

Ian Lenz, Honglak Lee, Ashutosh Saxena. Deep learning for detecting robotic grasps. The International Journal of Robotics Research Vol 34, Issue 4-5, pp. 705 - 724. March 16, 2015.
https://doi.org/10.1177/0278364914549607

Dmitry Kalashnikov, Alex Irpan, Peter Pastor, Julian Ibarz, Alexander Herzog, Eric Jang, Deirdre Quillen, Ethan Holly, Mrinal Kalakrishnan, Vincent Vanhoucke, Sergey Levine. QT-Opt: Scalable Deep Reinforcement Learning for Vision-Based Robotic Manipulation. arXiv: 1806.10293 [cs.LG], June 2018.

Hasan F.M. Zaki, Faisal Shafait, Ajmal Mian. Learning a deeply supervised multi-modal RGB-D embedding for semantic scene and object category recognition. Robotics and Autonomous Systems. Volume 92, 2017, Pages 41-52, ISSN 0921-8890.
https://doi.org/10.1016/j.robot.2017.02.008

Jiuxiang Gu, Zhenhua Wang, Jason Kuen, Lianyang Ma, Amir Shahroudy, Bing Shuai, Ting Liu, Xingxing Wang, Gang Wang, Jianfei Cai, Tsuhan Chen. Recent advances in convolutional neural networks. Pattern Recognition. Volume 77, 2018, Pages 354-377, ISSN 0031-3203.
https://doi.org/10.1016/j.patcog.2017.10.013

K. Thulasiraman and M.N. Swamy, Graphs: theory and algorithms (John Wiley & Sons, 2011).

Shahram Taheri, Önsen Toygar, On the use of DAG-CNN architecture for age estimation with multi-stage features fusion, Neurocomputing, Volume 329, 2019, Pages 300-310, ISSN 0925-2312.
https://doi.org/10.1016/j.neucom.2018.10.071

Zeiler M. D., Fergus R. (2014) Visualizing and Understanding Convolutional Networks. In: Fleet D., Pajdla T., Schiele B., Tuytelaars T. (eds) Computer Vision – ECCV 2014. ECCV 2014. Lecture Notes in Computer Science, vol 8689. Springer, Cham.
https://doi.org/10.1007/978-3-319-10590-1_53

Creative Technology Ltd, Creative Senz3D. Consulted on September 5, 2018, [Online].

Available in: https://us.creative.com/p/web cameras/creative-senz3d.

Pinzon Arenas, J., Jimenez Moreno, R., Hernandez Beleño, R., EMG Signal Acquisition and Processing Application with CNN Testing for MATLAB, (2018) International Review of Automatic Control (IREACO), 11 (1), pp. 44-51.
https://doi.org/10.15866/ireaco.v11i1.13379

Pinzon Arenas, J. O., Jimenez Moreno, R., Useche Murillo P. C., Faster R-CNN for Object Location in a Virtual Environment for Sorting Task. International Journal of Online Engineering, vol. 14, no. 7, p. 4-14, 2018.
https://doi.org/10.3991/ijoe.v14i07.8465

Useche Murillo P. C., Jimenez Moreno, R., Pinzon Arenas, J., Use of CNNs for follower mobile agents with safe distance, International Journal of Applied Engineering Research, vol. 13, no. 12, p.10412-10418, 2018.

Useche-Murillo, P., Jimenez-Moreno, R., Pinzon-Arenas, J., Classification of Objects with Occlusions by Means of a DAG-CNN, (2018) International Review of Automatic Control (IREACO), 11 (6), pp. 346-353.
https://doi.org/10.15866/ireaco.v11i6.15737

Agarwal, P., Arya, A., Suryaprasad, J., Theophilus, A., A Machine Learning Based Approach to Multiclass Classification of Customer Loyalty Using Deep Nets, (2017) International Review on Computers and Software (IRECOS), 12 (2), pp. 103-113.
https://doi.org/10.15866/irecos.v12i2.12354

Mazouzi, A., Bel Bachir, M., Enhancement of the Detection for Intelligent Vehicle Systems - Case Rain/Snow, (2017) International Review of Automatic Control (IREACO), 10 (2), pp. 112-117.
https://doi.org/10.15866/ireaco.v10i2.8242

Refbacks

There are currently no refbacks.

Username
Password
Remember me