Advanced Model for Human Action Annotation Based on Background Subtraction Using Learning Vector Quantitation with Co-occurrence Matrix Features

Moch. Arief Soeleman; Eko Mulyanto Yuniarno; Mochamad Hariadi; Mauridhy Hery Purnomo; Masanori Kakimoto

doi:10.15866/irecos.v11i11.9851

Advanced Model for Human Action Annotation Based on Background Subtraction Using Learning Vector Quantitation with Co-occurrence Matrix Features

Moch. Arief Soeleman^(1*), Eko Mulyanto Yuniarno⁽²⁾, Mochamad Hariadi⁽³⁾, Mauridhy Hery Purnomo⁽⁴⁾, Masanori Kakimoto⁽⁵⁾

^(*) Corresponding author

Authors' affiliations

DOI: https://doi.org/10.15866/irecos.v11i11.9851

Abstract

This paper presents an advanced model for human action annotation. The proposed technique is splitting human objects in two parts, upper and under part of human beings. From these two parts, a model to extract the feature vectors by GLCM was proposed as feature for classification. The first method is to extract the Haralick features out of GLCM and the next step is the normalization process for converting co-occurrence feature matrix into various vectors as feature for classification. The research employs learning vector quantification to classify all feature vectors. Finally, the experiment conducted by utilizing Weizmann dataset shows that this approach method achieves an accuracy of 84.7%.
Copyright © 2016 Praise Worthy Prize - All rights reserved.

Keywords

Annotation; GLCM; Classification; Learning Vector Quantification

Full Text:

PDF

References

Bobick A and Davis J, "The recognition of human movement using temporal template," Pattern Anlysis and Machine Intelligent, IEEE , vol. 23, no. 3, pp. 257-267, 2001.
http://dx.doi.org/10.1109/34.910878

G. L. Blank M, Shectman E, Irani M and Basri R, "Actions as space-times shapes," in Interational Conference Computer Vision, ICCV IEEE, 2005.
http://dx.doi.org/10.1109/iccv.2005.28

Ke Y, Sukthanka R and Herbert M, "Efficient visual event detection using volumetric features," in Int Conference Computer Vision, ICCV IEEE, 2005.
http://dx.doi.org/10.1109/iccv.2005.85

Sheikh Y, Sheikh M and Shah M, "Exploring the space of a human actions," in Int. Conference Computer Vision, ICCv IEEE, 2005.
http://dx.doi.org/10.1109/iccv.2005.90

Fathi A and Mori G, "Action recognition by learning mid-level motion features," in Computer Vision Pattern Recognition CVPR IEEE, 2008.
http://dx.doi.org/10.1109/cvpr.2008.4587735

Schuldt C, Laptev I and Caputo B, "Recognizing human actions : a local SVM approach," in Int. Conf Pattern Recognition, ICPR IEEE, 2004.
http://dx.doi.org/10.1109/icpr.2004.1334462

Weinland D, Ronfard R and Boyer E, "Free viewpoint action recognition using motion history volumes," Computer Vision IU, vol. 104, no. 2-3, pp. 249-257, 2006.
http://dx.doi.org/10.1016/j.cviu.2006.07.013

Yilmaz A and Shah M, "Action scetch : a novel action representation," in Proc. CVPR 1, 2005.
http://dx.doi.org/10.1109/cvpr.2005.58

S. Abburu, "Multi level Semantic Extraction for Cricket Video by Text processing," International Journal of Engineering Science and Technology, vol. 2, no. 10, pp. 5377-5384, 2010.
http://dx.doi.org/10.1109/icbecs.2010.5462313

D. Palma, J. Ascenso and F. Pereira, "Automatic text extraction in digital video based on Motion analysis," in Int. Conf. on Image Analysis and Recognition (ICIAR), Porto, 2004.
http://dx.doi.org/10.1007/978-3-540-30125-7_73

J. Lu, Y. Tian, Y. Li, Y. Zhang and Z. Lu, "A framework for video event detection using weigthed SVM classifiers," in Artificial Intelligence and Computational Intelligence, AICI, International Conference, 2009.
http://dx.doi.org/10.1109/aici.2009.77

C. Yang and M. Dong , "Region-based image annotation using Asymmetrical Support vector Machine-based Multiple-instance Learning," in Proc. IEEE Computer Society Conference on Computer Vision and Pattern Recognition , 2006.
http://dx.doi.org/10.1109/cvpr.2006.250

S. Barrat and S. Tabbone, "Classification and Automatic annotation extension of images using Bayesian network," in SSPR International Conference, 2008.
http://dx.doi.org/10.1007/978-3-540-89689-0_97

T. Gruber, "A translation approach to portable ontology specifications," Knowledge Acquisition , vol. 5, no. 2, pp. 199-220, 1993.
http://dx.doi.org/10.1006/knac.1993.1008

B. Vrusias, D. Makris and J. Renno, "A framework for ontology enriched semantic annotation of CCTV Video," in Eight International workshop on image analysis for multimedia interactive services,IEEE, 2007.
http://dx.doi.org/10.1109/wiamis.2007.4

J. Daugman, "Complete discrete 2-D Gabor Transform by Neural Network for image analysis and compression," IEEE Trans Accoust Speech SIgnal for Image analysis and Compression, vol. 36, pp. 1169-1179, 1988.
http://dx.doi.org/10.1109/29.1644

L. Soh and Tsatsoulis, "Texture analysis of SAR sea ice imagery using gray level co-occurrence matrices," Geoscience and Remote Sensing, IEEE Transactions , vol. 37, no. 2, pp. 780-795, 1999.
http://dx.doi.org/10.1109/36.752194

S. Raja and Shanmugam, "ANN and SVM Based War Scene Classification Using Invariant Moments and GLCM Features," Machine Learning, vol. 2, no. 6, pp. 869-873, 2012.
http://dx.doi.org/10.7763/ijmlc.2012.v2.255

R. Jardon , S. Chaudhurry and K. Biswas, "Generic video classification : An Evolutionary learning based fuzzy theoretic approach," in Int. Conf. Indian Computer Vision Graphics and Image Processing, 2002.
http://dx.doi.org/10.1145/2425333.2425342

A. Dorado, J. Calic and E. Izquierdo, "A Rule-based video annotation System," IEEE Transactions on Circuits and System for Video Technology, vol. 14, no. 5, 2004.
http://dx.doi.org/10.1109/tcsvt.2004.826764

M. Detyniecki and C. Marsala, "Automatic Video annotation with forests of Fuzzy Decision Trees," in Mathware and Soft Computing, 2000.
http://dx.doi.org/10.1145/1456223.1456308

M. Hosseini and M. Moghadam, "Fuzzy rule-based reasoning approach for event detection and annotation of broadcast soccer video," Appl. Soft. Computer , 2012.
http://dx.doi.org/10.1016/j.asoc.2012.10.007

H. Tong, J. He, J. Li, S. Zhang and W. Ma, "Graph Based multi modality learning," in ACM Multimedia, Singapore, 2005.
http://dx.doi.org/10.1145/1101149.1101337

M. Wang , X. Hua, R. Hong , J. Tang and Y. Song, "Unified Video annotation via Multigraph Learning," IEEE Trans. Circuits Syst. Video Tech. , vol. 19, no. 5, 2009.
http://dx.doi.org/10.1109/tcsvt.2009.2017400

M. Weng and Y. Chuang, "Multi-cue fusion for semantic video indexing," in ACM Multimedia, 2008.
http://dx.doi.org/10.1145/1459359.1459370

M. A. Soeleman, M. Hariadi and M. H. Purnomo, "Adaptive threshold for background subtraction in moving object detection using fuzzy c-means," in IEEE Tencon Philippine Section, Philippine, 2012.
http://dx.doi.org/10.1109/tencon.2012.6412265

P. Spagnolo, T. Orazio, Distante and M. L. A, "Robust foreground segmentation from color video sequence using background subtraction with multiple threshold," Journal Image and Vision, vol. 24, pp. 441-423, 2006.
http://dx.doi.org/10.1016/j.imavis.2006.01.001

H. M. Robert, S. K and D. Its'Hak, "Texture Features for Image Classification," IEEE Transactions On Systems, Man and Cybernetics, vol. 6, no. 3, pp. 610-621, 1973.
http://dx.doi.org/10.1109/tsmc.1973.4309314

Z. Jian, L. Chuan-Cai, Z. Yue and L. Gui-Fu, "Object recognition using Gabor co-occurrence similarity," Pattern Recognition, Elsevier, vol. 46, pp. 434 - 448, 2013.
http://dx.doi.org/10.1016/j.patcog.2012.06.018

B. Marcin and D. Włodzisław , "LVQ algorithm with instance weighting for generation of prototype-based rules," Elsevier, Neural Network, vol. 24, p. 824–830, 2011.
http://dx.doi.org/10.1016/j.neunet.2011.05.013

O. M. Jafar and R. Sivakumar, "Distance Based Hybrid Approach for Cluster Analysis Using Variants of K-means and Evolutionary Algorithm," Research Journal of Applied Sciences, Engineering and Technology, vol. 8, no. 11, pp. 1355-1362, 2014.
http://dx.doi.org/10.19026/rjaset.8.1107

G. Wenzong and C. Guolong, "Human action recognition via multi-task learning base on spatial-temporal feature," Information Sciecne, Elsevier, vol. 320, pp. 418-428, 2015.
http://dx.doi.org/10.1016/j.ins.2015.04.034

A.-A. A. Haiam and H. Elsayed E, "Human action recognition using trajectory-based representation," Egyptian Informatics Journal, Elsevier, vol. 16, pp. 187-198, 2015.
http://dx.doi.org/10.1016/j.eij.2015.05.002

S. Manel, M. Mahmoud and A. B. Chokri, "Human action recognition based on multi-layer Fisher vector encoding method," Pattern Recognition Letters, Elsevier, vol. 65, pp. 37-43, 2015.
http://dx.doi.org/10.1016/j.patrec.2015.06.029

N. Jalal A, "Energy-based model of least squares twin Support Vector Machines for human action recognition," Signal Processing, Elsevier, vol. 104, pp. 248-257, 2014.
http://dx.doi.org/10.1016/j.sigpro.2014.04.010

K. Villi, Z. Guoying and P. Matti, "Recognition of human actions using texture descriptors," Machine Vision and Application, Springer, pp. 26-39, 2009.
http://dx.doi.org/10.1007/s00138-009-0233-8

M. Mona M, H. Elsayed, F. Magda B and E. N. Heba A, "An enhanced method for human action recognition," Journal of Advanced Research, Elsevier, vol. 6, pp. 163-169, 2015.
http://dx.doi.org/10.1016/j.jare.2013.11.007

Csurka G, Dance C, Fan L, Willamowski J and Bray C, "Visual categorization with bags of key points," in ECCV International Workshop on Statistical Learning in Computer Vision, 2004.
http://dx.doi.org/10.1007/11744085_36

Refbacks

There are currently no refbacks.

Username
Password
Remember me