Effect of Over-Sampling Versus Under-Sampling for SVM and LDA Classifiers for Activity Recognition

Effect of Over-Sampling Versus Under-Sampling for SVM and LDA Classifiers for Activity Recognition

M.B Abidine B. Fergani F.J Ordóñez 

Laboratoire d‘Ingénierie des Systèmes Intelligents et Communicants, Faculty of Electronics and Computer Sciences, USTHB, Algiers, Algeria

Laboratoire d‘Ingénierie des Systèmes Intelligents et Communicants, Faculty of Electronics and Computer Sciences, USTHB, Algiers, Algeria

Computer Science Department, University Carlos III of Madrid, Leganes, Madrid, Spain

Page: 
306-316
|
DOI: 
https://doi.org/10.2495/DNE-V11-N3-306-316
Received: 
N/A
| |
Accepted: 
N/A
| | Citation

OPEN ACCESS

Abstract: 

Accurately recognizing the rare activities from sensor network-based smart homes for monitoring the elderly person is a challenging task. Activity recognition datasets are generally imbalanced, meaning certain activities occur more frequently than others. Not incorporating this class imbalance results in an evaluation that may lead to disastrous consequences for elderly persons. To overcome this problem, we evaluate two resampling methods using Over-sampling (OS) and Under-sampling (US). Then, these methods were combined with the discriminative classifiers named support vector machines (SVM) and linear discriminant analysis (LDA). experimental results carried out on multiple real-world smart home datasets demonstrate the feasibility of the proposal. Besides, a comparison with some state–of-the-art techniques based on Conditional Random Field (CRF) and Hidden Markov Model (HMM), we demonstrate that the US-SVM and OS-LDA are able to surpass HMM, CRF, SVM, LDA, OS-SVM and US-LDA. However, OS-LDA is the most effective method in terms of recognition of activities.

Keywords: 

humanactivity recognition, imbalanced data, LDA, machine learning, SVM

  References

[1] Tapia, E.,Intille, S. & Larson, K., Activity recognition in the home using simple and ubiquitous sensors. In Pervasive Computing Lecture Notes in Computer Science, Springer Heidelberg, 3001, pp. 158–175, 2004. http://dx.doi.org/10.1007/978-3-540-24646-6_10

[2] van Kasteren, T., Noulas, A., Englebienne, G. & Krose, B., Accurate activity recognition in a home setting. In UbiComp, 08, New York, NY, USA: ACM, pp. 1–9, 2008.

http://dx.doi.org/10.1145/1409635.1409637

[3] Abidine, M.B., Fergani, L., Fergani, B. & Fleury, A., Improving human activity recognition in smart homes. International Journal of E-Health and Medical Communications (IJEHMC), 6(3), pp. 19–37, 2015.

http://dx.doi.org/10.4018/IJEHMC.2015070102

[4] Fleury, A., Vacher, M. & Noury, N., SVM-based multi-modal classification of activities of daily living in health smart homes: sensors, algorithms and first experimental results. IEEE Transactions on Information Technology in Biomedicine, 14(2), pp. 274–283, 2010. http://dx.doi.org/10.1109/TITB.2009.2037317

[5] Aggrawal, J.K. & Ryoo, M.S., Human activity analysis: a review. ACM Computing Surveys, 43(3), pp. 1–16, 2011. http://dx.doi.org/10.1145/1922649.1922653

[6] Chen, L., Nugent, C.D. & Wang, H., A knowledge-driven approach to activity recognition in smart homes. IEEE Transactions on Knowledge and Data Engineering, 24, pp. 961–974, 2012. http://dx.doi.org/10.1109/TKDE.2011.51

[7] Bishop, C., Pattern Recognition and Machine Learning, Springer: New York, ISBN: 978-0387-31073-2, 2006.

[8] Logan, B., Healey, J., Philipose, M., Tapia, E.M. & Intille, S.S., A long-term evaluation of sensing modalities for activity recognition In Ubicomp ’07, pp. 483–500, 2007.

[9] Huang, W., Zhang, J. & Liu, Z., Activity recognition based on hidden Markov models. In Knowledge Science, Engineering and Management, Springer Berlin Heidelberg, pp. 532–537, 2007. http://dx.doi.org/10.1007/978-3-540-76719-0_54

[10] Vail, D.L., Veloso, M.M. & Lafferty, J.D., Conditional random fields for activity recognition. 

In International Conference on Autonomous Agents and Multi-agent Systems (AAMAS), 2007. http://dx.doi.org/10.1145/1329125.1329409

[11] Abidine, M.B. & Fergani, B., Evaluating C-SVM, CRF and LDA classification for daily  activity recognition. In Proceeding of IEEE ICMCS, Tangier-Morocco, pp. 272–277, 2012.

[12] Liu, S., Gao, R.X., John, D., Staudenmayer, J. & Freedson, P.S., Classification of physical  activities based on sparse representation. In 2012 Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC), pp. 6200–6203, 2012.

[13] Fang, H. & He, L., BP neural network for human activity recognition in smart home. In 2012 International Conference on Computer Science & Service System (CSSS), pp. 1034–1037, 2012. http://dx.doi.org/10.1109/CSSS.2012.262

[14] Fleury, A., Vacher, M. & Noury, N., SVM-based multi-modal classification of activities of daily living in health smart homes: sensors, algorithms and first experimental results. IEEE Transactions on Information Technology in Biomedicine, 14(2), pp. 274–283, 2010. http://dx.doi.org/10.1109/TITB.2009.2037317

[15] Abidine, M.B., Fergani, B. & Clavier, L., Importance-Weighted the imbalanced data for c-svm classifier to human activity recognition. Proceeding of IEEE WOSSPA’13, pp. 330–335, 2013. http://dx.doi.org/10.1109/wosspa.2013.6602386

[16] Chawla, N.V., Data mining for imbalanced datasets: an overview. In Data Mining and Knowledge Discovery Handbook, Springer, pp. 875–886, 2010.

[17] Akbani, R., Kwek, S. & Japkowicz, N., Applying support vector machines to imbalanced  datasets. In Machine Learning: ECML, Springer Berlin Heidelberg, pp. 39–50, 2004.

http://dx.doi.org/10.1007/978-3-540-30115-8_7

[18] Huang, Y.M. & Du, S.X., Weighted support vector machine for classification with uneven training class sizes. In Proceedings of the IEEE ICMLC, 7, pp. 4365–4369, 2005.

[19] Bishop, C., Pattern Recognition and Machine Learning, Springer: New York, ISBN: 978-0387-31073-2, 2006.

[20] Vapnik, V.N., The Nature of Statistical Learning Theory, (Statistics for Engineering and Information Science), Springer Verlag, 2000. http://dx.doi.org/10.1007/978-1-4757-3264-1

[21] Hastie, T., Tibshirani, R. & Friedman, J.H., The Elements of Statistical Learning, Springer, Heidelberg: NewYork, 2001. http://dx.doi.org/10.1007/978-0-387-21606-5

[22] Chang, C.C. & Lin, C.J., LIBSVM, available at http://www.csie.ntu.edu.tw/~cjlin-/libsvm/

[23] Datasets for Activity Recognition, available at http://sites.google.com/site/tim0306/ (accessed February 09, 2012)

[24] Van Kasteren, T., Alemdar, H. & Ersoy, C., Effective performance metrics for evaluating  activity recognition methods. In Proceedings of the 24th ARCS 2011, Comot, Italy, p. 10, 2011.