Formal Descriptive Study for the Extraction and Comparison of Tourist Spending Patterns

Formal Descriptive Study for the Extraction and Comparison of Tourist Spending Patterns

A. Rabasa N. Mollá-campello A. Pérez-torregrosa 

Operation Research Center, University Miguel Hernández of Elche, Spain

30 August 2018
| Citation



This paper presents the design of an in-depth descriptive analysis of data collected from public surveys at tourist information points. It uses a dataset that compiles different information related to trips to the valencian community (spain). the aim of this study is to describe the patterns (association rules) that a certain type of expense has, and how this could be used to improve the services offered to tourists.

There  are different kinds of expenses to analyze: transport, accommodation, leisure as well as total daily expenses and total daily expenses per person. those cases where expenses are especially high or low are considered as particularly important because of their strategic interest for the public administration of tourism.

The study starts with data preprocessing, followed by pattern extraction for the sub-samples with very high and very low expenses, and in some cases, zero expenses are not considered as outliers but as a particular group of individuals. after this, the study aims to extract the most important attributes (feature selection) to create a classification model and compare its efficiency with the models that compute the complete set of attributes.

To conclude, this paper presents the possible future predictive models that could lead to an improvement in planning for public tourist services in the valencian community (spain).


feature selection, pattern discovery, predictive tourism analysis

1. Introduction and Objectives
2. Data Set and Methodology
3. Computational Experiments
4. Conclusions
5. Future Research Lines

[1] Rabasa, A., Pérez-Martín, A. & Giner, D., Optimal clustering techniques for the segmentation of tourist spending. Analysis of tourist surveys in the Valencian community (Spain): a case study. International Journal of Design & Nature and Ecodynamics, 12(4), pp. 482–491, 2018.

[2] ‘EGATUR ’, available at distica_C&cid=1254736177002&menu=ultiDatos&idp=1254735576863 (accessed March 2018)

[3] Breiman, L. (ed), Classification and regression trees, Repr. Boca Raton: Chapman & Hall [u.a.], 1998.

[4] Aksoy, S. & Ozbuk, M.Y., Multiple criteria decision making in hotel location: does it relate to postpurchase consumer evaluations?. Tourism Management Perspectives, 22, pp. 73–81, 2017.

[5] Nilashi, M., Bagherifard, K., Rahmani, M. & Rafe, V., A recommender system for tourism industry using cluster ensemble and prediction machine learning techniques. Computers & Industrial Engineering, 109, pp. 357–368, 2017.

[6] Pronoza, E., Yagunova, E. & Volskaya, S., Aspect-Based Restaurant Information Extraction for the Recommendation System. Conference: 6th Language and Technology Conference. Lecture Notes in Computer Science, 9561, pp. 371–385, 2016.

[7] Sinclair, M.T. & Stabler, M., The Economics of Tourism. Library of Congress Cataloguing in Publication Data, 2002.

[8] Pol, A.P., Pascual, M.B. & Vazquez, P.C., Robust estimators and bootstrap confidence intervals applied to tourism spending. Tourism Management, 27(1), pp. 42–50, 2006.