Clustering Approach Applied on an Artificial Neural Network Model to Predict PM10 In Mega Cities of Mexico

Clustering Approach Applied on an Artificial Neural Network Model to Predict PM10 In Mega Cities of Mexico

E. Magaña-Villegas J.M. Carrera-Velueta  S. Ramos-Herrera  J.R. Hernández-Barajas  C. González-Figueredo  J.R. Laines-Canepa  A. Valdés-Manzanilla  R.G. Bautista-Margulis 

Professor at the Universidad Juárez Autónoma de Tabasco, México

Professor at the Instituto Tecnológico y de Estudios Superiores de Occidente, Jalisco, México

31 August 2016
| Citation



A cluster-based artificial neural network model called CLASO (Classification-Assemblage-Association) has been proposed to predict the maximum of the 24-h moving average of PM10 concentration on the next day in the three largest metropolitan areas of Mexico. The model is a self-organised, real-time learning neural network, which builds its topology via a process of pattern classification by using an historical database. This process is based on a supervised clustering technique, assigning a class to each centroid of the hidden layer, employing the Euclidean distance as a hierarchical criterion. A set of ARIMA models was compared with CLASO model in the forecast performance of the 24-h average PM10 concentration on the next day. In general, CLASO model produced more accurate predictions of the maximum of the 24-h moving average of PM10 concentration than the ARIMA models, although the latter showed a minor tendency to underpredict the results. The CLASO model solely requires to be built a historical database of the air quality parameter, an initial radius of classification and the learning factor. CLASO has demonstrated acceptable predictions of 24-h average PM10 concentration by using exclusively regressive PM10 concentrations. The forecasting capabilities of the model were found to be satisfactory compared to the classical models, demonstrating its potential application to the other major pollutants used in the Mexican air quality index.


air quality, artificial neural network, clustering, PM10 modelling


[1] Jordan, M.I. & Bishop, C.M., Neural networks. ACM Computing Surveys, 28(1), pp.73–75, 1996.

[2] Ordieres, J.B., Vergara, E.P., Capuz, R.S. & Salazar, R.E., Neural network prediction model for fine particulate matter (PM2.5) on the US-Mexico border in El Paso (Texas) and Ciudad Juárez (Chihuahua). Environmental Modelling & Software, 20, pp. 547–559, 2005.

[3] Alexandridis, A., Patrinos, P., Sarimveis, H. & Tsekouras, G., A two-stage evolutionary algorithm for variable selection in the development of RBF neural network models. Chemometrics and Intelligent Laboratory Systems, 75, pp. 149–162, 2005.

[4] Cherkassky, V., Krasnopolsky, V., Solomatine, D.P. & Valdes, J., Computational intelligence in earth sciences and environmental applications: issues and challenges. Neural Networks, 19, pp. 113–121, 2006.

[5] Gómez-Sanchís, J., Martín-Guerrero, J.D., Soria-Olivas S., Vila-Francés, J., Carrasco, J.L. & Del Valle-Tascón, S., Neural networks for analysing the relevance of input variables in the prediction of tropospheric ozone concentration. Atmospheric Environment, 40, pp. 6173–6180, 2006.

[6] Kurt, A. & Oktay, A.B., Forecasting air pollutant indicator levels with geographic models 3 days in advance using neural networks. Expert Systems with Applications, 37, pp. 7986–7992, 2010.

[7] Pfeiffer, H., Baumbach, G., Sarachaga-Ruiz, L., Kleanthous, S., Poulida, O. & Beyaz, E., Neural modelling of the spatial distribution of air pollutants. Atmospheric Environment, 43, pp. 3289–3297, 2009.

[8] Hooyberghs, J., Mensink, C., Dumont, G., Fierens, F. & Brasseur, O., A neural network forecast for daily average PM10 concentrations in Belgium. Atmospheric Environment, 39, pp. 3279–3289, 2005.

[9] Slini, T., Kaprara, A., Karatzas, K. & Moussiopoulos, N., PM10 forecasting for Thessaloniki, Greece. Environmental Modelling & Software 21, pp. 559–565, 2006.

[10] Caselli, M., Trizio, L., De Gennaro, G. & Lelpo, P., A simple feedforward neural network for the PM10 forecasting: comparison with a radial basis function network and a multivariate linear regression model. Water Air & Soil Pollution, 201, pp. 365–377, 2009.

[11] Grivas, G. & Chaloulakou, A., Artificial neural network models for prediction of PM10 hourly concentrations, in the Greater area of Athens, Greece. Atmospheric Environment, 40, pp. 1216–1229, 2006.

[12] Nejadkoorki, F. & Baroutian, S., Forecasting extreme PM10 concentrations using artificial neural networks. International Journal of Environmental Research, 6(1), pp. 277–284, 2012.

[13] Díaz-Robles, L.A., Ortega, J.C., Fu, J.S., Reed, G.D., Chow, J.C., Watson, J.G. & Moncada-Herrera, J.A., A hybrid ARIMA and artificial neural networks model to forecast particulate matter in urban areas: the case of temuco, Chile. Atmospheric Environment, 42, pp. 8331–8340, 2008.

[14] Voukantsis, D., Karatzas, K., Kukkonen, J., Räsänen, T., Karppinen, A. & Kolehmainen, M., Intercomparison of air quality data using principal component analysis, and forecasting of PM10 and PM2.5 concentrations using artificial neural networks, in Thessaloniki and Helsinki. Science of the Total Environment, 409, pp. 1266–1276, 2011.

[15] Paschalidou, A.K., Karakitsios, S., Kleanthous, S. & Kassomenos, P.A., Forecasting hourly PM10 concentration in cyprus through artificial neural networks and multiple regression models: implications to local environmental management. Environmental Science & Pollution Research, 18, pp. 316–327, 2011.