Artificial intelligence

Scientific journal

ISSN 2710-1673

ONLINE: ISSN 2710-1681

Select your language


Ensuring Accuracy and Stability of Multidimensional Data Clustering Using Kohonen Self-Organizing Maps Based on Automatic Data Reduction

Ivashchenko O.1, Fedin S.1
1 National Transport University
alexander.ivashchenko@gmail.com; sergey.fedin1975@gmail.com

Full text (PDF)

UDC: 004.855.5:519.237.8
Publication Language: Ukrainian
Stuc. intelekt. 2026; 31(1):71-89

Abstract: The article investigates a method for improving the accuracy and stability of multidimensional data clustering by combining Kohonen self-organizing maps with an automatic feature reduction procedure. A method for removing insignificant features using an interquartile range algorithm integrated into a software application for building SOMs is proposed. A comparison of the clustering results before and after reduction is performed with unchanged training parameters and a fixed seed value, a parameter that determines the initial state of the random number generator and ensures the reproducibility of the initialization of the neuron weight coefficients. It has been proven that the removal of features with low variability does not violate the topological structure of SOM, preserves the stability of cluster separation, and reduces computational costs. A high degree of correspondence between the data before and after reduction has been confirmed based on statistical analysis. It has been established that the proposed method increases the efficiency of SOM and is suitable for analyzing large multidimensional samples.

Keywords: SOM, data reduction, insignificant features, clusterization, interquartile range, software application

References:

  1. Dimensionality Reduction using Self Organizing Maps. [Online]. Available:https://stackoverflow.com/questions/27086222/dimensionality-reduction-using-self-organizing-maps/
  2. Zabielin S. Big Data Analysis via Model Reduction Methods // System Research and Information Technologies. – 2018. – № 2. – pp. 35-41. https://doi.org/10.20535/SRIT.2308-8893.2018.2.04
  3. Vayssieres M. Master Kohonen Self-Organizing Maps: A Hands-On Guide to Data Exploration with Python. [Online]. Available:https://medium.com/@MahounaVAYSSIERES/master-kohonen-self-organizing-maps-a-hands-on-guide-to-data-exploration-with-python-fb92f8ebd6f6
  4. Agboka K. M., Abdel-Rahman E. M., Salifu D., Kanji B., Ndjomatchoua F. T., Guimapi R. A. Y., Ekesi S., Landmann T. Towards combining self-organizing maps (SOM) and convolutional neural network (CNN) for improving model accuracy: Application to malaria vectors phenotypic resistance // MethodsX. – 2025. – Vol. 14. – 103198. https://doi.org/10.1016/j.mex.2025.103198
  5. Salap-Ayca S. Self-organizing maps as a dimension reduction approach for spatial global sensitivity analysis visualization // Transactions in GIS. – 2025. – Vol. 26. – № 4. – pp. 1718-1734. https://doi.org/10.1111/tgis.12963
  6. Silva R. G., Wilcox S. J. Feature evaluation and selection for condition monitoring using a self-organizing map and spatial statistics // Artificial Intelligence for Engineering Design, Analysis and Manufacturing. – 2018. – Vol. 33. – pp. 1-10. doi: 10.1017/S0890060417000518.
  7. Khacef L., Rodriguez L., Miramond B. Improving Self-Organizing Maps with Unsupervised Feature Extraction // The International Conference on Neural Information Processing. – 2020. https://doi.org/10.48550/arXiv.2009.02174
  8. Fan X., Zhang S., Xue X., Jiang R., Fan S., Kou H. An Improved Self-Organizing Map (SOM) Based on Virtual Winning Neurons // Symmetry. – 2025. – Vol. 17. – № 3. – pp. 449. doi: 10.3390/sym17030449
  9. Starkey A., Akpan U. I., Al Hosni O., Pullissery Y. Class-Level Feature Selection Method Using Feature Weighted Growing Self-Organising Maps // arXiv Preprint. – 2025. https://doi.org/10.48550/arXiv.2503.11732
  10. Benabdeslem K., Lebbah M. Feature Selection for Self-Organizing Map // 29th International Conference on Information Technology Interfaces. – Cavtat, Croatia. – 2007. – pp. 45-50. https://doi.org/10.1109/ITI.2007.4283742
  11. Ceylan O., Taskin Kaya G. Feature Selection Using Self Organizing Map Oriented Evolutionary Approach // IEEE International Geoscience and Remote Sensing Symposium (IGARSS). – Brussels, Belgium. – 2021. – pp. 4003-4006. https://doi.org/10.1109/IGARSS47720.2021.9553491
  12. Pratiwi D. The Use of Self Organizing Map Method and Feature Selection in Image Database Classification System // International Journal of Computer Science. – 2012. – № 9.
  13. Khelil M. I., Ladjal M., Brik Y., Ouali M. A. Self-Organizing Maps-Based Features Selection with Deep LSTM and SVM Classification Approaches for Advanced Water Quality Monitoring // International Journal of Intelligent Engineering and Systems. – 2022. – Vol. 15. – № 3. – pp. 90-102. https://doi.org/10.22266/ijies2022.0630.09
  14. Zang Z., Xu Y., Lu L., Geng Y., Yang S., Li S. Z. UDRN: Unified Dimensional Reduction Neural Network for feature selection and feature projection // Neural Networks. – 2023. – Vol. 161. – pp. 626-637. https://doi.org/10.1016/j.neunet.2023.02.018
  15. Mwangi B., Tian S. T., Jair C. S. A Review of Feature Reduction Techniques in Neuroimaging // Neuroinformatics. – 2014. – Vol. 12. – № 2. – pp. 229-244. https://doi.org/10.1007/s12021-013-9204-3
  16. Cheng X. A Comprehensive Study of Feature Selection Techniques in Machine Learning Models // Insights in Computer, Signals and Systems. – 2024. – Vol. 1. – pp. 65-78. https://doi.org/10.70088/xpf2b276
  17. Rossi R., Murari A., Gelfusa M. A deep learning framework for feature selection and dimensional analysis: Variational explainable neural networks // Knowledge-Based Systems. – 2025. – Vol. 324. – 113940. https://doi.org/10.1016/j.knosys.2025.113940
  18. Passimier A., Folco P., Raimondi D., Birolo G., Moreau Y., Fariselli P. A quantitative benchmark of neural network feature selection methods onomics data // Scientific Reports. – 2024. – Vol. 14. – 31180. https://doi.org/10.1038/s41598-024-82583-5
  19. Trelina A., Procházka A. Binary Stochastic Filtering: a Solution for Supervised Feature Selection and Neural Network Shape Optimization // arXiv Preprint. – 2019. https://doi.org/10.48550/arXiv.1902.04510
  20. Ougiaroglou S., Diamantaras K. I., Evangelidis G. Exploring the effect of data reduction on neural network and support vector machine classification // Neurocomputing. – 2018. – Vol. 280. – pp. 101-110. https://doi.org/10.1016/j.neucom.2017.08.076
  21. Jia W., Sun M., Liang J., Hou S. Feature dimensionality reduction: a review // Complex & Intelligent Systems. – 2022. – Vol. 8. – pp. 2663-2693. https://doi.org/10.1007/s40747-021-00650-0
  22. Ivashchenko O., Fedin S. Improving The Som Algorithm To Ensure Stability And Reproducibility Of Data Clustering Results // System Research and Information Technologies. – 2025. – № 4.
  23. Ivashchenko, O. V., & Fedin, S. S. Optymizatsiia alhorytmu Kokhonena dlia zabezpechennia vidtvoriuvanosti rezultativ klasteryzatsii // Shtuchnyi intelekt ta informatsiini tekhnolohii: materialy Pershoi mizhnarodnoi naukovo-praktychnoi konferentsii (June 3–4, 2024, Kyiv, Ukraine). Kyiv: NUFT, 2024, pp. 226–227.
  24. Bhandari P. How to Find Interquartile Range (IQR) | Calculator & Examples. [Online]. Available: https://www.scribbr.com/statistics/interquartile-range/
  25. Ivashchenko O., Fedin S. Supporting Decision-Making in the Segmentation of Telecommunications Company Customers Using Specialized Software // Modern Engineering and Innovative Technologies. – 2025. – № 41-01. – pp. 116-136. https://doi.org/10.30890/2567-5273.2025-41-01-002
  26. Jonatasv. Metrics Evaluation: MSE, RMSE, MPE and MAPE. – 2024. [Online]. Available: https://medium.com/@jonatasv/metrics-evaluation-mse-rmse-mae-and-mape-317cab85a26b
  27. Shaun Turney. Coefficient of Determination (R²) | Calculation & Interpretation. – 2023. [Online]. Available:https://www.scribbr.com/statistics/coefficient-of-determination/
  28. Ugur Turan. A Correlation Coefficients Analysis on Innovative Sustainable Development Groups // EUREKA: Social and Humanities. – 2020. – Vol. 1. – № 1. – pp. 46-55. https://doi.org/10.21303/2504-5571.2020.001130

View full text (PDF)