HYBRID CLUSTERING OPTIMIZATION MODEL FOR INTELLIGENT DECISION SUPPORT SYSTEMS
DOI:
https://doi.org/10.31548/itees.2025.01.009Keywords:
Clustering, Big Data, Decision Support Systems, Algorithm Optimization, Parallel Computing, Hybrid Models, Machine LearningAbstract
The article presents a comprehensive approach to optimizing clustering algorithms within decision support systems (DSS) in big data environments. It analyzes issues of scalability, computational complexity, and result stability that are typical of classical methods such as K-Means, DBSCAN, and Agglomerative Clustering. An improved hybrid algorithm, K-Means++ Hybrid, is proposed, combining parallel computing mechanisms, adaptive parameter tuning, and dynamic control of the iterative search process. The methodological foundation of the research is based on systems analysis, mathematical modeling, and experimental testing using datasets from the UCI Repository and GPU acceleration technologies (CUDA). Experimental results confirm that the proposed approach reduces clustering execution time by approximately 43% compared to baseline algorithms, while increasing the silhouette coefficient to 0.73 and reducing CPU energy consumption by 20–25%. The resulting model demonstrates high robustness when processing heterogeneous datasets and can be integrated into systems for traffic flow analysis, financial risk assessment, and environmental monitoring. The developed approach provides a foundation for building adaptive intelligent data analysis modules that support scalability, result interpretability, and real-time operation in streaming analytics systems. Future research should focus on integrating hybrid clustering with deep learning models and evolutionary optimization algorithms.
References
1. Boiko, N. I., & Tkachyk, O. A. (2023). Alhorytmy ta metody klasteryzatsii dlia riznomanitnykh danykh [Algorithms and methods of clustering for diverse data]. Naukovyi Visnyk Uzhhorodskoho Universytetu. Seriia “Matematyka i Informatyka” [Scientific Bulletin of Uzhhorod University. Series “Mathematics and Informatics”], 42(1), 129–147. https://doi.org/10.24144/2616-7700.2023.42(1).129-147
2. Alramahee, A., & Ghalib, F. (2024). A survey of clustering algorithms for determining optimal locations of distributed centers. Basrah Researches Sciences, 50(2), 318–332. https://doi.org/10.56714/bjrs.50.2.26
3. Oyewole, G. J., & Thopil, G. A. (2023). Data clustering: Application and trends. Artificial Intelligence Review, 56, 6439–6475. https://doi.org/10.1007/s10462-022-10325-y
4. Barchenko, N., Liubchak, V., & Velykodnyi, D. (2023). Vybir metoda klasteryzatsii z metoiu analizu pokaznykiv tsyfrovykh transformatsii rehioniv Ukrainy [Selecting a clustering method for analyzing indicators of digital transformations of the regions of Ukraine]. Informatsiini Tekhnolohii ta Suspilstvo [Information Technologies and Society], 2(8), 6–17. https://doi.org/10.32689/maup.it.2023.2.1
5. Usatenko, M. V. (2024). Metody vyiavlennia anomalii u masyvakh bahatovymirnykh danykh [Methods for detecting anomalies in multidimensional data arrays]. Zbirnyk Naukovykh Prats Kharkivskoho Natsionalnoho Universytetu Radioelektroniky [Collection of Scientific Papers of Kharkiv National University of Radio Electronics]. https://openarchive.nure.ua/bitstreams/15148cad-14b3-47db-9f0f-e7d705fcf/download
6. Alzubaidi, L., Zhang, J., Humaidi, A. J., Al-Dujaili, A., Duan, Y., Al-Shamma, O., Santamaria, J., Fadhel, M. A., Al-Amidie, M., & Farhan, L. (2021). Review of deep learning: Concepts, CNN architectures, challenges, applications, future directions. Journal of Big Data, 8, Article 53. https://doi.org/10.1186/s40537-021-00444-8
7. Dafir, Z., Lamari, Y., & Slaoui, S. C. (2021). A survey on parallel clustering algorithms for Big Data. Artificial Intelligence Review, 54, 2411–2443. https://doi.org/10.1007/s10462-020-09918-2
8. Zhou, S., Xu, H., Zheng, Z., Chen, J., Li, Z., Bu, J., Wang, X., Zhu, W., & Ester, M. (2022). A Comprehensive Survey on Deep Clustering: Taxonomy, Challenges, and Future Directions. arXiv. https://arxiv.org/abs/2206.07579. https://doi.org/10.48550/arXiv.2206.07579
9. Hu, L., Jiang, M., Dong, J., Liu, X., & He, Z. (2024). Interpretable clustering: A survey. arXiv. https://doi.org/10.48550/arXiv.2409.00743.
10. Tkachyk, O. A. (2023). Metody ta zasoby klasteryzatsii riznotypovykh danykh [Methods and tools for clustering heterogeneous data] [Doctoral dissertation, Lviv Polytechnic National University]. https://lpnu.ua/sites/default/files/2023/radaphd/25194/disertaciya-metodi-ta-zasobi-klasterizacii-riznotipovikh-danikh-tkachik-o-1-1.pdf
11. Chorna, O. S., Didyk, P. Yu., Titov, S. V., & Titova, O. V. (2024). Vykorystannia alhorytmiv klasteryzatsii dlia avtomatyzatsii planuvannia marshrutiv u zadachakh marshrutyzatsii perevezen [Use of clustering algorithms to automate route planning in transportation routing problems]. Systemy Obrobky Informatsii [Information Processing Systems], 1(176), 115–123. https://doi.org/10.30748/soi.2024.176.14
12. Yurchyshena, L. V. (2023). Klasteryzatsiia universytetiv ta yikh ekonomichna model na osnovi pokaznykiv finansovoi stiikosti [Clustering of universities and their economic model based on financial sustainability indicators]. In Ekonomichna model rozvytku universytetiv [Economic model of university development] (pp. 73–86). https://science.iea.gov.ua/wp-content/uploads/2023/10/6_Yurchyshena_324_2023_73-86.pdf
13. Batiuk, T. M., & Dosyn, D. H. (2023). Intelektualna systema klasteryzatsii korystuvachiv sotsialnykh merezh na osnovi analizu tonalnosti danykh [An intelligent clustering system for social network users based on sentiment analysis]. Informatsiini Systemy ta Merezhi [Information Systems and Networks], 13, 121–140. https://doi.org/10.23939/sisn2023.13.121
14. Kharlamova, G., & Chernyak, O. (2021). Cluster analysis of Ukrainian regions regarding the level of investment attractiveness. ICTERI Proceedings, 2, 230–241. https://icteri.org/icteri-2021/proceedings/volume-2/202110401.pdf
15. Artioli, P., Maci, A., & Magrì, A. (2024). A comprehensive investigation of clustering algorithms for User and Entity Behavior Analytics. Frontiers in Big Data, 7, Article 1375818. https://doi.org/10.3389/fdata.2024.1375818
16. Xue, J., Xing, L., Wang, Y., Li, Z., Zhang, H., & Chen, X. (2024). A comprehensive survey of fast graph clustering. Vicinagearth, 1, Article 7. https://doi.org/10.1007/s44336-024-00008-3
17. Saeed, M. M., Al Aghbari, Z., & Alsharidah, M. (2020). Big data clustering techniques based on Spark: A literature review. PeerJ Computer Science, 6, e321. https://doi.org/10.7717/peerj-cs.321
18. Kohavi, R. Census income [Data set]. UCI Machine Learning Repository. https://doi.org/10.24432/C5GP7S
Downloads
Published
Issue
Section
License
Copyright (c) 2025 Information technologies in economics and environmental sciences

This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.