ALGORITHMIC METHODS OF PARALLEL PROCESSING OF BIG DATA IN PREDICTIVE ANALYTICS SYSTEMS

Authors

  • N. Zaplatynskyi Lviv National Environmental University
  • V. Fiialkovskyi Lviv National Environmental University
  • S. Shtrohryn Lviv National Environmental University
  • Квасниця Kvasnytsia Lviv National Environmental University
  • А. Tatomyr Lviv National Environmental University

DOI:

https://doi.org/10.32718/agroengineering2025.29.232-241

Keywords:

algorithmic methods, predictive analytics, GPU computing, scalability, artificial intelligence, machine learning, distributed computing

Abstract

The rapid growth of data volumes and the increasing complexity of predictive and analytical models necessitate the efficient use of parallel processing in modern information systems. The aim of this paper is to substantiate algorithmic and architectural approaches to parallel processing of big data in predictive analytics systems, taking into account performance, scalability, and adaptability requirements. The study provides an analytical comparison of parallelism models, software frameworks, and computing architectures, and examines system-level constraints that affect the efficiency of predictive analytics systems in distributed and heterogeneous environments. It is shown that isolated scaling of computational resources does not ensure proportional performance gains, whereas the highest efficiency is achieved by hybrid configurations that combine multiple parallelism models with hardware accelerators. The scientific novelty of the study lies in the systematization of algorithmic and architectural approaches to parallel big data processing in predictive analytics with consideration of adaptability and system constraints, as well as in the formulation of an integrated analytical approach to combining software platforms and specialized computing architectures. The obtained results can be applied to the design of high-performance predictive analytics systems. The article also outlines directions for the further development of parallel predictive and analytical systems, in particular in the context of integrating machine learning methods, stream data processing, and dynamic management of computational resources. Special attention is paid to aligning algorithmic solutions with the characteristics of the hardware platform to minimize overhead costs and improve the energy efficiency of computations. The proposed analytical approach can serve as a methodological basis for building adaptive high-performance predictive analytics systems under conditions of variable workloads and heterogeneous computational environments.

References

Bu, L., Zhang, H., Xing, H., & Wu, L. (2021). Research on parallel data processing of data mining platform in the background of cloud computing. Journal of Intelligent Systems, 30, 479–486. https://doi.org / 10.1515 / jisys – 2020–0113.

Dritsas, E., & Trigka, M. (2025). Exploring the intersection of machine learning and big data: A survey. Machine Learning and Knowledge Extraction, 7 (1), 13. https://doi.org / 10.3390 / make7010013.

Ghimire, A., & Amsaad, F. (2024). A parallel approach to enhance the performance of supervised machine learning realized in a multicore environment. Machine Learning and Knowledge Extraction, 6 (3), 1840–1856. https://doi.org / 10.3390 / make6030090.

Jamarani, A., Haddadi, S., Sarvizadeh, R., et al. (2024). Big data and predictive analytics: A systematic review of applications. Artificial Intelligence Review, 57, 176–253. https://doi.org / 10.1007 / s10462–024–10811–5.

Kolisetty, V. V., & Rajput, D. S. (2020). A review on the significance of machine learning for data analysis in big data. Jordanian Journal of Computer and Information Technology, 6, 155–171.

Laouni, D. (2021). Dynamic distributed and parallel machine learning algorithms for big data mining processing. Data Technologies and Applications, 4, 558–601. https://doi.org / 10.1108 / dta – 06–2021–0153

Laouni, D., Bensaber, D. A., & Adjoudj, R. (2018). Big data analytics for prediction: Parallel processing of the big learning base with the possibility of improving the final result of the prediction. Information Discovery and Delivery, 46 (2), 147–160. https://doi.org / 10.1108 / IDD–02–2018–0002.

Naeem, M., Jamal, T., Diaz-Martinez, J., Butt, S. A., Montesano, N., Tariq, M. I., De-la Hoz-Franco, E., & De-La-Hoz-Valdiris, E. (2022). Trends and future perspective challenges in big data. In Advances in Intelligent Data Analysis and Applications: Proceedings of the Sixth Euro-China Conference on Intelligent Data Analysis and Applications (pp. 309–325). Springer.

Natesan, P., Sathishkumar, V. E., Mathivanan, S., Venkatasen, V., Maheshwari, J., Jayagopal, P., & Shaikh Muhammad, A. (2023). A distributed framework for predictive analytics using big data and MapReduce parallel programming. Mathematical Problems in Engineering, 2023, Article 6048891. https://doi.org / 10.1155 / 2023 / 604889.

Oo, M. C. M., & Thein, T. (2022). An efficient predictive analytics system for high dimensional big data. Journal of King Saud University – Computer and Information Sciences, 34, 1521–1532. https://doi.org / 10.1016 / j. jksuci. 2019.09.001.

Raghavendra, S. (2024). Scalability of data science algorithms: Empowering big data analytics. Journal of Artificial Intelligence and Computing Techniques, 1, 1–9.

Rakhimov, M., Ochilov, M., Javliev, S., & Nasimov, R. (2024). Analysis and possibilities of parallel approach in big data processing. In Proceedings of the 8th International Conference on Future Networks & Distributed Systems (ICFNDS '24) (pp. 20–25). ACM. https://doi.org / 10.1145 / 3726122.3726126.

Rakhimov, M., Zaripova, D., Javliev, S., & Karimberdiyev, J. (2024). Deep learning parallel approach using CUDA technology. AIP Conference Proceedings, 3244 (1), 030003. https://doi.org / 10.1063 / 5.0241439.

Silva, J., Hernández Palma, H., Niebles Núñez, W., & Ovallos-Gazabon, D. (2020). Parallel algorithm for reduction of data processing time in big data. Journal of Physics: Conference Series, 1432 (1), 012095. https://doi.org / 10.1088 / 1742–6596 / 1432 / 1 / 012095.

Tamilselvan, K., M. N. S., Saranya, A., Abdul Jaleel, D., Rajani Kanth, T. V., & Govardhan, S. D. (2025). Optimizing data processing in big data systems using hybrid machine learning techniques. International Journal of Computational and Experimental Science and Engineering, 11 (1), 775–782. https://doi.org / 10.22399 / ijcesen. 9361.

Tyagi, A. K., & G, R. (2019). Machine learning with big data. In Proceedings of the International Conference on Sustainable Computing in Science, Technology and Management (SUSCOM) (Amity University Rajasthan, Jaipur, India, 26–28 February 2019). Amity University Rajasthan.

Xu, R., Baracaldo, N., & Joshi, J. (2021). Privacy-preserving machine learning: Methods, challenges and directions. arXiv preprint. https://arxiv.org / abs / 2108.04417

Published

2025-12-01

How to Cite

Zaplatynskyi Н., Fiialkovskyi В., Shtrohryn С., Kvasnytsia Т., & Tatomyr А. (2025). ALGORITHMIC METHODS OF PARALLEL PROCESSING OF BIG DATA IN PREDICTIVE ANALYTICS SYSTEMS. Bulletin of Lviv National Environmental University. Series Agroengineering Research, (29), 232–241. https://doi.org/10.32718/agroengineering2025.29.232-241

Issue

Section

INFORMATION TECHNOLOGIES AND SYSTEMS. PROJECT MANAGEMENT IN AGRO ENGINEERING

Most read articles by the same author(s)