Approach to hybrid load management in Fat-Tree web clusters
DOI:
https://doi.org/10.20535/2786-8729.7.2025.338564Keywords:
load forecasting, web cluster, Fat-Tree topology, fault tolerance, traffic balancing, stochastic failuresAbstract
The paper presents an approach to hybrid load management in a web cluster that is capable of providing adaptive request balancing based on load prediction and resilience to random web server failures. The proposed architecture is built upon the Fat-Tree topology, which ensures high scalability, structural redundancy, and efficient routing within the cluster network. The developed system performs load forecasting using moving average methods and Erlang-based queueing models, enabling the estimation of overload probabilities and proactive redistribution of computational resources. Four representative simulation scenarios were analyzed: baseline load, peak load, dynamic traffic variations, and random server failures. The obtained results demonstrate enhanced system reliability, reduced average response time, and more balanced utilization of cluster resources. In the context of rapidly growing web services and user traffic volumes, the issue of maintaining high reliability and efficiency of clustered infrastructures becomes increasingly significant. Even with robust topologies such as Fat-Tree, irregular traffic patterns and sudden surges in client requests can cause local overloads and performance degradation. Random node failures further complicate cluster management, necessitating the use of adaptive and predictive control mechanisms. The proposed model integrates Fat-Tree network simulation with statistical forecasting algorithms, forming the basis for proactive load management. This integration allows for minimizing service degradation risks, dynamically responding to workload changes, and maintaining stable operation of web infrastructures under partial node failures. The architecture shows strong potential for real-time implementation in large-scale distributed web systems. It can be further enhanced by incorporating machine learning or wavelet-based forecasting methods to improve the accuracy of load estimation and system adaptability.
References
C. E. Leiserson, “Fat-trees: universal networks for hardware-efficient supercomputing,” IEEE Transactions on Computers, vol. C-34, no. 10, pp. 892–901, 1985. https://doi.org/10.1109/TC.1985.6312192 .
M. Al-Fares, A. Loukissas, and A. Vahdat, “A scalable, commodity data center network architecture,” ACM SIGCOMM Computer Communication Review, vol. 38, no. 4, pp. 63–74, 2008. https://doi.org/10.1145/1402946.1402967 .
A. Greenberg et al., “VL2: A scalable and flexible data center network,” Communications of the ACM, vol. 54, no. 3, pp. 95–104, 2009. https://doi.org/10.1145/1897852.1897877 .
C. Guo et al., “BCube: A high performance, server-centric network architecture for modular data centers,” Computer Communication Review, vol. 39, pp. 63–74, 2009. https://doi.org/10.1145/1592568.1592577 .
Y. Lohumi, D. Gangodkar, P. Srivastava, M. Khan, A. Alahmadia and A. Alahmadia, “Load Balancing in Cloud Environment: A State-of-the-Art Review,” IEEE Access, pp. 1–14, 2023. https://doi.org/10.1109/ACCESS.2023.3337146 .
L. Wang et al., “Cloud computing: A perspective study,” New Generation Computing, vol. 28, no. 2, pp. 137–146, 2010. https://doi.org/10.1007/s00354-008-0081-5 .
M. Mitzenmacher, “The power of two choices in randomized load balancing,” IEEE Transactions on Parallel and Distributed Systems, vol. 12, no. 10, pp. 1094–1104, 2001. https://doi.org/10.1109/71.963420 .
J. Dean and L. A. Barroso, “The tail at scale,” Communications of the ACM, vol. 56, no. 2, pp. 74–80, 2013. https://doi.org/10.1145/2408776.2408794 .
V. Cristea, C. Dobre, C. Stratan, F. Pop and A. Costan, “Architectures for large scale distributed systems,” in Large-Scale Distributed Computing and Applications, IGI Global, 2010. https://doi.org/10.4018/978-1-61520-703-9.ch002 .
M. Ferriol-Galmés et al., “RouteNet-Erlang: A Graph Neural Network for Network Performance Evaluation,” in IEEE INFOCOM 2022–IEEE Conference on Computer Communications, 2022. https://doi.org/10.1109/INFOCOM48880.2022.9796944 .
K. Vimhala and L. Gowrishankar, “Modelling Dynamic Traffic Loads in Multiserver Queues using G/G/k Queue,” E3S Web of Conferences, vol. 484, 01023, 2024. https://doi.org/10.1051/e3sconf/202448401023 .
C. Y. Chew, G. Teng, and Y. S. Lai, “Simulation of Erlang and Negative Binomial Distributions using the Generalized Lambert W Function,” Journal of Computational Mathematics and Data Science, vol. 10, 100092, 2024. https://doi.org/10.1016/j.jcmds.2024.100092 .
A. U. Rehman, R. Aguiar, and J. Barraca, “Fault-Tolerance in the Scope of Cloud Computing,” IEEE Access, vol. 10, pp. 1–21, 2022. https://doi.org/10.1109/ACCESS.2022.3182211 .
J. Kumar, A. Singh, and R. Buyya, “Self Directed Learning based Workload Forecasting Model for Cloud Resource Management,” Information Sciences, vol. 543, 2021. https://doi.org/10.1016/j.ins.2020.07.012 .
S. Hochreiter and J. Schmidhuber, “Long short-term memory,” Neural Computation, vol. 9, no. 8, pp. 1735–1780, 1997.
F. Agull'o, A. Gutierrez-Torre, J. Torres and J. Ll. Berral, “Enhancing the output of time series forecasting algorithms for cloud resource provisioning,” Future Generation Computer Systems, vol. 170, article 107833, 2025. https://doi.org/10.1016/j.future.2025.107833 .
K. O. Radchenko, “Application of discrete wavelet transformations for forecasting the level of load on the web server of general purpose computer networks,” Computer-Integrated Technologies: Education, Science, Production, no. 45, pp. 90–96, 2021. https://doi.org/10.36910/6775-2524-0560-2021-45-13 .
K. O. Radchenko, “Conceptual model for ensuring the effectiveness of web server load forecasting,” Scientific Notes of the V.I. Vernadsky Taurida National University. Series: Technical Sciences, vol. 31(70), no. 6, pp. 135–141, 2020. https://doi.org/10.32838/TNU-2663-5941/2020.6-1/23 .
K. O. Radchenko, “Peculiarities of predicting the level of web traffic in general purpose computer networks,” Problems of Informatization and Management, no. 3(71), pp. 41–50, 2022. https://doi.org/10.18372/2073-4751.71.17002 .
Downloads
Published
How to Cite
Issue
Section
License
Copyright (c) 2025 Information, Computing and Intelligent systems

This work is licensed under a Creative Commons Attribution 4.0 International License.