Method of horizontal pod scaling in Kubernetes to omit overregulation
DOI:
https://doi.org/10.20535/2786-8729.5.2024.315877Keywords:
Kubernetes, Microservices, Horizontal Pod Autoscaling, Proportional Regulation, Cloud ComputingAbstract
This paper describes the method of omitting the over-regulation effect that occurs under certain conditions by horizontal pod autoscaling microservices in the container application orchestration system Kubernetes. The effect was initially observed only for long-term HTTP WebSocket sessions, where it led to excessive use of computing resources, which reduced the efficiency of IT infrastructure management, and caused service failure. It was found that the overregulation effect is reproduced not only for connections with long-term HTTP sessions, such as HTTP WebSocket, but also for shorter-term REST HTTP sessions in case of increased delay in the metric collection cycle used for horizontal pod autoscaling. It is assumed that this effect happens due to the approach of implementing horizontal scaling controllers similar to the principles of proportional regulators in systems with negative feedback from the theory of automation and control. It is proposed to extend one of the methods used for optimizing the proportional controller to the problem consisting of reducing the time delay between scaling metrics collecting and upscale applied by the controller in Kubernetes. The applied method demonstrated its effectiveness, therefore, within the same methodology, an experiment was conducted on using the proportional-integral-differential controller for automatic horizontal scaling of pods. The results obtained showed why the proportional-integral-differential controller is not widespread among the overviewed Kubernetes solutions for horizontal automatic scaling. An assumption was made about the limitations of studying the downscaling process in Kubernetes due to the need to consider the quality of service when stopping pods and the need to collect indicator metrics using quality-of-service object management tools such as ISTIO.
References
E. Truyen, D.V. Landuyt, D. Preuveneers, B. Lagaisse W. Joosen “A Comprehensive Feature Comparison Study of Open-Source Container Orchestration Frameworks”, Appl. Sci. 2019, 9(5), pp. 35–43; https://doi.org/10.3390/app9050931.
B. Burns, J. Beda, K. Hightower and L. Evenson “Kubernetes: Up and Running”, O’Reilly Media inc, 2022 pp. 56–112 ; ISBN 978-1-098-11020-8.
D. C. Marinescu. “Chapter 8 – Cloud Hardware and Software”, Cloud Computing (Second Edition), Morgan Kaufmann, pp. 281–319, https://doi.org/10.1016/b978-0-12-812810-7.00011-x, ISBN 978-0-12-812810-7.
D.V. Martins “Scaling of Applications in Containers”. Master's thesis, 2023. pp. 13–15.
T. -T., Nguyen, Y. -J. Yeom, T. Kim, D. -H. Park, and S. Kim, “Horizontal Pod Autoscaling in Kubernetes for Elastic Container Orchestration”. Sensors, 20(16), 4621. 2020 pp. 13-15; https://doi.org/10.3390/s20164621.
O.І. Rolіk, S.F. Telenik, M.V. Yasochka, Upravlіnnya korporativnoyu іnfrastrukturoyu. Kyiv, Naukova Dumka, 2018, 576 p., ISBN 098-966-00-1665-1.
D.K. Alqahtani, A.N. Toosi, “Container Orchestration in Heterogeneous Edge Computing Environments. InResource Management in Distributed Systems”, 2024 pp. 151–168. https://doi.org/10.1007/978-981-97-2644-8_8, ISBN: 978-981-97-2643-1.
E. Casalicchio and V. Perciballi, "Auto-Scaling of Containers: The Impact of Relative and Absolute Metrics," 2017 IEEE 2nd International Workshops on Foundations and Applications of Self* Systems (FAS*W), Tucson, AZ, USA, 2017, pp. 207–214, https://doi.org/10.1109/FAS-W.2017.149.
G. Quattrocchi, E. Incerto, R. Pinciroli, C. Trubiani and L. Baresi, “Autoscaling Solutions for Cloud Applications Under Dynamic Workloads”, in IEEE Transactions on Services Computing, vol. 17, no. 3, pp. 804–820, May-June 2024, https://doi.org/10.1109/TSC.2024.3354062.
K. Johan, Å. Richard, M. Murray. “Feedback Control Systems”, 1963, p. 295 ISBN-13: 978-0-691-13576-2.
D. Skvorc, M. Horvat and S. Srbljic, “Performance evaluation of WebSocket protocol for implementation of full-duplex web streams”, 2014, 37th International Convention on Information and Communication Technology, Electronics and Microelectronics (MIPRO), Opatija, Croatia, 2014, pp. 1003–1008, https://doi.org/10.1109/MIPRO.2014.6859715.
S. Singh, C. H. Muntean and S. Gupta, “Boosting Microservice Resilience: An Evaluation of Istio’s Impact on Kubernetes Clusters Under Chaos”, 2024, 9th International Conference on Fog and Mobile Edge Computing (FMEC), Malmö, Sweden, 2024, pp. 245–252, https://doi.org/10.1109/FMEC62297.2024.10710237.
Downloads
Published
How to Cite
Issue
Section
License
Copyright (c) 2024 Information, Computing and Intelligent systems
This work is licensed under a Creative Commons Attribution 4.0 International License.