Research Article
Open access
Published on 10 September 2024
Download pdf
Zheng,H.;Xu,K.;Zhang,M.;Tan,H.;Li,H. (2024). Efficient resource allocation in cloud computing environments using AI-driven predictive analytics. Applied and Computational Engineering,82,17-23.
Export citation

Efficient resource allocation in cloud computing environments using AI-driven predictive analytics

Haotian Zheng *,1, Kangming Xu 2, Mingxuan Zhang 3, Hao Tan 4, Hanzhe Li 5
  • 1 Electrical & Computer Engineering, New York University, NY, USA
  • 2 Computer Science and Engineering, Santa Clara University, CA, USA
  • 3 Computer Science, University of California San Diego, CA, USA
  • 4 Data Science, New York University, NY, USA
  • 5 Computer Engineering, New York University ,New York, USA

* Author to whom correspondence should be addressed.

https://doi.org/10.54254/2755-2721/82/2024GLG0055

Abstract

This paper proposes an innovative AI-driven approach for efficient resource allocation in cloud computing environments using predictive analytics. The study addresses the critical challenge of optimizing resource utilization while maintaining high quality of service in dynamic cloud infrastructures. A hybrid predictive model combining XGBoost and LSTM networks is developed to forecast workload patterns across various time horizons. The model leverages historical data from a large-scale cloud environment, encompassing 1000 servers and over 52 million data points. A dynamic resource scaling algorithm is introduced, which integrates the predictive model outputs with real-time system state information to make proactive allocation decisions. The proposed framework incorporates advanced techniques such as workload consolidation, resource oversubscription, and elastic resource pools to maximize utilization efficiency. Experimental results demonstrate significant improvements in key performance indicators, including increasing resource utilization from 65% to 83%, reducing SLA violation rates from 2.5% to 0.8%, and enhancing energy efficiency, with PUE improving from 1.4 to 1.18. Comparative analysis shows that the proposed model outperforms existing prediction accuracy and resource allocation efficiency methods. The study contributes to the field by presenting a comprehensive, AI-driven solution that addresses the complexities of modern cloud environments and paves the way for more intelligent and autonomous cloud resource management systems.

Keywords

Cloud Computing, Artificial Intelligence, Resource Allocation, Predictive Analytics

[1]. Swain, S. R., Saxena, D., Kumar, J., Singh, A. K., & Lee, C. N. (2023). An AI-Driven Intelligent Traffic Management Model for 6G Cloud Radio Access Networks. IEEE Wireless Communications Letters, 12(6), 1056-1060.

[2]. Bansal, S., & Kumar, M. (2023). Deep Learning-based Workload Prediction in Cloud Computing to Enhance the Performance. In 2023 Third International Conference on Secure Cyber Computing and Communication (ICSCCC) (pp. 635-640). IEEE.

[3]. Sharma, E., Deo, R. C., Davey, C. P., Carter, B. D., & Salcedo-Sanz, S. (2024). Poster: Cloud Computing with AI-empowered Trends in Software-Defined Radios: Challenges and Opportunities. In 2024 IEEE 25th International Symposium on a World of Wireless, Mobile and Multimedia Networks (WoWMoM) (pp. 298-300). IEEE.

[4]. Zhou, N., Dufour, F., Bode, V., Zinterhof, P., Hammer, N. J., & Kranzlmüller, D. (2023). Towards Confidential Computing: A Secure Cloud Architecture for Big Data Analytics and AI. In 2023 IEEE 16th International Conference on Cloud Computing (CLOUD) (pp. 293-295). IEEE.

[5]. Zhang, Q., Yang, L. T., Yan, Z., Chen, Z., & Li, P. (2018). An efficient deep learning model to predict cloud workload for industry informatics. IEEE Transactions on Industrial Informatics, 14(7), 3170-3178.

[6]. Li, H., Wang, S. X., Shang, F., Niu, K., & Song, R. (2024). Applications of Large Language Models in Cloud Computing: An Empirical Study Using Real-world Data. International Journal of Innovative Research in Computer Science & Technology, 12(4), 59-69.

[7]. Ping, G., Wang, S. X., Zhao, F., Wang, Z., & Zhang, X. (2024). Blockchain-Based Reverse Logistics Data Tracking: An Innovative Approach to Enhance E-Waste Recycling Efficiency.

[8]. Xu, H., Niu, K., Lu, T., & Li, S. (2024). Leveraging artificial intelligence for enhanced risk management in financial services: Current applications and prospects. Engineering Science & Technology Journal, 5(8), 2402-2426.

[9]. Shi, Y., Shang, F., Xu, Z., & Zhou, S. (2024). Emotion-Driven Deep Learning Recommendation Systems: Mining Preferences from User Reviews and Predicting Scores. Journal of Artificial Intelligence and Development, 3(1), 40-46.

Cite this article

Zheng,H.;Xu,K.;Zhang,M.;Tan,H.;Li,H. (2024). Efficient resource allocation in cloud computing environments using AI-driven predictive analytics. Applied and Computational Engineering,82,17-23.

Data availability

The datasets used and/or analyzed during the current study will be available from the authors upon reasonable request.

Disclaimer/Publisher's Note

The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of EWA Publishing and/or the editor(s). EWA Publishing and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

About volume

Volume title: Proceedings of the 2nd International Conference on Machine Learning and Automation

Conference website: https://2024.confmla.org/
ISBN:978-1-83558-565-8(Print) / 978-1-83558-566-5(Online)
Conference date: 21 November 2024
Editor:Mustafa ISTANBULLU, Anil Fernando
Series: Applied and Computational Engineering
Volume number: Vol.82
ISSN:2755-2721(Print) / 2755-273X(Online)

© 2024 by the author(s). Licensee EWA Publishing, Oxford, UK. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license. Authors who publish this series agree to the following terms:
1. Authors retain copyright and grant the series right of first publication with the work simultaneously licensed under a Creative Commons Attribution License that allows others to share the work with an acknowledgment of the work's authorship and initial publication in this series.
2. Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the series's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgment of its initial publication in this series.
3. Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) prior to and during the submission process, as it can lead to productive exchanges, as well as earlier and greater citation of published work (See Open access policy for details).