
Comparative Study of Multi-Armed Bandit Algorithms in Clinical Trials
- 1 Universtiy of California San Diego, 9500 Gilman Dr, La Jolla, CA 92093
- 2 Qingdao Hengxing University of Science and Technology, 588 Jiushui East Street, Qingdao, Shandong, China
- 3 University of Washington, 1410 NE Campus Pkwy, Seattle, WA 98195
- 4 Universtiy of California San Diego, 9500 Gilman Dr, La Jolla, CA 92093
* Author to whom correspondence should be addressed.
Abstract
In recent years, with the rapid development of the information age, the influence of Multi Armed Bandit Algorithms (MAB) models in clinical trials for disease prevention has been increasing. In this study, based on Python programming language, Multi-Armed Bandit Algorithms (MAB) algorithm, Upper Confidence Bound (UCB) algorithm, Adaptive Epsilon-Greedy Algorithm, and Thompson Sampling (TS) algorithms to validate the idea of preventing, controlling and predicting the occurrence of diseases. The results show that the MAB model can effectively solve various decision-making problems in clinical trials, improve the efficiency of access to medical care, save doctors ‘diagnosis time, and at the same time achieve the prevention and treatment of diseases while minimising patients’ pain. This study is dedicated to proposing a more effective decision-making method and verifies that the method has a wide range of applications and great potential for development today.
Keywords
Multi-Armed Bandit Algorithms, Python, Clinical Trial
[1]. Rafferty, A., Ying, H., & Williams, J. (2019). "Statistical consequences of using multi-armed bandits to conduct adaptive educational experiments". Journal of Educational Data Mining, 11(1), 47-79.
[2]. Fouché, E., Komiyama, J., & Böhm, K. (2019). "Scaling multi-armed bandit algorithms". Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining.
[3]. Avadhanula, V. et al. (2021). "Stochastic Bandits for Multi-Platform Budget Optimization in Online Advertising". Proceedings of the Web Conference 2021. pp. 1472-1483. URL: https://dl.acm.org/doi/abs/10.1145/3442381.3450074.
[4]. Zhang, Q. (2024). "Real-world Applications of Bandit Algorithms: Insights and Innovations". Transactions on Computer Science and Intelligent Systems Research, 5, 753-758.
[5]. Aziz, M., Kaufmann, E., & Riviere, M.-K. (2021). "On multi-armed bandit designs for dose-finding trials". Journal of Machine Learning Research, 22(14), 1-38.
[6]. Bulucu, C. (2019). "Personalizing treatments via contextual multi-armed bandits by identifying relevance". MS thesis. Bilkent Universitesi (Turkey).
[7]. Chen, T., Gangrade, A., & Saligrama, V. (2022). "Strategies for safe multi-armed bandits with logarithmic regret and risk". International Conference on Machine Learning. PMLR, pp. 3198-3210.
[8]. Shin, J., Ramdas, A., & Rinaldo, A. (2019). "Are sample means in multi-armed bandits positively or negatively biased?". Advances in Neural Information Processing Systems, Vol. 32.
[9]. Huang, J., Dai, Y., & Huang, L. (2022). "Adaptive best-of-both-worlds algorithm for heavy-tailed multi-armed bandits". International Conference on Machine Learning. PMLR.
[10]. Shi, L., Wang, J., & Wu, T. (2023). "Statistical inference on multi-armed bandits with delayed feedback". International Conference on Machine Learning. PMLR.
[11]. Bouneffouf, D., & Rish, I. (2019). "A survey on practical applications of multi-armed and contextual bandits". arXiv preprint arXiv:1904.10040.
[12]. Chen, N. (2021). "Multi-armed bandit requiring monotone arm sequences". Advances in Neural Information Processing Systems, Vol. 34, 16093-16103.
Cite this article
Huang,W.;Wang,W.;Wu,Y.;Xi,C. (2024). Comparative Study of Multi-Armed Bandit Algorithms in Clinical Trials. Applied and Computational Engineering,83,45-51.
Data availability
The datasets used and/or analyzed during the current study will be available from the authors upon reasonable request.
Disclaimer/Publisher's Note
The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of EWA Publishing and/or the editor(s). EWA Publishing and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.
About volume
Volume title: Proceedings of CONF-MLA 2024 Workshop: Semantic Communication Based Complexity Scalable Image Transmission System for Resource Constrained Devices
© 2024 by the author(s). Licensee EWA Publishing, Oxford, UK. This article is an open access article distributed under the terms and
conditions of the Creative Commons Attribution (CC BY) license. Authors who
publish this series agree to the following terms:
1. Authors retain copyright and grant the series right of first publication with the work simultaneously licensed under a Creative Commons
Attribution License that allows others to share the work with an acknowledgment of the work's authorship and initial publication in this
series.
2. Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the series's published
version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgment of its initial
publication in this series.
3. Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) prior to and
during the submission process, as it can lead to productive exchanges, as well as earlier and greater citation of published work (See
Open access policy for details).