Practical applications of advanced cloud services and generative AI systems in medical image analysis

Research Article
Open access

Practical applications of advanced cloud services and generative AI systems in medical image analysis

Jingyu Xu 1* , Binbin Wu 2 , Jiaxin Huang 3 , Yulu Gong 4 , Yifan Zhang 5 , Bo Liu 6
  • 1 Computer Information Technology,Northern Arizona University    
  • 2 Heating Ventilation and Air Conditioning Engineering,Tsinghua University    
  • 3 Information Studies,Trine University    
  • 4 Computer & Information Technology,Northern Arizona University    
  • 5 Executive Master of Business Administration, Amazon Connect Technology Services (Beijing) Co., Ltd    
  • 6 Software Engineering,Zhejiang University    
  • *corresponding author jyxu01@outlook.com
ACE Vol.64
ISSN (Print): 2755-273X
ISSN (Online): 2755-2721
ISBN (Print): 978-1-83558-425-5
ISBN (Online): 978-1-83558-426-2

Abstract

The medical field is one of the important fields in the application of artificial intelligence technology. With the explosive growth and diversification of medical data, as well as the continuous improvement of medical needs and challenges, artificial intelligence technology is playing an increasingly important role in the medical field. Artificial intelligence technologies represented by computer vision, natural language processing, and machine learning have been widely penetrated into diverse scenarios such as medical imaging, health management, medical information, and drug research and development, and have become an important driving force for improving the level and quality of medical services. The article explores the transformative potential of generative AI in medical imaging, emphasizing its ability to generate synthetic data, enhance images, aid in anomaly detection, and facilitate image-to-image translation. Despite challenges like model complexity, the applications of generative models in healthcare, including Med-PaLM 2 technology, show promising results. By addressing limitations in dataset size and diversity, these models contribute to more accurate diagnoses and improved patient outcomes. However, ethical considerations and collaboration among stakeholders are essential for responsible implementation. Through experiments leveraging GANs to augment brain tumor MRI datasets, the study demonstrates how generative AI can enhance image quality and diversity, ultimately advancing medical diagnostics and patient care.

Keywords:

Artificial intelligence in medicine, Generative AI in medical imaging, Medical data diversity, Ethical considerations in healthcare AI

Xu,J.;Wu,B.;Huang,J.;Gong,Y.;Zhang,Y.;Liu,B. (2024). Practical applications of advanced cloud services and generative AI systems in medical image analysis. Applied and Computational Engineering,64,81-86.
Export citation

1. Introduction

Generative AI is a series of AI techniques and models designed to learn the underlying patterns and structures of data sets and generate new data points that may belong to the original data set. Combined with the current status of China's primary medical and health service ability, the author discusses the potential application of generative artificial intelligence technology represented by ChatGPT in the improvement of primary medical and health service ability. The ability to generate synthetic data has been one of the main goals of generating models in medical imaging, as it offers a viable solution for sharing data while protecting patient privacy. Recent medical imaging studies have demonstrated the promising ability to generate high-quality medical images. In addition to generating synthetic data, generative models have a wide range of other applications in medical imaging. One of these is anomaly detection, generating models that can be used to identify anomalies in medical images. This task is particularly useful for diagnosing diseases and detecting potential health risks. Another application is image-to-image translation (style transfer), where generative models can be trained to convert images from one mode to another. For example, these models can be used to convert a CT [1-4]scan to an MRI scan, a label map to an MRI image, or an MRI scan to an anomaly map. This is of great significance for medical imaging and can speed up human imaging. Overall, the potential applications of generative models in medical imaging are very broad, and their use is likely to continue to grow. However, this area is accompanied by challenges, including the use of various quality assessment indicators and the complexity of the models. These issues may hinder the progress and implementation of the model.

2. Related Work

2.1. Generative AI

Generated type artificial intelligence, also known as" artificial intelligence to generate content, "the goal is to learn a great deal of training data to understand the probability distribution of the data, then use this understanding to generate new, similar to the content of the training data, such as images, text, Audio, etc. The application scenarios for generative AI are very broad, and it has shown great potential and value in many areas. Compared with traditional "analytical AI", generative AI has unique advantages. The most important feature is its ability to create new content, not just the identification or classification of existing data. Generative AI systems describe the distribution of training data by building probabilistic models. The goal of the model is to capture the internal structure and regularity of the training data.[5-7] Once a generative AI model learns the distribution of the training data, it can be used to generate new data samples that are somewhat similar to the training data, but with certain differences. In view of its potential, it is necessary for us to think deeply about its development prospects in the field of stomatology.

2.2. Traditional medical imaging challenges

In 2018, OpenAI introduced the first generation of generative pre-trained transformer (GPT), bringing natural language processing into the "pre-training" era. By highlighting the enormous potential, benefits, challenges, and ethical considerations of AI, this study aims to foster an ongoing dialogue about responsibly using AI's transformative power to improve medical practice and patient health. For now, most MRI and PET-CT are currently operating at full capacity, and patients often need at least three days or several weeks to book an MRI. The core reasons are mainly reflected in two aspects: one is the huge gap in the number of diagnostic doctors and technicians; Second, the detection capacity of image detection equipment is insufficient, and a scan takes 30 minutes to 1 hour. For most of the current hospitals, in order to solve the problem of patient queuing mentioned earlier, it is necessary to purchase imaging equipment with higher scanning speed and quality, but a high-end imaging equipment costs tens of millions, or upgrading existing equipment also needs to invest millions, but this is only the cost of money, and the time cost is high.[8] It is reported that the cycle of upgrading or purchasing imaging equipment is about six months to one year, and in the process, it will cause the limited service capacity of the hospital, and cause some patients to choose other hospitals because they do not want to continue to wait. Therefore, taking into account the downstream problem, when the image quality is low in the actual situation, it will affect the quantitative diagnosis effect based on the image in the later stage, and then affect the final treatment effect.

2.3. Generative artificial intelligence and healthcare

The application of generative intelligence in the field of medical imaging has achieved some remarkable results. For example, generative models such as generative adversarial networks (GANs) are used to synthesize photorealistic medical image data to increase the diversity and number of data sets, thereby improving the generalization ability and performance of the model. In this context, Med-PaLM 2 technology introduces advanced generative intelligent algorithms, combined with expertise and technology in the field of medical imaging, aiming to further advance the analysis, diagnosis and treatment of medical imaging. Med-PaLM 2 [9]May include features such as more efficient medical image synthesis, more accurate lesion detection and segmentation, and more reliable pathological image analysis to provide healthcare workers with a more powerful and comprehensive image analysis tool. Relevant research reports that Med-PaLM 2 is one of the latest medical image detection technologies in the field of generative intelligence, and the biggest difference from the existing traditional AI image analysis system, which provides conclusions after analyzing the image, but does not explain the reason.

3. Experiment and Methodology

Medical imaging datasets are often limited in size and diversity, posing challenges for training accurate and robust machine learning models. Generative AI addresses this issue by synthesizing realistic and diverse medical images, thereby augmenting the available dataset. [10]This augmentation not only increases the quantity of data but also enhances its diversity, enabling more comprehensive training of machine learning models. By learning the underlying patterns of normal and abnormal anatomy, generative AI can assist radiologists and clinicians in detecting and localizing anomalies with greater accuracy and efficiency.

3.1. Experimental design

In this study, we harness the power of generative AI, specifically GANs, to address the challenges associated with limited and heterogeneous brain tumor MRI datasets. By generating synthetic brain tumor MRI images, we aim to augment the existing dataset and improve the performance of tumor detection and classification algorithms. Through this approach, we seek to leverage the strengths of generative AI to enhance the accuracy, efficiency, and reliability of brain tumor diagnosis, ultimately contributing to improved patient outcomes and clinical decision-making in neuro-oncology.

3.2. Generative Modelling

At the current population level of the country (1.417 billion), this means only 0.0035 percent are diagnosed with Brain Tumor! Let's assume that all MRI scans produce 100% accurate results. This would mean that for every 10,000 MRI scans, we only get 35 samples showing Brain Tumor versus many more that don't This, combined with other problems in accessing medical data, would lead to Machine Learning problems such as Class Imbalance and Bias. Generative adversarial networks are implicit likelihood models that generate data samples from the statistical distribution of the data. They’re used to copy variations within the dataset. They use a combination of two networks: generator and discriminator.

3.3. loss function or the value function

\( \mathrm{\nabla}_{\theta_d\frac{1}{m}} \) \( \sum_{i=1}^{m} [\log{D}(x^{(i)})+\log{(1 - D(G(Z^{(i)})))] } \) (1)

It measures the distance between the distribution of the data generated and the distribution of the real data. Both the generator and the discriminator have their own loss functions. The generator tries to minimize the loss function while the discriminator tries to maximize.

3.4. Image data preprocessing

fig1

Figure 1. Raw image data processing results

This section loads a dataset of MRI images containing brain tumors through the load_images function and converts them to grayscale images, then adjusts them to the specified size (128x128). Then, the image is normalized, the pixel values are scaled to the [-1, 1] range, and the image data is realigned to fit the model input. In addition, the Matplotlib library was used to draw 20 real brain tumor MRI image samples from the data set and display them in the form of gray color maps.

3.5. Model building and training

Step 1: Build a Generator model: Define a generator model that takes a random noise vector as input and converts the noise vector into an image through multiple Conv2DTranspose layers. The final output of the generator is a composite image that resembles the real image.

Step 2: Build a Discriminator model: A discriminator model is defined that distinguishes the synthetic image generated by the generator from the real image. The discriminator classifies the input image as real or false through multiple convolution layers (Conv2D) and fully connected layers (Dense). Combine generator and discriminator into GAN model:[11] Combine generator and discriminator into one overall generative adversarial network (GAN) model. In a GAN model, the output of the generator is fed into a discriminator that evaluates the authenticity of the generated image.

Step 3: Define a generator sample visualization function: Define a function, sample_images, to generate and visualize the image samples generated by the generator model. The function takes a random noise vector as input and uses the generator model to generate a corresponding number of composite images, which are displayed in a specified format.

fig2

Figure 2. Training image results

The following can be learned from the training image results:

1) Generator Loss: With the progress of training, the loss of the generator gradually decreases, indicating that the generator gradually learns to generate more realistic images. 2) Discriminator Loss: The discriminator loss may fluctuate during training, but the overall trend is gradually decreasing, indicating that the discriminator gradually learns to distinguish between real and generated images. 3) Training progress: As the training progresses, the steps within each epoch (batch) are gradually completed, while the loss of generators and discriminators is also changing. To a certain extent, the training of generator and discriminator is mutually influenced, and their optimization goals are competing with each other. 4) Generated image quality: Finally, the image sample generated by the sample_images function can be used to evaluate the effect of the generator training. If the generated image quality is high and similar to the real image, it means that the generator training effect is good.

3.6. Generation distribution map

By comparing the distribution of the generated image with the real sample, the training effect of the generator is evaluated. Using the sns.distplot function, the distribution curves of pixel values of the real image and the generated image are plotted respectively, and then compared and displayed. If the two distributions overlap more, it means that the generated image has a higher distribution similarity to the real image.

fig3

Figure 3. Generate a distribution map of the sample

Combining generative intelligence with medical image processing technology, we can draw the following experimental conclusions: By using generative models, we can achieve the enhancement, reconstruction and synthesis of medical images, thereby improving the quality and quantity of images. [12-13] The distribution similarity between the generated image and the real image is high, indicating that the generated image can truly reflect the patient's condition, provide doctors with more comprehensive diagnostic information, promote the development of precision medicine, and improve the level of medical service and the treatment experience of patients. Therefore, the application of the combination of generative intelligence and medical image processing technology will bring significant improvements and sustainable benefits to the real-world medical field.

4. Conclusion

The integration of generative intelligence with medical image processing technology presents a transformative opportunity in the medical field. Through the synthesis, enhancement, and reconstruction of medical images, this technology enhances both the quality and quantity of available diagnostic data. The high similarity between generated and real images reflects the patient's condition accurately, offering healthcare professionals comprehensive diagnostic insights. This advancement promotes precision medicine, improves medical service quality, and enhances patient treatment experiences, promising significant and sustainable benefits in real-world medical practice. Experimental results demonstrate the effectiveness of generative models in augmenting medical image datasets and improving the quality of generated images. By leveraging techniques like Generative Adversarial Networks (GANs), these models address challenges associated with limited and heterogeneous datasets in medical imaging. Through image synthesis, enhancement, and reconstruction, generative AI facilitates more accurate diagnoses, personalized medicine approaches, and enhanced surgical planning. The high distribution similarity between generated and real images underscores the potential of this technology to advance medical diagnostics and patient care, heralding a new era in healthcare innovation. In the future trend, generative artificial intelligence will bring exponential growth to Al medical imaging enterprises, and the combination of comprehensive medical artificial intelligence models and medical imaging fields will release great potential. Therefore, Al medical imaging enterprises will rely on the ecological route to accelerate their own hematopoietic speed and improve the ability to commercialize.


References

[1]. Chen, Yizhou, et al. "Generative adversarial networks in medical image augmentation: a review." Computers in Biology and Medicine 144 (2022): 105382.

[2]. Singh, N. K., & Raza, K. (2021). Medical image generation using generative adversarial networks: A review. Health informatics: A computational perspective in healthcare, 77-96.

[3]. Xiao, J., Chen, Y., Ou, Y., Yu, H., & Xiao, Y. (2024). Baichuan2-Sum: Instruction Finetune Baichuan2-7B Model for Dialogue Summarization. arXiv preprint arXiv:2401.15496.

[4]. Huo, Shuning, et al. "Deep Learning Approaches for Improving Question Answering Systems in Hepatocellular Carcinoma Research." arXiv preprint arXiv:2402.16038 (2024).

[5]. Xiang, Yafei, et al. "Text Understanding and Generation Using Transformer Models for Intelligent E-commerce Recommendations." arXiv preprint arXiv:2402.16035 (2024).

[6]. Zhu, Mengran, et al. "Utilizing GANs for Fraud Detection: Model Training with Synthetic Transaction Data." arXiv preprint arXiv:2402.09830 (2024).

[7]. Gong, Yulu, et al. "Enhancing Cybersecurity Resilience in Finance with Deep Learning for Advanced Threat Detection." arXiv preprint arXiv:2402.09820 (2024).

[8]. Cheng, Qishuo, et al. "Optimizing Portfolio Management and Risk Assessment in Digital Assets Using Deep Learning for Predictive Analysis." arXiv preprint arXiv:2402.15994 (2024).

[9]. Wu, Jiang, et al. "Data Pipeline Training: Integrating AutoML to Optimize the Data Flow of Machine Learning Models." arXiv preprint arXiv:2402.12916 (2024).

[10]. Zhou, Y., Tan, K., Shen, X., & He, Z. (2024). A Protein Structure Prediction Approach Leveraging Transformer and CNN Integration. arXiv preprint arXiv:2402.19095.

[11]. Zhang, Chenwei, et al. "Enhanced User Interaction in Operating Systems through Machine Learning Language Models." arXiv preprint arXiv:2403.00806 (2024)

[12]. Chen, Xiaoran, et al. "Deep generative models in the real-world: An open challenge from medical imaging." arXiv preprint arXiv:1806.05452 (2018).

[13]. Yi, Xin, Ekta Walia, and Paul Babyn. "Generative adversarial network in medical imaging: A review." Medical image analysis 58 (2019): 101552.


Cite this article

Xu,J.;Wu,B.;Huang,J.;Gong,Y.;Zhang,Y.;Liu,B. (2024). Practical applications of advanced cloud services and generative AI systems in medical image analysis. Applied and Computational Engineering,64,81-86.

Data availability

The datasets used and/or analyzed during the current study will be available from the authors upon reasonable request.

Disclaimer/Publisher's Note

The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of EWA Publishing and/or the editor(s). EWA Publishing and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

About volume

Volume title: Proceedings of the 6th International Conference on Computing and Data Science

ISBN:978-1-83558-425-5(Print) / 978-1-83558-426-2(Online)
Editor:Alan Wang, Roman Bauer
Conference website: https://www.confcds.org/
Conference date: 12 September 2024
Series: Applied and Computational Engineering
Volume number: Vol.64
ISSN:2755-2721(Print) / 2755-273X(Online)

© 2024 by the author(s). Licensee EWA Publishing, Oxford, UK. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license. Authors who publish this series agree to the following terms:
1. Authors retain copyright and grant the series right of first publication with the work simultaneously licensed under a Creative Commons Attribution License that allows others to share the work with an acknowledgment of the work's authorship and initial publication in this series.
2. Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the series's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgment of its initial publication in this series.
3. Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) prior to and during the submission process, as it can lead to productive exchanges, as well as earlier and greater citation of published work (See Open access policy for details).

References

[1]. Chen, Yizhou, et al. "Generative adversarial networks in medical image augmentation: a review." Computers in Biology and Medicine 144 (2022): 105382.

[2]. Singh, N. K., & Raza, K. (2021). Medical image generation using generative adversarial networks: A review. Health informatics: A computational perspective in healthcare, 77-96.

[3]. Xiao, J., Chen, Y., Ou, Y., Yu, H., & Xiao, Y. (2024). Baichuan2-Sum: Instruction Finetune Baichuan2-7B Model for Dialogue Summarization. arXiv preprint arXiv:2401.15496.

[4]. Huo, Shuning, et al. "Deep Learning Approaches for Improving Question Answering Systems in Hepatocellular Carcinoma Research." arXiv preprint arXiv:2402.16038 (2024).

[5]. Xiang, Yafei, et al. "Text Understanding and Generation Using Transformer Models for Intelligent E-commerce Recommendations." arXiv preprint arXiv:2402.16035 (2024).

[6]. Zhu, Mengran, et al. "Utilizing GANs for Fraud Detection: Model Training with Synthetic Transaction Data." arXiv preprint arXiv:2402.09830 (2024).

[7]. Gong, Yulu, et al. "Enhancing Cybersecurity Resilience in Finance with Deep Learning for Advanced Threat Detection." arXiv preprint arXiv:2402.09820 (2024).

[8]. Cheng, Qishuo, et al. "Optimizing Portfolio Management and Risk Assessment in Digital Assets Using Deep Learning for Predictive Analysis." arXiv preprint arXiv:2402.15994 (2024).

[9]. Wu, Jiang, et al. "Data Pipeline Training: Integrating AutoML to Optimize the Data Flow of Machine Learning Models." arXiv preprint arXiv:2402.12916 (2024).

[10]. Zhou, Y., Tan, K., Shen, X., & He, Z. (2024). A Protein Structure Prediction Approach Leveraging Transformer and CNN Integration. arXiv preprint arXiv:2402.19095.

[11]. Zhang, Chenwei, et al. "Enhanced User Interaction in Operating Systems through Machine Learning Language Models." arXiv preprint arXiv:2403.00806 (2024)

[12]. Chen, Xiaoran, et al. "Deep generative models in the real-world: An open challenge from medical imaging." arXiv preprint arXiv:1806.05452 (2018).

[13]. Yi, Xin, Ekta Walia, and Paul Babyn. "Generative adversarial network in medical imaging: A review." Medical image analysis 58 (2019): 101552.