Application of large language models in the field of education

Shiyi Shen

doi:10.54254/2753-8818/34/20241163

1. Introduction

1.1. Background

With the rapid development of generative artificial intelligence, the application of large language models in the field of education has gradually become a focal point of attention in both the academic and industrial sectors. The emergence of this technology has brought unprecedented opportunities to education but is also accompanied by a series of potential challenges. Educators and researchers are actively exploring how to integrate large language models to enhance student learning experiences and teaching effectiveness.

1.2. Purpose

The purpose of this paper is to delve into the application of generative artificial intelligence in the field of education, with a particular focus on the integration of large language models. We aim to reveal the opportunities presented by this technological trend while concurrently examining challenges that may have adverse effects on learning. Through analysis of the latest research and collaborative projects, we seek to provide a comprehensive understanding and valuable insights for decision-makers, educational technology developers, and researchers in the field of education.

1.3. Research Questions and Objectives

In exploring the integration of large language models in education, we will focus on the following core questions: How do large language models alter the landscape of educational technology? Can the integration of large language models effectively enhance student learning outcomes? While exploring opportunities, how should we address potential challenges? By answering these questions, we aim to provide decision-makers and practitioners in the education field with in-depth and substantive insights to guide their decisions and practices in adopting large language models in educational technology.

1.4. Overview of Paper Structure

This paper will be divided into several sections, systematically exploring the application of generative artificial intelligence in education. The second section will provide an overview of the basic concepts of generative artificial intelligence and current application cases in the education sector. The third section will focus on the opportunities and challenges of large language models in education, categorized into two aspects: opportunities and challenges. Subsequent sections will concentrate on the integration of large language models in intelligent tutoring systems, analyzing collaborative projects through case studies to explore their potential value in cultivating student curiosity and question posing. Following that, we will conduct an in-depth study of curiosity-driven dialogue agents, combining empirical research cases to highlight the application of large language models in promoting curiosity-driven learning. The last two sections will perform a technical survey, evaluating the possibilities of multimodal integrated educational systems and the synergies between large language models and other technologies. Finally, the seventh section will summarize research findings and provide insights into future directions for integrating large language models in the field of education.

2. Application of Generative Artificial Intelligence in Education

2.1. Overview of Generative Artificial Intelligence

Generative artificial intelligence, as an emerging technology in the field of artificial intelligence, has gained prominence in the education sector in recent years. The core feature of this technology lies in its ability to autonomously generate text, understand language, and engage in interactions, providing educators and learners with a new paradigm for learning. The fundamental concept of generative artificial intelligence is to simulate the creative thinking processes of humans, enabling it to produce complex, context-aware natural language outputs [1].

2.2. Current Application Cases in Education

Currently, generative artificial intelligence has made significant strides in the field of education. Innovators in educational technology actively explore the integration of generative artificial intelligence into schools, online education platforms, and tutoring systems to enhance the personalization and effectiveness of learning. Real-time intelligent tutoring, smart Q&A systems, personalized learning recommendations, and other application cases are gradually becoming highlights in educational practices, offering students more intelligent and personalized learning support.

2.3. Technological Trends and Future Prospects

With the widespread application of generative artificial intelligence in education, technological trends continue to evolve. Particularly noteworthy is the rise of large language models, injecting greater language understanding and generation capabilities into generative artificial intelligence. In the future, we can expect to see generative artificial intelligence demonstrating its potential in various fields, including but not limited to more intelligent educational tutoring, instant problem-solving, and more creative learning experiences.

In this section, we will delve into the key application areas of generative artificial intelligence in education to reveal its potential advantages in enhancing educational effectiveness and driving learning innovation. Through the exploration of current technological trends and future prospects, we aim to present readers with a comprehensive overview of generative artificial intelligence in education, laying the foundation for in-depth analysis in subsequent chapters.

3. Large Language Models in the Education Sector: Opportunities and Challenges

3.1. Opportunities

The integration of large language models in education presents unprecedented opportunities for personalized learning. Leveraging deep learning and natural language processing techniques, large language models can comprehend students’ learning needs, interests, and subject proficiency, offering tailored learning content and support to each student. This opportunity allows educators to better address the diverse learning needs of students, thereby improving learning outcomes and nurturing the individual potential of each student.

The integration of large language models also empowers teachers with greater teaching authority. Through intelligent assistance systems, teachers can access real-time feedback, personalized teaching suggestions, and customized teaching resources, better meeting the needs of each student in the classroom. Teachers can focus more on individualized guidance, enhance classroom efficiency, and provide more targeted support during critical moments in student development [2].

3.2. Challenges

Despite providing new opportunities for personalized learning, the reliance on large language models may pose potential risks to children’s learning. Excessive dependence on automated learning systems may result in students losing their ability for independent learning and becoming overly reliant on machine models. Additionally, the accuracy of the content generated by the models needs careful consideration to prevent misinformation from negatively impacting students’ academic development.

The widespread adoption of large language models in education also brings challenges in terms of data privacy and ethics. The extensive collection and analysis of students’ personal information and learning data may raise concerns about privacy. Moreover, the training data for the models may contain inherent biases, potentially leading to unfair impacts on student groups. Addressing these issues requires the establishment of rigorous data privacy regulations and ethical frameworks to ensure the security of student information and equitable treatment.

In the following chapters, we will conduct a detailed analysis of these opportunities and challenges, aiming to provide educators, decision-makers, and researchers with a more comprehensive understanding and offering possible methods and recommendations for addressing these issues.

4. Integration of Intelligent Tutoring Systems and Large Language Models

4.1. Overview of Intelligent Tutoring Systems

Intelligent Tutoring Systems are educational tools that utilize artificial intelligence technology to provide intelligent learning support. These systems analyze students’ learning data, identify learning needs, and offer personalized teaching suggestions. Their goal is to optimize students’ learning experiences, helping educators better meet the subject-specific needs of students. In recent years, with the rise of large language models, Intelligent Tutoring Systems are gradually integrating this technology, further enhancing the systems’ level of intelligence [3].

4.2. Role of Large Language Models in Tutoring Systems

Large language models play a crucial role in Intelligent Tutoring Systems. Firstly, these models can deeply understand questions posed by students, infer potential subject knowledge, and provide specific and personalized answers. Secondly, based on students’ learning history and performance, large language models can adjust teaching content and difficulty, achieving more accurate personalized learning support. By simulating the human teaching process, large language models not only provide efficient learning paths for students but also offer real-time feedback and guidance to educators.

4.3. Case Study: Collaboration between EvidenceB and Inria Flowers

In-depth examination of the integration of large language models in Intelligent Tutoring Systems involves exploring the technical details through the collaborative project between EvidenceB and Inria Flowers. This project, founded on technological innovation, aims to enhance students’ learning outcomes in mathematics and science.

4.3.1. Exquisite Application of Natural Language Processing Technology. In this collaborative project, natural language processing technology serves as a crucial technical foundation. Through deep learning methods, large language models are trained to precisely understand questions posed by students. Employing state-of-the-art natural language processing algorithms, the models can capture the grammar structure, context, and potential knowledge points within questions, achieving more accurate semantic understanding [4].

The successful application of this technology involves training on a large-scale corpus and fine-tuning the model parameters. By using rich and diverse subject texts, the model gains a comprehensive understanding of terms and knowledge structures in various subject areas. Additionally, the fine-tuning process requires meticulous adjustments to adapt the model to specific subject domains, ensuring higher accuracy and adaptability in answering student questions [5].

4.3.2. Construction and Optimization of the Inference Engine. The inference engine of large language models is another technical highlight in this project. By combining the model’s profound understanding of subject knowledge with its logical reasoning capabilities, the inference engine can generate high-quality answers. This inference process may involve extracting information from knowledge graphs and designing logical reasoning rules.

On the technical front, building the inference engine requires handling large-scale knowledge graphs, ensuring the model can efficiently retrieve and comprehend information within them. Additionally, to enhance the performance of the inference engine, advanced technical means such as parallel computing and distributed computing may be needed to address computational demands when dealing with complex problems.

4.3.3. Personalized Algorithm for Dynamic Learning Paths. Achieving dynamic personalized learning paths involves advanced personalized algorithms. This project may employ reinforcement learning or recommendation system techniques from deep learning to dynamically adjust learning paths based on students’ subject proficiency and learning history.

On the technical front, this requires establishing a system capable of processing student data in real-time and providing personalized recommendations. The model may need to update estimates of students’ subject proficiency in real-time while considering the interrelatedness between subject areas for more accurate adjustments to the learning path. This process may involve online updates of model parameters and rapid recommendation algorithms.

4.3.4. Technical Challenges and Optimization Paths. While this project showcases the powerful application of large language models in Intelligent Tutoring Systems, it also faces some technical challenges. Real-time responsiveness and computational efficiency are crucial aspects that require model optimization to meet fast response requirements. Additionally, the construction of personalized learning paths necessitates careful handling of student data privacy and security, employing clever privacy protection techniques to balance personalization and privacy relationships [6].

On the technical front, further optimization paths may include streamlining model structures and applying hardware acceleration to enhance computational efficiency. Moreover, adopting privacy protection measures such as differential privacy or federated learning is essential to ensure the proper protection of student data.

4.3.5. Technological Prospects and Future Development Directions. Looking ahead, this collaborative project opens up new possibilities for the technological application of large language models in Intelligent Tutoring Systems. The technical team can further explore the applicability of the model to a broader range of subject areas and increase the system’s intelligence by introducing more complex algorithms and model structures. As technology continues to evolve, this project also provides valuable experience for introducing more innovative and efficient solutions in the field of educational technology in the future.

5. Implementing Curiosity-Based Learning Dialogue Agents

5.1. Importance of Curiosity-Based Learning

Curiosity-based learning, as a crucial learning mode, plays a key role in cultivating students’ spirit of active inquiry and problem-solving abilities. This learning approach not only encourages students to actively pose questions but also sparks their desire for in-depth exploration of knowledge. Curiosity-based learning emphasizes students’ proactive exploration of subjects, contributing to the development of innovative thinking and critical reasoning skills.

5.2. Application of Large Language Models in Facilitating Curiosity-Based Learning

The application of large language models in the field of education opens up new possibilities for promoting curiosity-based learning. These models possess the capability to understand and generate knowledge across multiple domains, enabling them to provide insightful and inspiring answers when learners pose questions. Through simulating dialogue, large language models can stimulate students’ curiosity, guide them to actively engage in subject discussions, and encourage them to pose more profound questions.

5.3. Abdelghani et al.’s Case Study (2023)

To further explore the practical effects of large language models in fostering curiosity-based learning, we will introduce the research case study conducted by Abdelghani et al. in 2023. This study focuses on how combining dialogue agents with large language models can stimulate curiosity-based learning in students. By analyzing the interaction between students and the dialogue agents, the research team gained insights into the topics, depth of questions posed by students, and feedback on the model-generated answers. This case study will provide us with practical insights into how large language models can promote curiosity-based learning, enhancing our understanding of the potential value of this technology in the educational process [7].

In the subsequent sections, we will delve deeper into the application of large language models in dialogue agents and how they influence students’ curiosity-based learning. Through the analysis of specific cases, our aim is to provide educators with practical recommendations on how to maximize the use of large language models in promoting students’ curiosity-based learning.

6. Technical Survey: Integration of Multi-Module Technologies in Education

6.1. Integrating Various Technological Modules in Educational Systems

The rapid development of educational technology prompts us to explore how to integrate different technological modules to construct a more comprehensive and efficient educational system. Integrating multi-module technologies such as virtual reality, artificial intelligence, and large language models holds the potential to provide students with a richer learning experience. This comprehensive approach aims to optimize the teaching process, personalize learning, and enhance student engagement and depth of understanding.

6.2. Improving the Synergy between Large Language Models and Other Technologies

In this technical survey, we will focus on exploring how to improve the synergy between large language models and other educational technologies. This includes integration with technologies like virtual reality and machine learning algorithms to optimize the generation of instructional content and personalized learning paths for students. By enhancing collaborative efforts, we aim to better leverage the strengths of various technologies, elevating the overall intelligence and effectiveness of the educational system.

6.3. Case Study by Kasneci et al. (2023)

In this study, we delve into a research case conducted by Kasneci et al. in 2023, focusing on technological innovations in multi-module integrated education. This case provides a technological perspective, emphasizing how different technological modules collaborate to improve the overall performance of the educational system.

6.3.1. Technological Framework for Multi-Module Integration. Kasneci et al.’s research case first focuses on constructing a multi-module technological framework that integrates several key educational technology modules. These modules may include technologies from areas such as speech recognition, natural language processing, computer vision, and machine learning. By synergistically utilizing these technological modules, the research team aims to achieve a more intelligent and comprehensive educational system [8].

In the design of the technological framework, key issues involve collaboration and communication between modules. Possible technical means may include establishing unified data exchange standards, designing effective interfaces, and employing distributed computing. This helps ensure that each module can efficiently share information, enabling effective decision-making and operations.

6.3.2. Integration of Speech Recognition and Natural Language Processing. Kasneci et al.’s research emphasizes the deep integration of speech recognition and natural language processing modules. By combining these two key technologies, the research team attempts to enhance the system’s understanding of student voice inputs and achieve more natural, seamless interactions. This may involve real-time processing of speech signals, semantic analysis of text, and adaptability of the model to different dialects and accents, posing technical challenges.

At the level of technical integration, advanced technologies like deep neural networks may be employed to establish robust correlations between speech and text. This deep integration enables the system to more accurately understand students’ questions and responses, providing finer support for personalized learning.

6.3.3. Application of Computer Vision in Education. Another crucial technological module is computer vision, which Kasneci et al. include as an integral part of the research case in education. This module may involve technologies such as facial expression recognition, attention detection, and gesture analysis. By combining information from computer vision and other modules, the system can gain a more comprehensive understanding of students’ states and behaviors during the learning process.

At the technical level, the integration of computer vision may require advanced technologies like convolutional neural networks in deep learning. This helps extract key information from image and video data, enabling more accurate assessment of students’ emotional states and learning behaviors. Simultaneously, the real-time processing of data and rapid inference by the model are focal points for technical optimization.

6.3.4. Real-time Decision Support by Machine Learning Algorithms. Kasneci et al.’s research further emphasizes the critical role of machine learning algorithms in real-time decision support. Through the analysis and learning from data integrated across multiple modules, the system can achieve real-time decision-making to adjust teaching strategies and personalize recommended learning content.

On the technical side, machine learning algorithms such as online learning and incremental learning may be employed to enable the system to adapt to students’ needs in a continuously changing learning environment. This may also include optimization and parameter adjustments to ensure the system maintains efficient performance over prolonged usage.

6.3.5. Technical Challenges and Future Directions. The research case by Kasneci et al. reveals technological challenges faced by multi-module integrated education systems. One such challenge is the coordination and communication between modules, requiring the design of an efficient technological framework. Additionally, addressing real-time processing and handling large-scale data are technical challenges.

In future technological development directions, further optimization of each module’s performance may be necessary, along with exploring more efficient methods of technological integration and researching new machine learning algorithms to enhance the system’s intelligence and adaptability. This research case provides valuable insights for constructing more intelligent multi-module education systems in the future.

7. Conclusion and Prospects

Through an in-depth exploration of generative artificial intelligence and large language models in the field of education, this paper has achieved a series of key research findings. Firstly, we discussed the application trends of generative artificial intelligence in education, with a particular focus on the rise of large language models. This trend has brought opportunities to education, including personalized learning and teacher empowerment, while also presenting a series of challenges, such as potential risks to children’s learning and ethical considerations regarding data privacy.

In the subsequent sections of the paper, we delved into the integration of large language models in intelligent tutoring systems, exploring their advantages and potential impacts on personalized learning support. We further highlighted the application of large language models in fostering curiosity-driven learning, inspiring student initiative through dialogue agents. Additionally, we introduced practical cases, such as the collaborative project between EvidenceB and Inria Flowers, as well as the research by Abdelghani et al., to showcase the practical application effects of large language models in educational environments.

In the Technical Survey section, we discussed how to integrate different technological modules to construct more intelligent educational systems and how to improve the synergy between large language models and other technologies. We introduced the research case by Kasneci et al., providing an in-depth analysis of the practical effects of multi-module technological integration in enhancing students’ academic achievements.

Looking ahead, we believe that the education sector will continue to benefit from the development of generative artificial intelligence and large language models. With ongoing technological advancements, we anticipate the emergence of more intelligent and personalized educational solutions. However, at the same time, it is essential to continuously address ethical and privacy issues arising from technology and take measures to ensure the fair and just application of these technologies.

Future research directions may include a deeper exploration of the actual teaching effects of large language models in education and further improvement in understanding students’ learning processes. Simultaneously, research on multi-module technological integration is expected to be refined to realize the collaborative advantages of different technological modules. We8 look forward to educators, technology developers, and researchers collectively striving to propel the education sector towards a more intelligent and inclusive future in this ever-evolving field.

References

[1]. Kasneci, E., Seßler, K., Küchemann, S., Bannert, M., Dementieva, D., Fischer, F., … Kasneci, G. (2023, January 30). ChatGPT for Good? On Opportunities and Challenges of Large Language Models for Education. https://doi.org/10.35542/osf.io/5er8f

[2]. FLOWERS - 2022 - Annual activity report. (n.d.). https://radar.inria.fr/report/2022/flowers/index. html

[3]. Ayanwale, M. A., Sanusi, I. T., Adelana, O. P., Aruleba, K., & Oyelere, S. S. (2022). Teachers’ readiness and intention to teach artificial intelligence in schools. Computers & Education: Artificial Intelligence, 3, 100099.

[4]. https://doi.org/10.1016/j.caeai.2022.100099

[5]. Katsarou, E. (2021). The Effects of Computer Anxiety and Self-Efficacy on L2 Learners’ Self-Perceived Digital Competence and Satisfaction in Higher Education. Journal of Education and E-learning Research, 8(2), 158–172. https://doi.org/10.20448/journal.509.2021.82.158.172

[6]. Oyelere, S. S., Sanusi, I. T., Agbo, F. J., Oyelere, A. S., Omidiora, J. O., Adewumi, A. E., & Ogbebor, C. (2022). Artificial intelligence in African schools: Towards a contextualized approach. 2022 IEEE Global Engineering Education Conference (EDUCON). https://doi.org/10.1109/educon52537.2022.9766550

[7]. Kasneci, E., Sessler, K., Küchemann, S., Bannert, M., Dementieva, D., Fischer, F., Gasser, U., Groh, G., Günnemann, S., Hüllermeier, E., Krusche, S., Kutyniok, G., Michaeli, T., Nerdel, C., Pfeffer, J., Poquet, O., Sailer, M., Schmidt, A., Seidel, T., . . . Kasneci, G. (2023). ChatGPT for good? On opportunities and challenges of large language models for education. Learning and Individual Differences, 103, 102274. https://doi.org/10.1016/j.lindif.2023.102274

[8]. Karch, T. (2022, October 3). Contrastive multimodal learning for emergence of Graphical Sensory-Motor Communication. arXiv.org. https://arxiv.org/abs/2210.06468

[9]. Abdelghani, R., Wang, Y. H., Yuan, X., Wang, T., Lucas, P., Sauzéon, H., & Oudeyer, P. Y. (2023). GPT-3 Driven Pedagogical Agent for Cultivating Curiosity in Children’s Questioning Skills. International Journal of Educational Artificial Intelligence, 1-36.

Cite this article

Shen,S. (2024). Application of large language models in the field of education. Theoretical and Natural Science,34,147-154.

Data availability

The datasets used and/or analyzed during the current study will be available from the authors upon reasonable request.

Disclaimer/Publisher's Note

The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of EWA Publishing and/or the editor(s). EWA Publishing and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

About volume

Volume title: Proceedings of the 3rd International Conference on Computing Innovation and Applied Physics

ISBN：978-1-83558-369-2(Print) / 978-1-83558-370-8(Online)

Editor：Yazeed Ghadi

Conference website: https://www.confciap.org/

Conference date: 27 January 2024

Series: Theoretical and Natural Science

Volume number: Vol.34

ISSN：2753-8818(Print) / 2753-8826(Online)

© 2024 by the author(s). Licensee EWA Publishing, Oxford, UK. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license. Authors who publish this series agree to the following terms:
1. Authors retain copyright and grant the series right of first publication with the work simultaneously licensed under a Creative Commons Attribution License that allows others to share the work with an acknowledgment of the work's authorship and initial publication in this series.
2. Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the series's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgment of its initial publication in this series.
3. Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) prior to and during the submission process, as it can lead to productive exchanges, as well as earlier and greater citation of published work (See Open access policy for details).

References

[2]. FLOWERS - 2022 - Annual activity report. (n.d.). https://radar.inria.fr/report/2022/flowers/index. html

[4]. https://doi.org/10.1016/j.caeai.2022.100099

[8]. Karch, T. (2022, October 3). Contrastive multimodal learning for emergence of Graphical Sensory-Motor Communication. arXiv.org. https://arxiv.org/abs/2210.06468