1. Introduction
Generative artificial intelligence (AI) has experienced significant advancements in recent years, particularly in the development of large language models (LLMs). These models possess the ability to process and generate human-quality text, enabling them to undertake a wide array of complex tasks. Through exposure to extensive datasets during pre-training, LLMs acquire the capacity to develop a nuanced understanding of language and its intricacies.
Artificial intelligence (AI) is recognized as a key driver of the Fourth Industrial Revolution [
1], impacting diverse sectors [
2] including education. Integrating AI into education presents both challenges and opportunities, particularly in transforming teaching methodologies and enhancing learning efficiency [
3,
4]. The term AIEd is now widely used to denote the application of these technologies within educational contexts [
5].
UNESCO [
6] emphasizes the potential of AI to democratize knowledge and personalize learning experiences. However, it cautions that the implementation of AI in education must be guided by principles of inclusion and equity to mitigate the risk of exacerbating existing educational inequalities. This is particularly crucial in contexts with significant technological disparities, privacy concerns, and potential data biases.
In the ongoing pursuit of effective and motivating educational methods, active methodologies have been increasingly implemented in classrooms. By placing students at the center of their learning journey, these methodologies foster a more engaging and impactful educational experience. Active methodologies are defined as pedagogical approaches that integrate traditional pedagogical principles with innovative practices, enabling students to experiment, reflect, and internalize knowledge through meaningful and contextualized processes. These processes often utilize real or simulated experiences [
7,
8,
9,
10]. This student-centered approach fosters autonomy, creativity, and critical thinking in future professionals, empowering them to actively engage with and transform their reality [
11].
Hwang et al. [
12] have identified four primary roles for AI in education: intelligent tutor, tutee, learning tool or companion, and decision advisor. Through various AI models and applications, educators can generate diverse resources, including text, images, and videos. AI also enables educators to automate tasks such as delivering tutorials, creating personalized content, and conducting assessments [
5,
6]. This technology offers opportunities to adapt educational content to individual student needs, particularly for learners with disabilities or those in remote areas with limited access to specialized teachers [
6]. Furthermore, AI facilitates personalized learning pathways, improves accessibility, customizes teaching methodologies, fosters collaboration, supports adaptive learning environments, and promotes the development of essential skills, including critical thinking and problem-solving [
4,
13].
While the possibilities of AI in education are promising and seemingly endless, achieving optimal outcomes hinges on educators possessing a thorough understanding of both pedagogical and technical aspects [
12,
14]. Specifically, effective communication with AI models through prompts necessitates strong language skills, a nuanced understanding of the model’s capabilities and limitations, and a deep knowledge of the educational context [
15,
16], and adequate digital skills are essential [
6]. A poorly formulated prompt can lead to inaccurate, irrelevant, or ambiguous information [
17]. Therefore, prompt design is critical for ensuring optimal model performance, and a proper understanding of the model’s characteristics is key to crafting effective prompts. However, this remains a significant challenge for non-specialist educators, who often struggle to achieve satisfactory results when interacting with AI models [
18,
19].
Prominent active learning methodologies include project-based learning, case- and challenge-based learning, collaborative learning, the flipped classroom, and game-based learning [
8,
9]. Within game-based learning, two key strategies are particularly noteworthy: gamification and serious games.
Gamification is defined as the application of game design mechanics, elements, and techniques to non-game contexts, such as health, marketing, or education, to engage users and solve problems [
20,
21,
22]. Key components of this technique include points, badges, leaderboards, levels, avatars, challenges, storytelling, and other interactive elements.
In contrast to gamification, which integrates specific game elements into non-game contexts, serious games are complete games (digital or otherwise) designed with primary purposes beyond entertainment [
23]. From a simulation perspective, educational escape rooms and breakout EDU activities have gained significant popularity in recent years, yielding positive learning outcomes [
24]. Educational escape rooms are immersive learning environments where students, typically organized in teams, must collaborate to solve challenges (often called puzzles) within a time limit to “escape” a designated space. These activities incorporate interactivity and engaging narratives to enhance learning [
25]. Educational breakouts, or breakout EDUs, was similar to escape rooms but with a different objective: instead of escaping a physical space, participants solve a series of problems within a time limit to decipher codes and unlock one or more locked boxes [
26].
AI serves as a powerful tool to assist educators in managing and designing their courses. Whether an experienced educator seeking support and innovative ideas or a novice looking to implement new strategies in instructional design, AI models offer valuable support [
4,
27]. However, effectively leveraging these AI tools requires careful consideration of prompt design and construction. As previously discussed, a well-crafted prompt is crucial for achieving optimal results and avoiding inaccurate or irrelevant outputs.
One of the most recent and significant advancements in ChatGPT technology is the ability for users to design their own specialized AI assistants, known as GPTs or custom GPTs. This functionality allows for the creation of tailored AI models for more controlled generative text environments, trained on specific datasets and designed for specific purposes. An AI assistant is an entity (machine) capable of understanding user instructions, thanks to its underlying LLM, and applying pre-defined parameters and knowledge to achieve its designated objectives [
28].
Sajja et al. [
29] define an AI assistant as a computer system that utilizes AI techniques, such as natural language processing and machine learning, to understand, interpret, and respond to user requests. These assistants can perform a diverse array of tasks, including providing information, automating processes, offering personalized assistance, and facilitating interaction with other systems.
A significant limitation of utilizing LLMs in academia is their occasional generation of irrelevant or incorrect information. This can stem from the model’s lack of specific knowledge, nuanced understanding, or analytical reasoning capabilities regarding the given prompt. Such inaccuracies can undermine the perceived usefulness, credibility, and trustworthiness of this technology in academic contexts [
30,
31]. However, this issue can be mitigated by creating a dedicated knowledge base for the AI assistant, as demonstrated by Castleman et al. [
32], this knowledge base can be populated with relevant documents and data deemed essential by the creator, thereby enhancing the assistant’s accuracy and effectiveness.
Training the AI assistant with this curated documentation significantly enhances the precision and relevance of its responses, ensuring better alignment with the knowledge contained within those resources. This, in turn, fosters greater user satisfaction and trust in the AI assistant’s capabilities.
This method of creating customized AI applications significantly expands the possibilities of applying artificial intelligence in education [
33]. These pre-designed AI assistants, with their specialized knowledge bases, reduce the need for users to possess advanced prompt engineering skills. This is a critical advantage, as educators with limited technical expertise can readily utilize these tools to enhance their productivity and teaching effectiveness [
30]. In the educational context, AI assistants can support teachers in designing effective pedagogical strategies by providing relevant ideas, guidelines, and even customized learning content tailored to specific student needs.
Two primary challenges emerge in this context. Firstly, due to the novelty of AI in education, many educators lack the digital literacy and understanding required to effectively utilize AI tools, particularly in formulating effective prompts. Secondly, less experienced teachers often lack the pedagogical knowledge and confidence to design and implement game-based learning strategies within their subject areas. To address these challenges, an AI assistant was developed, following an extensive analysis of existing AI models, specifically ChatGPT [
34,
35,
36]. This AI assistant incorporates a dedicated knowledge base and precise instructions (a “megaprompt”) to facilitate effective AI utilization in education. The primary objective was to provide educators with a user-friendly tool that empowers them to leverage the benefits of AI without requiring expertise in prompt engineering, gamification, or serious game design. This tool aims to guide and assist teachers in creating engaging, gamified classroom scenarios, enabling them to design effective teaching strategies based on various game-based learning approaches.
This educational research presents the results of evaluating the effectiveness of the AI assistant. A rubric serves as the primary evaluation instrument. The study encompasses five distinct user interaction scenarios, with four tests conducted for each scenario. These scenarios feature diverse learning situations contextualized within various subjects of university degree and master’s programs.
This educational research investigates the effectiveness of a specialized AI assistant, ‘GamifIcA Edu’, designed to democratize the implementation of gamification and serious games in educational settings. By removing the need for specialized technical knowledge in AI and game-based learning, this tool enables educators with varying levels of expertise to readily integrate innovative pedagogical approaches into their teaching practices. This study evaluates the efficacy of ‘GamifIcA Edu’ in supporting educators and enhancing student learning experiences through gamified activities.
Based on this experience, the following research questions are proposed:
In what ways does the AI assistant ensure alignment with educational objectives when designing gamified activities?
How does the AI assistant maintain a consistent and coherent approach throughout the process of generating gamified learning activities?
How does the AI assistant balance consistency and flexibility when generating gamified activities?
The primary motivation of this research is to develop a GPT assistant that serves as a tutor for educators in designing and implementing gamified learning activities. This paper is organized as follows:
Section 2 describes the functionalities of ‘GamifIcA Edu’ and the testing methodology.
Section 3 from evaluating the AI assistant’s performance are then presented.
Section 4 analyses these findings, considering the potential applications of ‘GamifIcA Edu’ within diverse educational frameworks.
Section 5 are also discussed. Finally,
Section 6 summarizes the key findings and contributions.
4. Discussion
This section presents an analysis of the AI assistant (custom GPT)’s performance in supporting educators with gamification design, based on the application of a general evaluation rubric. The evaluation reveals a comprehensive understanding of the assistant’s capabilities, strengths, and limitations. Through this analysis, we aim to address the research questions proposed for this study. The following discussion examines key findings across critical dimensions, including alignment, coherence, consistency, and flexibility.
4.1. In What Ways Does the AI Assistant Ensure Alignment with Educational Objectives When Designing Gamified Activities?
Across all evaluated scenarios, the assistant demonstrated adequate performance without requiring advanced prompt engineering knowledge. This finding aligns with previous research by Sánchez-Prieto et al. [
38], which also observed user-friendly interaction patterns with AI assistants.
The alignment of text-generative AI assistants, such as GPTs, is crucial for ensuring that they meet their intended objectives and provide practical value. A well-aligned assistant generates accurate, relevant, and contextually appropriate responses while minimizing biases, errors, and undesirable behaviors. This alignment not only enhances the quality of interactions but also mitigates risks associated with misinformation, inappropriate content generation, and misinterpretation of instructions. In scientific, educational, and professional contexts, where accuracy and reliability are paramount, proper alignment enables these models to serve as dependable tools for complex problem-solving, decision support, and communication tasks. In education specifically, alignment is essential for providing effective guidance to teachers in developing and implementing gamified learning activities. Conversely, misalignment can undermine trust in the technology and limit its adoption in critical educational settings.
Fulgencio’s [
39] research indicates that assistant alignment depends on three key elements: a robust knowledge base, well-defined behavioral parameters in the assistant’s core instructions, and clear response protocols for various user requests.
The evaluation rubric results demonstrated the AI assistant’s exceptional alignment with these principles, achieving scores above 96/100 across all five test scenarios. The assistant consistently provided accurate information without errors, a crucial aspect achieved through well-designed instructions and a comprehensive knowledge base [
32].
Detailed scenario analysis reveals that the assistant’s alignment extends beyond narrative integration to encompass both pedagogical understanding and practical implementation. In Scenario 1, the breakout EDU activities demonstrated this comprehensive alignment by not only meeting established objectives but also providing teachers with specific tools to develop students’ collaborative skills, critical thinking, and problem-solving abilities. The activities showcased innovative approaches to learning, particularly in implementing team-based problem-solving dynamics where students collaborated using digital tools to progress through sequential challenges.
However, Scenario 5 revealed some limitations in the assistant’s alignment, particularly in Tests 1 and 3. This scenario was designed to evaluate the model’s ability to redirect non-gamification queries toward gamified approaches without directly addressing the original requests. In Test 1, the assistant’s responses were overly general and failed to establish clear connections with pedagogical objectives. Similarly, Test 3, while providing detailed proposals, did not successfully redirect interactions toward a gamified framework, indicating potential difficulties in contextual query interpretation.