The reduction of excessive academic burden in China’s basic education system has been established as a central objective of national education reform and has become a subject of intense policy debate. To elucidate the complex strategic interactions that shape the implementation of the “Double Reduction” policy, a multi-agent evolutionary game model was constructed incorporating three principal stakeholder groups: government authorities, schools and teachers, and students and parents. Replicator dynamic equations were employed to examine the evolutionary stability of stakeholder strategies and the conditions under which equilibrium outcomes emerge. Through numerical simulations, the influence of regulatory enforcement intensity on behavioral trajectories and convergence patterns was evaluated. The results reveal that asymptotically stable equilibria exist, with optimal system performance achieved when government bodies maintain active and credible regulatory oversight, educational institutions engage in substantive and sustained burden-reduction efforts, and families adopt cooperative and adaptive responses. By clarifying the mechanisms through which stakeholder interactions determine collective outcomes, this study provides theoretical support for the refinement of policy coordination and the long-term enhancement of education governance capacity. These findings contribute not only to the understanding of the “Double Reduction” policy’s systemic impact but also to broader discussions on the role of evolutionary game theory in evaluating multi-agent policy interventions in education systems.
The psychometric validity of multiple-choice questions (MCQs) generated by an advanced Artificial Intelligence (AI) language model (ChatGPT) was evaluated in comparison with those developed by experienced human instructors, with a focus on mathematics teacher education. Two parallel 30-item MCQ tests—one human-designed and one AI-generated—were administered to 30 mathematics teacher trainees. A comprehensive psychometric analysis was conducted using six metrics: item difficulty index (Pi), discrimination index (D), point-biserial correlation, item-test correlation (Rit), Cronbach’s alpha (α) for internal consistency, and score variance. The analysis was facilitated by the Analysis of Didactic Items with Excel (AnDIE) tool. Results indicated that the human-authored MCQs exhibited acceptable difficulty (mean Pi = 0.55), moderate discrimination power (mean D = 0.31), and strong internal consistency (Cronbach’s α = 0.752). In contrast, the AI-generated MCQs were found to be substantially more difficult (mean Pi = 0.22), demonstrated weak discrimination (mean D = 0.16), and yielded negative internal consistency reliability (Cronbach’s α = −0.1), raising concerns about their psychometric quality. While AI-generated assessments offer advantages in terms of scalability and speed, the findings underscore the necessity of expert human review to ensure content validity, construct alignment, and pedagogical appropriateness. These results suggest that AI, in its current form, is not yet equipped to autonomously generate assessment instruments of sufficient quality for high-stakes educational settings. A hybrid test design model is therefore advocated, wherein AI is leveraged for initial item drafting, followed by rigorous human refinement. This approach may enhance both efficiency and quality in the development of educational assessments. The implications extend to educators, assessment designers, and developers of educational AI systems, highlighting the need for collaborative human-AI frameworks to achieve reliable, valid, and pedagogically sound testing instruments.
This study investigates the role of Artificial Intelligence (AI) in sustainable education through a bibliometric analysis, aiming to explore research trends, key contributors, citation analysis, co-authorship, and thematic developments in the field. As AI becomes increasingly integrated into Education, it is crucial to understand its impact on learning personalization, institutional efficiency, and sustainability. The study also identifies research gaps and provides recommendations for future exploration. The study employs a bibliometric and content analysis methodology using Scopus data. Two hundred seventy-six documents (2016-2025) were analyzed through descriptive statistics, citation analysis, co-word analysis, and co-authorship networks, utilizing VOSviewer and Biblioshiny for data visualization. The analysis examines publication trends, top-cited articles, leading institutions, and international collaborations to map the intellectual landscape of AI in sustainable education. The findings indicate a significant increase in AI-related publications after 2019, reflecting growing global interest. India, the USA, and China lead research output, while Sustainability (Switzerland) and Lecture Notes in Networks and Systems are the most prominent publication sources. The co-authorship analysis highlights strong global research collaborations, with the UK, Brazil, and China playing key roles. Thematic clustering reveals four major research areas: AI-driven Environmental Education, AI in Education, sustainable education frameworks, and AI's technical advancements in learning systems. This study provides a comprehensive, macro-level bibliometric analysis that maps global research dynamics, identifies intellectual structures, and visualizes collaborative networks in AI and sustainable education. Despite its contributions, the study has several limitations. First, while Scopus offers broad and reputable coverage of peer-reviewed literature, the exclusive reliance on this database limits the inclusion of potentially relevant studies indexed in other databases such as Web of Science (WoS). This may restrict the diversity and comprehensiveness of the findings. Future research should consider cross-validating results using multiple databases to ensure a more holistic understanding of AI in sustainable education. Second, the exclusion of non-English publications may limit the diversity of perspectives. Third, the study primarily focuses on journal articles and conference papers, excluding books and institutional reports that might offer more profound insights.
The extent to which students’ cultural values influence mathematical problem-solving skills was investigated, with emphasis placed on the moderating effect of prior mathematical knowledge and the mediating role of student motivation. A mixed-methods design was employed to ensure both quantitative and qualitative dimensions of the inquiry were addressed, enabling a comprehensive understanding of the underlying relationships. A purposive sample of 370 students from culturally diverse regions of Ghana was selected to ensure contextual validity and sociocultural relevance. It was found that students’ cultural values significantly shaped their problem-solving performance, particularly in relation to cognitive processing strategies and the selection of problem-solving heuristics. The relationship between cultural values and problem-solving skills was moderated by students’ prior knowledge, with students possessing a stronger foundational understanding of mathematics deriving greater benefit from cultural alignment. Contrary to expectations, the mediating effect of student motivation on this relationship was not supported, suggesting that while motivation is influenced by cultural values, it does not serve as a direct conduit for enhanced problem-solving capability. These findings underscore the necessity of incorporating culturally responsive pedagogical strategies that recognize and harness cultural value systems as cognitive assets. Furthermore, implications for curriculum development and teacher training were discussed, with a recommendation that future research explore context-specific interventions that operationalize cultural capital to improve mathematical learning trajectories.
Although methodological innovations have reshaped many aspects of scientific writing, literature reviews remain one of its most structurally underdeveloped and conceptually inconsistent components. Existing approaches often fail to communicate the functional role of individual sources within the research argument, leaving both readers and reviewers without a transparent sense of contribution, coherence, or originality. This paper introduces the Pyramid of Contribution Review (PCR), a novel framework that visually and functionally maps references according to their role in the manuscript (Introduction, Methodology, Results, Discussion, Gap) and their level of relevance. Through a mixed-methods validation process, including expert-based Delphi design (n = 28) and a large-scale evaluation survey (n = 118), the method was rigorously tested across disciplines. Statistical analyses reveal that manuscripts perceived to employ the PCR model are 3.45 times more likely to be rated as publishable compared to those relying on conventional narrative reviews. Experts overwhelmingly endorsed the model for its clarity, strategic value, and pedagogical utility. This study positions the PCR framework not only as a solution to a long-standing structural gap in scientific writing but as a forward-looking standard for literature organization in high-impact research. The future of scholarly communication requires not just citation density but citation precision, exactly what the PCR model provides.
The effectiveness of a microlearning-supported flipped classroom model in improving learning achievement and student attitudes was investigated among vocational college students enrolled in Information Technology (IT) courses in Zibo City, China. While the flipped classroom model—characterized by pre-class engagement with instructional content and in-class participatory learning—has been widely adopted in vocational education, concerns regarding cognitive overload and limited student engagement persist. To address these challenges, microlearning was integrated to deliver content in concise, targeted segments intended to enhance comprehension and reduce extraneous cognitive load. A quasi-experimental design was employed involving 60 first-year students, who were randomly assigned to either an experimental group (microlearning-supported flipped classroom) or a control group (traditional flipped classroom). Learning outcomes were evaluated using a 50-item IT achievement test, while student attitudes were assessed through a 20-item Likert-scale questionnaire covering four attitudinal dimensions. High instrument validity, i.e., average Scale-level Content Validity Index (SCVI/ave) = 0.977, and internal reliability (Cronbach’s α = 0.958) were established. No significant differences were observed in the pre-test scores between groups, confirming baseline equivalence. Post-intervention results demonstrated a statistically significant improvement in the experimental group (M = 52.733, SD = 3.805) compared to the control group (M = 49.600, SD = 3.838), t (58) = 3.376, p = 0.002), indicating enhanced academic performance. Favorable shifts in learning attitudes were also observed among students exposed to the microlearning-enhanced model, although the four-week intervention period constrained the generalizability of these attitudinal outcomes. These findings suggest that the incorporation of microlearning elements into flipped classroom pedagogies can foster more effective engagement and lead to measurable improvements in academic performance within vocational IT education contexts. Future research involving extended implementation periods and larger, more diverse sample populations is recommended to further validate the durability and scalability of these effects.
A scientifically rigorous evaluation of college teachers’ engagement is essential for enhancing human resource management practices within higher education institutions. In this study, a fuzzy decision-making framework was constructed to facilitate the objective assessment of college teachers’ engagement in Chinese universities. Drawing on Rich’s tripartite conceptualization of engagement—comprising cognitive focus, emotional involvement, and behavioral effort—a hierarchical evaluation index system was developed, consisting of three primary dimensions and seventeen secondary indicators. To address the inherent uncertainty and subjectivity in engagement assessment, a fuzzy entropy weight method was employed to determine the relative importance of each indicator. Subsequently, a novel fuzzy-grey correlation evaluation method was proposed, integrating grey and fuzzy mathematics theories to comprehensively assess engagement levels. The applicability and robustness of the proposed framework were demonstrated through a case study involving four university instructors from a higher education institution in Tianjin. The results revealed that the fuzzy-grey correlation approach effectively distinguished varying levels of engagement among the sampled instructors, offering a more nuanced and data-driven foundation for engagement-based performance evaluations. This methodological framework provides a practical tool for decision-makers and education administrators aiming to implement equitable and evidence-based faculty management strategies.
In the context of rapidly advancing digital technology, where touchscreen interactions dominate, the tactile sensory development of children is increasingly compromised. This shift towards digital media can hinder the ability of children to effectively engage with and observe their surroundings. One promising solution to this issue is the integration of art appreciation into educational practices. However, in regions such as Indonesia, there is a noticeable scarcity of comprehensive learning kits aimed at teaching art appreciation. This study addresses this gap by designing and developing an art appreciation learning kit intended for children aged 7 to 11, aiming to teach art appreciation through artful thinking (AT). The kit employs the “see, think, wonder” (STW) thinking routine, a structured three-step process that encourages children to observe art, analyze their observations, and engage with the art through inquiry. The analysis, design, development, implementation, and evaluation (ADDIE) framework was employed as the instructional design model. Additionally, a qualitative research through design (RtD) methodology was adopted to guide the design process and ensure the creation of an innovative educational tool. The developed learning kit integrates physical components, multimedia resources, and hands-on arts and crafts activities that complement the STW routine, thereby fostering deeper engagement and critical thinking skills among young learners. The study emphasizes the value of employing the ADDIE model to assess the learning needs and challenges faced by children, particularly during the STW activity. Collaboration with educators during the design and development phases was identified as crucial for refining the learning kit. Key recommendations include the integration of graphic visualizations, clear demonstrations, and interactive activities to enhance children’s engagement and enthusiasm for art appreciation. The findings offer empirical evidence supporting the effective use of the ADDIE model in educational kit design, providing a valuable reference for future product designers in the educational technology field.