EVALUATION OF MOBILE EDUCATION APPS BY POPULAR GENERATIVE ARTIFICIAL INTELLIGENCE (GENAI) SYSTEMS: AN EXTENDED STUDY ON ITS CONSISTENCY AND THUS CONVERGENT VALIDITY

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

As an extension to a previous pilot study, this article aims to analyze the consistency between and thus the convergent validity of a few popular generative artificial intelligence (GenAI) systems in evaluating popular mobile education apps’ usability, effectiveness, and efficiency. The three GenAI systems examined were Microsoft Copilot, Google PaLM, and Assistant, which were individually prompted to award rating scores to the eight major dimensions of usability, effectiveness, and efficency, namely, (1) content/course quality, (2) pedagogical design, (3) learner support, (4) technology infrastructure, (5) social interaction, (6) learner engagement, (7) instructor support, and (8) cost-effectiveness of 100 popular mobile education apps. A paired sample t-test was applied to the rating score difference in each of the above eight dimensions between each GenAI system pair out of the above three GenAI systems over all the 100 apps. Then, Cronbach’s coefficient alpha of the rating scores was computed for each of the above eight dimensions between all the three GenAI systems across all the 100 apps. The computational results were to confirm whether the GenAI systems, with respect to each other, systematically overrated or underrated any dimension over the 100 apps and whether there were high consistency between and thus convergent validity of the three GenAI systems in evaluating each dimension across the 100 apps. Among other collateral finding, it was revealed that the consistency between and thus the convergent validity of the three GenAI systems was basically sufficiently high in evaluating all the eight dimensions across the 100 apps, with those in the dimension (8) cost-effectiveness being at least marginally high enough, and thus the three GenAI systems may be rather reliable in evaluating all the eight usability, effectiveness, and efficiency dimensions across mobile education apps.

Original languageEnglish
Title of host publicationProceedings of the International Conferences on Mobile Learning 2025 and Educational Technologies 2025
EditorsInmaculada Arnedillo Sanchez, Piet Kommers, Tomayess Issa, Pedro Isaias, Luis Rodrigues
PublisherIADIS
Pages151-161
Number of pages11
ISBN (Electronic)9789898704665
Publication statusPublished - 2025
Event21st International Conferences on Mobile Learning, ML 2025 and 10th International Conference on Educational Technologies, ICEduTech 2025 - Madeira Island, Portugal
Duration: 1 Mar 20253 Mar 2025

Publication series

NameProceedings of the International Conferences on Mobile Learning 2025 and Educational Technologies 2025

Conference

Conference21st International Conferences on Mobile Learning, ML 2025 and 10th International Conference on Educational Technologies, ICEduTech 2025
Country/TerritoryPortugal
CityMadeira Island
Period1/03/253/03/25

Keywords

  • Convergent Validity
  • Evaluation
  • Generative Artificial Intelligence (Genai)
  • Mobile Education Apps

Fingerprint

Dive into the research topics of 'EVALUATION OF MOBILE EDUCATION APPS BY POPULAR GENERATIVE ARTIFICIAL INTELLIGENCE (GENAI) SYSTEMS: AN EXTENDED STUDY ON ITS CONSISTENCY AND THUS CONVERGENT VALIDITY'. Together they form a unique fingerprint.

Cite this