The integration of generative artificial intelligence (GAI) into educational settings offers unprecedented opportunities to enhance the efficiency of teaching and the effectiveness of learning, particularly within online platforms. This study evaluates the development and application of a customized GAI-powered teaching assistant, trained specifically to enhance teaching efficiency for educators and improve learning outcomes for students in online education. Using four Grade 12 courses (i.e., English, Mathematics, Financial Accounting, and Simplified Chinese), we assessed the performance of generative pretrained transformer (GPT)-4, GPT-4o, and the Trained-GPT model. Results demonstrate that the Trained-GPT achieved grading accuracy and consistency comparable to human teachers, with strong correlations observed in Mathematics (0.996) and English (0.874). While GPT-4o performed well in specific cases, its variability highlights areas for improvement. These findings underscore the potential of AI-powered teaching assistants to streamline grading, deliver timely feedback, and support scalable, high-quality online education.

GAI Versus Teacher Scoring: Which is Better for Assessing Student Performance?

Zappatore M.;
2025-01-01

Abstract

The integration of generative artificial intelligence (GAI) into educational settings offers unprecedented opportunities to enhance the efficiency of teaching and the effectiveness of learning, particularly within online platforms. This study evaluates the development and application of a customized GAI-powered teaching assistant, trained specifically to enhance teaching efficiency for educators and improve learning outcomes for students in online education. Using four Grade 12 courses (i.e., English, Mathematics, Financial Accounting, and Simplified Chinese), we assessed the performance of generative pretrained transformer (GPT)-4, GPT-4o, and the Trained-GPT model. Results demonstrate that the Trained-GPT achieved grading accuracy and consistency comparable to human teachers, with strong correlations observed in Mathematics (0.996) and English (0.874). While GPT-4o performed well in specific cases, its variability highlights areas for improvement. These findings underscore the potential of AI-powered teaching assistants to streamline grading, deliver timely feedback, and support scalable, high-quality online education.
File in questo prodotto:
File Dimensione Formato  
2025 - IEEE TLT.pdf

solo utenti autorizzati

Descrizione: Articolo
Tipologia: Versione editoriale
Licenza: NON PUBBLICO - Accesso privato/ristretto
Dimensione 762.42 kB
Formato Adobe PDF
762.42 kB Adobe PDF   Visualizza/Apri   Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11587/566206
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 2
  • ???jsp.display-item.citation.isi??? 2
social impact