ChatGPT를 활용한 서술형 문항 생성 프로토콜과 문항의 질 평가: 국어과 사례를 중심으로
Protocol for Developing Constructed-Response Items Using ChatGPT and Quality Evaluation:Focusing on Korean Language Arts
Citations

WEB OF SCIENCE

0
Citations

SCOPUS

0

초록

This study explores prompt guidelines for generating constructed-response items using ChatGPT and examines the quality of the AI-generated items in comparison to human-developed items. Based on a literature review and testing of various prompting strategies, we developed a three-step prompt guideline for generating both constructed-response items and scoring rubrics. We then compared the quality of the ChatGPT-generated items to human-developed items across 20 evaluation criteria. The results showed that the quality of the ChatGPT-generated items was significantly lower than that of the human-developed items, particularly in terms of task clarity, alignment with reading materials, compliance with the national curriculum, and reliability of scoring criteria. However, no significant difference was observed between the ChatGPT and human-developed items in areas such as alignment among learning objectives, task content, scoring criteria, and vocabulary appropriateness. Finally, implications for using AI in item generation and designing human-AI collaborative assessments were discussed.

키워드

constructed-response itemsitem qualityChatGPThuman-AI collaborative assessmentformative assessment서술형 문항문항의 품질ChatGPT인간-AI 협력 평가형성 평가
제목
ChatGPT를 활용한 서술형 문항 생성 프로토콜과 문항의 질 평가: 국어과 사례를 중심으로
제목 (타언어)
Protocol for Developing Constructed-Response Items Using ChatGPT and Quality Evaluation:Focusing on Korean Language Arts
저자
함은혜박소영이병윤김기동이대형
DOI
10.30916/KERA.62.8.63
발행일
2024-12
저널명
교육학연구
62
8
페이지
63 ~ 93