Automated Grading of Students' Handwritten Graphs: A Comparison of Meta-Learning and Vision-Large Language Models

πŸ“… 2025-07-03
πŸ“ˆ Citations: 0
✨ Influential: 0
πŸ“„ PDF
πŸ€– AI Summary
This study addresses the challenge of automated grading of handwritten diagram-based assignments in STEM courses. We propose a novel assessment framework integrating multimodal meta-learning with vision-language large models (VLLMs). To our knowledge, this is the first systematic comparative analysis of these two paradigms on handwritten graph recognition and classification tasks, revealing their complementary strengths: meta-learning achieves higher accuracy on binary classification, whereas VLLMs slightly outperform on ternary classification but exhibit lower stability. Methodologically, we jointly leverage image processing and textual understanding techniques to enable end-to-end feature modeling and fine-grained scoring of handwritten diagram–text hybrid submissions. Evaluated on a real-world educational dataset, our approach establishes an interpretable and scalable paradigm for AI-assisted assessment, significantly improving consistency and efficiency in online mathematics education evaluation.

Technology Category

Application Category

πŸ“ Abstract
With the rise of online learning, the demand for efficient and consistent assessment in mathematics has significantly increased over the past decade. Machine Learning (ML), particularly Natural Language Processing (NLP), has been widely used for autograding student responses, particularly those involving text and/or mathematical expressions. However, there has been limited research on autograding responses involving students' handwritten graphs, despite their prevalence in Science, Technology, Engineering, and Mathematics (STEM) curricula. In this study, we implement multimodal meta-learning models for autograding images containing students' handwritten graphs and text. We further compare the performance of Vision Large Language Models (VLLMs) with these specially trained metalearning models. Our results, evaluated on a real-world dataset collected from our institution, show that the best-performing meta-learning models outperform VLLMs in 2-way classification tasks. In contrast, in more complex 3-way classification tasks, the best-performing VLLMs slightly outperform the meta-learning models. While VLLMs show promising results, their reliability and practical applicability remain uncertain and require further investigation.
Problem

Research questions and friction points this paper is trying to address.

Autograding handwritten graphs in STEM education
Comparing meta-learning and VLLMs for graph grading
Evaluating model performance on real-world student datasets
Innovation

Methods, ideas, or system contributions that make the work stand out.

Multimodal meta-learning for handwritten graph grading
Comparison of meta-learning with Vision-LLMs
Performance evaluation on real-world student dataset
πŸ”Ž Similar Papers
No similar papers found.
Behnam Parsaeifard
Behnam Parsaeifard
University of Basel
Computational physicsMachine learningStatistical physics
Martin Hlosta
Martin Hlosta
Swiss Distance University of Applied Sciences
Machine LearningLearning AnalyticsEducational Data MiningAI in education
P
Per Bergamin
Institute for Research in Open-, Distance- and eLearning, Swiss Distance University of Applied Sciences, Brig, CH-3900, Switzerland; North-West University, Potchefstroom, 2531, South Africa