Digitization of Printed Multiple Choice Questions Using Object Detection Methods: A Yolov7-Based Approach

Digitization of Printed Exam Questions and Data Extraction Using the YOLOv7 Algorithm

  • Yiğit Çetinkaya Havelsan
  • Adem Tekerek Gazi University, Faculty of Technology, Department of Computer Engineering, Ankara, Turkey https://orcid.org/0000-0002-0880-7955
Keywords: Convolutional Neural Networks, Real-Time Object Detection, YOLO

Abstract

In education, multiple-choice questions are widely used in exams. Hard copies of multiple-choice questions are usually used to prepare students for exams. However, when hard-copy questions are intended to be used in electronic environments, difficulties are experienced because editing cannot be done, and statistical and mathematical operations cannot be performed with hard-copy questions. This study uses object detection methods to transfer hard-copy multiple-choice questions to digital media. Within the scope of the study, a new approach is presented to determine the locations of multiple-choice questions and convert them to text using the YOLOv7 algorithm. There are 16 different class labels, such as question stem, question number, and question options, in the proposed YOLO model. During the data preprocessing phase, 1,140 images were manually labeled using the Roboflow image data labeling system, resulting in the creation of a unique dataset. As a result of model training, it was seen that the model classified approximately 94% of the questions correctly. The data detected with the developed model was converted to XML format using the algorithms of the YOLOv7 library. Educational institutions can use this approach to extract data and understand visual information.

Author Biography

Adem Tekerek, Gazi University, Faculty of Technology, Department of Computer Engineering, Ankara, Turkey

Assoc. Prof. Adem Tekerek is a faculty member in the Department of Computer Engineering at Gazi University, specializing in artificial intelligence, machine learning, and data mining. He has a strong research background, with numerous publications focused on cybersecurity and advanced computational techniques. His work often addresses real-world problems, contributing significantly to the fields of engineering and technology. Prof. Tekerek's research and publications reflect his commitment to developing innovative solutions in the rapidly evolving landscape of computer science.

Published
29-12-2024
How to Cite
Çetinkaya, Y., & Tekerek, A. (2024). Digitization of Printed Multiple Choice Questions Using Object Detection Methods: A Yolov7-Based Approach. Artificial Intelligence Studies, 7(2), 66-82. https://doi.org/10.30855/AIS.2024.07.02.04
Section
Articles