Skip to main navigation Skip to search Skip to main content

XES3G5M: A Knowledge Tracing Benchmark Dataset with Auxiliary Information

  • Zitao Liu
  • , Qiongqiong Liu*
  • , Teng Guo*
  • , Jiahao Chen
  • , Shuyan Huang
  • , Xiangyu Zhao
  • , Jiliang Tang
  • , Weiqi Luo
  • , Jian Weng
  • *Corresponding author for this work
  • Jinan University
  • TAL Education Group
  • City University of Hong Kong
  • Michigan State University

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Knowledge tracing (KT) is a task that predicts students' future performance based on their historical learning interactions.With the rapid development of deep learning techniques, existing KT approaches follow a data-driven paradigm that uses massive problem-solving records to model students' learning processes.However, although the educational contexts contain various factors that may have an influence on student learning outcomes, existing public KT datasets mainly consist of anonymized ID-like features, which may hinder the research advances towards this field.Therefore, in this work, we present, XES3G5M, a large-scale dataset with rich auxiliary information about questions and their associated knowledge components (KCs)2.The XES3G5M dataset is collected from a real-world online math learning platform, which contains 7, 652 questions, and 865 KCs with 5, 549, 635 interactions from 18, 066 students.To the best of our knowledge, the XES3G5M dataset not only has the largest number of KCs in math domain but contains the richest contextual information including tree structured KC relations, question types, textual contents and analysis and student response timestamps.Furthermore, we build a comprehensive benchmark on 19 state-of-the-art deep learning based knowledge tracing (DLKT) models.Extensive experiments demonstrate the effectiveness of leveraging the auxiliary information in our XES3G5M with DLKT models.We hope the proposed dataset can effectively facilitate the KT research work.

Original languageEnglish
Title of host publicationAdvances in Neural Information Processing Systems 36 - 37th Conference on Neural Information Processing Systems, NeurIPS 2023
EditorsA. Oh, T. Neumann, A. Globerson, K. Saenko, M. Hardt, S. Levine
PublisherNeural information processing systems foundation
ISBN (Electronic)9781713899921
StatePublished - 2023
Externally publishedYes
Event37th Conference on Neural Information Processing Systems, NeurIPS 2023 - New Orleans, United States
Duration: 10 Dec 202316 Dec 2023

Publication series

NameAdvances in Neural Information Processing Systems
Volume36
ISSN (Print)1049-5258

Conference

Conference37th Conference on Neural Information Processing Systems, NeurIPS 2023
Country/TerritoryUnited States
CityNew Orleans
Period10/12/2316/12/23

Fingerprint

Dive into the research topics of 'XES3G5M: A Knowledge Tracing Benchmark Dataset with Auxiliary Information'. Together they form a unique fingerprint.

Cite this