Semi-Automatic Labeling of Online Course Discussion Posts

Dharamjit Parmar, Ali Dewan, Dunwei Wen, Oscar Lin

Research output: Contribution to conferencePaperpeer-review

Abstract

Automatic text labeling is crucial in diverse text analysis applications, which require high precision. This paper presents a semi-automatic approach for text labeling by combining deep learning methods with human interventions. We employ bidirectional long short-term memory (Bi-LSTM) and convolutional neural network (CNN) to generate initial labels of online course discussion posts, which we then refine through a structured human-in-the-loop feedback mechanism. This semi-automatic and iterative process in labeling the forum posts reduces time for the labeling task and enhances the reliability of the training data to be used in real application of text classification. We evaluated our framework on a MOOC course dataset, demonstrating significant improvement in model performance. The results under-score the potential of integrating human expertise to complement and augment machine learning in automating the labeling tasks of the training data, paving the way for more reliable and robust applications of text analysis, especially in education.
Original languageCanadian English
Publication statusPublished - 22 Jun. 2025
EventInternational Conference on Human-Computer Interaction, HCII 2025 - Gothenburg, Sweden
Duration: 22 Jun. 202527 Jun. 2025

Conference

ConferenceInternational Conference on Human-Computer Interaction, HCII 2025
Abbreviated titleHCII 2025
Country/TerritorySweden
CityGothenburg
Period22/06/2527/06/25

Keywords

  • Text labeling, cognitive engagement, deep learning, Bi-LSTM, CNN, text classification, online learning, course discussion posts

Fingerprint

Dive into the research topics of 'Semi-Automatic Labeling of Online Course Discussion Posts'. Together they form a unique fingerprint.

Cite this