Written by Human or ChatGPT - Authorship Forensics in the Era of Generative AI

Robert Schmidt, Greg Fredin, Kevin Haghighat, Rita Kuo, Maiga Chang

    Research output: Chapter in Book/Report/Conference proceedingPublished Conference contributionpeer-review

    Abstract

    In this evolving landscape of text generation, distinguishing between human-written and ChatGPT-generated content has become increasingly important. This paper presents a novel approach to authorship attribution, leveraging both Statistical Natural Language Processing (SNLP) and Convolutional Neural Networks (CNN) techniques to differentiate between documents written by humans and ChatGPTs. The research uses 212 abstracts of academic papers written and published by the research group as the human-written set and asks both ChatGPT 3.5 and 4 to generate corresponding abstracts based on paper titles as AI-written set. Models are trained on a laptop to classify human and AI-written abstract texts in 2-class (i.e., human and ChatGPT) and 3-class (i.e., human, ChatGPT 3.5, and ChatGPT 4) based on their part-of-speech tag frequency distribution patterns. The 2-class model is well-trained in less than ONE minute (i.e., 56.82 seconds) and the 3 -class model is welltrained in 7 minutes and 26.076 seconds. The results demonstrate a significant ability of the models to distinguish between human and AI-written text, with precision 0.9682 (F0.5 score 0.95) for the 2-class (human and ChatGPT) testing subset and precision 0.9806 (F0.5 score 0.96) in the 3-class (human, ChatGPT 3.5, and ChatGPT 4) testing subset. The proposed 3 -stage Authorship Forensics approach has been implemented as an open access web application to allow teachers and users to either train their own models or use the existing trained model to get some advice on how the model considers a piece of given text written by human or AI.

    Original languageEnglish
    Title of host publication2025 13th International Conference on Information and Education Technology, ICIET 2025
    Pages441-445
    Number of pages5
    ISBN (Electronic)9798331537845
    DOIs
    Publication statusPublished - 2025
    Event13th International Conference on Information and Education Technology, ICIET 2025 - Fukuyama, Japan
    Duration: 18 Apr. 202520 Apr. 2025

    Publication series

    Name2025 13th International Conference on Information and Education Technology, ICIET 2025

    Conference

    Conference13th International Conference on Information and Education Technology, ICIET 2025
    Country/TerritoryJapan
    CityFukuyama
    Period18/04/2520/04/25

    Keywords

    • ChatGPT
    • Convolutional Neural Networks
    • Natural Language Processing
    • Neural NLP
    • Part of Speech
    • Statistical NLP

    Fingerprint

    Dive into the research topics of 'Written by Human or ChatGPT - Authorship Forensics in the Era of Generative AI'. Together they form a unique fingerprint.

    Cite this