A Framework for Detecting AI-Generated Text in Research Publications

Authors

  • Paria Sarzaeim
  • Arya Doshi
  • Qusay Mahmoud Ontario Tech University

DOI:

https://doi.org/10.58190/icat.2023.28

Keywords:

Generative artificial intelligence, research papers, machine learning, AI-generated text

Abstract

The use of generative artificial intelligence is becoming increasingly prevalent in creating content in various formats such as text, video, and image. However, there is a need to distinguish between content that has been generated by humans and content that has been generated by AI as misuse of these technologies can raise scientific and social challenges. Moreover, there are concerns about the reliability and comprehensiveness of the content generated by AI without human validation. This paper presents a framework for AI-generated text. The prototype implementation of the proposed approach is to train a model using predefined datasets and deploy this model on a cloud-based service to predict whether a text was created by a human or AI. This approach is specifically focused on assessing the accuracy of scientific writings and research papers rather than general text. The proposed framework is compared with recently developed tools such as OpenAI Text Classifier, ZeroGPT, and Turnitin. The results show that training a text classifier can be highly useful in detecting whether a text is written by a human or AI. The source code and dataset are made open source so others can experiment with the prototype implementation and use it for future research.

Downloads

Published

2023-09-26

How to Cite

Sarzaeim, P., Doshi, A., & Mahmoud, Q. (2023). A Framework for Detecting AI-Generated Text in Research Publications. Proceedings of the International Conference on Advanced Technologies, 11, 121–127. https://doi.org/10.58190/icat.2023.28