Skip to main navigation Skip to search Skip to main content

The Consistent Lack of Variance of Psychological Factors Expressed by LLMs and Spambots

  • Vasudha Varadarajan
  • , Salvatore Giorgi
  • , Siddharth Mangalik
  • , Nikita Soni
  • , David M. Markowitz
  • , H. Andrew Schwartz
  • Stony Brook University
  • University of Pennsylvania
  • Michigan State University

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

3 Scopus citations

Abstract

In recent years, the proliferation of chatbots like ChatGPT and Claude has led to an increasing volume of AI-generated text. While the text itself is convincingly coherent and human-like, the variety of expressed of human attributes may still be limited. Using theoretical individual differences, the fundamental psychological traits which distinguish people, this study reveals a distinctive characteristic of such content: AI-generations exhibit remarkably limited variation in inferrable psychological traits compared to human-authored texts. We present a review and study across multiple datasets spanning various domains. We find that AI-generated text consistently models the authorship of an "average" human with such little variation that, on aggregate, it is clearly distinguishable from human-written texts using unsupervised methods (i.e., without using ground truth labels). Our results show that (1) fundamental human traits are able to accurately distinguish human- and machine-generated text and (2) current generation capabilities fail to capture a diverse range of human traits.

Original languageEnglish
Title of host publicationGenAIDetect 2025 - Proceedings of the 1st Workshop on GenAI Content Detection, Proceedings of the Workshop - 31st International Conference on Computational Linguistics, COLING 2025
EditorsFiroj Alam, Preslav Nakov, Nizar Habash, Iryna Gurevych, Shammur Chowdhury, Artem Shelmanov, Yuxia Wang, Ekaterina Artemova, Mucahid Kutlu, George Mikros
PublisherAssociation for Computational Linguistics (ACL)
Pages111-119
Number of pages9
ISBN (Electronic)9798891762053
StatePublished - 2025
Event1st Workshop on GenAI Content Detection, GenAIDetect 2025 - Abu Dhabi, United Arab Emirates
Duration: Jan 19 2025 → …

Publication series

NameProceedings - International Conference on Computational Linguistics, COLING

Conference

Conference1st Workshop on GenAI Content Detection, GenAIDetect 2025
Country/TerritoryUnited Arab Emirates
CityAbu Dhabi
Period01/19/25 → …

Fingerprint

Dive into the research topics of 'The Consistent Lack of Variance of Psychological Factors Expressed by LLMs and Spambots'. Together they form a unique fingerprint.

Cite this