Skip to main navigation Skip to search Skip to main content

PASTA: A Dataset for Modeling PArticipant STAtes in Narratives

  • Stony Brook University
  • University of Maryland, Baltimore County
  • United States Naval Academy

Research output: Contribution to journalArticlepeer-review

2 Scopus citations

Abstract

The events in a narrative are understood as a coherent whole via the underlying states of their participants. Often, these participant states are not explicitly mentioned, instead left to be inferred by the reader. A model that understands narratives should likewise infer these implicit states, and even reason about the impact of changes to these states on the narrative. To facilitate this goal, we introduce a new crowdsourced English-language, Participant States dataset, PASTA. This dataset contains inferable participant states; a coun-terfactual perturbation to each state; and the changes to the story that would be necessary if the counterfactual were true. We introduce three state-based reasoning tasks that test for the ability to infer when a state is entailed by a story, to revise a story conditioned on a counterfactual state, and to explain the most likely state change given a revised story. Ex-periments show that today’s LLMs can reason about states to some degree, but there is large room for improvement, especially in problems requiring access and ability to reason with diverse types of knowledge (e.g., physical, numerical, factual).

Original languageEnglish
Pages (from-to)1283-1300
Number of pages18
JournalTransactions of the Association for Computational Linguistics
Volume11
DOIs
StatePublished - Nov 2 2023

Fingerprint

Dive into the research topics of 'PASTA: A Dataset for Modeling PArticipant STAtes in Narratives'. Together they form a unique fingerprint.

Cite this