Skip to main navigation Skip to search Skip to main content

A Report on the FigLang 2024 Shared Task on Multimodal Figurative Language

  • Columbia University

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

3 Scopus citations

Abstract

We present the outcomes of the Multimodal Figurative Language Shared Task held at the 4th Workshop on Figurative Language Processing (FigLang 2024) co-located at NAACL 2024. The task utilized the V-FLUTE dataset (Saakyan et al., 2024) which is comprised of <image, text> pairs that use figurative language and includes detailed textual explanations for the entailment or contradiction relationship of each pair. The challenge for participants was to develop models capable of accurately identifying the visual entailment relationship in these multimodal instances and generating persuasive free-text explanations. The results showed that the participants’ models significantly outperformed the initial baselines in both automated and human evaluations. We also provide an overview of the systems submitted and analyze the results of the evaluations. All participating systems outperformed the LLaVA-ZS baseline, provided by us in F1-score.

Original languageEnglish
Title of host publicationFigLang 2024 - 4th Workshop on Figurative Language Processing, Proceedings of the Workshop
EditorsDebanjan Ghosh, Smaranda Muresan, Anna Feldman, Tuhin Chakrabarty, Emmy Liu
PublisherAssociation for Computational Linguistics (ACL)
Pages115-119
Number of pages5
ISBN (Electronic)9798891761100
DOIs
StatePublished - 2024
Event4th Workshop on Figurative Language Processing, FigLang 2024 - Hybrid, Mexico City, Mexico
Duration: Jun 21 2024 → …

Publication series

NameFigLang 2024 - 4th Workshop on Figurative Language Processing, Proceedings of the Workshop

Conference

Conference4th Workshop on Figurative Language Processing, FigLang 2024
Country/TerritoryMexico
CityHybrid, Mexico City
Period06/21/24 → …

Fingerprint

Dive into the research topics of 'A Report on the FigLang 2024 Shared Task on Multimodal Figurative Language'. Together they form a unique fingerprint.

Cite this