Skip to main navigation Skip to search Skip to main content

Discourse relation prediction: Revisiting word pairs with convolutional networks

  • Columbia University

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

28 Scopus citations

Abstract

Word pairs across argument spans have been shown to be effective for predicting the discourse relation between them. We propose an approach to distill knowledge from word pairs for discourse relation classification with convolutional neural networks by incorporating joint learning of implicit and explicit relations. Our novel approach of representing the input as word pairs achieves state-of-the-art results on four-way classification of both implicit and explicit relations as well as one of the binary classification tasks. For explicit relation prediction, we achieve around 20% error reduction on the four-way task. At the same time, compared to a two-layered Bi-LSTM-CRF model, our model is able to achieve these results with half the number of learnable parameters and approximately half the amount of training time.

Original languageEnglish
Title of host publicationSIGDIAL 2019 - 20th Annual Meeting of the Special Interest Group Discourse Dialogue - Proceedings of the Conference
PublisherAssociation for Computational Linguistics (ACL)
Pages442-452
Number of pages11
ISBN (Electronic)9781950737611
DOIs
StatePublished - 2019
Event20th Annual Meeting of the Special Interest Group on Discourse and Dialogue, SIGDIAL 2019 - Stockholm, Sweden
Duration: Sep 11 2019Sep 13 2019

Publication series

NameSIGDIAL 2019 - 20th Annual Meeting of the Special Interest Group Discourse Dialogue - Proceedings of the Conference

Conference

Conference20th Annual Meeting of the Special Interest Group on Discourse and Dialogue, SIGDIAL 2019
Country/TerritorySweden
CityStockholm
Period09/11/1909/13/19

Fingerprint

Dive into the research topics of 'Discourse relation prediction: Revisiting word pairs with convolutional networks'. Together they form a unique fingerprint.

Cite this