Skip to main navigation Skip to search Skip to main content

STT: Soft Template Tuning for Few-Shot Adaptation

  • SUNY Buffalo
  • Microsoft USA
  • Adobe Systems Incorporated

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

2 Scopus citations

Abstract

Prompt tuning has been an extremely effective tool to adapt a pre-trained model to downstream tasks. However, standard prompt-based methods mainly consider the case of sufficient data of downstream tasks. It is still unclear whether the advantage can be transferred to the few-shot regime, where only limited data are available for each downstream task. Although some works have demonstrated the potential of prompt-tuning under the few-shot setting, the main stream methods via searching discrete prompts or tuning soft prompts with limited data are still very challenging. Through extensive empirical studies, we find that there is still a gap between prompt tuning and fully fine-tuning for few-shot learning. To bridge the gap, we propose a new prompt-tuning framework, called Soft Template Tuning (STT) 1. STT combines manual and auto prompts, and treats down-stream classification tasks as a masked language modeling task. Comprehensive evaluation on different settings suggests STT can close the gap between fine-tuning and prompt-based methods without introducing additional parameters. Significantly, it can even outperform the time- and resource-consuming fine-tuning method on sentiment classification tasks.

Original languageEnglish
Title of host publicationProceedings - 22nd IEEE International Conference on Data Mining Workshops, ICDMW 2022
EditorsK. Selcuk Candan, Thang N. Dinh, My T. Thai, Takashi Washio
PublisherIEEE Computer Society
Pages941-946
Number of pages6
ISBN (Electronic)9798350346091
DOIs
StatePublished - 2022
Event22nd IEEE International Conference on Data Mining Workshops, ICDMW 2022 - Orlando, United States
Duration: Nov 28 2022Dec 1 2022

Publication series

NameIEEE International Conference on Data Mining Workshops, ICDMW
Volume2022-November

Conference

Conference22nd IEEE International Conference on Data Mining Workshops, ICDMW 2022
Country/TerritoryUnited States
CityOrlando
Period11/28/2212/1/22

Keywords

  • NLP
  • few-shot learning
  • language model
  • prompt-tuning

Fingerprint

Dive into the research topics of 'STT: Soft Template Tuning for Few-Shot Adaptation'. Together they form a unique fingerprint.

Cite this