Skip to main navigation Skip to search Skip to main content

Deep metric learning with data summarization

  • Wenlin Wang
  • , Changyou Chen
  • , Wenlin Chen
  • , Piyush Rai
  • , Lawrence Carin

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

8 Scopus citations

Abstract

We present Deep Stochastic Neighbor Compression (DSNC), a framework to compress training data for instance-based methods (such as k-nearest neighbors). We accomplish this by inferring a smaller set of pseudo-inputs in a new feature space learned by a deep neural network. Our framework can equivalently be seen as jointly learning a nonlinear distance metric (induced by the deep feature space) and learning a compressed version of the training data. In particular, compressing the data in a deep feature space makes DSNC robust against label noise and issues such as within-class multi-modal distributions. This leads to DSNC yielding better accuracies and faster predictions at test time, as compared to other competing methods. We conduct comprehensive empirical evaluations, on both quantitative and qualitative tasks, and on several benchmark datasets, to show its effectiveness as compared to several baselines.

Original languageEnglish
Title of host publicationMachine Learning and Knowledge Discovery in Databases - European Conference, ECML PKDD 2016, Proceedings
EditorsJilles Giuseppe, Niels Landwehr, Giuseppe Manco, Paolo Frasconi
PublisherSpringer Verlag
Pages777-794
Number of pages18
ISBN (Print)9783319461274
DOIs
StatePublished - 2016
Event15th European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, ECML PKDD 2016 - Riva del Garda, Italy
Duration: Sep 19 2016Sep 23 2016

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume9851 LNAI

Conference

Conference15th European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, ECML PKDD 2016
Country/TerritoryItaly
CityRiva del Garda
Period09/19/1609/23/16

Fingerprint

Dive into the research topics of 'Deep metric learning with data summarization'. Together they form a unique fingerprint.

Cite this